METHOD AND APPARATUS FOR QR-FACTORIZING MATRIX ON A MULTIPROCESSOR SYSTEM
    1.
    发明申请
    METHOD AND APPARATUS FOR QR-FACTORIZING MATRIX ON A MULTIPROCESSOR SYSTEM 失效
    用于在多处理器系统上进行QR加工矩阵的方法和装置

    公开(公告)号:US20090235049A1

    公开(公告)日:2009-09-17

    申请号:US12402780

    申请日:2009-03-12

    申请人: Hui Li Bai Ling Wang

    发明人: Hui Li Bai Ling Wang

    CPC分类号: G06F17/16

    摘要: The present invention provides a method and apparatus for QR-factorizing matrix on a multiprocessor system, wherein the multiprocessor system comprises at least one core processor and a plurality of accelerators, the method comprises the steps of: iteratively factorizing each panel in the matrix until the whole matrix is factorized; wherein in each iteration, the method comprises: partitioning an unprocessed matrix part in the matrix into a plurality of blocks according to a predetermined block size; partitioning a current processed panel in the unprocessed matrix part into at least two sub panels, wherein the current processed panel is composed of a plurality of blocks; and performing QR factorization one by one on the at least two sub panels with the plurality of accelerators, and updating the data of the sub panel(s) on which no QR factorization has been performed among the at least two sub panels by using the factorization result. The present invention enables a multiprocessor system having a high computing capability to be applied to the matrix QR factorization having a large amount of computation tasks.

    摘要翻译: 本发明提供了一种用于在多处理器系统上进行QR分解矩阵的方法和装置,其中所述多处理器系统包括至少一个核心处理器和多个加速器,所述方法包括以下步骤:对矩阵中的每个面板进行迭代因式分解,直到 整体矩阵被分解; 其中在每次迭代中,所述方法包括:根据预定的块大小将矩阵中未处理的矩阵部分划分成多个块; 将未处理的矩阵部分中的当前处理的面板划分为至少两个子面板,其中当前处理面板由多个块组成; 并且利用所述多个加速器在所述至少两个子面板上逐个地执行QR因式分解,并且通过使用因式分解来更新在所述至少两个子面板中未执行QR因子分解的子面板的数据 结果。 本发明能够将具有高计算能力的多处理器系统应用于具有大量计算任务的矩阵QR因式分解。

    Method and apparatus for QR-factorizing matrix on a multiprocessor system
    2.
    发明授权
    Method and apparatus for QR-factorizing matrix on a multiprocessor system 失效
    在多处理器系统上进行QR分解矩阵的方法和装置

    公开(公告)号:US08543626B2

    公开(公告)日:2013-09-24

    申请号:US13559885

    申请日:2012-07-27

    申请人: Hui Li Bai Ling Wang

    发明人: Hui Li Bai Ling Wang

    CPC分类号: G06F17/16

    摘要: A method and apparatus for QR-factorizing matrix on a multiprocessor system, wherein the multiprocessor system comprises at least one core processor and a plurality of accelerators, comprises the steps of: iteratively factorizing each panel in the matrix until the whole matrix is factorized; wherein in each iteration, the method comprises: partitioning an unprocessed matrix part in the matrix into a plurality of blocks according to a predetermined block size; partitioning a current processed panel in the unprocessed matrix part into at least two sub panels, wherein the current processed panel is composed of a plurality of blocks; and performing QR factorization one by one on the at least two sub panels with the plurality of accelerators, and updating the data of the sub panel(s) on which no QR factorization has been performed among the at least two sub panels by using the factorization result.

    摘要翻译: 一种用于在多处理器系统上QR分解矩阵的方法和装置,其中所述多处理器系统包括至少一个核心处理器和多个加速器,包括以下步骤:对所述矩阵中的每个面板进行迭代因式分解,直到整个矩阵被分解为止; 其中在每次迭代中,所述方法包括:根据预定的块大小将矩阵中未处理的矩阵部分划分成多个块; 将未处理的矩阵部分中的当前处理的面板划分为至少两个子面板,其中当前处理面板由多个块组成; 并且利用所述多个加速器在所述至少两个子面板上逐个地执行QR因式分解,并且通过使用因式分解来更新在所述至少两个子面板中未执行QR因子分解的子面板的数据 结果。

    Method and apparatus for QR-factorizing matrix on multiprocessor system
    3.
    发明授权
    Method and apparatus for QR-factorizing matrix on multiprocessor system 失效
    多因素系统QR分解矩阵的方法和装置

    公开(公告)号:US08296350B2

    公开(公告)日:2012-10-23

    申请号:US12402780

    申请日:2009-03-12

    申请人: Hui Li Bai Ling Wang

    发明人: Hui Li Bai Ling Wang

    CPC分类号: G06F17/16

    摘要: The present invention provides a method and apparatus for QR-factorizing matrix on a multiprocessor system, wherein the multiprocessor system comprises at least one core processor and a plurality of accelerators, the method comprises the steps of: iteratively factorizing each panel in the matrix until the whole matrix is factorized; wherein in each iteration, the method comprises: partitioning an unprocessed matrix part in the matrix into a plurality of blocks according to a predetermined block size; partitioning a current processed panel in the unprocessed matrix part into at least two sub panels, wherein the current processed panel is composed of a plurality of blocks; and performing QR factorization one by one on the at least two sub panels with the plurality of accelerators, and updating the data of the sub panel(s) on which no QR factorization has been performed among the at least two sub panels by using the factorization result. The present invention enables a multiprocessor system having a high computing capability to be applied to the matrix QR factorization having a large amount of computation tasks.

    摘要翻译: 本发明提供了一种用于在多处理器系统上进行QR分解矩阵的方法和装置,其中所述多处理器系统包括至少一个核心处理器和多个加速器,所述方法包括以下步骤:对矩阵中的每个面板进行迭代因式分解,直到 整体矩阵被分解; 其中在每次迭代中,所述方法包括:根据预定的块大小将矩阵中未处理的矩阵部分划分成多个块; 将未处理的矩阵部分中的当前处理的面板划分为至少两个子面板,其中当前处理面板由多个块组成; 并且利用所述多个加速器在所述至少两个子面板上逐个地执行QR因式分解,并且通过使用因式分解来更新在所述至少两个子面板中未执行QR因子分解的子面板的数据 结果。 本发明能够将具有高计算能力的多处理器系统应用于具有大量计算任务的矩阵QR因式分解。

    Invoking Multi-Library Applications on a Multiple Processor System
    4.
    发明申请
    Invoking Multi-Library Applications on a Multiple Processor System 失效
    在多处理器系统上调用多库应用程序

    公开(公告)号:US20120204003A1

    公开(公告)日:2012-08-09

    申请号:US13452129

    申请日:2012-04-20

    IPC分类号: G06F15/76 G06F9/02

    摘要: A mechanism is provided for invoking multi-library application on a multiple processor system, wherein the multiple processor system comprises a Power Processing Element (PPE) and a plurality of Synergistic Processing Element (SPE). Applications including multi-libraries run in the memory of the PPEs. The mechanism comprises maintaining the status of each SPE in the applications running on the PPE, where there are SPE agents for capturing the instructions from the PPE in the SPEs that have been started. In response to a request for invoking a library, the PPE determines whether the number of available SPEs for invoking the library is adequate based on the current status of SPEs. If the number of available SPEs is adequate, the PPE sends a run instruction to selected SPEs. After finishing the invocation of all libraries, the PPE sends termination instructions to all started SPEs.

    摘要翻译: 提供了一种用于在多处理器系统上调用多库应用的机制,其中所述多处理器系统包括功率处理元件(PPE)和多个协同处理元件(SPE)。 包含多个库的应用程序运行在PPE的内存中。 该机制包括在PPE上运行的应用程序中维护每个SPE的状态,其中有SPE代理用于捕获已经启动的SPE中的PPE中的指令。 响应于调用库的请求,PPE基于SPE的当前状态来确定用于调用库的可用SPE的数量是否足够。 如果可用SPE的数量足够,则PPE向选定的SPE发送运行指令。 完成所有库的调用后,PPE向所有启动的SPE发送终止指令。

    Invoking Multi-Library Applications on a Multiple Processor System
    5.
    发明申请
    Invoking Multi-Library Applications on a Multiple Processor System 审中-公开
    在多处理器系统上调用多库应用程序

    公开(公告)号:US20100058029A1

    公开(公告)日:2010-03-04

    申请号:US12549505

    申请日:2009-08-28

    IPC分类号: G06F15/76 G06F9/06

    摘要: A mechanism is provided for invoking a multi-library application on a multiple processor system, wherein the multiple processor system comprises a Power Processing Element (PPE) and a plurality of Synergistic Processing Element (SPE). Applications including multi-libraries run in the memory of the PPE. The mechanism comprises maintaining the status of each SPE in the application running on the PPE, where there are SPE agents for capturing the instructions from the PPE in the SPEs that have been started. In response to a request for invoking a library, the PPE determines whether the number of available SPEs for invoking the library is adequate based on the current status of SPEs. If the number of available SPEs is adequate, the PPE sends a run instruction to selected SPEs. After finishing the invocation of all libraries, the PPE sends termination instructions to all started SPEs. IBM confidential

    摘要翻译: 提供了一种用于在多处理器系统上调用多库应用的机制,其中所述多处理器系统包括功率处理元件(PPE)和多个协同处理元件(SPE)。 包含多个库的应用程序运行在PPE的内存中。 该机制包括在PPE上运行的应用程序中维护每个SPE的状态,其中有SPE代理用于捕获已经启动的SPE中的PPE中的指令。 响应于调用库的请求,PPE基于SPE的当前状态来确定用于调用库的可用SPE的数量是否足够。 如果可用SPE的数量足够,则PPE向选定的SPE发送运行指令。 完成所有库的调用后,PPE向所有启动的SPE发送终止指令。 IBM保密

    Starting other processing elements (OPEs) responsive to a determination that currently-running OPEs are inadequate to invoke a library for a multi-library application
    6.
    发明授权
    Starting other processing elements (OPEs) responsive to a determination that currently-running OPEs are inadequate to invoke a library for a multi-library application 失效
    响应于确定当前运行的OPE不足以为多库应用调用库的响应,开始其他处理元件(OPE)

    公开(公告)号:US08694756B2

    公开(公告)日:2014-04-08

    申请号:US13452129

    申请日:2012-04-20

    IPC分类号: G06F15/76

    摘要: A mechanism is provided for invoking multi-library application on a multiple processor system, wherein the multiple processor system comprises a Power Processing Element (PPE) and a plurality of Synergistic Processing Element (SPE). Applications including multi-libraries run in the memory of the PPEs. The mechanism comprises maintaining the status of each SPE in the applications running on the PPE, where there are SPE agents for capturing the instructions from the PPE in the SPEs that have been started. In response to a request for invoking a library, the PPE determines whether the number of available SPEs for invoking the library is adequate based on the current status of SPEs. If the number of available SPEs is adequate, the PPE sends a run instruction to selected SPEs. After finishing the invocation of all libraries, the PPE sends termination instructions to all started SPEs.

    摘要翻译: 提供了一种用于在多处理器系统上调用多库应用的机制,其中所述多处理器系统包括功率处理元件(PPE)和多个协同处理元件(SPE)。 包含多个库的应用程序运行在PPE的内存中。 该机制包括在PPE上运行的应用程序中维护每个SPE的状态,其中有SPE代理用于捕获已经启动的SPE中的PPE中的指令。 响应于调用库的请求,PPE基于SPE的当前状态来确定用于调用库的可用SPE的数量是否足够。 如果可用SPE的数量足够,则PPE向选定的SPE发送运行指令。 完成所有库的调用后,PPE向所有启动的SPE发送终止指令。

    METHOD AND APPARATUS FOR QR-FACTORIZING MATRIX ON A MULTIPROCESSOR SYSTEM
    7.
    发明申请
    METHOD AND APPARATUS FOR QR-FACTORIZING MATRIX ON A MULTIPROCESSOR SYSTEM 失效
    用于在多处理器系统上进行QR加工矩阵的方法和装置

    公开(公告)号:US20120296950A1

    公开(公告)日:2012-11-22

    申请号:US13559885

    申请日:2012-07-27

    申请人: Hui Li Bai Ling Wang

    发明人: Hui Li Bai Ling Wang

    IPC分类号: G06F7/32

    CPC分类号: G06F17/16

    摘要: A method and apparatus for QR-factorizing matrix on a multiprocessor system, wherein the multiprocessor system comprises at least one core processor and a plurality of accelerators, comprises the steps of: iteratively factorizing each panel in the matrix until the whole matrix is factorized; wherein in each iteration, the method comprises: partitioning an unprocessed matrix part in the matrix into a plurality of blocks according to a predetermined block size; partitioning a current processed panel in the unprocessed matrix part into at least two sub panels, wherein the current processed panel is composed of a plurality of blocks; and performing QR factorization one by one on the at least two sub panels with the plurality of accelerators, and updating the data of the sub panel(s) on which no QR factorization has been performed among the at least two sub panels by using the factorization result.

    摘要翻译: 一种用于在多处理器系统上QR分解矩阵的方法和装置,其中所述多处理器系统包括至少一个核心处理器和多个加速器,包括以下步骤:对所述矩阵中的每个面板进行迭代因式分解,直到整个矩阵被分解为止; 其中在每次迭代中,所述方法包括:根据预定的块大小将矩阵中未处理的矩阵部分划分成多个块; 将未处理的矩阵部分中的当前处理的面板划分为至少两个子面板,其中当前处理面板由多个块组成; 并且利用所述多个加速器在所述至少两个子面板上逐个地执行QR因式分解,并且通过使用因式分解来更新在所述至少两个子面板中未执行QR因子分解的子面板的数据 结果。

    Method and Device for Integrating Multiple Threat Security Services
    8.
    发明申请
    Method and Device for Integrating Multiple Threat Security Services 有权
    集成多个威胁安全服务的方法和设备

    公开(公告)号:US20090158428A1

    公开(公告)日:2009-06-18

    申请号:US12331912

    申请日:2008-12-10

    申请人: Bai Ling Wang

    发明人: Bai Ling Wang

    IPC分类号: G06F11/30 G06F21/22

    摘要: A method and device for integrating multiple threat security services are disclosed. The method may comprise parsing an incoming packet at a current layer and analyzing the packet with respect to multiple threat security services and so that one or more threat security services needed by the packet may be determined. According to an exemplary embodiment, the current layer may be a layer in a protocol stack constructed based on the multiple threat security services. With this method, integrated multiple threat security services may filter application data and parse network packet data via a single integrated entity, and thus the efficacy of filtering application data may be improved while computation overhead may be reduced.

    摘要翻译: 公开了一种用于集成多个威胁安全服务的方法和设备。 该方法可以包括在当前层解析输入分组并且相对于多个威胁安全服务分析分组,并且可以确定分组所需的一个或多个威胁安全服务。 根据示例性实施例,当前层可以是基于多个威胁安全服务构造的协议栈中的层。 利用这种方法,集成的多威胁安全服务可以通过单个集成实体过滤应用数据并解析网络分组数据,从而可以提高过滤应用数据的功效,同时降低计算开销。

    Performing Molecular Dynamics Simulation on a Multiprocessor System
    9.
    发明申请
    Performing Molecular Dynamics Simulation on a Multiprocessor System 审中-公开
    在多处理器系统上进行分子动力学模拟

    公开(公告)号:US20100185425A1

    公开(公告)日:2010-07-22

    申请号:US12686416

    申请日:2010-01-13

    IPC分类号: G06G7/48 G06F7/60

    CPC分类号: G16C10/00

    摘要: The present invention provides techniques for performing molecular dynamics simulation on a multiprocessor system. The method comprises: dividing a substance space on which molecular dynamics simulation are to be performed into a plurality of cells; storing data of molecules of the plurality of cells in the main memory of the multiprocessor system such that data of molecules of each cell are continuously stored in a memory area corresponding to the cell; and the plurality of accelerators repeatedly acquiring the data of molecules of the plurality of cells from the main memory and performing molecular dynamics simulation computations in parallel such that data of molecules of at least one cell are acquired in one DMA operation. By continuously storing data of molecules of each cell in a memory area corresponding to the cell, the present invention reduces the data exchanges between each accelerator and the main memory during simulation.

    摘要翻译: 本发明提供了用于在多处理器系统上进行分子动力学模拟的技术。 该方法包括:将要进行分子动力学模拟的物质空间划分成多个单元; 将多个单元的分子的数据存储在多处理器系统的主存储器中,使得每个单元的分子的数据被连续地存储在与单元相对应的存储区域中; 并且多个加速器从主存储器重复地获取多个单元的分子的数据,并且并行执行分子动力学模拟计算,使得在一个DMA操作中获取至少一个单元的分子的数据。 通过将每个单元的分子的数据连续地存储在与单元相对应的存储区域中,本发明在模拟期间减少了每个加速器和主存储器之间的数据交换。

    Method and device for integrating multiple threat security services
    10.
    发明授权
    Method and device for integrating multiple threat security services 有权
    集成多个威胁安全服务的方法和设备

    公开(公告)号:US08751787B2

    公开(公告)日:2014-06-10

    申请号:US12331912

    申请日:2008-12-10

    申请人: Bai Ling Wang

    发明人: Bai Ling Wang

    IPC分类号: H04L29/06

    摘要: A method and device for integrating multiple threat security services are disclosed. The method may comprise parsing an incoming packet at a current layer and analyzing the packet with respect to multiple threat security services and so that one or more threat security services needed by the packet may be determined. According to an exemplary embodiment, the current layer may be a layer in a protocol stack constructed based on the multiple threat security services. With this method, integrated multiple threat security services may filter application data and parse network packet data via a single integrated entity, and thus the efficacy of filtering application data may be improved while computation overhead may be reduced.

    摘要翻译: 公开了一种用于集成多个威胁安全服务的方法和设备。 该方法可以包括在当前层解析输入分组并且相对于多个威胁安全服务分析分组,并且可以确定分组所需的一个或多个威胁安全服务。 根据示例性实施例,当前层可以是基于多个威胁安全服务构造的协议栈中的层。 利用这种方法,集成的多威胁安全服务可以通过单个集成实体过滤应用数据并解析网络分组数据,从而可以提高过滤应用数据的功效,同时降低计算开销。