ASSOCIATION RULE MINING WITH THE MICRON AUTOMATA PROCESSOR

    公开(公告)号:US20170091287A1

    公开(公告)日:2017-03-30

    申请号:US14871457

    申请日:2015-09-30

    CPC classification number: G06F16/24569 G06F16/2465

    Abstract: The present invention discloses a heterogeneous computation framework, of Association. Rule Mining (ARM) using Micron's Autotmata Processor (AP). This framework is based on the Apriori algorithm. Two Automaton designs are proposed to match and count the individual itemset. Several performance improvement strategies are proposed including minimizing the number of reporting vectors and reduce reconfiguration delays. The experiment results show up to 94× speed ups of the proposed AP-accelerated Apriori on six synthetic and real-world datasets, when compared with the Apriori single-core CPU implementation. The proposed AP-accelerated Apriori solution also outperforms the state-of-the-art multicore and GPU implementations of Equivalence Class Transformation (Eclat) algorithm on big datasets.

    METHOD AND SYSTEM FOR DATA DISPATCH PROCESSING IN A BIG DATA SYSTEM
    5.
    发明申请
    METHOD AND SYSTEM FOR DATA DISPATCH PROCESSING IN A BIG DATA SYSTEM 审中-公开
    用于大数据系统中数据分配处理的方法和系统

    公开(公告)号:US20150186429A1

    公开(公告)日:2015-07-02

    申请号:US14490685

    申请日:2014-09-19

    CPC classification number: G06F9/5066 G06F16/24569

    Abstract: A system and a method for data dispatch processing in a big data system are provided. The system includes a plurality of computing machines and a database cluster. The method includes disassembling a computing procedure into a plurality of processing elements. The method also includes identifying a database accessing point for accessing a target data node from one of the data nodes in the computing procedure. The method further includes configuring the processing elements to the computing machines according to the database accessing point, and transmitting a data tuple corresponding to the computing procedure according to the processing elements configured to the computing machines and a data transmitting cost between the computing machines. Accordingly, the method effectively improves system performance for transmitting the big data.

    Abstract translation: 提供了一种大数据系统中的数据调度处理系统和方法。 该系统包括多个计算机和数据库集群。 该方法包括将计算过程拆分成多个处理元件。 该方法还包括在计算过程中从数据节点之一识别用于访问目标数据节点的数据库访问点。 该方法还包括根据数据库访问点将处理元件配置到计算机,以及根据配置到计算机的处理元件和计算机之间的数据传输成本,发送与计算过程相对应的数据元组。 因此,该方法有效地提高了传输大数据的系统性能。

    DATABASE MANAGEMENT SYSTEM, COMPUTER, AND DATABASE MANAGEMENT METHOD
    6.
    发明申请
    DATABASE MANAGEMENT SYSTEM, COMPUTER, AND DATABASE MANAGEMENT METHOD 审中-公开
    数据库管理系统,计算机和数据库管理方法

    公开(公告)号:US20150169591A1

    公开(公告)日:2015-06-18

    申请号:US14402878

    申请日:2012-05-24

    CPC classification number: G06F16/252 G06F16/24532 G06F16/2455 G06F16/24569

    Abstract: A database management system (DBMS) manages a database existing in a second storage device with an access speed lower than that of a first storage device. In an execution of a query, the DBMS dynamically generates tasks two or more executable tasks in parallel. The DBMS generates task start information which is information representing a content of the execution of the task, manages the task start information, and executes a content represented by the task start information by the task. The task start information includes a data address set existing in the second storage device. The DBMS controls movement of the data address sets between the first storage device and the second storage device based on a management state of the task start information. In addition, the DBMS selects the task start information based on whether or not the data address set exists in the first storage device.

    Abstract translation: 数据库管理系统(DBMS)以比第一存储设备低的访问速度来管理第二存储设备中存在的数据库。 在执行查询时,DBMS并行动态生成任务两个或多个可执行任务。 DBMS生成作为任务执行内容的信息的任务开始信息,管理该任务开始信息,并执行任务开始信息所表示的内容。 任务开始信息包括存在于第二存储装置中的数据地址集。 DBMS基于任务开始信息的管理状态,控制第一存储装置与第二存储装置之间的数据地址集的移动。 另外,DBMS根据第一存储装置中是否存在数据地址组来选择任务开始信息。

    DATABASE SYSTEM AND DATABASE MANAGEMENT METHOD
    7.
    发明申请
    DATABASE SYSTEM AND DATABASE MANAGEMENT METHOD 审中-公开
    数据库系统和数据库管理方法

    公开(公告)号:US20140297697A1

    公开(公告)日:2014-10-02

    申请号:US13576365

    申请日:2012-07-11

    CPC classification number: G06F3/061 G06F3/0655 G06F3/067 G06F16/24569

    Abstract: The method includes (A) acquiring storage location information that can identify a volume that stores data and access type information, (B) acquiring volume management information that can identify the storage unit that stores the volume, (C) identifying the volume of data to be accessed, identifying the storage unit storing the volume, and identifying the storage method of the storage unit, (D) identifying the type of access to the data to be accessed, (E) determining whether the data needs to be moved to another storage unit of a different storage method based on the storage method and the type of access, and (F) giving an indication of moving the data if it is determined that the data needs to be moved in (E).

    Abstract translation: 该方法包括:(A)获取可以识别存储数据的卷的存储位置信息和访问类型信息,(B)获取可以识别存储卷的存储单元的卷管理信息,(C)识别数据量到 识别存储单元的存储单元,识别存储单元的存储方法,(D)识别要访问的数据的访问类型,(E)确定数据是否需要被移动到另一个存储器 基于存储方法和访问类型的不同存储方法的单元,以及(F)如果确定需要在(E)中移动数据,则给出移动数据的指示。

    Stream engine using compressed bitsets

    公开(公告)号:US12126694B2

    公开(公告)日:2024-10-22

    申请号:US17713938

    申请日:2022-04-05

    Inventor: Jonathan Colt

    CPC classification number: H04L67/535 G06F16/2237 G06F16/24568 G06F16/24569

    Abstract: Technologies are described for storing and reporting user activities within a computing environment. For example, bitsets (e.g., compressed and/or uncompressed bitsets) can be used to store activities (e.g., where each activity is a bit in the bitset in chronological order). Separate bitsets can be maintained for followable aspects of the activities (e.g., a separate bitset for each unique followable). Activity streams can be produced from the compressed bitsets (e.g., custom streams reflecting followables designated by users).

    QUERY PROCESSING ON ACCELERATED PROCESSING UNITS

    公开(公告)号:US20240311380A1

    公开(公告)日:2024-09-19

    申请号:US18465764

    申请日:2023-09-12

    CPC classification number: G06F16/24569 G06F16/24542

    Abstract: Query processing systems and methods are disclosed herein. In an example system, query information is received over a network for processing a query. A first processing architecture loads a set of data associated with the query into a shared memory. A second processing architecture accesses the set of data from the shared memory. In one example, the first and second processing architectures and the shared memory are integrated in a hardware chip (e.g., a chiplet containing several processor architectures, such as CPU and a graphics processing unit (GPU)). The query is processed based on the set of data accessed from the shared memory using the second processing architecture to generate a query result. The query result is provided over the network. In this manner, a computing device may execute a query based on different processing systems contained therein.

Patent Agency Ranking