SYSTEM AND METHOD FOR INDEXING HIGH-DIMENSIONAL DATA IN CLUSTER SYSTEM
    1.
    发明申请
    SYSTEM AND METHOD FOR INDEXING HIGH-DIMENSIONAL DATA IN CLUSTER SYSTEM 审中-公开
    用于在群集系统中引导高维数据的系统和方法

    公开(公告)号:US20090157624A1

    公开(公告)日:2009-06-18

    申请号:US12207180

    申请日:2008-09-09

    IPC分类号: G06F17/30 G06F7/06

    CPC分类号: G06F16/2264 G06F16/2246

    摘要: Provided are a system and a method for indexing high-dimensional data in parallel in a cluster environment. The system for indexing high-dimensional data in parallel in a cluster environment includes a Spill-tree creation means for creating a Spill-tree using an sampled N-dimensional feature vector, a feature vector division storage means for distributedly storing the N-dimensional feature vector in a terminal node of the Spill-tree, and a local signature creation means for creating and managing a local signature for the N-dimensional feature vector dispersed into each node of the Spill-tree.

    摘要翻译: 提供了一种用于在集群环境中并行索引高维数据的系统和方法。 用于在群集环境中并行索引高维数据的系统包括:使用采样的N维特征向量创建溢出树的溢出树创建装置,用于分布式地存储N维特征的特征向量分割存储装置 向量,以及本地签名创建装置,用于创建和管理分散到溢出树的每个节点中的N维特征向量的本地签名。

    Stream data processing system and method for avoiding duplication of data process
    2.
    发明申请
    Stream data processing system and method for avoiding duplication of data process 有权
    流数据处理系统和方法,避免数据重复过程

    公开(公告)号:US20070136239A1

    公开(公告)日:2007-06-14

    申请号:US11607279

    申请日:2006-11-29

    IPC分类号: G06F17/30

    摘要: Provided is a stream data processing system and method for avoiding duplication of data process. The system including: an evaluation result storing unit for updating and storing a query condition evaluation result; a window evaluating unit for performing window evaluation; a data separating unit for separating data into new data and duplication input data; a reuse result extracting unit for receiving duplication input data from the data separating unit and extracting a query condition evaluation result; a query condition evaluating unit for receiving new data from the data separating unit, performing query condition evaluation and creating a query condition evaluation result; and a result organizing unit for receiving the query condition evaluation result, merging, outputting and transmitting the query condition evaluation result to the evaluation result storing unit.

    摘要翻译: 提供了一种用于避免数据处理重复的流数据处理系统和方法。 该系统包括:评估结果存储单元,用于更新和存储查询条件评估结果; 用于执行窗口评估的窗口评估单元; 用于将数据分离成新数据和复制输入数据的数据分离单元; 重用结果提取单元,用于从数据分离单元接收复制输入数据并提取查询条件评估结果; 查询条件评估单元,用于从数据分离单元接收新数据,执行查询条件评估和创建查询条件评估结果; 以及结果组织单元,用于接收查询条件评估结果,合并,输出并将查询条件评估结果发送到评估结果存储单元。

    METHOD AND SYSTEM FOR INDEXING AND SEARCHING HIGH-DIMENSIONAL DATA USING SIGNATURE FILE
    3.
    发明申请
    METHOD AND SYSTEM FOR INDEXING AND SEARCHING HIGH-DIMENSIONAL DATA USING SIGNATURE FILE 失效
    使用签名文件进行索引和搜索高维数据的方法和系统

    公开(公告)号:US20090157601A1

    公开(公告)日:2009-06-18

    申请号:US12107419

    申请日:2008-04-22

    IPC分类号: G06F7/10 G06F17/30

    CPC分类号: G06F17/3002

    摘要: Provided are a content-based searching method and system for multimedia objects using a high-dimensional feature vector data based on a 2-level signature. The method for searching the high-dimensional data using a signature file includes calculating a first-level query signature and a second-level query signature by using the query feature vector, performing a first filtering operation to obtain a primary candidate cell group by searching a second-level signature file, and performing a secondary filtering operation to obtain a secondary candidate cell group having a high similarity in a primary candidate cell group. Accordingly, the high-dimensional data searching method and system can process a query quickly and accurately and can increase the searching accuracy by using an enhanced signature of the query feature vector.

    摘要翻译: 提供一种基于内容的搜索方法和系统,用于使用基于2级签名的高维特征向量数据的多媒体对象。 使用签名文件搜索高维数据的方法包括使用查询特征向量计算第一级查询签名和第二级查询签名,执行第一过滤操作以通过搜索主要候选小区组来获得主要候选小区组 二级签名文件,并且执行二次过滤操作以获得在主候选小区组中具有高相似性的辅助候选小区组。 因此,高维数据搜索方法和系统可以快速准确地处理查询,并且可以通过使用查询特征向量的增强签名来提高搜索精度。

    Method and system for indexing and searching high-dimensional data using signature file
    4.
    发明授权
    Method and system for indexing and searching high-dimensional data using signature file 失效
    使用签名文件索引和搜索高维数据的方法和系统

    公开(公告)号:US08032534B2

    公开(公告)日:2011-10-04

    申请号:US12107419

    申请日:2008-04-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3002

    摘要: Provided are a content-based searching method and system for multimedia objects using a high-dimensional feature vector data based on a 2-level signature. The method for searching the high-dimensional data using a signature file includes calculating a first-level query signature and a second-level query signature by using the query feature vector, performing a first filtering operation to obtain a primary candidate cell group by searching a second-level signature file, and performing a secondary filtering operation to obtain a secondary candidate cell group having a high similarity in a primary candidate cell group. Accordingly, the high-dimensional data searching method and system can process a query quickly and accurately and can increase the searching accuracy by using an enhanced signature of the query feature vector.

    摘要翻译: 提供一种基于内容的搜索方法和系统,用于使用基于2级签名的高维特征向量数据的多媒体对象。 使用签名文件搜索高维数据的方法包括使用查询特征向量计算第一级查询签名和第二级查询签名,执行第一过滤操作以通过搜索主要候选小区组获得主要候选小区组 二级签名文件,并且执行二次过滤操作以获得在主候选小区组中具有高相似性的辅助候选小区组。 因此,高维数据搜索方法和系统可以快速准确地处理查询,并且可以通过使用查询特征向量的增强签名来提高搜索精度。

    Stream data processing system and method for avoiding duplication of data process
    5.
    发明授权
    Stream data processing system and method for avoiding duplication of data process 有权
    流数据处理系统和方法,避免数据重复过程

    公开(公告)号:US07490078B2

    公开(公告)日:2009-02-10

    申请号:US11607279

    申请日:2006-11-29

    IPC分类号: G06F7/00

    摘要: Provided is a stream data processing system and method for avoiding duplication of data process. The system including: an evaluation result storing unit for updating and storing a query condition evaluation result; a window evaluating unit for performing window evaluation; a data separating unit for separating data into new data and duplication input data; a reuse result extracting unit for receiving duplication input data from the data separating unit and extracting a query condition evaluation result; a query condition evaluating unit for receiving new data from the data separating unit, performing query condition evaluation and creating a query condition evaluation result; and a result organizing unit for receiving the query condition evaluation result, merging, outputting and transmitting the query condition evaluation result to the evaluation result storing unit.

    摘要翻译: 提供了一种用于避免数据处理重复的流数据处理系统和方法。 该系统包括:评估结果存储单元,用于更新和存储查询条件评估结果; 用于执行窗口评估的窗口评估单元; 用于将数据分离成新数据和复制输入数据的数据分离单元; 重用结果提取单元,用于从数据分离单元接收复制输入数据并提取查询条件评估结果; 查询条件评估单元,用于从数据分离单元接收新数据,执行查询条件评估和创建查询条件评估结果; 以及结果组织单元,用于接收查询条件评估结果,合并,输出并将查询条件评估结果发送到评估结果存储单元。

    System and method for processing integrated queries against input data stream and data stored in database using trigger
    6.
    发明申请
    System and method for processing integrated queries against input data stream and data stored in database using trigger 审中-公开
    使用触发器对输入数据流和存储在数据库中的数据进行集成查询的系统和方法

    公开(公告)号:US20070136254A1

    公开(公告)日:2007-06-14

    申请号:US11594641

    申请日:2006-11-08

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F16/24568 G06F16/80

    摘要: Provided are a system and a method for processing integrated queries against an input data stream and data stored in a database using trigger. The system for processing an integrated query against an input data stream and data stored in a database using a trigger, including: a data stream manager for managing a continuously inputted data stream; a trigger result manager for registering a trigger in a database which interworks with the trigger result manager and forming a set of results that are obtained by executing the registered trigger to thereby provide the set of results in real time; and an executer for processing an integrated query against the data stream from the data stream manager and data stored in the database, wherein the integrated query is processed by referring to the set of results from the trigger result manager for the data stored in the database.

    摘要翻译: 提供了一种用于使用触发器来处理针对输入数据流的集成查询和存储在数据库中的数据的系统和方法。 用于使用触发来处理针对输入数据流的综合查询和存储在数据库中的数据的系统,包括:用于管理连续输入的数据流的数据流管理器; 触发结果管理器,用于在与触发结果管理器进行交互的数据库中注册触发器,并形成通过执行注册的触发而获得的一组结果,从而实时提供该组结果; 以及执行器,用于处理来自数据流管理器的数据流的综合查询和存储在数据库中的数据,其中通过参考存储在数据库中的数据的触发结果管理器的结果集来处理集成查询。

    System and method for processing continuous integrated queries on both data stream and stored data using user-defined shared trigger
    7.
    发明授权
    System and method for processing continuous integrated queries on both data stream and stored data using user-defined shared trigger 失效
    用于使用用户定义的共享触发来处理数据流和存储数据的连续集成查询的系统和方法

    公开(公告)号:US07860884B2

    公开(公告)日:2010-12-28

    申请号:US11838599

    申请日:2007-08-14

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30516

    摘要: Provided are a system and method for processing continuous integrated queries on both data stream and stored data using user-defined shared trigger. The system includes a data stream manager for managing data stream inputted from outside; a continuous integrated queries manager for managing the continuous integrated queries inputted from an external application; a trigger manager for managing the user-defined shared trigger inputted from the external application and registering the user-defined shared trigger in an external database; a trigger result manager for forming and managing a trigger result set from a performance result of the user-defined shared trigger registered in the cooperation database; and a continuous integrated queries performer for processing the continuous integrated queries referring to the transmitted data stream and trigger result set.

    摘要翻译: 提供了一种用于使用用户定义的共享触发来处理数据流和存储数据的连续集成查询的系统和方法。 该系统包括用于管理从外部输入的数据流的数据流管理器; 用于管理从外部应用输入的连续集成查询的连续集成查询管理器; 用于管理从外部应用输入的用户定义的共享触发并将用户定义的共享触发记录在外部数据库中的触发管理器; 触发结果管理器,用于从登记在协作数据库中的用户定义的共享触发的执行结果形成和管理触发结果集; 以及连续的综合查询执行器,用于处理连续的综合查询,参考发送的数据流和触发结果集。

    Apparatus and method for managing data stream distributed parallel processing service
    8.
    发明授权
    Apparatus and method for managing data stream distributed parallel processing service 有权
    用于管理数据流分布式并行处理服务的装置和方法

    公开(公告)号:US08997109B2

    公开(公告)日:2015-03-31

    申请号:US13585252

    申请日:2012-08-14

    IPC分类号: G06F9/46 G06F9/50

    CPC分类号: G06F9/5038 G06F9/505

    摘要: Disclosed herein are an apparatus and method for managing a data stream distributed parallel processing service. The apparatus includes a service management unit, a Quality of Service (QoS) monitoring unit, and a scheduling unit. The service management unit registers a plurality of tasks constituting the data stream distributed parallel processing service. The QoS monitoring unit gathers information about the load of the plurality of tasks and information about the load of a plurality of nodes constituting a cluster which provides the data stream distributed parallel processing service. The scheduling unit arranges the plurality of tasks by distributing the plurality of tasks among the plurality of nodes based on the information about the load of the plurality of tasks and the information about the load of the plurality of nodes.

    摘要翻译: 这里公开了一种用于管理分布式并行处理服务的数据流的装置和方法。 该装置包括服务管理单元,服务质量(QoS)监视单元和调度单元。 服务管理单元登记构成数据流分散并行处理服务的多个任务。 QoS监视单元收集关于多个任务的负载的信息和关于构成提供数据流分布式并行处理服务的集群的多个节点的负载的信息。 调度单元基于关于多个任务的负载的信息和关于多个节点的负载的信息,在多个节点之间分配多个任务来配置多个任务。

    APPARATUS AND METHOD FOR MANAGING DATA STREAM DISTRIBUTED PARALLEL PROCESSING SERVICE
    9.
    发明申请
    APPARATUS AND METHOD FOR MANAGING DATA STREAM DISTRIBUTED PARALLEL PROCESSING SERVICE 有权
    用于管理数据流分布式并行处理服务的装置和方法

    公开(公告)号:US20130219405A1

    公开(公告)日:2013-08-22

    申请号:US13585252

    申请日:2012-08-14

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5038 G06F9/505

    摘要: Disclosed herein are an apparatus and method for managing a data stream distributed parallel processing service. The apparatus includes a service management unit, a Quality of Service (QoS) monitoring unit, and a scheduling unit. The service management unit registers a plurality of tasks constituting the data stream distributed parallel processing service. The QoS monitoring unit gathers information about the load of the plurality of tasks and information about the load of a plurality of nodes constituting a cluster which provides the data stream distributed parallel processing service. The scheduling unit arranges the plurality of tasks by distributing the plurality of tasks among the plurality of nodes based on the information about the load of the plurality of tasks and the information about the load of the plurality of nodes.

    摘要翻译: 这里公开了一种用于管理分布式并行处理服务的数据流的装置和方法。 该装置包括服务管理单元,服务质量(QoS)监视单元和调度单元。 服务管理单元登记构成数据流分散并行处理服务的多个任务。 QoS监视单元收集关于多个任务的负载的信息和关于构成提供数据流分布式并行处理服务的集群的多个节点的负载的信息。 调度单元基于关于多个任务的负载的信息和关于多个节点的负载的信息,在多个节点之间分配多个任务来配置多个任务。

    INCREMENTAL MAPREDUCE-BASED DISTRIBUTED PARALLEL PROCESSING SYSTEM AND METHOD FOR PROCESSING STREAM DATA
    10.
    发明申请
    INCREMENTAL MAPREDUCE-BASED DISTRIBUTED PARALLEL PROCESSING SYSTEM AND METHOD FOR PROCESSING STREAM DATA 审中-公开
    基于MAPREDUCE的分布式并行处理系统和处理流数据的方法

    公开(公告)号:US20110154339A1

    公开(公告)日:2011-06-23

    申请号:US12968647

    申请日:2010-12-15

    IPC分类号: G06F9/46 G06F15/76

    摘要: Disclosed herein is a system for processing large-capacity data in a distributed parallel processing manner based on MapReduce using a plurality of computing nodes. The distributed parallel processing system is configured to provide an incremental MapReduce-based distributed parallel processing function for large-capacity stream data which is being continuously collected even during the performance of the distributed parallel processing, as well as for large-capacity stored data which has been previously collected.

    摘要翻译: 本文公开了一种基于使用多个计算节点的MapReduce以分布式并行处理方式处理大容量数据的系统。 分布式并行处理系统被配置为为大容量流数据提供增量的基于MapReduce的分布式并行处理功能,即使在分布式并行处理的执行期间也连续收集,并且对于大容量存储的数据 以前收集。