Parallel data processing architecture

发明授权

US07454411B2 Parallel data processing architecture 有权

标题翻译：并行数据处理架构

请登陆查看更多内容

专利标题： Parallel data processing architecture
专利标题（中）： 并行数据处理架构
申请号： US10767776

申请日： 2004-01-30
公开(公告)号： US07454411B2

公开(公告)日： 2008-11-18
发明人: John D. Birdwell , Tse-Wei Wang , Roger D. Horn , Puneet Yadav , David J. Icove
申请人： John D. Birdwell , Tse-Wei Wang , Roger D. Horn , Puneet Yadav , David J. Icove
申请人地址： US TN Knoxville
专利权人： Universtiy of Tennessee Research Foundation
当前专利权人： Universtiy of Tennessee Research Foundation
当前专利权人地址： US TN Knoxville
代理机构： Smith, Gambrell & Russell, LLP
主分类号： G06F17/30
IPC分类号： G06F17/30 ; G06F7/00 ; G06F17/00

摘要：

A tree-structured index to multidimensional data is created using naturally occurring patterns and clusters within the data which permit efficient search and retrieval strategies in a database of DNA profiles. A search engine utilizes hierarchical decomposition of the database by identifying clusters of similar DNA profiles and maps to parallel computer architecture, allowing scale up past previously feasible limits. Key benefits of the new method are logarithmic scale up and parallelization. These benefits are achieved by identification and utilization of naturally occurring patterns and clusters within stored data. The patterns and clusters enable the stored data to be partitioned into subsets of roughly equal size. The method can be applied recursively, resulting in a database tree that is balanced, meaning that all paths or branches through the tree have roughly the same length. The method achieves high performance by exploiting the natural structure of the data in a manner that maintains balanced trees. Implementation of the method maps naturally to parallel computer architectures, allowing scale up to very large databases.

摘要（中）：

使用数据中的自然发生的模式和集群创建树形结构的多维数据索引，这些数据允许DNA简档数据库中的高效搜索和检索策略。搜索引擎利用数据库的分层分解，通过识别类似DNA分布的集群并将其映射到并行计算机体系结构，从而超越以前可行的限制。新方法的主要优点是对数放大和并行化。这些优点通过识别和利用存储数据中的自然发生的模式和集群来实现。模式和集群使存储的数据能够被分割成大致相等大小的子集。该方法可以递归地应用，导致数据库树是平衡的，意味着通过树的所有路径或分支具有大致相同的长度。该方法通过以保持平衡树的方式利用数据的自然结构来实现高性能。该方法的实现自然映射到并行计算机体系结构，允许扩展到非常大的数据库。

公开/授权文献

US20040186920A1 Parallel data processing architecture 公开/授权日：2004-09-23

信息查询

Espacenet