-
公开(公告)号:US20140025685A1
公开(公告)日:2014-01-23
申请号:US13942277
申请日:2013-07-15
Applicant: Ab Initio Technology LLC
Inventor: Vrishal Kulkarni , Stephen Schmidt , Craig W. Stanfill , Ephraim Meriwether Vishniac
IPC: G06F17/30
CPC classification number: G06F17/30312 , G06F17/30321 , G06F17/30418
Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.
Abstract translation: 通过以下操作来管理数据:接收一组单独可访问的数据单元,每个数据单元由键值标识,其中键值被确定为使得该键值识别在第二数据单元之前接收的第一数据单元以比该 识别第二数据单元的键值; 并处理数据单元以存储在数据存储系统中。 该处理包括:存储数据块,通过组合多个数据单元生成块; 向所述索引提供具有条目的条目,所述条目基于所提供的密钥值,所述条目包括与所提供的密钥值对应的数据单元的块; 以及生成与所述块相关联的一个或多个筛选数据结构,用于基于给定的密钥值来确定是否搜索所存储的块以获得对应于给定密钥值的数据单元。
-
2.
公开(公告)号:US08949189B2
公开(公告)日:2015-02-03
申请号:US13942277
申请日:2013-07-15
Applicant: Ab Initio Technology LLC
Inventor: Vrishal Kulkarni , Stephen Schmidt , Craig W. Stanfill , Ephraim Meriwether Vishniac
IPC: G06F17/30
CPC classification number: G06F17/30312 , G06F17/30321 , G06F17/30418
Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.
Abstract translation: 通过以下操作来管理数据:接收一组单独可访问的数据单元,每个数据单元由键值标识,其中键值被确定为使得该键值识别在第二数据单元之前接收的第一数据单元以比该 识别第二数据单元的键值; 并处理数据单元以存储在数据存储系统中。 该处理包括:存储数据块,通过组合多个数据单元生成块; 向所述索引提供具有条目的条目,所述条目基于所提供的密钥值,所述条目包括与所提供的密钥值对应的数据单元的块; 以及生成与所述块相关联的一个或多个筛选数据结构,用于基于给定的密钥值来确定是否搜索所存储的块以获得对应于给定密钥值的数据单元。
-
公开(公告)号:US09753751B2
公开(公告)日:2017-09-05
申请号:US14520588
申请日:2014-10-22
Applicant: Ab Initio Technology LLC
Inventor: Matthew Darcy Atterbury , H. Mark Bromley , Wayne Mesard , Arkadi Popov , Stephen Schmidt , Craig W. Stanfill , Joseph Skeffington Wholey
CPC classification number: G06F9/44521 , G06F9/44536 , G06F9/4494
Abstract: Processing data includes: receiving units of work that each include one or more work elements, and processing a first unit of work using a first compiled dataflow graph (160) loaded into a data processing system (100) in response to receiving the first unit of work. The processing includes: analysis to determine a characteristic of the first unit of work; identifying one or more compiled dataflow graphs from graphs stored in a data storage system (107) that include at least some that were compiled for processing a unit of work having the determined characteristic; loading one of the identified compiled dataflow graphs into the data processing system (100) as the first compiled dataflow graph (160); and generating one or more output work elements from at least one work element in the first unit of work.
-
公开(公告)号:US20150106818A1
公开(公告)日:2015-04-16
申请号:US14520588
申请日:2014-10-22
Applicant: Ab Initio Technology LLC
Inventor: Matthew Darcy Atterbury , H. Mark Bromley , Wayne Mesard , Arkadi Popov , Stephen Schmidt , Craig W. Stanfill , Joseph Skeffington Wholey
CPC classification number: G06F9/44521 , G06F9/44536 , G06F9/4494
Abstract: Processing data includes: receiving units of work that each include one or more work elements, and processing a first unit of work using a first compiled dataflow graph (160) loaded into a data processing system (100) in response to receiving the first unit of work. The processing includes: analysis to determine a characteristic of the first unit of work; identifying one or more compiled dataflow graphs from graphs stored in a data storage system (107) that include at least some that were compiled for processing a unit of work having the determined characteristic; loading one of the identified compiled dataflow graphs into the data processing system (100) as the first compiled dataflow graph (160); and generating one or more output work elements from at least one work element in the first unit of work.
Abstract translation: 处理数据包括:接收每个工作单元包括一个或多个工作单元,以及响应于接收到第一单元的第一单元来处理加载到数据处理系统(100)中的第一编译数据流图(160)来处理第一工作单元 工作。 处理包括:分析确定第一工作单元的特征; 从存储在数据存储系统(107)中的图形中识别一个或多个编译数据流图,所述图形包括至少一些被编译用于处理具有所确定的特征的工作单元的编译数据流图; 将所识别的编译数据流图中的一个加载到数据处理系统(100)中作为第一编译数据流图(160); 以及从所述第一工作单元中的至少一个工作元件生成一个或多个输出工作元件。
-
-
-