-
公开(公告)号:US20050114368A1
公开(公告)日:2005-05-26
申请号:US10941373
申请日:2004-09-15
申请人: Joel Gould , Carl Feynman , Paul Bay
发明人: Joel Gould , Carl Feynman , Paul Bay
CPC分类号: G06F17/30371 , G06F17/30466 , G06F17/30486 , G06F17/30489 , G06F17/30539 , G06F17/3056 , G06F17/30569 , G06F17/30598
摘要: Processing data includes accepting information characterizing values of a first field in records of a first data source and information characterizing values of a second field in records of a second data source. Quantities characterizing a relationship between the first field and the second field are computed based on the accepted information. Information relating the first field and the second field is presented.
摘要翻译: 处理数据包括接收表征第一数据源的记录中的第一字段的值的信息和第二数据源的记录中的第二字段的信息表征值的信息。 基于所接受的信息计算表征第一场和第二场之间的关系的量。 介绍与第一场和第二场有关的信息。
-
公开(公告)号:US08639674B2
公开(公告)日:2014-01-28
申请号:US13552706
申请日:2012-07-19
申请人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
发明人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
IPC分类号: G06F17/30
CPC分类号: G06F17/30312 , G06F17/30321
摘要: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.
摘要翻译: 一种用于管理数据的方法包括接收单独可访问的数据单元,每个单元由密钥值识别; 存储多个数据块,通过组合多个数据单元来生成至少一些块中的每一个; 并提供包括每个块的条目的索引。 一个或多个条目使得能够基于所提供的键值来定位包括与包括所提供的键值的键值的范围相对应的数据单元的块。
-
公开(公告)号:US08229902B2
公开(公告)日:2012-07-24
申请号:US11555458
申请日:2006-11-01
申请人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
发明人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
IPC分类号: G06F17/30
CPC分类号: G06F17/30312 , G06F17/30321
摘要: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.
摘要翻译: 一种用于管理数据的方法包括接收单独可访问的数据单元,每个单元由密钥值识别; 存储多个数据块,通过组合多个数据单元来生成至少一些块中的每一个; 并提供包括每个块的条目的索引。 一个或多个条目使得能够基于所提供的键值来定位包括与包括所提供的键值的键值的范围相对应的数据单元的块。
-
公开(公告)号:US20080104149A1
公开(公告)日:2008-05-01
申请号:US11555458
申请日:2006-11-01
申请人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
发明人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
IPC分类号: G06F12/02
CPC分类号: G06F17/30312 , G06F17/30321
摘要: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.
摘要翻译: 一种用于管理数据的方法包括接收单独可访问的数据单元,每个单元由密钥值识别; 存储多个数据块,通过组合多个数据单元来生成至少一些块中的每一个; 并提供包括每个块的条目的索引。 一个或多个条目使得能够基于所提供的键值来定位包括与包括所提供的键值的键值的范围相对应的数据单元的块。
-
公开(公告)号:US06584581B1
公开(公告)日:2003-06-24
申请号:US09608995
申请日:2000-06-30
申请人: Paul Bay , Ephraim Vishniac , Craig W. Stanfill
发明人: Paul Bay , Ephraim Vishniac , Craig W. Stanfill
IPC分类号: G06F1100
CPC分类号: G06F11/1458 , G06F2201/82
摘要: A data processing system and method that provides checkpointing and permits a continuous flow of data processing by allowing each process to return to operation after checkpointing, independently of the time required by other processes to checkpoint their state. Checkpointing in accordance with the invention makes use of a command message from a checkpoint processor that sequentially propagates through a process stage from data sources through processes to data sinks, triggering each process to checkpoint its state and then pass on a checkpointing message to connected “downstream” processes. This approach provides checkpointing and permits a continuous flow of data processing by allowing each process to return to normal operation after checkpointing, independently of the time required by other processes to checkpoint their state.
摘要翻译: 数据处理系统和方法,其提供检查点并允许数据处理的连续流程,允许每个进程在检查点之后返回到操作,而与其他进程检查点状态无关。 根据本发明的检查点利用来自检查点处理器的命令消息,其顺序地通过处理阶段从数据源通过进程传送到数据宿,触发每个进程检查其状态,然后将检查点消息传递到连接的“下游 “进程。 这种方法提供检查点,并允许数据处理的连续流程,允许每个进程在检查点之后返回正常操作,而与其他进程检查点状态无关。
-
公开(公告)号:US08868580B2
公开(公告)日:2014-10-21
申请号:US10941402
申请日:2004-09-15
申请人: Joel Gould , Carl Feynman , Paul Bay
发明人: Joel Gould , Carl Feynman , Paul Bay
IPC分类号: G06F17/30
CPC分类号: G06F17/30371 , G06F17/30466 , G06F17/30486 , G06F17/30489 , G06F17/30539 , G06F17/3056 , G06F17/30569 , G06F17/30598
摘要: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.
摘要翻译: 处理数据包括从数据源分析数据,包括从数据源读取数据,在读取数据时计算表征数据的汇总数据,以及存储基于摘要数据的简档信息。 然后从数据源处理数据。 该处理包括访问所存储的简档信息并根据所访问的简档信息处理数据。
-
公开(公告)号:US20120284240A1
公开(公告)日:2012-11-08
申请号:US13552706
申请日:2012-07-19
申请人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
发明人: Ephraim Meriwether Vishniac , Marshall A. Isman , Paul Bay , H. Mark Bromley , John L. Richardson
IPC分类号: G06F17/30
CPC分类号: G06F17/30312 , G06F17/30321
摘要: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.
摘要翻译: 一种用于管理数据的方法包括接收单独可访问的数据单元,每个单元由密钥值识别; 存储多个数据块,通过组合多个数据单元来生成至少一些块中的每一个; 并提供包括每个块的条目的索引。 一个或多个条目使得能够基于所提供的键值来定位包括与包括所提供的键值的键值的范围相对应的数据单元的块。
-
公开(公告)号:US07849075B2
公开(公告)日:2010-12-07
申请号:US10941373
申请日:2004-09-15
申请人: Joel Gould , Carl Feynman , Paul Bay
发明人: Joel Gould , Carl Feynman , Paul Bay
IPC分类号: G06F17/30
CPC分类号: G06F17/30371 , G06F17/30466 , G06F17/30486 , G06F17/30489 , G06F17/30539 , G06F17/3056 , G06F17/30569 , G06F17/30598
摘要: Processing data includes accepting information characterizing values of a first field in records of a first data source and information characterizing values of a second field in records of a second data source. Quantities characterizing a relationship between the first field and the second field are computed based on the accepted information. Information relating the first field and the second field is presented.
摘要翻译: 处理数据包括接收表征第一数据源的记录中的第一字段的值的信息和第二数据源的记录中的第二字段的信息表征值的信息。 基于所接受的信息计算表征第一场和第二场之间的关系的量。 介绍与第一场和第二场有关的信息。
-
公开(公告)号:US07756873B2
公开(公告)日:2010-07-13
申请号:US10941401
申请日:2004-09-15
申请人: Joel Gould , Carl Feynman , Paul Bay
发明人: Joel Gould , Carl Feynman , Paul Bay
IPC分类号: G06F17/30
CPC分类号: G06F17/30371 , G06F17/30466 , G06F17/30486 , G06F17/30489 , G06F17/30539 , G06F17/3056 , G06F17/30569 , G06F17/30598
摘要: Processing data includes identifying a plurality of subsets of fields of data records of a data source, determining co-occurrence statistics for each of the plurality of subsets, and identifying one or more of the plurality of subsets as having a functional relationship among the fields of the identified subset.
摘要翻译: 处理数据包括识别数据源的数据记录的多个字段子集,确定多个子集中的每一个的同现统计,以及将所述多个子集中的一个或多个子集识别为在 识别的子集。
-
公开(公告)号:US20050114369A1
公开(公告)日:2005-05-26
申请号:US10941402
申请日:2004-09-15
申请人: Joel Gould , Carl Feynman , Paul Bay
发明人: Joel Gould , Carl Feynman , Paul Bay
CPC分类号: G06F17/30371 , G06F17/30466 , G06F17/30486 , G06F17/30489 , G06F17/30539 , G06F17/3056 , G06F17/30569 , G06F17/30598
摘要: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.
摘要翻译: 处理数据包括从数据源分析数据,包括从数据源读取数据,在读取数据时计算表征数据的汇总数据,以及存储基于摘要数据的简档信息。 然后从数据源处理数据。 该处理包括访问所存储的简档信息并根据所访问的简档信息处理数据。
-
-
-
-
-
-
-
-
-