Data processing apparatus and method of processing data
    41.
    发明授权
    Data processing apparatus and method of processing data 有权
    数据处理装置及数据处理方法

    公开(公告)号:US08099573B2

    公开(公告)日:2012-01-17

    申请号:US12256329

    申请日:2008-10-22

    IPC分类号: G06F12/00 G06F13/00 G06F13/28

    CPC分类号: G06F11/1451

    摘要: Data processing apparatus comprising: a chunk store containing specimen data chunks, a manifest store containing at least one manifest that represents at least a part of a data set and that comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only those specimen data chunks having a predetermined characteristic, the processing apparatus being operable to process input data into input data chunks and to use the sparse chunk index to identify at least one of said at least one manifest that includes at least one reference to one of said specimen data chunks that corresponds to one of said input data chunks having the predetermined characteristic.

    摘要翻译: 数据处理装置包括:包含样本数据块的块存储器,包含至少一个表示数据集的至少一部分的清单的清单存储器,并且包括至少一个对所述样本数据块中的至少一个的引用,稀疏 块指数仅包含具有预定特征的样本数据块的信息,该处理装置可操作以将输入数据处理成输入数据块,并使用稀疏块指数来识别至少包括至少一个清单中的至少一个, 对与所述具有预定特征的所述输入数据块之一相对应的所述样本数据块之一的引用。

    SYSTEM AND METHOD FOR DISPLAYING DOCUMENTS
    42.
    发明申请
    SYSTEM AND METHOD FOR DISPLAYING DOCUMENTS 审中-公开
    用于显示文件的系统和方法

    公开(公告)号:US20110202886A1

    公开(公告)日:2011-08-18

    申请号:US12705585

    申请日:2010-02-13

    IPC分类号: G06F3/048

    CPC分类号: G06F16/353

    摘要: A computer system that includes a graphical user interface used to organize a group of documents is provided. The system includes a processor that is adapted to execute machine-readable instructions. The system also includes a storage device that is adapted to store data. The data includes a plurality of documents and instructions that are executable by the processor to generate the graphical user interface. The graphical user interface includes a cluster map that includes the results of a clustering algorithm applied to the documents. The graphical user interface also includes a principal documents screen that includes a principal document that is identified by weighting each of the documents in a cluster based, at least in part, on an occurrence of representative terms in the document. The representative terms are terms that have been identified by the clustering algorithm as being more effective for distinguishing between documents that belong to different clusters.

    摘要翻译: 提供了包括用于组织一组文档的图形用户界面的计算机系统。 该系统包括适于执行机器可读指令的处理器。 该系统还包括适于存储数据的存储设备。 数据包括可由处理器执行以生成图形用户界面的多个文档和指令。 图形用户界面包括包含应用于文档的聚类算法的结果的聚类映射。 图形用户界面还包括主文档屏幕,其包括通过至少部分地基于文档中的代表项的出现来对群集中的每个文档进行加权来标识的主文档。 代表性术语是由聚类算法识别为对区分属于不同簇的文档更有效的术语。

    BATCHING REQUESTS FOR ACCESSING DIFFERENTIAL DATA STORES
    43.
    发明申请
    BATCHING REQUESTS FOR ACCESSING DIFFERENTIAL DATA STORES 审中-公开
    批量访问不同数据存储的要求

    公开(公告)号:US20100281077A1

    公开(公告)日:2010-11-04

    申请号:US12432804

    申请日:2009-04-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/2471 G06F16/27

    摘要: Data objects are selectively stored across a plurality of differential data stores, where selection of the differential data stores for storing respective data objects is according to a criterion relating to compression of the data objects in each of the data stores, and where the differential data stores are stored in persistent storage media. Plural requests for accessing the differential data stores are batched, and one of the differential data stores is selected to page into temporary storage from the persistent storage media. The batched plural requests for accessing the selected differential data store that has been paged into the temporary storage are executed.

    摘要翻译: 数据对象被选择性地存储在多个差分数据存储器中,其中用于存储各个数据对象的差分数据存储的选择是根据与每个数据存储器中的数据对象的压缩相关的标准,并且差分数据存储 存储在持久存储介质中。 批量访问差异数据存储的多个请求被选择,并且选择差分数据存储中的一个来从永久存储介质寻入临时存储。 执行已经被分页到临时存储器中的批量复制的访问所选择的差分数据存储器的请求。

    IDENTIFYING SIMILAR FILES IN AN ENVIRONMENT HAVING MULTIPLE CLIENT COMPUTERS
    44.
    发明申请
    IDENTIFYING SIMILAR FILES IN AN ENVIRONMENT HAVING MULTIPLE CLIENT COMPUTERS 有权
    在具有多个客户端计算机的环境中识别类似文件

    公开(公告)号:US20100250480A1

    公开(公告)日:2010-09-30

    申请号:US12409978

    申请日:2009-03-24

    IPC分类号: G06N5/02 G06F17/30 G06Q10/00

    CPC分类号: G06N5/02 G06F17/3015

    摘要: To identify similar files in an environment having multiple client computers, a first client computer receives, from a coordinator computer, a request to find files located at the first client computer that are similar to at least one comparison file, wherein the request has also been sent to other client computers by the coordinator computer to request that the other client computers also find files that are similar to the at least one comparison file. In response to the request, the first client computer compares signatures of the files located at the first client computer with a signature of the at least one comparison file to identify at least a subset of the files located at the first client computer that are similar to the at least one comparison file according to a comparison metric. The first client computer sends, to the coordinator computer, a response relating to the comparing.

    摘要翻译: 为了在具有多个客户端计算机的环境中识别类似的文件,第一客户端计算机从协调器计算机接收查找位于第一客户端计算机上的文件的请求,其类似于至少一个比较文件,其中该请求也已被 由协调器计算机发送到其他客户端计算机,以请求其他客户端计算机还查找与至少一个比较文件类似的文件。 响应于该请求,第一客户端计算机将位于第一客户端计算机的文件的签名与至少一个比较文件的签名进行比较,以识别位于第一客户端计算机的文件的至少一个子集,其类似于 所述至少一个比较文件根据比较度量。 第一个客户端计算机向协调者计算机发送与比较相关的响应。

    System For And Method Of Data Cache Managment
    45.
    发明申请
    System For And Method Of Data Cache Managment 有权
    数据缓存管理系统与方法

    公开(公告)号:US20100082907A1

    公开(公告)日:2010-04-01

    申请号:US12243093

    申请日:2008-10-01

    IPC分类号: G06F12/08 G06F12/12

    摘要: The present invention provides a system for and a method of data cache management. In accordance with an embodiment, of the present invention, a method of cache management is provided. A request for access to data is received. A sample value is assigned to the request, the sample value being randomly selected according to a probability distribution. The sample value is compared to another value. The data is selectively stored in the cache based on results of the comparison.

    摘要翻译: 本发明提供了一种用于数据高速缓存管理的系统和方法。 根据本发明的实施例,提供了一种高速缓存管理方法。 接收到访问数据的请求。 将样本值分配给请求,根据概率分布随机选择样本值。 将样本值与另一个值进行比较。 基于比较的结果,将数据选择性地存储在高速缓存中。

    Managing Storage Of Data In A Data Structure
    46.
    发明申请
    Managing Storage Of Data In A Data Structure 有权
    管理数据结构中的数据存储

    公开(公告)号:US20100082562A1

    公开(公告)日:2010-04-01

    申请号:US12243103

    申请日:2008-10-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3033

    摘要: To manage storing of data in a data structure, a particular data value is represented as a group of segments stored in corresponding entries of the data structure. Additional data values represented by corresponding groups of segments are written into the data structure. A probability of overwriting segments representing the particular data value increases as a number of the additional data values increase. A correct version of the particular data value is retrieved even though one or more segments representing the particular data value has been overwritten.

    摘要翻译: 为了管理数据结构中的数据存储,特定数据值被表示为存储在数据结构的对应条目中的一组段。 由对应的段组表示的附加数据值被写入数据结构。 覆盖表示特定数据值的段的概率随着附加数据值的数量增加而增加。 即使表示特定数据值的一个或多个段已经被覆盖,也检索特定数据值的正确版本。

    Self-referential integrity checking system and method
    47.
    发明申请
    Self-referential integrity checking system and method 有权
    自我参照完整性检查系统及方法

    公开(公告)号:US20070273516A1

    公开(公告)日:2007-11-29

    申请号:US11440650

    申请日:2006-05-24

    IPC分类号: G08B13/14

    摘要: An integrity checking system includes a tag programming device that generates a plurality of identifiers. Each identifier is associated with either a storage item or an item to be stored by the storage item. The programming device stores each of the identifiers in a plurality of readable tags, each readable tag being adapted to be attached to a corresponding item. A tag reading device reads the identifiers stored in the readable tags and, using only information from the read tags, provides information indicating whether any item supposed to be stored on the storage item is missing from the storage item. Also, methods for storing and reading the identifiers are disclosed along with storing additional information about the items in the tags, such as physical information like weight and/or volume of the items, and then using this information to determine whether any items have been altered.

    摘要翻译: 完整性检查系统包括生成多个标识符的标签编程装置。 每个标识符与存储项目或存储项目要存储的项目相关联。 编程设备将每个标识符存储在多个可读标签中,每个可读标签适于附加到相应的项目。 标签读取装置读取存储在可读标签中的标识符,并且仅使用来自读取标签的信息提供指示存储项目中应该存储在存储项目上的任何项目是否丢失的信息。 此外,公开了用于存储和读取标识符的方法以及存储关于标签中的项目的附加信息,诸如物体信息,如物品的重量和/或体积,然后使用该信息来确定是否有任何项目被改变 。

    Error correction code generation method and apparatus
    48.
    发明申请
    Error correction code generation method and apparatus 有权
    纠错码生成方法和装置

    公开(公告)号:US20060020873A1

    公开(公告)日:2006-01-26

    申请号:US10896217

    申请日:2004-07-21

    申请人: Vinay Deolalikar

    发明人: Vinay Deolalikar

    IPC分类号: H03M13/00

    CPC分类号: H03M13/17 H03M13/19

    摘要: A method and apparatus for generating an error correction code used in communicating over a channel, includes generating a set of candidate circulant blocks corresponding to a parity check matrix and a Hamming code wherein the Hamming code is initially unable to detect a predetermined error pattern without ambiguity due to one or more redundancies and eliminating columns of the parity check matrix and related redundancies in the detection of a predetermined error pattern as used by the resulting Hamming code.

    摘要翻译: 一种生成用于通过信道进行通信的纠错码的方法和装置,包括生成与奇偶校验矩阵和汉明码相对应的一组候选循环块,其中汉明码最初不能检测到预定的错误模式而没有模糊 由于一个或多个冗余并且消除了奇偶校验矩阵的列以及由所得到的汉明码所使用的预定错误模式的检测中的相关冗余。