Data processing apparatus and method of processing data
    31.
    发明授权
    Data processing apparatus and method of processing data 有权
    数据处理装置及数据处理方法

    公开(公告)号:US08332404B2

    公开(公告)日:2012-12-11

    申请号:US12257659

    申请日:2008-10-24

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30162 G06F11/1451

    摘要: Data processing apparatus comprising: a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of a data set and each of which comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only some specimen data chunks, the processor being operable to: process input data into input data chunks; identify manifests having at least one reference to one of said specimen data chunks that corresponds to one of said input data chunks and on which there is information contained in the sparse chunk index; and prioritize the identified manifests for subsequent operation.

    摘要翻译: 数据处理装置,包括:包含标本数据块的块存储器,包含多个清单的清单存储器,每个清单代表数据集的至少一部分,每个清单包括至少一个对所述样本 数据块,仅包含一些标本数据块的信息的稀疏组块索引,所述处理器可操作以:将输入数据处理成输入数据块; 识别具有至少一个对所述样本数据块中的一个的对应于所述输入数据块中的一个的清单,并且其中包含在所述稀疏块索引中的信息; 并将识别的清单优先于后续操作。

    Cache management using sampled values assigned to a request
    32.
    发明授权
    Cache management using sampled values assigned to a request 有权
    使用分配给请求的采样值进行缓存管理

    公开(公告)号:US08250302B2

    公开(公告)日:2012-08-21

    申请号:US12243093

    申请日:2008-10-01

    IPC分类号: G06F12/08 G06F12/12

    摘要: A system and method for data cache management are provided in which a request for access to data is, and a sample value is assigned to the request, the sample value being randomly selected according to a probability distribution. The sample value is compared to another value such as a previously stored sample value, and the data is selectively stored in the cache based on results of the comparison. If the requested data is not in the cache, the sample value may be compared with an extreme one of a plurality of sampled values such as the lowest sampled value. Each of the sampled values may be stored in a database, and the sampled values or the probability distribution may be changed over time to account for frequency of requests.

    摘要翻译: 提供了一种用于数据高速缓存管理的系统和方法,其中访问数据的请求是,并且样本值被分配给请求,根据概率分布随机选择样本值。 将采样值与诸如先前存储的采样值的另一值进行比较,并且基于比较的结果将数据选择性地存储在高速缓存中。 如果请求的数据不在高速缓存中,则可以将采样值与诸如最低采样值的多个采样值中的极端值进行比较。 每个采样值可以存储在数据库中,并且可以随时间改变采样值或概率分布以考虑请求频率。

    Managing storage of data in a data structure
    33.
    发明授权
    Managing storage of data in a data structure 有权
    管理数据结构中的数据存储

    公开(公告)号:US08180744B2

    公开(公告)日:2012-05-15

    申请号:US12243103

    申请日:2008-10-01

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/3033

    摘要: A particular data value is represented as a group of segments stored in corresponding entries of a data structure. Additional data values represented by corresponding groups of segments are written into the data structure. A probability of overwriting segments representing the particular data value increases as a number of the additional data values increase. A correct version of the particular data value is retrieved even though one or more segments representing the particular data value has been overwritten.

    摘要翻译: 特定数据值被表示为存储在数据结构的对应条目中的一组段。 由对应的段组表示的附加数据值被写入数据结构。 覆盖表示特定数据值的段的概率随着附加数据值的数量增加而增加。 即使表示特定数据值的一个或多个段已经被覆盖,也检索特定数据值的正确版本。

    SYSTEM AND METHOD FOR IDENTIFYING FRESH INFORMATION IN A DOCUMENT SET
    34.
    发明申请
    SYSTEM AND METHOD FOR IDENTIFYING FRESH INFORMATION IN A DOCUMENT SET 审中-公开
    用于识别文件集中的新鲜信息的系统和方法

    公开(公告)号:US20110202528A1

    公开(公告)日:2011-08-18

    申请号:US12705586

    申请日:2010-02-13

    IPC分类号: G06F17/30

    CPC分类号: G06F16/355

    摘要: A method of identifying a fresh document in a document set is provided. The method may include obtaining a query document that is included in a document set comprising a plurality of documents. The method may also include grouping the plurality of documents into a plurality of fine clusters based on a textual similarity between the plurality of documents. The method may also include identifying a target fine cluster within the plurality of fine clusters, the target fine cluster including the query document. The method may also include ordering the documents included in the target fine cluster by time to identify the fresh document. The method may also include generating a query response that includes the fresh document.

    摘要翻译: 提供了一种识别文档集中的新文档的方法。 该方法可以包括获得包括在包括多个文档的文档集合中的查询文档。 该方法还可以包括基于多个文档之间的文本相似性将多个文档分组成多个精细集群。 该方法还可以包括识别多个精细集群内的目标精细集群,目标精细集群包括查询文档。 该方法还可以包括按时间排序包含在目标细集群中的文档以识别新文档。 该方法还可以包括生成包括新鲜文档的查询响应。

    COPYING A DIFFERENTIAL DATA STORE INTO TEMPORARY STORAGE MEDIA IN RESPONSE TO A REQUEST
    35.
    发明申请
    COPYING A DIFFERENTIAL DATA STORE INTO TEMPORARY STORAGE MEDIA IN RESPONSE TO A REQUEST 有权
    将不同数据存储复制到临时存储介质中以响应请求

    公开(公告)号:US20100280997A1

    公开(公告)日:2010-11-04

    申请号:US12432807

    申请日:2009-04-30

    IPC分类号: G06F17/30

    摘要: A plurality of differential data stores are stored in persistent storage media. In response to receiving a first request to store a particular data object, one of the differential data stores that are stored in the persistent storage media is selected, wherein selecting the one differential data store is according to a criterion relating to compression of data objects in the differential data stores. The selected differential data store is copied into temporary storage media, where the copying is not delayed after receiving the first request to await receipt of more requests. The particular data object is inserted into the copy of the selected differential data store in the temporary storage media, where the inserting is performed without having to retrieve more data from the selected differential store in the persistent storage media. The selected differential data store in the persistent storage media is replaced with the copy of the selected differential data store in the temporary storage media that has been modified.

    摘要翻译: 多个差分数据存储器存储在持久存储介质中。 响应于接收到存储特定数据对象的第一请求,选择存储在永久存储介质中的差分数据存储之一,其中选择一个差分数据存储是根据与数据对象的压缩有关的标准 差分数据存储。 所选择的差分数据存储被复制到临时存储介质中,其中在接收到等待接收更多请求的第一请求之后复制不被延迟。 将特定数据对象插入临时存储介质中所选择的差分数据存储的副本,其中执行插入,而不必从永久存储介质中的所选择的差分存储中检索更多的数据。 永久存储介质中所选择的差分数据存储被所修改的临时存储介质中所选差分数据存储的副本所替代。

    DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA
    36.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD OF PROCESSING DATA 有权
    数据处理装置和数据处理方法

    公开(公告)号:US20090113167A1

    公开(公告)日:2009-04-30

    申请号:US12256329

    申请日:2008-10-22

    IPC分类号: G06F12/08

    CPC分类号: G06F11/1451

    摘要: Data processing apparatus comprising: a chunk store containing specimen data chunks, a manifest store containing at least one manifest that represents at least a part of a data set and that comprises at least one reference to at least one of said specimen data chunks, a sparse chunk index containing information on only those specimen data chunks having a predetermined characteristic, the processing apparatus being operable to process input data into input data chunks and to use the sparse chunk index to identify at least one of said at least one manifest that includes at least one reference to one of said specimen data chunks that corresponds to one of said input data chunks having the predetermined characteristic.

    摘要翻译: 数据处理装置包括:包含样本数据块的块存储器,包含至少一个表示数据集的至少一部分的清单的清单存储器,并且包括至少一个对所述样本数据块中的至少一个的引用,稀疏 块指数仅包含具有预定特征的样本数据块的信息,该处理装置可操作以将输入数据处理成输入数据块,并使用稀疏块指数来识别至少包括至少一个清单中的至少一个 对与所述具有预定特征的所述输入数据块之一相对应的所述样本数据块之一的引用。

    Determining an approximate number of instances of an item for an organization
    37.
    发明申请
    Determining an approximate number of instances of an item for an organization 有权
    确定组织的项目的实例的大致数量

    公开(公告)号:US20090024682A1

    公开(公告)日:2009-01-22

    申请号:US11880135

    申请日:2007-07-20

    IPC分类号: G06F1/02

    CPC分类号: G06Q10/10

    摘要: Embodiments of the present invention pertain to determining an approximate number of instances of an item for an organization. According to one embodiment, instances of items that reside on computer systems associated with the organization are determined. Instances of the same item can reside on different computers and an identification uniquely identifies an item. Random numbers are associated with identifications of the items. An approximate number of instances of the item is determined based on a highest random number associated with the item. The highest random number is the highest of the random numbers that were generated for the instances of the item.

    摘要翻译: 本发明的实施例涉及确定组织的项目的实例的大致数量。 根据一个实施例,确定驻留在与组织相关联的计算机系统上的项目的实例。 相同项目的实例可以驻留在不同的计算机上,并且标识唯一地标识项目。 随机数与项目的标识相关联。 基于与该项目相关联的最高随机数来确定项目的实例的大致数量。 最高随机数是为项目实例生成的随机数中最高的。

    Method and system for power control in wireless portable devices using wireless channel characteristics
    38.
    发明申请
    Method and system for power control in wireless portable devices using wireless channel characteristics 有权
    使用无线通道特性的无线便携式设备中的功率控制方法和系统

    公开(公告)号:US20050170801A1

    公开(公告)日:2005-08-04

    申请号:US10769044

    申请日:2004-01-30

    IPC分类号: H04B7/005 H04B17/00 H04B7/01

    CPC分类号: H04W52/287 H04W52/24

    摘要: A method controls the operation of devices which communicate over a wireless communications channel. The method includes determining a parameter of a received signal communicated over the wireless communications channel and determining a minimum threshold value of the received signal. An average duration of fade is determined using the parameter and the minimum threshold. The method detects whether the received signal is less than the minimum threshold value. At least one of the devices is placed in a sleep mode for approximately the average duration of fade in response to the received signal being detected as less than the minimum threshold value. The determined parameter of the received signal may be the root mean square value of the received signal.

    摘要翻译: 一种方法控制通过无线通信信道进行通信的设备的操作。 该方法包括确定通过无线通信信道传送的接收信号的参数并确定接收信号的最小阈值。 使用参数和最小阈值确定褪色的平均持续时间。 该方法检测接收信号是否小于最小阈值。 响应于检测到的接收信号小于最小阈值,至少一个设备被置于休眠模式中大约平均的渐变持续时间。 所确定的接收信号的参数可以是接收信号的均方根值。

    Data processing apparatus and method of processing data
    39.
    发明授权
    Data processing apparatus and method of processing data 有权
    数据处理装置及数据处理方法

    公开(公告)号:US08959089B2

    公开(公告)日:2015-02-17

    申请号:US12988365

    申请日:2008-04-25

    IPC分类号: G06F17/30 G06F11/14

    CPC分类号: G06F11/1453

    摘要: One embodiment is a data processing apparatus that has a chunk store containing specimen data chunks, a manifest store containing a plurality of manifests, each of which represents at least a part of previously processed data and includes at least one reference to at least one of the specimen data chunks, and a sparse chunk index containing information on only some specimen data chunks. Input data is processed into a plurality of input data segments. Each manifest of the first set has at least one reference to one of said specimen data chunks that corresponds to one of the input data chunks of a first input data segment. Specimen data chunks corresponding to other input data chunks of the first input data segment are identified by using the identified first set of manifests and at least one manifest identified when processing previous data.

    摘要翻译: 一个实施例是具有包含标本数据块的块存储器的数据处理装置,包含多个清单的清单存储器,每个清单代表先前处理的数据的至少一部分,并且包括至少一个对至少一个 标本数据块,以及仅包含一些标本数据块的信息的稀疏块指数。 输入数据被处理成多个输入数据段。 第一组的每个清单具有对应于第一输入数据段的输入数据块中的一个的所述样本数据块中的一个的至少一个引用。 对应于第一输入数据段的其他输入数据块的样本数据块通过使用所识别的第一组清单和在处理先前数据时识别的至少一个清单来识别。

    SYSTEM AND METHOD FOR IDENTIFYING THE PRINCIPAL DOCUMENTS IN A DOCUMENT SET
    40.
    发明申请
    SYSTEM AND METHOD FOR IDENTIFYING THE PRINCIPAL DOCUMENTS IN A DOCUMENT SET 审中-公开
    用于识别文件集中主要文件的系统和方法

    公开(公告)号:US20120296902A1

    公开(公告)日:2012-11-22

    申请号:US13383592

    申请日:2010-02-13

    IPC分类号: G06F17/30

    CPC分类号: G06F16/34 G06F16/355

    摘要: A method (200) of identifying a principal document in a document set is provided. An exemplary method includes obtaining a document set comprising a plurality of documents (202) and grouping the plurality of documents into a plurality of clusters based, at least in part, on a textual similarity between each of the plurality of documents (204). The method also includes obtaining one or more descriptive terms corresponding to the plurality of documents, wherein the descriptive terms are terms within the plurality of documents that have been identified as being useful for discriminating between the clusters (206). The method also includes, for each cluster, identifying a subset of descriptive terms based, at least in part, on a prevalence of the descriptive terms within the documents of the cluster (208) and identifying the principal documents in the cluster based, at least in part, on a prevalence of the subset of descriptive terms within each of the documents in the cluster (210).

    摘要翻译: 提供了一种识别文档集中的主文档的方法(200)。 一种示例性方法包括:至少部分地基于所述多个文档(204)中的每一个之间的文本相似度,获得包括多个文档(202)的文档集合并将所述多个文档分组为多个集群。 所述方法还包括获得与所述多个文档相对应的一个或多个描述性术语,其中所述描述性术语是所述多个文档中已经被识别为有助于区分所述簇(206)的术语。 该方法还包括对于每个集群,至少部分地基于集群(208)的文档内的描述性条件的流行来识别描述性条款的子集,并且至少基于集群中的主要文档 部分地基于集群(210)中的每个文档内的描述性词语子集的流行。