摘要:
One embodiment of the present invention provides a distributed, differential electronic-data storage system (302, 304, 306-309, 310-316, and 318) that includes client computers (310-316), component data-storage systems, and a routing component. Client computers direct data objects (418) to component data-storage systems within the distributed, differential electronic-data storage system. Component data-storage systems provide data storage for the distributed, differential electronic-data storage system. The routing component directs data objects, received from the clients computers, through logical bins (324-338) to component data-storage systems by a compression-enhancing routing method.
摘要:
At least one term is extracted [202] from unstructured information. The at least one term is validated [204]. Then, a sense of the at least one extracted and validated term is determined [206]. The at least one extracted and validated term is clustered [208] into at least one group of terms according to the determined sense. A taxonomy is generated [210] based on the clustering and a mining of accessible taxonomies.
摘要:
One embodiment of the present invention provides a distributed, differential electronic-data backup and archiving system that includes client computers and cells. Client computers execute front-end-application components of the distributed, differential electronic-data backup and archiving system, the front-end application components receiving data objects from client computers and sending the received data objects to cells of the distributed, differential electronic-data backup and archiving system for storage. Cells within the distributed, differential electronic-data backup and archiving system store the data objects, each cell comprising at least one computer system with attached mass-storage and each cell storing entire data objects as lists that reference stored, unique data chunks within the cell, a cell storing all of the unique data chunks for all data objects stored in the cell.
摘要:
One embodiment of the present invention provides a distributed, differential electronic-data backup and archiving system that includes client computers and cells. Client computers execute front-end-application components of the distributed, differential electronic-data backup and archiving system, the front-end application components receiving data objects from client computers and sending the received data objects to cells of the distributed, differential electronic-data backup and archiving system for storage. Cells within the distributed, differential electronic-data backup and archiving system store the data objects, each cell comprising at least one computer system with attached mass-storage and each cell storing entire data objects as lists that reference stored, unique data chunks within the cell, a cell storing all of the unique data chunks for all data objects stored in the cell.
摘要:
One embodiment of the present invention includes a method for routing a data object (502), comprising a sequence of data units (404), to a particular component data-storage system (104-110), or particular group of component data-storage systems, within a distributed, differential electronic-data storage system (104-110, 112, 114, 116) by selecting one or more subsequences of data units (402) from the data object, computing a characteristic value from the selected subsequences, computing an index (420) from the characteristic value; and directing the data object (502) to the particular component data-storage system, or to the particular group component data-storage systems, identified by the computed index.