Locating potentially identical objects across multiple computers
    32.
    发明授权
    Locating potentially identical objects across multiple computers 有权
    在多个计算机上定位潜在的相同对象

    公开(公告)号:US07310644B2

    公开(公告)日:2007-12-18

    申请号:US09876377

    申请日:2001-06-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30156

    摘要: Potentially identical objects (such as files) across multiple computers are located. In one embodiment, a computer generates object information for an object stored on the computer. The object information can be generated in a variety of manners (e.g., based on hashing the object, based on characteristics of the object, and so forth). The object information is then transferred to one or more database server computers, where the object information can be compared to object information from other computers to determine whether the object is potentially identical to another object on one of the other computers.

    摘要翻译: 位于多个计算机上的潜在相同对象(如文件)。 在一个实施例中,计算机生成存储在计算机上的对象的对象信息。 可以以各种方式生成对象信息(例如,基于对象的散列,基于对象的特征等)。 然后将对象信息传送到一个或多个数据库服务器计算机,其中可以将对象信息与来自其他计算机的对象信息进行比较,以确定对象是否与其他计算机之一上的另一个对象潜在地相同。

    Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
    33.
    发明授权
    Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys 失效
    用于识别和合并用不同密钥加密的相同对象的加密系统和方法

    公开(公告)号:US06983365B1

    公开(公告)日:2006-01-03

    申请号:US09565821

    申请日:2000-05-05

    IPC分类号: H04L9/00

    摘要: Cryptographic protocols and methods of employing the same are described. The described protocols advantageously enable two or more identical encryptable objects that are coded for encryption with different keys to be identified as identical without access to either the unencrypted objects or the keys that are used in the encryption process. Additionally, the protocols enable two or more identical encryptable objects to be processed with different encryption keys, yet be stored in a manner so that the total required storage space is proportional to the space that is required to store a single encryptable object, plus a constant amount for each distinct encryption key. In various embodiments, the encryptable objects comprise files and the cryptographic protocols enable encrypted files to be used in connection with single instance store (SIS) systems.

    摘要翻译: 描述了使用它们的密码协议和方法。 所描述的协议有利地使得被编码用于具有不同密钥的加密的两个或更多个相同的可加密对象被识别为相同,而不访问加密过程中使用的未加密对象或密钥。 另外,这些协议允许使用不同的加密密钥来处理两个或多个相同的可加密对象,但是以一定的方式存储,使得所需的总存储空间与存储单个可加密对象所需的空间成正比,加上常数 每个不同加密密钥的数量。 在各种实施例中,可加密对象包括文件,并且加密协议使加密文件能够与单实例存储(SIS)系统相关联使用。

    Continuous media file server system and method for scheduling network
resources to play multiple files having different data transmission
rates
    34.
    发明授权
    Continuous media file server system and method for scheduling network resources to play multiple files having different data transmission rates 失效
    连续媒体文件服务器系统和方法,用于调度网络资源,播放具有不同数据传输速率的多个文件

    公开(公告)号:US6134596A

    公开(公告)日:2000-10-17

    申请号:US934972

    申请日:1997-09-18

    摘要: A continuous media file server system has a controller connected to multiple data servers. Each data server supports at least one storage disk. Data files are distributed across the data servers so that data blocks of the data files are stored on each of the storage disks. The data files have different data transmission rates at which they are served over a network to clients in the form of data streams. A scheduling unit maintains a network schedule that provides a relative ordering of transmission times of requested data streams. The transmission times indicate when the data servers are to transmit corresponding data blocks of the requested data files over the network to stream the data to clients. The block play times for all data files have a fixed duration, with the size of the data blocks from data file to data file varying according to the data transmission rates of the files. When a transmission time for a data file block approaches, the scheduling unit instructs the appropriate data server to read a data block for that data file from the disk prior to the transmission time in the network schedule. In this manner, disk reads are scheduled implicitly according to the network schedule. The data is temporarily stored in buffer memory, and later transmitted over the network. When a request for a new data stream is received, the scheduling unit evaluates whether the new data stream can be inserted into the network schedule without overburdening the data servers' ability to serve the existing data streams on the network schedule.

    摘要翻译: 连续的媒体文件服务器系统具有连接到多个数据服务器的控制器。 每个数据服务器至少支持一个存储磁盘。 数据文件分布在数据服务器上,使数据文件的数据块存储在每个存储磁盘上。 数据文件具有不同的数据传输速率,它们以数据流的形式通过网络向客户端提供服务。 调度单元维护提供所请求数据流的传输时间的相对排序的网络调度。 传输时间表示数据服务器何时通过网络传输所请求的数据文件的相应数据块,以将数据流传输到客户端。 所有数据文件的块播放时间具有固定的持续时间,数据块的数据块的大小根据文件的数据传输速率而变化。 当数据文件块的传输时间接近时,调度单元指示适当的数据服务器在网络调度中的发送时间之前从盘读取该数据文件的数据块。 以这种方式,磁盘读取根据网络调度隐含调度。 数据临时存储在缓冲存储器中,然后通过网络传输。 当接收到对新的数据流的请求时,调度单元评估新数据流是否可以被插入到网络调度中,而不会使数据服务器在网络调度上服务于现有数据流的能力过高。

    Distributed scheduling in a multiple data server system
    35.
    发明授权
    Distributed scheduling in a multiple data server system 失效
    多数据服务器系统中的分布式调度

    公开(公告)号:US5867657A

    公开(公告)日:1999-02-02

    申请号:US684840

    申请日:1996-06-06

    摘要: The scheduling functionality for a distributed system having multiple data servers is spread across the data servers rather than being centralized at a single machine. The distribution of the scheduling across multiple servers enhances the scalability of the system and enhances the fault tolerance of the system. Viewer state records are employed that contain scheduling information for viewers. Scheduling occurs by circulating the viewer state records amongst the data servers that are to provide the requisite data. The data servers output data blocks in a predetermined sequence based on how the data blocks are striped across the data servers. Each viewer state record is forwarded to a first successor and second successors in the scheduling sequence. The task of adding a viewer to a schedule is also distributed to the data servers.

    摘要翻译: 具有多个数据服务器的分布式系统的调度功能分布在数据服务器上,而不是集中在单个机器上。 跨多个服务器的调度分配增强了系统的可扩展性,并增强了系统的容错能力。 采用查看器状态记录,其中包含查看者的调度信息。 通过在提供必需数据的数据服务器之间循环观看者状态记录来进行调度。 数据服务器基于数据块如何跨数据服务器进行条带化,以预定的顺序输出数据块。 每个查看器状态记录被转发到调度序列中的第一后继者和第二后继者。 将查看器添加到日程表的任务也被分发到数据服务器。

    Virtually infinite reliable storage across multiple storage devices and storage services
    36.
    发明授权
    Virtually infinite reliable storage across multiple storage devices and storage services 有权
    几乎无限可靠的存储跨多个存储设备和存储服务

    公开(公告)号:US08010498B2

    公开(公告)日:2011-08-30

    申请号:US11101610

    申请日:2005-04-08

    CPC分类号: G06F9/38 G06F11/2094

    摘要: A logical file system that distributes copies of files across various different physical storage resources yet provides a consistent view to the user of his or her data, regardless of which machine the user is accessing the files from, and even when the user's computer is offline. The distributed file system uses smart data redundancy to enable a virtually infinite amount of storage as long as additional storage resources are made available to the distributed file system. The result is a reliable storage system that does not necessarily tie the user's data to the user's particular computer. Instead, the user's data is associated with the user—for life—or for however long the user would like the data to be maintained, regardless of whether the user's computer or data storage components are replaced or destroyed.

    摘要翻译: 分配不同物理存储资源的文件副本的逻辑文件系统,但是为用户提供一致的视图,无论用户正在从哪个机器访问文件,甚至当用户的计算机脱机时也是如此。 只要额外的存储资源可用于分布式文件系统,分布式文件系统使用智能数据冗余来实现几乎无限量的存储。 结果是可靠的存储系统不一定将用户的数据与用户的特定计算机相结合。 相反,无论用户的计算机或数据存储组件是否被更换或销毁,用户的数据都会与用户生活相关联 - 或者长时间用户希望维护数据。

    Method and system for regulating background tasks using performance measurements

    公开(公告)号:US07765553B2

    公开(公告)日:2010-07-27

    申请号:US11010471

    申请日:2004-12-13

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4881

    摘要: A method and system for regulating tasks of background processes so as to reduce interference with foreground processes. The progress rate of a background task (e.g., amount of work performed per unit time) is measured and evaluated against a target amount. If the progress rate appears degraded, the background task is suspended for a computed time interval so as to back off from its interference with a foreground process. Each time the progress rate appears degraded, the time interval is exponentially increased from its previous value up to a maximum, however if the performance appears normal, the time interval is reset to a minimum. Evaluation of the work is statistically based so as to eliminate variations in measurements, and automatic calibration of the target amount is provided, as is a mechanism for prioritizing multiple background tasks.

    Efficient processing of time-bounded messages
    38.
    发明授权
    Efficient processing of time-bounded messages 有权
    有效处理有时限的消息

    公开(公告)号:US07600126B2

    公开(公告)日:2009-10-06

    申请号:US11139705

    申请日:2005-05-27

    IPC分类号: H04L9/32

    摘要: Efficient processing of time-bound messages is described herein. In an implementation, messages are classified either time-bounded messages or non-time-bounded messages. The time-bounded messages are then processed separately from the non-time-bound messages. Examples of time-bounded messages can include spam e-mails or other types or classes of e-mails subject to a retention policy, such that the are retained only for some set period of time. Metadata relating to the time-bound messages are compiled and stored separately from the messages, thereby enabling optimization of the metadata storage and processing apart from the messages. The time-bounded messages are accumulated before they are processed in bulk, thereby reducing the number if I/O operations consumed by the time-bound messages, and reducing the amount and cost of resources supporting the process of the time-bounded messages.

    摘要翻译: 这里描述了时间有限的消息的有效处理。 在实现中,消息被分类为时间有限的消息或非有时限的消息。 然后将有时限的消息与非时间限制的消息分开处理。 有时限的消息的示例可以包括垃圾邮件或受保留策略限制的其他类型或类型的电子邮件,以便仅保留一些设定的时间段。 与时间有关的消息相关的元数据与消息分开地被编译和存储,从而能够优化除消息之外的元数据存储和处理。 时间有限的消息在批量处理之前被累积,从而减少由有时限的消息消耗的I / O操作的数量,并且减少支持有时限消息的处理的资源的数量和成本。

    On-disk file format for a serverless distributed file system

    公开(公告)号:US07539867B2

    公开(公告)日:2009-05-26

    申请号:US11014317

    申请日:2004-12-16

    IPC分类号: G06F11/30 H04K1/00

    摘要: A file format for a serverless distributed file system is composed of two parts: a primary data stream and a metadata stream. The data stream contains a file that is divided into multiple blocks. Each block is encrypted using a hash of the block as the encryption key. The metadata stream contains a header, a structure for indexing the encrypted blocks in the primary data stream, and some user information. The indexing structure defines leaf nodes for each of the blocks. Each leaf node consists of an access value used for decryption of the associated block and a verification value used to verify the encrypted block independently of other blocks. In one implementation, the access value is formed by hashing the file block and encrypting the resultant hash value using a randomly generated key. The key is then encrypted using the user's key as the encryption key. The verification value is formed by hashing the associated encrypted block using a one-way hash function. The file format supports verification of individual file blocks without knowledge of the randomly generated key or any user keys. To verify a block of the file, the file system traverses the tree to the appropriate leaf node associated with a target block to be verified. The file system hashes the target block and if the hash matches the access value contained in the leaf node, the block is authentic.