摘要:
A special-purpose file system is provided which includes a combination of services and features which improve the performance of applications employing caches, such as caching web proxies. This file system also provides a file system cache which can be used as a general high-performance file system cache for file systems which need a cache. The services and features include collocating files together on a disk; keeping file meta-data in memory; compressing file names to hash values for reference; using a flat naming scheme for files; providing background daemons that pack files into clusters and clean disk and main memory space; and implementing a user-level library. The file system reduces access to related disk files, eliminates disk operations that are needed to maintain directory and meta-data information, and avoids multiple buffering of file data, and saves and efficiently manages disk and main memory space.
摘要:
A data replication system is disclosed in which replication functionalities between a host computer, an interconnecting computer network, and a plurality of storage devices are separated into host elements and a plurality of storage elements. The host computer is connected to one or more host elements. The host element is responsible for replicating data between the storage devices, which are each connected to an associated storage element, and for maintaining data consistency. Further, the host element instructs a storage element whose associated storage device does not contain up-to-date data to recover from another one of the plurality of storage elements and its associated storage device. The storage elements and their associated storage devices may be located in any combination of diverse or same geographical sites in a manner to ensure sufficient replication in the event of a site or equipment failure. The storage elements are responsible for executing read and write requests received from the host computer and for maintaining data consistency. When a storage element and its associated storage device is determined not to contain up-to-date data, recovery is effected by data transfer from one of the other storage elements and its associated storage device, that is indicated by the host element. Such recovery is done directly between the respective storage elements and associated storage devices, without transiting the data through the host element.
摘要:
Systems, methods and computer readable media for identifying transient content in web pages. Transient content can be identified, for example, by parsing different versions of the same web page into tokens, and inserting fingerprints associated with the tokens into data structures. The data structures can be compared to each other to identify differences between the web pages, thereby identifying transient content associated with the web pages.
摘要:
A method and system for servicing disk I/O requests from domains which have been guaranteed minimum quality of disk service maintains I/O requests for each domain are in separate queues. The queues are serviced by a disk scheduler which selects requests from the queues in accordance with a fair queuing scheduling algorithm that considers the estimated time required to service the request at the head of a queue with regard for the size of the input or output associated with the request, the proportion of disk bandwidth assigned to the particular domain, and the state of the other queues.
摘要:
A uniform application programming interface (API) provides efficient generation and control of resource reservations so as to ensure a desired quality of service (QoS) for applications running on an operating system of a computer system. The uniform API is based on a hierarchical file system which associates resource reservations with references to shared objects, rather than with the objects themselves. The uniform API permits the use of a variety of different proportional share schedulers for controlling access to physical resources, e.g., CPU, memory, disk, network interfaces, etc. of the computer system. The hierarchical file system includes a separate directory for each independently-scheduled physical resource of the computer system. A parent of the resource reservation in the hierarchical file system is either a root node of the file system or another reservation for the same resource. The resource reservation represents either an internal reservation, corresponding to a directory in the file system which is permitted to have child nodes associated therewith in the file system, or a queue, corresponding to a directory which is not permitted to have child nodes associated therewith. A given request for a resource reservation is processed by a proportional share scheduler associated with the resource, and identifies a queue having a predetermined portion of the resource allocated thereto. The invention also provides mechanisms for request tagging by the operating system, parental limitation of resources used by a child process, and “garbage collection” of resource reservations that are no longer needed.
摘要:
Received packets are processed utilizing protocol processing stages in an operating system, in accordance with a technique referred to as signaled receiver processing (SRP). A given protocol processing stage uses a next stage submit (NSS) function to pass a packet to a subsequent protocol processing stage. Each protocol processing stage is associated with one or more sockets. Each socket is associated with the input function of a protocol processing stage, an unprocessed input queue (UIQ) that contains packets destined to the socket but not yet processed by the socket's input function, and a list of owner processes, which expect input from the socket. NSS uses a multi-stage early demultiplexing (MED) function, which determines a packet's next destination socket, based on the stage currently processing the packet and on information contained in the packet. If the amount of buffering used by a packet plus the amount of input buffering already used by the packet's destination socket exceeds a designated limit associated with that socket, NSS drops the packet. Otherwise, NSS enqueues the packet in that socket's UIQ and signals that socket's owner processes. The signal triggers a default action corresponding to performance of the required protocol processing by one of the destination socket's owner processes, but those processes may override the default action by, e.g., deferring the required protocol processing until execution of a subsequent receive call.
摘要:
Methods, systems, and apparatus, including computer program products, for clustering documents. A plurality of documents are identified from a set of documents, where the identified documents have the same top N terms by term frequency score for an integer N. A pattern string that is satisfied by at least a subset of the identified documents is identified. A document cluster is formed from at least the subset of the identified documents.
摘要:
A computer operating system that allows legacy applications to be run automatically with quality of service (QoS) guarantees matching required QoS performance levels. In accordance with the invention, files have QoS requirement attributes that can be set-by users. Additionally, users may interpose a requirement broker between a given legacy application and the operating system. The requirement broker may be in the form of a modified version of a library that is dynamically linked with applications at load time. The requirement broker intercepts certain system calls and automatically requests from the system QoS guarantees in accordance with the QoS requirement attributes of the accessed files, whether local or remote.
摘要:
A system for detecting the movement of a device that receives a signal broadcast from a relative fixed position, such as a set-top terminal in a Direct Broadcast Satellite (DBS) system, is disclosed, using the phase shift of the broadcast signal, as received by the device. A set-top terminal or similar mechanism restricts access to the transmitted multimedia information using stored decryption keys. If a set-top terminal is moved, the distance between the set-top terminal and the satellite will change, thereby causing a measurable phase shift in the signal received by the set-top terminal. The set-top terminal periodically or intermittently records a sequence of bits received from the satellite and provides the recorded bits to the service provider server for analysis. If the set-top terminal has been moved, the recorded bits will shift. The sequence of bits may be recorded at random intervals, triggered by an external event, shown, for example, by an accurate internal clock, a request sent by a terrestrial paging network, or a request sent by a low-earth-orbit (LEO) satellite messaging system. The service provider detects relative movements of the set-top terminals by instructing a group of set-top terminals in geographical proximity to start recording at the same time, and comparing the sequence of bits recorded by each set-top terminal.
摘要:
A recovery process allows a recovering replica in a replicated data storage system to recover from a current replica the changes to the data that it has missed during a failure while, at the same time, the system is processing new requests issued by the source. Sequence numbers, assigned by the source to write requests and stored by each replica in association with the data item that the write request modifies, are used by the recovery process to determine when a replica has missed one or more write requests and requires recovery. During recovery, the recovering replica ignores all requests directly received from the source, and replaces the contents of a data item with the data received from the current replica only if the newly received data item has a higher sequence number than the corresponding sequence number of the data item already stored in the recovering replica. It also updates data items in response to new write requests that are forwarded to it by the current replica. At the same time, the current replica continues to receive requests from the source, performs those requests, and forwards those write requests to the recovering replica. It also scans its data items and sends to the recovering replica those data items and associated sequence numbers that have a sequence number higher than the sequence number of the last consecutive write request sent by the source that the recovering replica received before missing a write request.