摘要:
The rate staggering technique for efficient storage and retrieval of video data is embodied in a data processing system including a disk array based video server, which includes a disk array for storing video data, a processor for processing data, a memory buffer for storing video data and a rate staggering controller. Data is transferred to a network from video server through a network interface and from the network to client stations employing devices having differing resolution capabilities. The processor executes tasks under control of the rate staggering controller. These tasks include a retrieval task which retrieves scalable video from the array of disks, a storage task which stores videos temporarily in the memory buffer, and a transmission task which transmits video to client stations through the network.
摘要:
A content based method is used to download/record digital video from networks to client stations and home VCRs. In this manner, the disadvantages associated with conventional time-based downloading of programs to analog video is avoided. The method includes the steps of receiving each of the video programs, identifying one of the programs from those received based upon frame signature matching, pre-loaded pattern searching, annotation signal searching or program bar code searching and saving the desired video program based upon the performed identification. Alternately a storage apparatus is included for storing selected signals. Furthermore, content based filtering is included for identifying selected signals and for transferring the identified selected signals to the storage device.
摘要:
A dynamic, finite versioning scheme supports concurrent transaction and query processing in which there is no interference between transactions and queries and no quiescence of either transactions or queries for allowing queries to access a more up-to-date database. Only a finite number of logical versions are dynamically maintained on disk for a database page. Acquiring no locks, queries access appropriate query versions, according to their initiation times. Each corresponding query version of all the database pages constitutes a transaction-consistent, but perhaps slightly out-of-date, database snapshot. Through typical concurrency control mechanisms, different transactions access the most up-to-date versions, and their updates are allowed to be incrementally written into the database before they are committed. To save storage, a physical page copy may simultaneously represent multiple versions. The exact logical version(s) that a physical page copy represents changes dynamically and implicitly. A new mechanism using time-invariant and time-varying data structures is introduced to define query snapshots, to facilitate a new query snapshot to be taken without interrupting either the transaction or query processing, to identify dynamically appropriate versions for transaction and query accesses, and to allow efficient, on-the-fly garbage collection when it is recognized that only a single page copy is sufficient to represent the required logical versions.
摘要:
A communications system and method include an efficient cache invalidation technique which allows a computer to relocate and to disconnect without informing the server. The server partitions the entire database into a number of groups. The server also dynamically identifies recently updated objects in a group and excludes them from the group when checking the validity of the group. If these objects have already been included in the most recent invalidation broadcast, the remote computer can invalidate them in its cache before checking the group validity with the server. With the recently updated objects excluded from a group, the server can conclude that the cold objects in the group can be retained in the cache, and validate the rest of the group.
摘要:
An efficient method and system of communicating and accumulating global information for parallel data mining. Since each database is partitioned, a single node in a parallel environment will be unable to make decisions based on global information. A novel technique of progressive thresholding is devised to accumulates global information as a function of a transaction support threshold while minimizing transmission overhead. The transaction support threshold is varied as a function of the number of nodes contributing to the global information progresses.
摘要:
An efficient computer implemented method of mining path traversal patterns in a communications network. The method of the present invention comprises two steps. A method, called MF (standing for maximal forward references), is first used to convert an original sequence of log data into a set of traversal subsequences. Each traversal subsequence represents a maximal forward reference from the starting point of a user access. This step of converting the original log sequence into a set of maximal forward references will filter out the effect of backward references which are mainly made for ease of traveling, and enable us to concentrate on mining meaningful user access sequences. Accordingly, when backward references occur, a forward reference path terminates. This resulting forward reference path is termed a maximal forward reference. After a maximal forward reference is obtained, we back track to the starting point of the forward reference and begin a new forward reference path. In addition, the occurrence of a null source node also indicates the termination of an ongoing forward reference path and the beginning of a new one. Second, methods are developed to determine the frequent traversal patterns, termed large reference sequences, from the maximal forward references obtained above, where a large reference sequence is a reference sequence that appeared a sufficient number of times in the database to exceed a predetermined threshold.
摘要:
A system and method for resource adaptive classification of data streams. Embodiments of systems and methods provide classifying data received in a computer, including discretizing the received data, constructing an intermediate data structure from said received data as training instances, performing subspace sampling on said received data as test instances and adaptively classifying said received data based on statistics of said subspace sampling.
摘要:
A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing.
摘要:
Techniques for classifying structural data with skewed distribution are disclosed. By way of example, a method classifying structural input data comprises a computer system performing the following steps. Multiple classifiers are constructed, wherein each classifier is constructed on a subset of training data, using one or more selected composite features from the subset of training data. A consensus among the multiple classifiers is computed in accordance with a voting scheme such that at least a portion of the structural input data is assigned to a particular class in accordance with the computed consensus. Such techniques for structured data classification are capable of handling skewed class distribution and partial feature coverage issues.
摘要:
A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing.