摘要:
An apparatus and method for representing and retrieving multi-dimensional data such as large satellite images. Images are stored in forms that can be rapidly browsed and retrieved by remote client applications in a drill-down or roll-up fashion. The data can be represented and retrieved using a view element data structure that includes node elements and transition elements between nodes. The data is decomposed (in space or spatial-frequency to construct a tree-based or graph-based data structure) into view elements. A set of view elements is selected, compressed and stored without adversely impacting image view extraction or generation speed. View elements are placed into the node elements of the data structure and the transition elements indicate the processing to generate other view elements in the data structure. In a server-side view construction, the view elements are selectively retrieved from storage, decompressed, and processed to generate the views of the data. In a client-side progressive view construction, the client caches the view elements and processes them in combination with view elements retrieved from the server to generate views of the data. The data reuse at the client reduces data transmission in drill-down or roll-up browsing. Data can be ingested, read and written in units of spatial blocks and decomposed into view elements using the spatial block units. Thus, the ingestion, decomposition, compression, and view retrieval for large images can be done using computer devices that have limited storage and processing capabilities.
摘要:
The present invention provides an elegant solution for processing multi-feature queries, which considers the differing access costs associated with each feature. Access cost is a critical factor in determining how individual features should be processed in terms of retrieving through sorted or random access, and, hence, in minimizing the overall query response time. The present invention operates dynamically during query processing and seeks to minimize the total query cost in terms of number of features retrieved and cost for access. It works by evaluating different combinations of feature access plans (sorted and random access) according to the number of retrieved features and forward access costs, and it selects the lowest cost plan. Experimental results on practical data show a significant speed-up in multi-features queries using the proposed solution.
摘要:
A framework is provided for describing multimedia content and a system in which a plurality of multimedia storage devices employing the content description methods of the present invention can interoperate. In accordance with one form of the present invention, the content description framework is a description scheme (DS) for describing streams or aggregations of multimedia objects, which may comprise audio, images, video, text, time series, and various other modalities. This description scheme can accommodate an essentially limitless number of descriptors in terms of features, semantics or metadata, and facilitate content-based search, index, and retrieval, among other capabilities, for both streamed or aggregated multimedia objects.
摘要:
Linear optimization queries, which usually arise in various decision support and resource planning applications, are queries that retrieve top N data records (where N is an integer greater than zero) which satisfy a specific optimization criterion. The optimization criterion is to either maximize or minimize a linear equation. The coefficients of the linear equation are given at query time. Methods and apparatus are disclosed for constructing, maintaining and utilizing a multidimensional indexing structure of database records to improve the execution speed of linear optimization queries. Database records with numerical attributes are organized into a number of layers and each layer represents a geometric structure called convex hull. Such linear optimization queries are processed by searching from the outer-most layer of this multi-layer indexing structure inwards. At least one record per layer will satisfy the query criterion and the number of layers needed to be searched depends on the spatial distribution of records, the query-issued linear coefficients, and N, the number of records to be returned. When N is small compared to the total size of the database, answering the query typically requires searching only a small fraction of all relevant records, resulting in a tremendous speedup as compared to linearly scanning the entire dataset.