摘要:
Similarity measure has been one of the critical issues for successful content-based retrieval. Simple quadratic forms of distance is inadequate as it does not necessary correspond to perceived similarity nor is it adaptive to different applications. This patent application describes a new sequential query processing algorith for evaluating content-based composite object queries. The composite objects consist of spatial and temporal arrangements of simple objects. The simple objects are defined in terms of spatial, temporal, feature and semantic attributes. The query method defines a process for executing a best-first search for the matches to the query, while providing a flexible framework for broadening the search space as required. The query method guarantees that there are no false dismissals of the candidate composite objects.
摘要:
An iterative refinement algorithm for content-based retrieval of images based on low-level features such as textures, color histograms, and shapes that can be described by feature vectors. This technique adjusts the original feature space to the new application by performing nonlinear multidimensional scaling. Consequently, the transformed distance of those feature vectors which are considered to be similar is minimized in the new feature space. Meanwhile, the distance among clusters are maintained. User feedback is utilized to refine the query, by dynamically adjusting the similarity measure and modifying the linear transform of features, along with revising the feature vectors.
摘要:
Apparatus and method for generating a view element representation of multiple-attribute tabular data are provided, including converting tabular data into a multidimensional lattice form whereby each functional attribute of the relational data is mapped to a dimension in the lattice, and each cell in the lattice corresponds to an aggregation over records in the data table. The mechanism further provides for generating a view element representation of multidimensional lattice data comprising decomposing the multidimensional data into view elements such that the view elements retain sufficient information to reconstruct the original lattice data. Alternatively, the mechanism for generating a view element data representation including iterative decomposition of the lattice data into aggregated and residual view elements or by spatially partitioning the lattice data. Further taught are an apparatus and a method for generating a view element data representation including the decomposition of the lattice data by iteratively and jointly aggregating and spatially partitioning the lattice data. Under the system for representing the data using view element sets, costs and benefits are assigned to the view elements and the view element sets are formed on the bases of the costs and benefits. A view of the data from a set of view elements is synthesized by selecting view elements from the view element sets and assembling the view elements together to construct the views.
摘要:
A method and apparatus is disclosed that receives an input narrative in a particular media format, and produces therefrom a multimedia enriched version of the input narrative. In one embodiment, a method is provided for enriching an input narrative with multimedia content, wherein the method includes the step of selectively segmenting the input narrative to form queries pertaining to different portions or elements of the input narrative. The method further comprises retrieving multimedia artifacts from one or more multimedia repositories, wherein each retrieved multimedia artifact is associated with one of the queries, and selecting a subset of the retrieved multimedia artifacts, wherein the subset includes artifacts comprising different types of multimedia content. The input narrative is then combined with respective multimedia artifacts of the subset, in order to provide an enriched and enhanced narrative.
摘要:
Methods and apparatus for optimizing resource allocation among data analysis functions in a classification system are provided. Each of the data analysis functions is characterized as a set of operating points in accordance with at least one of resource requirements and analysis quality. An operating point for each of the data analysis functions is selected in accordance with one or more constraints. The analysis functions are applied at selected operating points to optimize resource allocation among the data analysis functions in the classification system.
摘要:
Methods and apparatus for optimizing resource allocation among data analysis functions in a classification system are provided. Each of the data analysis functions is characterized as a set of operating points in accordance with at least one of resource requirements and analysis quality. An operating point for each of the data analysis functions is selected in accordance with one or more constraints. The analysis functions are applied at selected operating points to optimize resource allocation among the data analysis functions in the classification system.
摘要:
A method and apparatus for classifying or annotating image and video content by assigning each class of event, action, region or object a unique symbol and then building symbol strings to represent sequences in space or time. The symbol strings can be decoded using a library of annotated PT descriptors to automatically label the image and video content. Furthermore, the PT descriptors can be used for searching by sketch or searching by example where the search query input is converted to symbol strings, which are efficiently compared based on the presence and relative counts of PT descriptors mapped into a PT matrices. The Precedence Template (PT) descriptor can be used for classifying and querying video based on the spatial and temporal orderings of regions, objects, actions or events. Applied to video, the PT descriptors provide a way to compare the temporal order of events, actions, or objects such as those represented in a scene transition graph, key-frame list, or event string. Applied to images, the PT descriptors provide a way to compare the spatial arrangement of image regions or objects. By capturing the spatial and temporal relationships statistically, the PT descriptors provide a robust way to measure similarity in the presence of insertions, deletions, substitutions, replications and relocations of events, actions, regions or objects.
摘要:
Methods and apparatus for optimizing resource allocation among data analysis functions in a classification system are provided. Each of the data analysis functions is characterized as a set of operating points in accordance with at least one of resource requirements and analysis quality. An operating point for each of the data analysis functions is selected in accordance with one or more constraints. The analysis functions are applied at selected operating points to optimize resource allocation among the data analysis functions in the classification system.
摘要:
An invention for generating standard description records from multimedia information. The invention utilizes fundamental entity-relation models for the Generic AV DS that classify the entities, the entity attributes, and the relationships in relevant types to describe visual data. It also involves classification of entity attributes into syntactic and semantic attributes. Syntactic attributes can be categorized into different levels: type/technique, global distribution, local structure, and global composition. Semantic attributes can be likewise discretely categorized: generic object, generic scene, specific object, specific scene, abstract object, and abstract scene. The invention further classifies entity relationships into syntactic/semantic categories. Syntactic relationship categories include spatial, temporal, and visual categories. Semantic relationship categories include lexical and predicative categories. Spatial and temporal relationships can be topological or directional; visual relationships can be global, local, or composition; lexical relationships can be synonymy, antonymy, hyponymy/hypernymy, or meronymy/holonymy; and predicative relationships can be actions (events) or states.
摘要:
A system and method for uniformly describing space and frequency views of multi-dimensional lattice data, including regions, tilings and hierarchical decompositions of image, video, audio content, and time series data in space, time, frequency and resolution. The space and frequency view description scheme provides a way to specify regions in space, time, frequency and resolution in term of space and frequency views. This allows specification of concepts such as “half resolution”, “upper right quadrant” or “high pass band”, such as when referring to views of an image. The space and frequency view description scheme also provides for SFTilings comprising non-redundant sets of views and SFPartitionings which are SFTilings which completely cover the space, as well as SFHierarchical decompositions comprising hierarchies of SFViews where there are processing dependencies among views. The inventive system and method also provide a way to index these view, which is important in applications of progressive retrieval in which a client application can send requests for views of images stored at a server in terms of an index value. The space and frequency view description scheme handles the details concerning the access and relationship of views with different resolutions, spatial locations and sizes.