摘要:
Systems and methods for describing image content establish image description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). For image content, image objects can include global objects (O0 8) and local objects (O1 2 and O2 6). The image objects are further defined by a number of features of different classes (36, 38 and 40), which in turn are further defined by a number of feature descriptors. The relationships between and among the objects in the object set are defined by the object hierarchy (26) and entity relation graphs (28). The image description records provide a standard vehicle for describing the content and context of image information for subsequent access and processing by computer applications such as search engines, filters, and archive systems.
摘要:
Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.
摘要:
The present invention relates to a system for generating a description record from multimedia information including, e.g., video data. A multimedia information input interface is used to receive multimedia information. A computer processor receives the multimedia information, processes the video information by performing video object extraction processing to generate video object descriptions from the video information, processes the generated video object descriptions by object hierarchy construction and extraction processing to generate video object hierarchy descriptions, and processes the generated video object descriptions by entity relation graph descriptions.
摘要:
The invention provides a next generation decision support system that includes highly adaptive sensors, transport, and models to enable time critical decision making. The decision support model of the system is progressive so that not all input parameters need to be available to produce model prediction output. Input parameters of the model with the largest first order derivative values are given highest priority in data acquisition and transmission. The parameters acquired at the sensors are stored in progressive data representation formats, which later on are progressively transmitted to the decision support server. Progressive data representation, progressive transmission, and progressive model are thus three key elements that need to be integrated together. The present invention presents a method and an apparatus that form an optimized process including the three elements subject to the decision making utility function.
摘要:
An invention for generating standard description records from multimedia information. The invention utilizes fundamental entity-relation models for the Generic AV DS that classify the entities, the entity attributes, and the relationships in relevant types to describe visual data. It also involves classification of entity attributes into syntactic and semantic attributes. Syntactic attributes can be categorized into different levels: type/technique, global distribution, local structure, and global composition. Semantic attributes can be likewise discretely categorized: generic object, generic scene, specific object, specific scene, abstract object, and abstract scene. The invention further classifies entity relationships into syntactic/semantic categories. Syntactic relationship categories include spatial, temporal, and visual categories. Semantic relationship categories include lexical and predicative categories. Spatial and temporal relationships can be topological or directional; visual relationships can be global, local, or composition; lexical relationships can be synonymy, antonymy, hyponymy/hypernymy, or meronymy/holonymy; and predicative relationships can be actions (events) or states.
摘要:
Described is a method and apparatus for obtaining accurate, timely information for event detection and prediction based on autonomous opportunism. The objective is to make the best possible use of all available resources at the time of acquisition, including historical data, multiple sensors, and multiresolution acquisition capabilities, under a given set of processing and communication bandwidth constraints. This method (and the corresponding apparatus) fuses multiple adaptively acquired data sources to prepare information for use by decision support models. The onboard data acquisition schedule is constructed to maximize the prediction accuracy of the decision models, which are designed to operate progressively, utilizing data representations consisting of multiple abstraction levels and multiple resolutions. Due to the progressive nature of these models, they can be executed onboard even with the use of substantially summarized (or compressed) datasets delivered from the ground or from other satellite platforms. Models are formulated to accept data with less than complete certainty, thus allowing real-time decisions to be made on locations where additional data is to be acquired based on predicted likelihood of the event of interest and uncertainties. Multi-abstraction-level multi-resolution data is expressed using standard-compliant representations, and progressively transmitted to the ground or other platforms. More detailed calculations can then be performed on the ground using all of the available real time and historical data.
摘要:
Methods and apparatus for providing a multi-tier object-relational database architecture are disclosed. In one illustrative embodiment of the present invention, a multi-tier database architecture comprises an object-relational database engine as a top tier, one or more domain-specific extension modules as a bottom tier, and one or more universal extension modules as a middle tier. The individual extension modules of the bottom tier operationally connect with the one or more universal extension modules which, themselves, operationally connect with the database engine. The domain-specific extension modules preferably provide such functions as search, index, and retrieval services of images, video, audio, time series, web pages, text, XML, spatial data, etc. The domain-specific extension modules may include one or more IBM DB2 extenders, Oracle data cartridges and/or Informix datablades, although other domain-specific extension modules may be used.
摘要:
A multimedia archive description scheme is provided for characterizing a multimedia archive having records and associated record descriptions. The multimedia archive description scheme provides a data structure which relates records by similarity measures. The principle data structure in the multimedia archive description scheme is a cluster. A cluster includes one or more attributes of the records in the archive and can include one or more cluster relationships. Cluster attributes can include feature space attributes, semantic attributes, media attributes and meta attributes of the records in the archive. The cluster relationships can relate records to clusters or clusters to clusters. Cluster relationships can include feature space (syntactic) relationships, semantic relationships, media relationships and meta relationships. The multimedia archive description scheme provides an efficient form for describing a collection of records.
摘要:
A method and apparatus is disclosed that receives an input narrative in a particular media format, and produces therefrom a multimedia enriched version of the input narrative. In one embodiment, a method is provided for enriching an input narrative with multimedia content, wherein the method includes the step of selectively segmenting the input narrative to form queries pertaining to different portions or elements of the input narrative. The method further comprises retrieving multimedia artifacts from one or more multimedia repositories, wherein each retrieved multimedia artifact is associated with one of the queries, and selecting a subset of the retrieved multimedia artifacts, wherein the subset includes artifacts comprising different types of multimedia content. The input narrative is then combined with respective multimedia artifacts of the subset, in order to provide an enriched and enhanced narrative.
摘要:
Methods and apparatus for optimizing resource allocation among data analysis functions in a classification system are provided. Each of the data analysis functions is characterized as a set of operating points in accordance with at least one of resource requirements and analysis quality. An operating point for each of the data analysis functions is selected in accordance with one or more constraints. The analysis functions are applied at selected operating points to optimize resource allocation among the data analysis functions in the classification system.