摘要:
Techniques for generating utility-based descriptors from compressed multimedia information are disclosed. A preferred method includes the steps of receiving least a segment of compressed multimedia information, determining two or more portions of utility based descriptor information based on one or more adaptation operations, each corresponding to a unique target rate, adapting the compressed multimedia segment by each the portions of utility based descriptor information to generate adapted multimedia segments, using a quality management method to generate measurement for each adapted multimedia segment, and generating a utility based descriptors based on the portions of utility based descriptor information and corresponding quality measurements.
摘要:
A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.
摘要:
A context-based concept fusion method detects a first concept in an image record. The method includes automatically determining at least one other concept in the image record which has a contextual relationship with the first concept and which is to be labeled by a user of the method; and labeling the at least one other concept by the user with a ground truth label to be used in the context-based concept fusion method to improve detection of the first concept in the image record.
摘要:
The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content. The multimedia description is then stored into a database. As a result, a user may query a search engine which then retrieves the multimedia content from the database whose integration description matches the query criteria specified by the user. The search engine can then provide the user a useful search result based on the multimedia integration description.
摘要:
Object-oriented methods and systems for permitting a user to locate one or more video objects from one or more video clips over an interactive network are disclosed. The system includes one or more server computers (110) comprising storage (111) for video clips and databases of video object attributes, a communications network (120), and a client computer (130). The client computer contains a query interface to specify video object attribute information, including motion trajectory information (134), a browser interface to browse through stored video object attributes within the server computers, and an interactive video player.
摘要:
A system for authentication of a digital image includes a signature generator for creating a robust digital signature for an original image based on instrument features of the image. An authentication processor extracts a set of invariant features for the original image from the digital signature, generates a corresponding set of invariant features for the present image to be authenticated and compares the two sets of invariant features to determine whether the image has been subjected to malicious manipulation. The invariant features include the polarity and magnitude of the difference between discrete cosine transform coefficients at corresponding coefficient locations in selected image block pairs. The intensity of the original image is also authenticated by comparing a mean value of coefficient of the original image to the mean value of the coefficient of the present image.
摘要:
A method and system for maintaining the quality of video transported over wireless channels uses a transcoder to modify and maintain the optical resilience of an encoded bitstream. The transcoder increases the spatial resilience by reducing the number of blocks per slice, and increases the temporal resilience by increasing the proportion of I-blocks that are transmitted in each frame. Also, the transcoder maintains the same input bit rate by dropping less significant coefficients as it increases resilience. The transcoder of the present invention maintains the resilience at an optimal level to accommodate the prevailing channel conditions as measured by the BER of the wireless channel. Rate distortion theory is applied to determine the optimal allocation of bit rate among spatial resilience, temporal resilience and source rate, where it is has been found that the optimal allocation of the present invention (which occurs in near-real time) provides nearly the same result as doing an exhaustive search.