摘要:
An algorithm for finding regions of interest (ROI) in synthetic images based on an information driven approach in which sub-blocks of a set of synthetic image are analyzed for information content or compressibility based on textural and color features. A DCT may be used to analyze the textural features of a set of images and a color histogram may be used to analyze the color features of the set of images. Sub-blocks of low compressibility are grouped into ROIs using a type of morphological technique. Unlike other algorithms that are geared for highly specific types of ROI (e.g. OCR text detection), the method of the present invention is generally applicable to arbitrary synthetic images. The present invention can be used with several other image applications, including Stained-Glass collages presentations.
摘要:
An algorithm for finding regions of interest (ROI) in images and photos based on an information driven approach in which sub-blocks of an image are analyzed for information content or compressibility based on the discrete cosine transform. The sub-blocks of low compressibility are grouped into ROIs using a morphological technique. Unlike other algorithms that are geared for highly specific types of ROI (e.g. face detection), the method of the present invention is generally applicable to arbitrary images and photos. A center-weighted variation of the algorithm can produce better results for certain photo applications. The algorithm can be used with several other image applications, including Stained-Glass collages and Pan-and-Scan presentations.
摘要:
In one embodiment, the present invention extracts video regions of interest from one or more videos and generates a highly condensed visual summary of the videos. The video regions of interest are extracted based on to energy, movement, face or other object detection methods, associated data or external input, or some other feature of the video. In another embodiment, the present invention extracts regions of interest from images and generates highly condensed visual summaries of the images. The highly condensed visual summary is generated by laying out germs on a canvas and then filling the spaces between the germs. The result is a visual summary that resembles a stained glass window having cells of varying shape. The germs may be laid out by temporal order, color histogram, similarity, according to a desired pattern, size, or some other manner. The people, objects and other visual content in the germs appear larger and become easier to see. The visual summary of the present invention utilizes important regions within the key frames, leading to more condensed summaries that are well suitable for small screens.
摘要:
A computer-based method is provided for enabling navigation of video using a keyframe-based video browser on a display device with a limited screen size, for a video segmented into video shots. The video shots are clustered by similarity, while temporal order of the video shots is maintained. A hierarchically organized navigation tree is produced for the clusters of video shots, while the path lengths of the tree are minimized.
摘要:
A hypervideo summary comprised of multiple levels of related content and appropriate navigational links can be automatically generated from a media file such as a linear video. A number of algorithms and selection criteria can be used to modify how such a summary is generated. Viewers of an automatically-generated hypervideo summary can interactively select the amount of detail displayed for each portion of the summary. This selection can be done by following explicit navigational links, or by changing between media channels that are mapped to the various levels of related content.This description is not intended to be a complete description of, or limit the scope of, the invention. Other features, aspects, and objects of the invention can be obtained from a review of the specification, the figures, and the claims.
摘要:
A method for creating multimedia meeting minutes is disclosed. In an embodiment, the method receives notations from a user. As each notation is received, context information is recorded with the notation. The context information is used to select pertinent portions of multimedia information received concurrently with the notations. An association is then created between the notation and each selected portion of the multimedia information. These associations may be used to access the selected portions of the multimedia information from the notations. The notations and their respective associations are deposited for future retrieval. The deposited notations may be revised by receiving an altered copy of the notations from a user. The deposited notations are modified in accordance with the altered copy.
摘要:
Data organizing systems and methods organize a plurality of data files using meta data or other data relating to a plurality of data files by extracting the related data for at least some of the data files, organizing the extracted related data and dividing at least some of the data files into groups based on the extracted related data and an input parameter value.
摘要:
A heuristically derived unsuitability score is computed and used as an input for metaphorical springs in which each selected video segment from recorded video of a video camera is associated with a metaphorical spring that maintains the selected segment at an optimal length while being responsive to a global system spring whose spring strength determines a final length of a final edited output video. Accordingly, user-specified changes to the final length of the final output video automatically lengthen or shorten the lengths of individual segments in such a way that high quality video segments having low unsuitability scores are emphasized over low quality video segments having high unsuitability scores.
摘要:
A method for generating content links between a first digital file and a second digital file by detecting a content feature of a first digital file segment of the first digital file during playback of the first digital file segment of the first digital file, searching an index of a plurality of content features for a plurality of segments including a second digital file segment of the second digital file, and dynamically generating a link between the first digital file one segment of the first digital file and the second digital file segment of the second digital file when a content feature of the first digital file segment of the first digital file is related to the content feature of the at least one segment of the second digital file.
摘要:
Systems and methods provide for determining a location and size of a visual link to digital media on physical media such as a paper document. An authoring tool for creating a link on a paper document, such as an Embedded Media Marker (EMM), identifies and scores other EMMs and related keypoints on the document to determine similarities between a newly-created EMM and other EMMs and keypoints on the paper document. The scores are visualized for a user on a display in order to position and size the newly-created EMM in a location on the paper document that will avoid confusion with other EMMs and related content. The location and size of the newly-created EMM may be automatically adjusted based on the scoring of the keypoints and related EMMs.