Abstract:
A method for improvement of the consistency of color and brightness across boundaries of multicamera and/or multidisplayed overlapping or near overlapping composite images, reduction of objectionable artifacts at overlapping image seams of composite images, adjustment of color and brightness/intensity on either side of overlapping image seams, application of color correction across a composite image, reduction of color and intensity mismatches of composite images, gradual change of color across image seams of a composite image, including altering color outside of overlap regions is provided so that the seams are less discernible and to avoid sharp color changes across a composite or mosaic image are reduced or avoided. Color difference between two source images which form a composite is estimated by looking at color pixels from each source image region, determining a centroid of a cluster in a color space formed by the pixels of each source image, and determining the difference between cluster centroids for overlapping or nearly overlapping source image regions which are a measure of the vector color difference between such regions. The vector color difference between at least two overlapping source image regions are interpolated or mapped across those images.
Abstract:
Video recordings of meetings and scanned paper documents are natural digital documents that come out of a meeting. These can be placed on the Internet for easy access, with links generated between them by matching scanned documents to a segment of the video referencing the scanned document. Furthermore, annotations made on the paper documents during the meeting can be extracted and used as indexes to the video. An orthonormal transform, such as a Digital Cosine Transform (DCT) is used to compare scanned documents to video frames.
Abstract:
A media browser, graphical user interface and method for browsing a media file wherein a user selects at least one feature in a media file and is provided with information regarding the existence of the selected feature in the media file. Based on the information, the user can identify and playback portions of interest in a media file. Features in a media file, such as a speaker's identity, applause, silence, motion, or video cuts, are preferably automatically time-wise evaluated in the media file using known methods. Metadata generated based on the time-wise feature evaluation are preferably mapped to confidence score values that represent a probability of a corresponding feature's existence in the media file. Confidence score information is preferably presented graphically to a user as part of a graphical user interface, and is used to interactively browse the media file.
Abstract:
A system and method for detecting useful images and for ranking images in order of usefulness based on a vignette score describing how closely each one resembles a “vignette,” or a central object or image surrounded by a featureless or deemphasized background. Several methods for determining an image's vignette score are disclosed as examples. Variance ratio analysis entails calculation of the ratio of variance between the edge region of the image and the entire image. Statistical model analysis entails developing a statistical classifier capable of determining a statistical model of each image class based on pre-entered training data. Spatial frequency analysis involves estimating the energy at different spatial frequencies in the central and edge regions and in the image as a whole. A vignette score is calculated as the ratio of mid-frequency energies in the edge region to the mid-frequency energies of the entire image.
Abstract:
A system, method and apparatus for remotely annotating an object. An embodiment of the present invention includes a video camera projector that captures video images of a local object and projects annotations made by a user at a remote location onto said local object.
Abstract:
Embodiments of the present invention provide a method for producing a summary of a digital file on one or more computers. The method includes segmenting the digital file into a plurality of segments, clustering said segments into a plurality of clusters and selecting a cluster from said plurality of clusters wherein said selected cluster includes segments representative of said digital file. Upon selection of a cluster a segment of the cluster is provided as a summary of said digital file.
Abstract:
Optimal summaries of a linear media source are automatically produced by parameterizing a linear media source. The parameterized linear media source is used to create a similarity array in which each array element includes the value of a similarity measurement between a two portions of the parameterized media signal. A segment fitness function, adapted for measuring the similarity between a segment of the parameterized media signal and the entire parameterized media signal, is optimized to find an optimal segment location. The portion of the linear media source corresponding to the optimal segment location is selected as the optimal summary. This method produces optimal summaries of any type of linear media, such as video, audio, or text information.