摘要:
Music videos are automatically produced from source audio and video signals. The music video contains edited portions of the video signal synchronized with the audio signal. An embodiment detects transition points in the audio signal and the video signal. The transition points are used to align in time the video and audio signals. The video signal is edited according to its alignment with the audio signal. The resulting edited video signal is merged with the audio signal to form a music video.
摘要:
Data organizing systems and methods organize a plurality of data files using meta data or other data relating to a plurality of data files by extracting the related data for at least some of the data files, organizing the extracted related data and dividing at least some of the data files into groups based on the extracted related data and an input parameter value.
摘要:
A computer assisted meeting capture system in which camera selection, camera control and sensor notification of candidate activity event for camera image changes are integrated. The information is displayed on a representation of a room layout. Camera switch suggestions are notified to the operator through the use of low-overhead cognitive cues such as changeable human sensible display characteristics.
摘要:
Video recordings of meetings and scanned paper documents are natural digital documents that come out of a meeting. These can be placed on the Internet for easy access, with links generated between them by matching scanned documents to a segment of the video referencing the scanned document. Furthermore, annotations made on the paper documents during the meeting can be extracted and used as indexes to the video. An orthonormal transform, such as a Digital Cosine Transform (DCT) is used to compare scanned documents to video frames.
摘要:
A stream of ordered information, such as, for example, audio, video and/or text data, can be windowed and parameterized. A similarity between the parameterized and windowed stream of ordered information can be determined, and a probabilistic decomposition or probabilistic matrix factorization, such as non-negative matrix factorization, can be applied to the similarity matrix. The component matrices resulting from the decomposition indicate major components or segments of the ordered information. Excerpts can then be extracted from the stream of ordered information based on the component matrices to generate a summary of the stream of ordered information.
摘要:
Methods and systems for transferring media between media source devices and media sink devices are disclosed. Remote control units are used to indicate the media sink and media source devices for transferring media data between these elements.
摘要:
Embodiments of the present invention provide a system and method for discriminatively selecting keyframes that are representative of segments of a source digital media and at the same time distinguishable from other keyframes representing other segments of the digital media. The method and system, in one embodiment, includes pre-processing the source digital media to obtain feature vectors for frames of the media. Discriminatively selecting a keyframe as a representative for each segment of a source digital media wherein said discriminative selection includes determining a similarity measure for each candidate keyframe and determining a dis-similarity measure for each candidate keyframe and selecting the keyframe with the highest goodness value computing from the similarity and dis-similarity measures.
摘要:
Systems and methods in accordance with the present invention can be applied to generate a personal media library of media segments from a media stream. A method in accordance with one embodiment can comprise receiving the media stream, identifying one or more novelty points within the media stream and creating a plurality of media segments based on said one or more novelty points. The method can further be applied to compile a playlist or substitute media stream organizing such stream as desired, eliminating redundant media clips and discarding advertisements.
摘要:
A system for providing a dynamic audio-visual environment using an eSurface situated in a room environment; a projector situated for projecting images onto the eSurface; a camera situated to picture the room environment; a central processor coupled to the eSurface, the projector and the camera. The processor receives pictures from the camera for detecting the location of the eSurface; and controls the projector to aim its projection beam onto the eSurface. The eSurface is a sheet-like surface having the property of accepting optically projected image when powered, and retaining the projected image after the power is turned off.
摘要:
Systems and methods for providing a status of a teleconference by determining an approximate delay time and providing a status signal in view of the determined approximate delay time are provided. An approximate delay time is approximately the amount of time that will elapse before an occurrence occurring at a first time, which is captured into an occurrence signal by a source unit, will be experienced at a second time after the occurrence signal is received by at least one receiving unit.