Abstract:
A stream of ordered information, such as, for example, audio, video and/or text data, can be windowed and parameterized. A similarity between the parameterized and windowed stream of ordered information can be determined, and a probabilistic decomposition or probabilistic matrix factorization, such as non-negative matrix factorization, can be applied to the similarity matrix. The component matrices resulting from the decomposition indicate major components or segments of the ordered information. Excerpts can then be extracted from the stream of ordered information based on the component matrices to generate a summary of the stream of ordered information.
Abstract:
Methods and systems for transferring media between media source devices and media sink devices are disclosed. Remote control units are used to indicate the media sink and media source devices for transferring media data between these elements.
Abstract:
Data organizing systems and methods organize a plurality of data files using meta data or other data relating to a plurality of data files by extracting the related data for at least some of the data files, organizing the extracted related data and dividing at least some of the data files into groups based on the extracted related data and an input parameter value.
Abstract:
Embodiments of the present invention provide a system and method for discriminatively selecting keyframes that are representative of segments of a source digital media and at the same time distinguishable from other keyframes representing other segments of the digital media. The method and system, in one embodiment, includes pre-processing the source digital media to obtain feature vectors for frames of the media. Discriminatively selecting a keyframe as a representative for each segment of a source digital media wherein said discriminative selection includes determining a similarity measure for each candidate keyframe and determining a dis-similarity measure for each candidate keyframe and selecting the keyframe with the highest goodness value computing from the similarity and dis-similarity measures.
Abstract:
Music videos are automatically produced from source audio and video signals. The music video contains edited portions of the video signal synchronized with the audio signal. An embodiment detects transition points in the audio signal and the video signal. The transition points are used to align in time the video and audio signals. The video signal is edited according to its alignment with the audio signal. The resulting edited video signal is merged with the audio signal to form a music video.
Abstract:
Systems and methods in accordance with the present invention can be applied to generate a personal media library of media segments from a media stream. A method in accordance with one embodiment can comprise receiving the media stream, identifying one or more novelty points within the media stream and creating a plurality of media segments based on said one or more novelty points. The method can further be applied to compile a playlist or substitute media stream organizing such stream as desired, eliminating redundant media clips and discarding advertisements.
Abstract:
Systems and methods determine the location of a microphone with an unknown location, given the location of a number of other microphones by determining a difference in an arrival time between a first audio signal generated by and microphone with a known location and a second audio signal generated by another microphone with an unknown location, wherein the first and second audio signals are a representation of a substantially same sound emitted from an acoustic source with a known location; determining, based on at least the determined difference in arrival time, a distance between the acoustic source with the known location and the microphone with the unknown location; and determining, based on the determined distance between the acoustic source with the known location and the microphone with the unknown location, the location of the unknown microphone.
Abstract:
Provides a system for detecting an intersection between more than one panoramic video sequence and detecting the orientation of the sequences forming the intersection. Video images and corresponding location data are received. If required, the images and location data is processed to ensure the images contain location data. An intersection between two paths is then derived from the video images by deriving a rough intersection between two images, determining a neighborhood for the two images, and dividing each image in the neighborhood into strips. An identifying value is derived from each strip to create a row of strip values which are then converted to the frequency domain. A distance measure is taken between strips in the frequency domain, and the intersection is determined from the images having the smallest distance measure between them. The orientation between the two paths may also be determined in the frequency domain by using the phases of signals representing the images in the Fourier domain or performing a circular cross correlation of two vectors representing the images.
Abstract:
Algorithms to show multiple images at the maximum possible resolution are proposed. Rather than reducing the resolution of each image, the portion of each image that is actually shown is reduced. The algorithms select which part of each image is to be shown. In one embodiment of the invention, changing the parameters over time further increases the information displayed.
Abstract:
Systems and methods determine the location of a microphone with an unknown location, given the location of a number of other microphones by determining a difference in an arrival time between a first audio signal generated by and microphone with a known location and a second audio signal generated by another microphone with an unknown location, wherein the first and second audio signals are a representation of a substantially same sound emitted from an acoustic source with a known location; determining, based on at least the determined difference in arrival time, a distance between the acoustic source with the known location and the microphone with the unknown location; and determining, based on the determined distance between the acoustic source with the known location and the microphone with the unknown location, the location of the unknown microphone.