Abstract:
A method and apparatus for processing object-based audio signals for reproduction through a playback system is provided. The apparatus receives a plurality of object-based audio signals in at least one audio frame. In addition, the apparatus receives at least one audio object command associated with at least one object-based audio signal of the plurality of object-based audio signals. In addition, the apparatus processes the at least one object-based audio signal based on the received at least one audio object command. Further, the apparatus renders a set of object-based audio signals of the plurality of object-based audio signals to a set of output signals based on the at least one audio object command. The at least one audio frame may be received from one of a set top box, an OD player, or a television. The apparatus may be an AV receiver or a television.
Abstract:
Multiple video recordings may be synchronized using audio features of the recordings. A synchronization process may compare energy tracks of each recording within a multi-resolution framework to correlate audio features of one recording to another.
Abstract:
A system and method is provided for generating summaries of video clips and then utilizing a source of data indicative of the consumption by viewers of those video summaries. In particular, summaries of videos are published and audience data is collected regarding the usage of those summaries, including which summaries are viewed, how they are viewed, the duration of viewing and how often. This usage information may be utilized in a variety of ways. In one embodiment, the usage information is fed into a machine learning algorithm that identifies, updates and optimizes groupings of related videos and scores of significant portions of those videos in order to improve the selection of the summary. In this way the usage information is used to find a summary that better engages the audience. In another embodiment usage information is used to predict popularity of videos. In still another embodiment usage information is used to assist in the display of advertising to users.
Abstract:
Systems and methods for determining the location of advertisements in multimedia assets are disclosed. A method includes obtaining an audio signature corresponding to a time period of a multimedia asset, identifying a match between the obtained audio signature and one or more stored audio signatures, comparing programming data of the multimedia assets of the obtained audio signature and the matching audio signatures, and determining whether the time period of the multimedia asset contains an advertisement based on the comparison of the programming data of the multimedia assets of the obtained audio signature and the one or more matching audio signatures. Another method includes identifying matches between a plurality of obtained audio signatures and a plurality of stored audio signatures, and determining whether consecutive time periods of the multimedia asset contain an advertisement based on a number of consecutive matching audio signatures of the plurality of stored audio signatures.
Abstract:
Systems and methods for content and program type detection, including identification of true boundaries between content segments. A broadcast provider sends a broadcast as an encoded stream. During a switch between content types, an automation system sends identifying metadata indicative of an approximate boundary between content types. A mediacast generation system receives the encoded stream of content and metadata, processes the metadata, time corrects the metadata, and slices the content on the exact boundary where the content change occurs. The mediacast generation system decodes an audio stream directly into Waveform Audio File Format (WAVE) while using an envelope follower to measure amplitude. When the system detects a metadata marker, an analyzer may look inside a buffered time window. The WAVE data may be analyzed to look for a period most likely to be the true boundary or split point between content segments. The content may then be split up on the new true boundary.
Abstract:
Technologies to generate multimedia data are generally described. In some examples, a multimedia generator may receive initial audio data that may include audio rhythm data. The audio rhythm data may be effective to indicate a pattern of a set of beats. The multimedia generator may also compare the audio rhythm data with video rhythm data, where the video rhythm data may be effective to indicate a change of direction of a set of points in a video segment. The multimedia generator may also identify the video segment based on the comparison of the audio rhythm data with the video rhythm data. The multimedia generator may also map the video segment to at least a portion of the initial audio data to generate the multimedia data.