Scalable Media Fingerprint Extraction
    31.
    发明申请
    Scalable Media Fingerprint Extraction 有权
    可扩展媒体指纹提取

    公开(公告)号:US20110268315A1

    公开(公告)日:2011-11-03

    申请号:US13142355

    申请日:2010-01-07

    IPC分类号: G06K9/00

    摘要: Derivation of a fingerprint includes generating feature matrices based on one or more training images, generating projection matrices based on the feature matrices in a training process, and deriving a fingerprint for one or more images by, at least in part, projecting a feature matrix based on the one or more images onto the projection matrices generated in the training process.

    摘要翻译: 指纹的推导包括基于一个或多个训练图像生成特征矩阵,基于训练过程中的特征矩阵生成投影矩阵,以及通过至少部分地基于特征矩阵投影来导出一个或多个图像的指纹, 在一个或多个图像上,在训练过程中产生的投影矩阵上。

    Alignment and re-association of metadata for media streams within a computing device
    32.
    发明授权
    Alignment and re-association of metadata for media streams within a computing device 有权
    计算设备内媒体流元数据的对齐和重新关联

    公开(公告)号:US09075806B2

    公开(公告)日:2015-07-07

    申请号:US13402718

    申请日:2012-02-22

    摘要: Techniques for re-associating dynamic metadata with media data are provided. A media processing system creates, with a first media processing stage, binding information comprising dynamic metadata and a time relationship between the dynamic metadata and media data. The binding information may be derived from the media data. While the first media processing stage delivers the media data to a second media processing stage in a first data path, the first media processing stage passes the binding information to the second media processing stage in a second data path. The media processing system re-associates, with the second media processing stage, the dynamic metadata and the media data using the binding information.

    摘要翻译: 提供了将动态元数据与媒体数据重新关联的技术。 媒体处理系统在第一媒体处理阶段创建包含动态元数据和动态元数据与媒体数据之间的时间关系的绑定信息。 可以从媒体数据导出绑定信息。 当第一媒体处理阶段将媒体数据传送到第一数据路径中的第二媒体处理阶段时,第一媒体处理阶段将绑定信息传递到第二数据路径中的第二媒体处理阶段。 媒体处理系统使用绑定信息在第二媒体处理阶段重新关联动态元数据和媒体数据。

    Ranking Representative Segments in Media Data
    33.
    发明申请
    Ranking Representative Segments in Media Data 有权
    媒体数据中的代表片段排名

    公开(公告)号:US20130289756A1

    公开(公告)日:2013-10-31

    申请号:US13997866

    申请日:2011-12-15

    IPC分类号: G06F17/00

    摘要: Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

    摘要翻译: 提供了在媒体数据中排列代表片段的技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以将多个排名得分分配给多个候选代表段。 基于从媒体数据可提取的一种或多种类型的特征,多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。 可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。 可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

    Robust Media Fingerprints
    36.
    发明申请
    Robust Media Fingerprints 有权
    强大的媒体指纹

    公开(公告)号:US20110153050A1

    公开(公告)日:2011-06-23

    申请号:US13060032

    申请日:2009-08-26

    IPC分类号: G06F17/00

    CPC分类号: G10L19/018

    摘要: Robust media fingerprints are derived from a portion of audio content. A portion of content in an audio signal is categorized. The audio content is characterized based, at least in part, on one or more of its features. The features may include a component that relates to one of several sound categories, e.g., speech and/or noise, which may be mixed with the audio signal. Upon categorizing the audio content as free of the speech or noise related components, the audio signal component is processed. Upon categorizing the audio content as including the speech related component and/or the noise related components, the speech or noise related components are separated from the audio signal. The audio signal is processed independent of the speech related component and/or the noise related component. Processing the audio signal includes computing the audio fingerprint, which ably corresponds to the audio signal.

    摘要翻译: 强大的媒体指纹是从音频内容的一部分导出的。 对音频信号中的内容的一部分进行分类。 音频内容的特征在于,至少部分地基于其一个或多个特征。 特征可以包括与几个声音类别中的一个相关联的组件,例如可以与音频信号混合的语音和/或噪声。 在将音频内容分类为没有语音或噪声相关组件的情况下,处理音频信号分量。 在将音频内容分类为包括语音相关分量和/或噪声相关分量时,语音或噪声相关分量与音频信号分离。 音频信号被独立于语音相关分量和/或噪声相关分量进行处理。 处理音频信号包括计算音频指纹,其与音频信号相当。

    Deriving Video Signatures That Are Insensitive to Picture Modification and Frame-Rate Conversion
    37.
    发明申请
    Deriving Video Signatures That Are Insensitive to Picture Modification and Frame-Rate Conversion 失效
    导出对图像修改和帧速率转换不敏感的视频签名

    公开(公告)号:US20100238350A1

    公开(公告)日:2010-09-23

    申请号:US12600466

    申请日:2008-05-01

    IPC分类号: H04N5/14

    摘要: A signature that can be used to identify video content in a series of video frames is generated by first calculating the average and variance of picture elements in a low-resolution composite image that represents a temporal and spatial composite of the video content in the series of frames. The signature is generated by applying a hash function to values derived from the average and variance composite representations. The video content of a signal can be represented by a set of signatures that are generated for multiple series of frames within the signal. A set of signatures can provide reliable identifications despite intentional and unintentional modifications to the content.

    摘要翻译: 可以通过首先计算低分辨率合成图像中的图像元素的平均值和方差来生成可用于识别一系列视频帧中的视频内容的签名,该复合图像表示该系列视频内容中的视频内容的时间和空间复合 框架。 通过将散列函数应用于从平均和方差复合表示中导出的值来生成签名。 信号的视频内容可以由为信号内的多个帧系列生成的一组签名来表示。 尽管有意和无意地修改内容,一组签名可以提供可靠的标识。

    Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
    39.
    发明授权
    Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols 有权
    音频编码方法和系统,用于通过实现不同解码协议的解码器生成统一的比特流解码

    公开(公告)号:US09378743B2

    公开(公告)日:2016-06-28

    申请号:US14009503

    申请日:2012-04-05

    CPC分类号: G10L19/002 G10L19/167

    摘要: In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

    摘要翻译: 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。

    Ranking representative segments in media data
    40.
    发明授权
    Ranking representative segments in media data 有权
    在媒体数据中排列代表性细分

    公开(公告)号:US09313593B2

    公开(公告)日:2016-04-12

    申请号:US13997866

    申请日:2011-12-15

    摘要: Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

    摘要翻译: 提供了在媒体数据中排列代表片段的技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以将多个排名得分分配给多个候选代表段。 基于从媒体数据可提取的一种或多种类型的特征,多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。 可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。 可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。