Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
    43.
    发明授权
    Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols 有权
    音频编码方法和系统,用于通过实现不同解码协议的解码器生成统一的比特流解码

    公开(公告)号:US09378743B2

    公开(公告)日:2016-06-28

    申请号:US14009503

    申请日:2012-04-05

    CPC分类号: G10L19/002 G10L19/167

    摘要: In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

    摘要翻译: 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。

    Ranking representative segments in media data
    44.
    发明授权
    Ranking representative segments in media data 有权
    在媒体数据中排列代表性细分

    公开(公告)号:US09313593B2

    公开(公告)日:2016-04-12

    申请号:US13997866

    申请日:2011-12-15

    摘要: Techniques for ranking representative segments in media data are provided. Media features of many different types may be extracted from the media data. A plurality of ranking scores may be assigned to a plurality of candidate representative segments. Each individual candidate representative segment in the plurality of candidate representative segments comprises at least one scene in one or more statistical patterns in media features of the media data based on one or more types of features extractable from the media data. Each individual ranking score in the plurality of ranking scores may be assigned to an individual candidate representative segment in the plurality of candidate representative segments. A representative segment to be played to an end user may be selected from the candidate representative segments, based on the plurality of ranking scores.

    摘要翻译: 提供了在媒体数据中排列代表片段的技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以将多个排名得分分配给多个候选代表段。 基于从媒体数据可提取的一种或多种类型的特征,多个候选代表段中的每个候选代表段包括媒体数据的媒体特征中的一个或多个统计模式中的至少一个场景。 可以将多个排名得分中的每个个体排名分数分配给多个候选代表段中的个人候选代表段。 可以基于多个排名得分从候选代表段中选择要向最终用户播放的代表段。

    Method, apparatus, and medium for detecting frequency extension coding in the coding history of an audio signal
    45.
    发明授权
    Method, apparatus, and medium for detecting frequency extension coding in the coding history of an audio signal 有权
    用于在音频信号的编码历史中检测频率扩展编码的方法,装置和介质

    公开(公告)号:US09117440B2

    公开(公告)日:2015-08-25

    申请号:US14116113

    申请日:2012-04-30

    摘要: The present document relates to audio forensics, notably the blind detection of traces of parametric audio encoding/decoding. In particular, the present document relates to the detection of parametric frequency extension audio coding, such as spectral band replication (SBR) or spectral extension (SPX), from uncompressed waveforms such as PCM (pulse code modulation) encoded waveforms. A method for detecting frequency extension coding history in a time domain audio signal is described. The method may comprise transforming the time domain audio signal into a frequency domain, thereby generating a plurality of subband signals in a corresponding plurality of subbands comprising low and high frequency subbands; determining a degree of relationship between subband signals in the low frequency subbands and subband signals in the high frequency subbands; wherein the degree of relationship is determined based on the plurality of subband signals; and determining frequency extension coding history if the degree of relationship is greater than a relationship threshold.

    摘要翻译: 本文件涉及音频取证,特别是盲目检测参数音频编码/解码的痕迹。 特别地,本文件涉及从诸如PCM(脉冲编码调制)编码波形的未压缩波形检测参数频率扩展音频编码,例如频谱带复制(SBR)或频谱扩展(SPX)。 描述了用于检测时域音频信号中的频率扩展编码历史的方法。 该方法可以包括将时域音频信号变换成频域,从而在包括低频和高频子带的相应多个子带中产生多个子带信号; 确定低频子带中的子带信号与高频子带中的子带信号之间的关系程度; 其中所述关系度基于所述多个子带信号来确定; 以及如果所述关系度大于关系阈值,则确定频率扩展编码历史。

    AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS
    46.
    发明申请
    AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS 有权
    音视频编码方法和系统,用于生成由解码器实现的不同解码协议解码的统一的双绞线

    公开(公告)号:US20140358554A1

    公开(公告)日:2014-12-04

    申请号:US14009503

    申请日:2012-04-05

    IPC分类号: G10L19/002

    CPC分类号: G10L19/002 G10L19/167

    摘要: In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.

    摘要翻译: 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。

    Scene Change Detection Around a Set of Seed Points in Media Data
    47.
    发明申请
    Scene Change Detection Around a Set of Seed Points in Media Data 有权
    媒体数据中一组种子点的场景变化检测

    公开(公告)号:US20130287214A1

    公开(公告)日:2013-10-31

    申请号:US13997860

    申请日:2011-12-15

    IPC分类号: H04R29/00

    摘要: Techniques for scene change detection around seed points in media data are provided. Media features of many different types may be extracted from the media data. One or more statistical patterns of media features in a plurality of time-wise intervals around a plurality of seed time points of the media data may be determined using one or more types of features extractable from the media data. At least one of the one or more types of features comprises a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data. A plurality of beginning scene change points and a plurality of ending scene change points in the media data may be detected, based on the one or more statistical patterns, for the plurality of seed time points in the media data.

    摘要翻译: 提供媒体数据中种子点周围场景变化检测技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以使用从媒体数据可提取的一种或多种类型的特征来确定围绕媒体数据的多个种子时间点的多个时间间隔中的媒体特征的一个或多个统计模式。 一种或多种类型的特征中的至少一种包括捕获与媒体数据相关的结构性质,包括和声和旋律的音调,音色,节奏,响度,立体声混合或数量的声源的特征的类型。 可以基于媒体数据中的多个种子时间点的一个或多个统计模式来检测媒体数据中的多个起始场景变化点和多个结束场景变化点。

    Repetition Detection in Media Data
    48.
    发明申请
    Repetition Detection in Media Data 审中-公开
    媒体数据中的重复检测

    公开(公告)号:US20130275421A1

    公开(公告)日:2013-10-17

    申请号:US13997847

    申请日:2011-12-15

    IPC分类号: G06F17/30

    摘要: Techniques for repetition detection in media data are provided. Media features of many different types may be extracted from the media data. Query sequences of fingerprints may be selected time intervals that begin at query times. Matched sequences of fingerprints may be determined. A set of offset values may be determined based on the matched sequences of fingerprints. This set of offset values may be further refined into a set of significant time points using a relatively targeted search and comparison method based on the media features of a second type extracted from the media data.

    摘要翻译: 提供了媒体数据中重复检测技术。 可以从媒体数据中提取许多不同类型的媒体特征。 指纹的查询序列可以是从查询时间开始的选择的时间间隔。 可以确定匹配的指纹序列。 可以基于匹配的指纹序列来确定一组偏移值。 可以使用基于从媒体数据提取的第二类型的媒体特征的相对有针对性的搜索和比较方法,将这组偏移值进一步细化为一组有效时间点。

    Multimode coding of speech-like and non-speech-like signals
    49.
    发明授权
    Multimode coding of speech-like and non-speech-like signals 有权
    语音和非语音信号的多模式编码

    公开(公告)号:US08392179B2

    公开(公告)日:2013-03-05

    申请号:US12921752

    申请日:2009-03-12

    IPC分类号: G10L11/06

    摘要: The invention relates to the coding of audio signals that may include both speech-like and non-speech-like signal components. It describes methods and apparatus for code excited linear prediction (CELP) audio encoding and decoding that employ linear predictive coding (LPC) synthesis filters controlled by LPC parameters, a plurality of codebooks each having codevectors, at least one codebook providing an excitation more appropriate for non-speech-like signals and at least one codebook providing an excitation more appropriate for speech-like signals, and a plurality of gain factors, each associated with a codebook. The encoding methods and apparatus select from the codebooks codevectors and/or associated gain factors by minimizing a measure of the difference between the audio signal and a reconstruction of the audio signal derived from the codebook excitations. The decoding methods and apparatus generate a reconstructed output signal from the LPC parameters, codevectors, and gain factors.

    摘要翻译: 本发明涉及可以包括语音类和非语音类信号分量的音频信号的编码。 它描述了采用由LPC参数控制的线性预测编码(LPC)合成滤波器的码激励线性预测(CELP)音频编码和解码的方法和装置,每个具有码矢量的多个码本,提供更适合于 非语音类信号和至少一个提供更适合于类似语音的信号的激励的码本,以及多个增益因子,每个与码本相关联。 编码方法和装置通过最小化音频信号与从码本激励导出的音频信号的重建之间的差异的度量来从码本代码矢量和/或相关联的增益因子中选择。 解码方法和装置从LPC参数,代码矢量和增益因子产生重构的输出信号。