Multi-channel hierarchical audio coding with compact side information
    41.
    发明申请
    Multi-channel hierarchical audio coding with compact side information 有权
    具有紧凑侧面信息的多通道分层音频编码

    公开(公告)号:US20060233380A1

    公开(公告)日:2006-10-19

    申请号:US11314711

    申请日:2005-12-21

    IPC分类号: H04R5/00

    摘要: A parametric representation of a multi-channel audio signal describes the spatial properties of the audio signal well with compact side information when a coherence information, describing the coherence between a first and a second channel, is derived within a hierarchical encoding process only for channel pairs including a first channel having only information of a left side with respect to a listening position and including a second channel having only information from a right side with respect to a listening position. As within the hierarchical process the multiple audio channels of the audio signal are downmixed iteratively into monophonic channels, one can pick the relevant parameters from an encoding step involving only channel pairs carrying the information needed to describe the spatial properties of the multi-channel audio signal.

    摘要翻译: 多通道音频信号的参数化表示利用紧凑的侧信息来描述音频信号的空间属性,当描述第一和第二信道之间的相干性的相干信息仅在信道对的分级编码处理中导出时 包括仅具有相对于收听位置的左侧的信息的第一频道,并且包括仅具有相对于收听位置的右侧的信息的第二频道。 如在分层处理中,音频信号的多个音频通道被迭代地下混合成单声道,可以从仅涉及携带描述多声道音频信号的空间属性所需的信息的信道对的编码步骤中选择相关参数 。

    Stereo compatible multi-channel audio coding
    42.
    发明申请
    Stereo compatible multi-channel audio coding 有权
    立体声兼容多声道音频编码

    公开(公告)号:US20060133618A1

    公开(公告)日:2006-06-22

    申请号:US11286239

    申请日:2005-11-23

    IPC分类号: H04R5/00

    CPC分类号: G10L19/008

    摘要: A parametric representation of a multi-channel audio signal having parameters suited to be used together with a monophonic downmix signal to calculate a reconstruction of the multi-channel audio signal can efficiently be derived in a stereo-backwards compatible way when a parameter combiner is used to generate the parametric representation by combining a one or more spatial parameters and a stereo parameter resulting in a parametric representation having a decoder usable stereo parameter and an information on the one or more spatial parameters that represents, together with the decoder usable stereo parameter, the one or more spatial parameters.

    摘要翻译: 当使用参数组合器时,具有适合于与单声道下混信号一起使用以计算多声道音频信号的重建的参数的多声道音频信号的参数表示可以以立体声向后兼容的方式有效地导出 通过组合一个或多个空间参数和立体声参数来产生参数表示,该立体参数导致具有解码器可用立体声参数的参数表示以及与解码器可用立体参数一起表示的一个或多个空间参数的信息, 一个或多个空间参数。

    Scene change detection around a set of seed points in media data
    43.
    发明授权
    Scene change detection around a set of seed points in media data 有权
    媒体数据中一组种子点周围的场景变化检测

    公开(公告)号:US09317561B2

    公开(公告)日:2016-04-19

    申请号:US13997860

    申请日:2011-12-15

    摘要: Techniques for scene change detection around seed points in media data are provided. Media features of many different types may be extracted from the media data. One or more statistical patterns of media features in a plurality of time-wise intervals around a plurality of seed time points of the media data may be determined using one or more types of features extractable from the media data. At least one of the one or more types of features comprises a type of features that captures structural properties, tonality including harmony and melody, timbre, rhythm, loudness, stereo mix, or a quantity of sound sources as related to the media data. A plurality of beginning scene change points and a plurality of ending scene change points in the media data may be detected, based on the one or more statistical patterns, for the plurality of seed time points in the media data.

    摘要翻译: 提供媒体数据中种子点周围场景变化检测技术。 可以从媒体数据中提取许多不同类型的媒体特征。 可以使用从媒体数据可提取的一种或多种类型的特征来确定围绕媒体数据的多个种子时间点的多个时间间隔中的媒体特征的一个或多个统计模式。 一种或多种类型的特征中的至少一种包括捕获与媒体数据相关的结构性质,包括和声和旋律的音调,音色,节奏,响度,立体声混合或数量的声源的特征的类型。 可以基于媒体数据中的多个种子时间点的一个或多个统计模式来检测媒体数据中的多个起始场景变化点和多个结束场景变化点。

    Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods
    46.
    发明申请
    Advanced processing based on a complex-exponential-modulated filterbank and adaptive time signalling methods 有权
    基于复指数调制滤波器组和自适应时间信号方法的高级处理

    公开(公告)号:US20070121952A1

    公开(公告)日:2007-05-31

    申请号:US11698611

    申请日:2007-01-26

    IPC分类号: H04H5/00

    摘要: A synthesizer for generating a decorrelation signal using an input signal is operative on a plurality of subband signals, wherein a subband signal includes a sequence of at least two subband samples, the sequence of the subband samples representing a bandwidth of the subband signal, which is smaller than a bandwidth of the input signal. The synthesizer includes a filter stage for filtering each subband signal using a reverberation filter to obtain a plurality of reverberated subband signals, wherein a plurality of reverberated subband signals together represent the decorrelation signal. This decorrelation signal is used for reconstructing a signal based on a parametrically encoded stereo signal consisting of a mono signal and a coherence measure.

    摘要翻译: 用于使用输入信号产生去相关信号的合成器在多个子带信号上操作,其中子带信号包括至少两个子带样本的序列,子带样本的序列表示子带信号的带宽,其是 小于输入信号的带宽。 合成器包括滤波器级,用于使用混响滤波器对每个子带信号进行滤波以获得多个混响的子带信号,其中多个混响的子带信号一起表示解相关信号。 该去相关信号用于基于由单声道信号和相干测量组成的参数编码的立体声信号来重构信号。

    Efficient and scalable parametric stereo coding for low bitrate audio coding applications
    49.
    发明授权
    Efficient and scalable parametric stereo coding for low bitrate audio coding applications 有权
    低比特率音频编码应用的高效可扩展的参数立体声编码

    公开(公告)号:US09218818B2

    公开(公告)日:2015-12-22

    申请号:US13458492

    申请日:2012-04-27

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。

    EFFICIENT AND SCALABLE PARAMETRIC STEREO CODING FOR LOW BITRATE AUDIO CODING APPLICATIONS
    50.
    发明申请
    EFFICIENT AND SCALABLE PARAMETRIC STEREO CODING FOR LOW BITRATE AUDIO CODING APPLICATIONS 审中-公开
    低成本音频编码应用的高效和可扩展参数立体声编码

    公开(公告)号:US20120213377A1

    公开(公告)日:2012-08-23

    申请号:US13458492

    申请日:2012-04-27

    IPC分类号: H04R5/00

    摘要: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.

    摘要翻译: 本发明提供对现有技术的音频编解码器的改进,其通过对接收到的单声道信号的后处理产生立体幻象。 这些改进通过提取在编码器侧描述参数的立体图像来实现,其被传输并随后用于在解码器侧对立体声发生器的控制。 此外,本发明通过使用新形式的参数立体声编码来弥合简单伪立体声方法和现有的真实立体声编码方法之间的差距。 引入立体声平衡参数,其实现更高级的立体声模式,并且还构成了频谱包络的​​立体编码的新方法的基础,在采用引导HFR(高频重建)的系统中特别有用。 作为特殊情况,描述了这种立体声编码方案在可扩展的基于HFR的编解码器中的应用。