Apparatus and method for extracting an ambient signal in an apparatus and method for obtaining weighting coefficients for extracting an ambient signal and computer program
    71.
    发明授权
    Apparatus and method for extracting an ambient signal in an apparatus and method for obtaining weighting coefficients for extracting an ambient signal and computer program 有权
    用于在装置中提取环境信号的装置和方法以及用于获得用于提取环境信号和计算机程序的加权系数的方法

    公开(公告)号:US08588427B2

    公开(公告)日:2013-11-19

    申请号:US12055787

    申请日:2008-03-26

    IPC分类号: H04R5/00

    CPC分类号: H04R5/04

    摘要: An apparatus for extracting an ambient signal from an input audio signal comprises a gain-value determinator configured to determine a sequence of time-varying ambient signal gain values for a given frequency band of the time-frequency distribution of the input audio signal in dependence on the input audio signal. The apparatus comprises a weighter configured to weight one of the sub-band signals representing the given frequency band of the time-frequency-domain representation with the time-varying gain values, to obtain a weighted sub-band signal. The gain-value determinator is configured to obtain one or more quantitative feature-values describing one or more features of the input audio signal and to provide the gain-value as a function of the one or more quantitative feature values such that the gain values are quantitatively dependent on the quantitative values. The gain value determinator is configured to determine the gain values such that ambience components are emphasized over non-ambience components in the weighted sub-band signal.

    摘要翻译: 一种用于从输入音频信号中提取环境信号的装置包括:增益值确定器,被配置为根据输入音频信号的时间 - 频率分布的给定频带来确定时变环境信号增益值的序列,依赖于 输入音频信号。 该装置包括被配置为将表示时间 - 频域表示的给定频带的子带信号中的一个与时变增益值加权的权重器,以获得加权子带信号。 增益值确定器被配置为获得描述输入音频信号的一个或多个特征的一个或多个定量特征值,并且将增益值提供为一个或多个定量特征值的函数,使得增益值为 定量依赖于定量值。 增益值确定器被配置为确定增益值,使得氛围分量在加权子带信号中的非环境分量上被强调。

    Audio encoder, method for providing output signal, bandwidth extension decoder, and method for providing bandwidth extended audio signal
    73.
    发明授权
    Audio encoder, method for providing output signal, bandwidth extension decoder, and method for providing bandwidth extended audio signal 有权
    音频编码器,用于提供输出信号的方法,带宽扩展解码器和用于提供带宽扩展音频信号的方法

    公开(公告)号:US08401862B2

    公开(公告)日:2013-03-19

    申请号:US13158547

    申请日:2011-06-13

    IPC分类号: G10L19/00

    摘要: An audio encoder for providing an output signal using an input audio signal includes a patch generator, a comparator and an output interface. The patch generator generates at least one bandwidth extension high-frequency signal, wherein a bandwidth extension high-frequency signal includes a high-frequency band. The high-frequency band of the bandwidth extension high-frequency signal is based on a low frequency band of the input audio signal. A comparator calculates a plurality of comparison parameters. A comparison parameter is calculated based on a comparison of the input audio signal and a generated bandwidth extension high-frequency signal. Each comparison parameter of the plurality of comparison parameters is calculated based on a different offset frequency between the input audio signal and a generated bandwidth extension high-frequency signal. Further, the comparator determines a comparison parameter from the plurality of comparison parameters, wherein the determined comparison parameter fulfils a predefined criterion.

    摘要翻译: 用于使用输入音频信号提供输出信号的音频编码器包括片生成器,比较器和输出接口。 贴片发生器产生至少一个带宽扩展高频信号,其中带宽扩展高频信号包括高频带。 带宽扩展高频信号的高频带基于输入音频信号的低频带。 比较器计算多个比较参数。 基于输入音频信号和产生的带宽扩展高频信号的比较来计算比较参数。 基于输入音频信号与生成的带宽扩展高频信号之间的不同偏移频率来计算多个比较参数的每个比较参数。 此外,比较器确定来自多个比较参数的比较参数,其中确定的比较参数满足预定标准。

    Audio decoder, audio object encoder, method for decoding a multi-audio-object signal, multi-audio-object encoding method, and non-transitory computer-readable medium therefor
    74.
    发明授权
    Audio decoder, audio object encoder, method for decoding a multi-audio-object signal, multi-audio-object encoding method, and non-transitory computer-readable medium therefor 有权
    音频解码器,音频对象编码器,用于解码多音频对象信号的方法,多音频对象编码方法及其非暂时性的计算机可读介质

    公开(公告)号:US08280744B2

    公开(公告)日:2012-10-02

    申请号:US12253515

    申请日:2008-10-17

    IPC分类号: G10L19/00

    摘要: An audio decoder for decoding a multi-audio-object signal having an audio signal of a first type and an audio signal of a second type encoded therein is described, the multi-audio-object signal having a downmix signal and side information, the side information having level information of the audio signals of the first and second types in a first predetermined time/frequency resolution, and a residual signal specifying residual level values in a second predetermined time/frequency resolution, the audio decoder having a processor for computing prediction coefficients based on the level information; and an up-mixer for up-mixing the downmix signal based on the prediction coefficients and the residual signal to obtain a first up-mix audio signal approximating the audio signal of the first type and/or a second up-mix audio signal approximating the audio signal of the second type.

    摘要翻译: 描述用于解码具有第一类型的音频信号和其中编码的第二类型的音频信号的多音频对象信号的音频解码器,具有下混信号和侧信息的多音频对象信号 在第一预定时间/频率分辨率中具有第一和第二类型的音频信号的电平信息的信息,以及指定第二预定时间/频率分辨率中的剩余电平值的残差信号,该音频解码器具有用于计算预测系数的处理器 基于等级信息; 以及上混频器,用于基于预测系数和残差信号对混合信号进行混频,以获得近似第一类型的音频信号的第一上混合音频信号和/或接近第 第二种类型的音频信号。

    MULTI-MODE AUDIO SIGNAL DECODER, MULTI-MODE AUDIO SIGNAL ENCODER, METHODS AND COMPUTER PROGRAM USING A LINEAR-PREDICTION-CODING BASED NOISE SHAPING
    75.
    发明申请
    MULTI-MODE AUDIO SIGNAL DECODER, MULTI-MODE AUDIO SIGNAL ENCODER, METHODS AND COMPUTER PROGRAM USING A LINEAR-PREDICTION-CODING BASED NOISE SHAPING 有权
    多模式音频信号解码器,多模式音频信号编码器,使用基于线性预测编码的噪声形状的方法和计算机程序

    公开(公告)号:US20120245947A1

    公开(公告)日:2012-09-27

    申请号:US13441469

    申请日:2012-04-06

    IPC分类号: G10L19/00

    CPC分类号: G10L19/20 G10L19/022

    摘要: A multi-mode audio signal decoder has a spectral value determinator to obtain sets of decoded spectral coefficients for a plurality of portions of an audio content and a spectrum processor configured to apply a spectral shaping to a set of spectral coefficients in dependence on a set of linear-prediction-domain parameters for a portion of the audio content encoded in a linear-prediction mode, and in dependence on a set of scale factor parameters for a portion of the audio content encoded in a frequency-domain mode. The audio signal decoder has a frequency-domain-to-time-domain converter configured to obtain a time-domain audio representation on the basis of a spectrally-shaped set of decoded spectral coefficients for a portion of the audio content encoded in the linear-prediction mode and for a portion of the audio content encoded in the frequency domain mode. An audio signal encoder is also described.

    摘要翻译: 多模式音频信号解码器具有频谱值确定器,以获得用于音频内容的多个部分的解码频谱系数集合;以及频谱处理器,被配置为将频谱整形应用于一组频谱系数 用于以线性预测模式编码的音频内容的一部分的线性预测域参数,并且依赖于以频域模式编码的音频内容的一部分的一组比例因子参数。 音频信号解码器具有频域 - 时域转换器,其被配置为基于频谱形式的解码频谱系数集来获得时域音频表示,所述解码频谱系数被编码在线性 - 预测模式和频域模式中编码的音频内容的一部分。 还描述了音频信号编码器。

    Diffuse sound shaping for BCC schemes and the like
    76.
    发明授权
    Diffuse sound shaping for BCC schemes and the like 有权
    BCC方案的漫射声音整形等

    公开(公告)号:US08238562B2

    公开(公告)日:2012-08-07

    申请号:US12550519

    申请日:2009-08-31

    IPC分类号: H04R5/00

    CPC分类号: G10L19/008 H04S3/02

    摘要: In one embodiment, C input audio channels are encoded to generate E transmitted audio channel(s), where one or more cue codes are generated for two or more of the C input channels, and the C input channels are downmixed to generate the E transmitted channel(s), where C>E≧1. One or more of the C input channels and the E transmitted channel(s) are analyzed to generate a flag indicating whether or not a decoder of the E transmitted channel(s) should perform envelope shaping during decoding of the E transmitted channel(s). In one implementation, envelope shaping adjusts a temporal envelope of a decoded channel generated by the decoder to substantially match a temporal envelope of a corresponding transmitted channel.

    摘要翻译: 在一个实施例中,对C个输入音频信道进行编码以产生E个发送的音频信道,其中为两个或更多个C个输入信道生成一个或多个提示码,并且将C个输入信道下混合以产生E个发送的 通道,其中C>E≥1。 分析C个输入信道和E个发送信道中的一个或多个,以产生一个标志,该标志指示E个被发送的信道的解码器是否应在E个发送的信道的解码期间执行包络整形, 。 在一个实现中,包络整形调整由解码器产生的解码信道的时间包络,以使其对应的传输信道的时间包络基本匹配。

    Diffuse sound shaping for BCC schemes and the like
    77.
    发明授权
    Diffuse sound shaping for BCC schemes and the like 有权
    BCC方案的漫射声音整形等

    公开(公告)号:US08204261B2

    公开(公告)日:2012-06-19

    申请号:US11006492

    申请日:2004-12-07

    IPC分类号: H04R5/02

    CPC分类号: G10L19/008 H04S3/02

    摘要: An input audio signal having an input temporal envelope is converted into an output audio signal having an output temporal envelope. The input temporal envelope of the input audio signal is characterized. The input audio signal is processed to generate a processed audio signal, wherein the processing de-correlates the input audio signal. The processed audio signal is adjusted based on the characterized input temporal envelope to generate the output audio signal, wherein the output temporal envelope substantially matches the input temporal envelope.

    摘要翻译: 具有输入时间包络的输入音频信号被转换成具有输出时间包络的输出音频信号。 表征输入音频信号的输入时间包络。 输入音频信号被处理以产生经处理的音频信号,其中该处理使输入音频信号去相关。 经处理的音频信号基于表征的输入时间包络被调整以产生输出音频信号,其中输出时间包络基本上与输入的时间包络相匹配。

    Generation of decorrelated signals
    78.
    发明授权
    Generation of decorrelated signals 有权
    消除相关信号的产生

    公开(公告)号:US08145499B2

    公开(公告)日:2012-03-27

    申请号:US12440940

    申请日:2008-04-14

    IPC分类号: G10L19/00 H03G3/00

    摘要: In a case of transient audio input signals, in a multi-channel audio reconstruction, uncorrelated output signals are generated from an audio input signal in that the audio input signal is mixed with a representation of the audio input signal delayed by a delay time such that, in a first time interval, a first output signal corresponds to the audio input signal, and a second output signal corresponds to the delayed representation of the audio input signal, wherein, in a second time interval, the first output signal corresponds to the delayed representation of the audio input signal, and the second output signal corresponds to the audio input signal.

    摘要翻译: 在瞬态音频输入信号的情况下,在多声道音频重建中,从音频输入信号产生不相关的输出信号,因为音频输入信号与延迟了延迟时间的音频输入信号的表示混合,使得 在第一时间间隔中,第一输出信号对应于音频输入信号,第二输出信号对应于音频输入信号的延迟表示,其中,在第二时间间隔中,第一输出信号对应于延迟的 音频输入信号的表示,第二输出信号对应于音频输入信号。

    Apparatus and Method for Synchronizing Additional Data and Base Data
    79.
    发明申请
    Apparatus and Method for Synchronizing Additional Data and Base Data 审中-公开
    用于同步附加数据和基本数据的装置和方法

    公开(公告)号:US20110282471A1

    公开(公告)日:2011-11-17

    申请号:US13190221

    申请日:2011-07-25

    IPC分类号: G06F17/00

    摘要: For adding additional data, such as multi-channel extension data, to base data, such as conventional stereo data, a test fingerprint of test data relating to a test time instant of the test data is provided. The test data equals the additional data or the base data or depends on the additional data or the base data in parametric manner. Using the test fingerprint, reference time instant information is determined, which depends on a reference time instant in reference data, the reference data being the conventional stereo data. Finally, the additional data or the base data is manipulated, namely using the reference time instant information and the test time instant information, to obtain manipulated data, by which synchronous reproduction of the data information can be performed. Thus, a robust and flexible possibility for synchronous, especially late extension of base data by additional data is obtained.

    摘要翻译: 为了将诸如多通道扩展数据的附加数据添加到诸如常规立体数据的基础数据,提供了与测试数据的测试时刻有关的测试数据的测试指纹。 测试数据等于附加数据或基本数据,或者以参数方式取决于附加数据或基本数据。 使用测试指纹,确定参考时刻信息,其取决于参考数据中的参考时刻,参考数据是常规立体声数据。 最后,使用参考时刻信息和测试时刻信息来操纵附加数据或基本数据,以获得可以执行数据信息的同步再现的操纵数据。 因此,获得了通过附加数据同步,特别是延迟基础数据的鲁棒且灵活的可能性。

    Frequency-based coding of channels in parametric multi-channel coding systems
    80.
    发明授权
    Frequency-based coding of channels in parametric multi-channel coding systems 有权
    参数化多通道编码系统中频道的频率编码

    公开(公告)号:US07805313B2

    公开(公告)日:2010-09-28

    申请号:US10827900

    申请日:2004-04-20

    IPC分类号: G10L19/00 H04R5/00

    摘要: For a multi-channel audio signal, parametric coding is applied to different subsets of audio input channels for different frequency regions. For example, for a 5.1 surround sound signal having five regular channels and one low-frequency (LFE) channel, binaural cue coding (BCC) can be applied to all six audio channels for sub-bands at or below a specified cut-off frequency, but to only five audio channels (excluding the LFE channel) for sub-bands above the cut-off frequency. Such frequency-based coding of channels can reduce the encoding and decoding processing loads and/or size of the encoded audio bitstream relative to parametric coding techniques that are applied to all input channels over the entire frequency range.

    摘要翻译: 对于多声道音频信号,参数编码被应用于不同频率区域的音频输入通道的不同子集。 例如,对于具有五个常规频道和一个低频(LFE)频道的5.1环绕声信号,可以将双耳提示编码(BCC)应用于所有六个音频通道,用于等于或小于指定截止频率的子频带 ,但对于截止频率以上的子频带,只有五个音频通道(不包括LFE通道)。 通道的这种基于频率的编码可以相对于在整个频率范围上应用于所有输入通道的参数编码技术来减少编码和解码处理负载和/或编码音频比特流的大小。