Multi-channel hierarchical audio coding with compact side information
    21.
    发明授权
    Multi-channel hierarchical audio coding with compact side information 有权
    具有紧凑侧面信息的多通道分层音频编码

    公开(公告)号:US07961890B2

    公开(公告)日:2011-06-14

    申请号:US11314711

    申请日:2005-12-21

    IPC分类号: H04R5/00

    摘要: A parametric representation of a multi-channel audio signal describes the spatial properties of the audio signal well with compact side information when a coherence information, describing the coherence between a first and a second channel, is derived within a hierarchical encoding process only for channel pairs including a first channel having only information of a left side with respect to a listening position and including a second channel having only information from a right side with respect to a listening position. As within the hierarchical process the multiple audio channels of the audio signal are downmixed iteratively into monophonic channels, one can pick the relevant parameters from an encoding step involving only channel pairs carrying the information needed to describe the spatial properties of the multi-channel audio signal.

    摘要翻译: 多通道音频信号的参数化表示利用紧凑的侧信息来描述音频信号的空间属性,当描述第一和第二信道之间的相干性的相干信息仅在信道对的分级编码处理中导出时 包括仅具有相对于收听位置的左侧的信息的第一频道,并且包括仅具有相对于收听位置的右侧的信息的第二频道。 如在分层处理中,音频信号的多个音频通道被迭代地下混合成单声道,可以从仅涉及携带描述多声道音频信号的空间属性所需的信息的信道对的编码步骤中选择相关参数 。

    Entropy coding with compact codebooks
    22.
    发明授权
    Entropy coding with compact codebooks 有权
    使用紧凑型码本进行熵编码

    公开(公告)号:US07788106B2

    公开(公告)日:2010-08-31

    申请号:US11251485

    申请日:2005-10-14

    IPC分类号: G10L21/04 H03M7/40 G06K9/46

    CPC分类号: H03M7/42

    摘要: The present invention is based on the finding that an efficient code for encoding information values can be derived, when two or more information values are grouped in a tuple in a tuple order and when an encoding rule is used, that assigns the same code word to tuples having identical information values in different orders and that does derive an order information, indicating the tuple order, and when the code word is output in association with the order information.

    摘要翻译: 本发明基于以下发现:当两个或多个信息值以元组顺序被分组成元组并且当使用编码规则时,可以导出用于编码信息值的有效代码,将相同的代码字分配给 元组具有不同顺序的相同信息值,并且确定导出指示元组顺序的订单信息,以及何时与订单信息相关联地输出代码字。

    Apparatus and method for synthesizing three output channels using two input channels
    23.
    发明授权
    Apparatus and method for synthesizing three output channels using two input channels 有权
    使用两个输入通道合成三个输出通道的装置和方法

    公开(公告)号:US07760886B2

    公开(公告)日:2010-07-20

    申请号:US11313180

    申请日:2005-12-20

    IPC分类号: H04R5/00

    CPC分类号: H04S5/00

    摘要: For synthesizing at least three output channels using two stereo input channels, the stereo input channels are analyzed to detect signal components occurring in both input channels. A signal generator is operative to introduce at least a part of the detected signal components into the second channel associated with a second speaker in an intended speaker scheme, which is positioned between a first and a third speaker in the speaker scheme. When, however, feeding of the complete detected signal components would result in a clipping situation, then only a part of the detected signal components is fed into the second channel as a real center channel and the remainder is located in the first and third channels as a phantom center channel.

    摘要翻译: 为了使用两个立体声输入通道合成至少三个输出通道,分析立体声输入通道以检测在两个输入通道中发生的信号分量。 信号发生器可操作以将所检测的信号分量的至少一部分引入与位于扬声器方案中的第一和第三扬声器之间的预期扬声器方案中与第二扬声器相关联的第二通道。 然而,当提供完整检测到的信号分量将导致削波情况时,则只有一部分检测到的信号分量被馈送到第二信道中作为实际中心信道,其余部分位于第一和第三信道中,如 幻影中心频道。

    Method and device for characterizing a signal and method and device for producing an indexed signal
    24.
    发明授权
    Method and device for characterizing a signal and method and device for producing an indexed signal 有权
    用于表征信号的方法和装置以及用于产生索引信号的方法和装置

    公开(公告)号:US07478045B2

    公开(公告)日:2009-01-13

    申请号:US10484513

    申请日:2002-07-15

    IPC分类号: G10L15/02 G10L19/02

    摘要: In a method for characterizing a signal representing an audio content a measure is determined for a tonality of the signal, whereupon a statement is made about the audio content of the signal on the basis of the measure for the tonality of the signal. The measure for the tonality is derived from a quotient whose numerator is the mean of the summed values of spectral components of the signal exponentiated with a first power and whose denominator is the mean of the summed values of spectral components exponentiated with a second power, the first and second powers differing from each other. The measure for the tonality of the signal for the content analysis is robust in relation to a signal distortion, due e.g. to MP3 coding, and has a high correlation with the content of the analyzed signal.

    摘要翻译: 在表征音频内容的信号的表征方法中,针对信号的音调确定了一个度量,然后根据该信号音调的度量,对该信号的音频内容做出声明。 音调的度量来自商,其分子是以第一功率取幂的信号的频谱分量的总和值的平均值,其分母是用第二功率指数的频谱分量的总和值的平均值, 第一和第二权力彼此不同。 用于内容分析的信号的音调的度量相对于信号失真是鲁棒的,例如。 到MP3编码,并且与分析的信号的内容具有高度的相关性。

    Process for coding and decoding stereophonic spectral values
    25.
    发明授权
    Process for coding and decoding stereophonic spectral values 有权
    立体声频谱值的编码和解码过程

    公开(公告)号:US06771777B1

    公开(公告)日:2004-08-03

    申请号:US09214656

    申请日:1999-05-28

    IPC分类号: H04H500

    CPC分类号: H04S1/007

    摘要: A method of coding stereo audio spectral values first carries out grouping of those values in scale factor bands, with which scale factors are associated. Sections are formed next, each comprising at least one scale factor band. The spectral values are coded within at least one section with a code book assigned to the section, out of a plurality of code books each with a code book number assigned to it, the number of the code book used being transmitted as side information to the coded stereo audio spectral values. At least one additional code book number is provided, which does not refer to a code book but shows information relevant to the section to which it is assigned. A method of decoding stereo audio spectral values which are partly coded by the intensity stereo process and which have side information uses the relevant information, showing the additional code book numbers, to cancel the existing coding of the stereo audio spectral values.

    摘要翻译: 对立体声音频频谱值进行编码的方法首先对与比例因子相关联的比例因子频带中的那些值进行分组。 接下来形成切片,每个部分包括至少一个比例因子带。 频谱值在至少一个部分内被编码,其中分配有代码簿的部分,在分配有代码簿编号的多个代码簿中,使用的代码簿的编号作为辅助信息被发送到 编码立体声音频频谱值。 提供至少一个附加的代码簿编号,其不涉及代码簿,但是显示与其被分配的部分相关的信息。 解码由强度立体声处理部分地编码并且具有侧面信息的立体声音频频谱值的方法使用显示附加码本号码的相关信息来取消立体声音频频谱值的现有编码。

    Method and a device for coding audio signals and a method and a device for decoding a bit stream
    26.
    发明授权
    Method and a device for coding audio signals and a method and a device for decoding a bit stream 有权
    用于编码音频信号的方法和装置以及用于解码比特流的方法和装置

    公开(公告)号:US06502069B1

    公开(公告)日:2002-12-31

    申请号:US09530001

    申请日:2000-04-20

    IPC分类号: G10L1912

    CPC分类号: H04B1/665 H04B14/046

    摘要: The present invention permits a combination of a scalable audio coder with the TNS technique. In a method for coding time signals sampled in a first sampling rate, second time signals are first generated whose sampling rate is smaller than the first sampling rate. The second time signals are then coded according to a first coding algorithm and written into a bit stream. The coded second time signals are, however, decoded again, and, like the first time signals, transformed into the frequency domain. From a spectral representation of the first time signals, TNS prediction coefficients are calculated. The transformed output signal of the coder/decoder with the first coding algorithm, like the spectral representation of the first time signal, undergoes a prediction over the frequency to obtain residual spectral values for both signals, though only the prediction coefficients calculated on the basis of the first time signals are used. These two signals are evaluated against each other. The evaluated residual spectral values are then coded by means of a second coding algorithm to obtain coded evaluated residual spectral values, which, together with the side information containing the calculated prediction coefficients, are written into the bit stream.

    摘要翻译: 本发明允许可扩展音频编码器与TNS技术的组合。 在对以第一采样率采样的时间信号进行编码的方法中,首先生成采样率小于第一采样率的第二时间信号。 然后根据第一编码算法对第二时间信号进行编码并写入比特流。 然而,编码的第二时间信号被再次解码,并且像第一次信号一样被转换成频域。 根据第一时间信号的频谱表示,计算TNS预测系数。 使用第一编码算法的编码器/解码器的变换输出信号,如第一时间信号的频谱表示,对频率进行预测,以获得两个信号的残差频谱值,尽管仅基于 第一次使用信号。 这两个信号被相互评估。 然后通过第二编码算法对所评估的残差频谱值进行编码,以获得编码的估计残差频谱值,其与包含计算的预测系数的边信息一起写入比特流。

    Method and device for detecting a transient in a discrete-time audiosignal
    27.
    发明授权
    Method and device for detecting a transient in a discrete-time audiosignal 有权
    用于检测离散时间音频信号中的瞬态的方法和装置

    公开(公告)号:US06453282B1

    公开(公告)日:2002-09-17

    申请号:US09424596

    申请日:1999-11-24

    IPC分类号: G10L1900

    CPC分类号: H04B1/665

    摘要: A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal as to generate consecutive segments of the same length with unfiltered discrete-time audio signals. The discrete-time audio signal in a current segment is filtered. Either the energy of the filtered discrete-time audio signal in the current segment is compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment is formed and this current relationship compared with a preceding corresponding relationship. Whether a transient is present in the discrete-time audio signal is detected using one and/or the other of these comparisons.

    摘要翻译: 用于检测离散时间音频信号中的瞬态的方法在时域中完全执行,并且包括分段离散时间音频信号以生成具有未滤波离散时间音频信号的相同长度的连续片段的步骤。 当前片段中的离散时间音频信号被过滤。 将当前片段中滤波的离散时间音频信号的能量与先前片段中滤波的离散时间音频信号的能量或电流中滤波后的离散时间音频信号的能量之间的当前关系进行比较 形成当前段中未经滤波的离散时间音频信号的能量,并将该当前关系与先前的对应关系进行比较。 使用这些比较中的一个和/或另一个来检测离散时间音频信号中是否存在瞬态。

    Method for masking defects in a stream of audio data
    28.
    发明授权
    Method for masking defects in a stream of audio data 有权
    用于掩蔽音频数据流中的缺陷的方法

    公开(公告)号:US06421802B1

    公开(公告)日:2002-07-16

    申请号:US09331697

    申请日:1999-06-23

    IPC分类号: G10L1900

    摘要: In a method for concealing errors in an audio data stream the occurrence of an error is detected in the audio data stream, audio data prior to the occurrence of the fault being intact audio data. Thereafter a spectral energy of a subgroup of the intact audio data is calculated. After forming a pattern for substitute data on the basis of the spectral energy calculated for the subgroup of the intact audio data, substitute data for erroneous or missing audio data which correspond to the subgroup are created on the basis of the pattern.

    摘要翻译: 在用于隐藏音频数据流中的错误的方法中,在音频数据流中检测到错误的发生,在发生故障之前的音频数据是完整的音频数据。 此后,计算完整音频数据的子组的频谱能量。 基于为完整音频数据的子组计算的频谱能量形成用于替代数据的模式之后,基于该模式创建与该子组对应的错误或缺失音频数据的替代数据。