-
公开(公告)号:US20070239295A1
公开(公告)日:2007-10-11
申请号:US11710070
申请日:2007-02-23
申请人: Jeffrey Thompson , Robert Reams , Aaron Warner
发明人: Jeffrey Thompson , Robert Reams , Aaron Warner
IPC分类号: G06F17/00
CPC分类号: G10L19/265 , G10L19/008 , G10L19/02 , G10L19/032
摘要: An audio processing application is provided which utilizes an audio codec encode/decode simulation system and a psychoacoustic model to estimate audible quantization noise that may occur during lossy audio compression. Mask-to-noise ratio values are computed for a plurality of frequency bands and are used to intelligently process an audio signal specifically for a given audio codec. In one exemplary embodiment, the mask-to-noise ratio values are used to reduce the extent of perceived artifacts for lossy compression, such as by modifying the energy and/or coherence of frequency bands in which quantization noise is estimated to exceed the masking threshold.
摘要翻译: 提供了一种音频处理应用,其利用音频编解码器/解码模拟系统和心理声学模型来估计在有损音频压缩期间可能发生的可听量化噪声。 针对多个频带计算面对噪声比值,并且用于智能处理特定于给定音频编解码器的音频信号。 在一个示例性实施例中,掩模 - 噪声比值被用于减少用于有损压缩的感知伪像的程度,例如通过修改估计量化噪声超过掩蔽阈值的频带的能量和/或相干性 。
-
公开(公告)号:US20060106620A1
公开(公告)日:2006-05-18
申请号:US11261100
申请日:2005-10-28
申请人: Jeffrey Thompson , Robert Reams , Aaron Warner
发明人: Jeffrey Thompson , Robert Reams , Aaron Warner
IPC分类号: G10L21/00
CPC分类号: G10L19/008
摘要: An audio spatial environment engine is provided for converting from an N channel audio system to an M channel audio system, such as in a dynamic down-mixer where N and M are integers and N is greater than M. The dynamic down-mix methodology consists of a static down-mix system utilizing an intelligent analysis and correction loop. The original N-channel audio signals are provided to a static down-mix process which produces a down-mixed M-channel audio signal. That M-channel audio signal is provided to an up-mix process which generates a subsequent N-channel audio signal. Any spectral, temporal, or spatial inaccuracies between the original N-channel audio and the subsequent up-mixed N-channel audio are then identified and corrected in the down-mixed M-channel audio signal over a plurality of frequency bands generating the final down-mixed M-channel audio signal. The corrections performed on the down-mixed M-channel audio signal consist of modifications to the relevant inter-channel spatial cues such as inter-channel level difference (ICLD) and inter-channel coherence (ICC) per frequency band.
摘要翻译: 提供了一种音频空间环境引擎,用于从N通道音频系统转换为M通道音频系统,例如在N和M为整数且N大于M的动态下混频器中。动态缩混方法包括 使用智能分析和校正循环的静态下混系统。 原始N声道音频信号被提供给产生下混合M声道音频信号的静态缩混处理。 该M声道音频信号被提供给产生后续N声道音频信号的上混合处理。 然后,在原始N声道音频和后续上混合N声道音频之间的任何频谱,时间或空间不准确性在多个频带上产生最终下降的下混合M声道音频信号中被识别和校正 混合M声道音频信号。 对下混合的M声道音频信号进行的校正包括对频道间空白线索的修改,例如每个频带的信道间电平差(ICLD)和信道间相干性(ICC)。
-
公开(公告)号:US07853022B2
公开(公告)日:2010-12-14
申请号:US11262029
申请日:2005-10-28
CPC分类号: H04S3/006
摘要: An audio spatial environment engine for flexible and scalable up-mixing from an M channel audio system to an N channel audio system, where M and N are integers and N is greater than M, is provided. The input M channel audio is provided to an analysis filter bank which converts the time domain signals into frequency domain signals. Relevant inter-channel spatial cues are extracted from the frequency domain signals on a sub-band basis and are used as parameters to generate adaptive N channel filters which control the spatial placement of a frequency band element in the up-mixed sound field. The N channel filters are smoothed across both time and frequency to limit filter variability which could cause annoying fluctuation effects. The smoothed N channel filters are then applied to adaptive combinations of the frequency domain input signals and are provided to a synthesis filter bank which generates the N channel time domain output signals.
摘要翻译: 提供了一种音频空间环境引擎,用于从M通道音频系统到N通道音频系统的灵活和可扩展的上混合,其中M和N是整数,N大于M。 输入M声道音频被提供给分析滤波器组,其将时域信号转换成频域信号。 相关的信道间空间提示是以子带为基础从频域信号中提取的,并被用作生成自适应N信道滤波器的参数,该自适应N信道滤波器控制上混频声场中的频带元素的空间位置。 N通道滤波器在时间和频率上均匀平滑,以限制滤波器变化,这可能导致烦人的波动效应。 然后将经平滑的N沟道滤波器应用于频域输入信号的自适应组合,并被提供给产生N个信道时域输出信号的合成滤波器组。
-
公开(公告)号:US20050169482A1
公开(公告)日:2005-08-04
申请号:US10975841
申请日:2004-10-28
申请人: Robert Reams , Jeffrey Thompson , Aaron Warner
发明人: Robert Reams , Jeffrey Thompson , Aaron Warner
CPC分类号: H04S3/00 , H04S2400/01
摘要: An audio spatial environment engine for converting from an N channel audio system to an M channel audio system, where N is an integer greater than M, is provided. The audio spatial environment engine includes one or more correlators receiving two or more of the N channels of audio data and eliminating delays between the channels that are irrelevant to an average human listener. One or more Hilbert transform systems each perform a Hilbert transform on one or more of the correlated channels of audio data. One or more summers receive at least one of the correlated channels of audio data and at least one of the Hilbert transformed correlated channels of audio data and generate one of the M channels of audio data.
摘要翻译: 提供了一种用于从N声道音频系统转换成M通道音频系统的音频空间环境引擎,其中N是大于M的整数。 音频空间环境引擎包括一个或多个相关器,其接收N个音频数据中的两个或更多个信道,并消除与平均人类听者无关的频道之间的延迟。 一个或多个希尔伯特变换系统在音频数据的一个或多个相关通道上执行希尔伯特变换。 一个或多个夏季接收音频数据的相关信道和至少一个音频数据的希尔伯特变换的相关信道中的至少一个,并生成M个音频数据之一。
-
-
-