-
公开(公告)号:US20130182852A1
公开(公告)日:2013-07-18
申请号:US13612543
申请日:2012-09-12
申请人: Jeff Thompson , Brandon Smith , Aaron Warner , Zoran Fejzo , Jean-Mar Jot
发明人: Jeff Thompson , Brandon Smith , Aaron Warner , Zoran Fejzo , Jean-Mar Jot
IPC分类号: H04R5/04
CPC分类号: H04R5/04 , G10L19/008 , G10L21/0308 , G10L25/06 , H04S3/00
摘要: There is disclosed methods and apparatus for decomposing a signal having a plurality of channels into direct and diffuse components. The correlation coefficient between each pair of signals from the plurality of signals may be estimated. A linear system of equations relating the estimated correlation coefficients and direct energy fractions of each of the plurality of channels may be constructed. The linear system may be solved to estimate the direct energy fractions. A direct component output signal and a diffuse component output signal may be generated based in part on the direct energy fractions.
摘要翻译: 公开了用于将具有多个通道的信号分解成直接和漫射部件的方法和装置。 可以估计来自多个信号的每对信号之间的相关系数。 可以构建将估计的相关系数和多个通道中的每一个的直接能量分数相关联的方程式的线性系统。 可以解决线性系统以估计直接能量分数。 可以部分地基于直接能量分数来生成直接分量输出信号和漫反射分量输出信号。
-
公开(公告)号:US09253574B2
公开(公告)日:2016-02-02
申请号:US13612543
申请日:2012-09-12
申请人: Jeff Thompson , Brandon Smith , Aaron Warner , Zoran Fejzo , Jean-Marc Jot
发明人: Jeff Thompson , Brandon Smith , Aaron Warner , Zoran Fejzo , Jean-Marc Jot
IPC分类号: H04R5/04 , G10L19/008 , H04S3/00 , G10L25/06 , G10L21/0308
CPC分类号: H04R5/04 , G10L19/008 , G10L21/0308 , G10L25/06 , H04S3/00
摘要: There is disclosed methods and apparatus for decomposing a signal having a plurality of channels into direct and diffuse components. The correlation coefficient between each pair of signals from the plurality of signals may be estimated. A linear system of equations relating the estimated correlation coefficients and direct energy fractions of each of the plurality of channels may be constructed. The linear system may be solved to estimate the direct energy fractions. A direct component output signal and a diffuse component output signal may be generated based in part on the direct energy fractions.
摘要翻译: 公开了用于将具有多个通道的信号分解成直接和漫射部件的方法和装置。 可以估计来自多个信号的每对信号之间的相关系数。 可以构建将估计的相关系数和多个通道中的每一个的直接能量分数相关联的方程式的线性系统。 可以解决线性系统以估计直接能量分数。 可以部分地基于直接能量分数来生成直接分量输出信号和漫反射分量输出信号。
-
公开(公告)号:US20090157204A1
公开(公告)日:2009-06-18
申请号:US12001851
申请日:2007-12-13
申请人: Brandon Smith , Jeffrey K. Thompson , Aaron Warner
发明人: Brandon Smith , Jeffrey K. Thompson , Aaron Warner
IPC分类号: G06F17/00
CPC分类号: G11B20/00086 , G10L19/018 , G11B20/00181 , G11B20/00891 , G11B2020/00028 , G11B2020/00057
摘要: A system for identifying audio data is provided. The system includes a transform system receiving left channel audio data and right channel audio data and generating a plurality of frequency bins of left channel magnitude data, left channel phase data, right channel magnitude data and right channel phase data. A watermarking system receives watermarking data and modifies predetermined frequency bins of the left channel phase data and the right channel phase data to encode the watermarking data. A magnitude system receives the left channel magnitude data and the right channel magnitude data and increases the left channel magnitude data and the right channel magnitude data for one or more of the predetermined frequency bins to a threshold level if the left channel magnitude data and the right channel magnitude data for the corresponding frequency bin is less than the threshold level.
摘要翻译: 提供用于识别音频数据的系统。 该系统包括接收左声道音频数据和右声道音频数据并产生左声道幅度数据,左声道相位数据,右声道幅度数据和右声道相位数据的多个频率仓的变换系统。 水印系统接收水印数据并修改左声道相位数据和右声道相位数据的预定频率仓以对水印数据进行编码。 幅度系统接收左声道幅度数据和右声道幅度数据,并将左声道幅度数据和左声道幅度数据和右声道幅度数据和右声道幅度数据和右声道幅度数据和右声道幅度数据以及用于一个或多个预定频率仓的右声道幅度数据增加到阈值电平 相应频率仓的通道幅度数据小于阈值电平。
-
公开(公告)号:US08099285B2
公开(公告)日:2012-01-17
申请号:US12001851
申请日:2007-12-13
申请人: Brandon Smith , Jeffrey Thompson , Aaron Warner
发明人: Brandon Smith , Jeffrey Thompson , Aaron Warner
CPC分类号: G11B20/00086 , G10L19/018 , G11B20/00181 , G11B20/00891 , G11B2020/00028 , G11B2020/00057
摘要: A system for identifying audio data is provided. The system includes a transform system receiving left channel audio data and right channel audio data and generating a plurality of frequency bins of left channel magnitude data, left channel phase data, right channel magnitude data and right channel phase data. A watermarking system receives watermarking data and modifies predetermined frequency bins of the left channel phase data and the right channel phase data to encode the watermarking data. A magnitude system receives the left channel magnitude data and the right channel magnitude data and increases the left channel magnitude data and the right channel magnitude data for one or more of the predetermined frequency bins to a threshold level if the left channel magnitude data and the right channel magnitude data for the corresponding frequency bin is less than the threshold level.
摘要翻译: 提供用于识别音频数据的系统。 该系统包括接收左声道音频数据和右声道音频数据并产生左声道幅度数据,左声道相位数据,右声道幅度数据和右声道相位数据的多个频率仓的变换系统。 水印系统接收水印数据并修改左声道相位数据和右声道相位数据的预定频率仓以对水印数据进行编码。 幅度系统接收左声道幅度数据和右声道幅度数据,并且如果左声道幅度数据和右声道幅度数据,则将左声道幅度数据和用于一个或多个预定频率仓的右声道幅度数据增加到阈值电平 相应频率仓的通道幅度数据小于阈值电平。
-
公开(公告)号:US08385556B1
公开(公告)日:2013-02-26
申请号:US12192404
申请日:2008-08-15
申请人: Aaron Warner , Jeffrey Thompson , Robert Reams
发明人: Aaron Warner , Jeffrey Thompson , Robert Reams
CPC分类号: G10L19/008 , G10L19/173
摘要: A system for generating parametric stereo data from phase modulated stereo data is provided. A phase difference system receives left channel data and right channel data and determines a phase difference between the left channel data and the right channel data. A phase difference weighting system receives the phase difference data and generates weighting data to adjust left channel amplitude data and right channel amplitude data based on the phase difference data. A magnitude modification system adjusts the left channel amplitude data and the right channel amplitude data using the weighting data to eliminate phase data in the left channel data and the right channel data.
摘要翻译: 提供了一种从相位调制立体数据产生参数立体声数据的系统。 相位差系统接收左声道数据和右声道数据,并确定左声道数据和右声道数据之间的相位差。 相位差加权系统接收相位差数据,并产生加权数据,以根据相位差数据调整左声道幅度数据和右声道幅度数据。 幅度修正系统使用加权数据来调整左声道幅度数据和右声道幅度数据,以消除左声道数据和右声道数据中的相位数据。
-
公开(公告)号:US20070297519A1
公开(公告)日:2007-12-27
申请号:US11666512
申请日:2005-10-28
申请人: Jeffrey Thompson , Robert Reams , Aaron Warner
发明人: Jeffrey Thompson , Robert Reams , Aaron Warner
IPC分类号: G10L19/02
CPC分类号: G10L19/008
摘要: An audio spatial environment engine is provided for converting between different formats of audio data. The audio spatial environment engine (100) allows for flexible conversion between N-channel data and M-channel data and conversion from M-channel data back to N′-channel data, where N, M, and N′ are integers and where N is not necessarily equal to N′. For example, such systems could be used for the transmission or storage of surround sound data across a network or infrastructure designed for stereo sound data. The audio spatial environment engine provides improved and flexible conversions between different spatial environments due to an advanced dynamic down-mixing unit (102) and a high-resolution frequency band up-mixing unit (104). The dynamic down-mixing unit includes an intelligent: analysis and correction loop (108, 110) capable of correcting for spectral, temporal, and spatial inaccuracies common to many down-mixing methods. The up-mixing unit utilizes the extraction and analysis of important inter-channel spatial cues across high-resolution frequency bands to derive the spatial placement of different frequency elements. The down-mixing and up-mixing units, when used individually or as a system, provide improved sound quality and spatial distinction.
摘要翻译: 提供音频空间环境引擎,用于在不同格式的音频数据之间进行转换。 音频空间环境引擎(100)允许在N信道数据和M信道数据之间进行灵活转换,并且从M信道数据转换回N信道数据,其中N,M和N'是整数,并且其中N 不一定等于N'。 例如,这样的系统可以用于通过设计用于立体声声音数据的网络或基础设施的环绕声数据的传输或存储。 音频空间环境引擎由于先进的动态下混单元(102)和高分辨率频带向上混合单元(104)而在不同的空间环境之间提供改进和灵活的转换。 动态下混单元包括智能的分析和校正回路(108,110),其能够校正许多下混方法共同的频谱,时间和空间不准确。 上混单元利用跨高分辨率频带的重要信道间空间提示的提取和分析,得出不同频率元素的空间位置。 下混和上混单元当单独使用或作为系统使用时,可提供改善的声音质量和空间差异。
-
公开(公告)号:US20060093164A1
公开(公告)日:2006-05-04
申请号:US11262190
申请日:2005-10-28
申请人: Robert Reams , Jeffrey Thompson , Aaron Warner
发明人: Robert Reams , Jeffrey Thompson , Aaron Warner
CPC分类号: H04S3/02 , H04S2400/01 , H04S2420/07
摘要: An audio spatial environment engine is provided for converting between different formats of audio data. The audio spatial environment engine allows for flexible conversion between N-channel data and M-channel data and conversion from M-channel data back to N′-channel data, where N, M, and N′ are integers and where N is not necessarily equal to N′. For example, such systems could be used for the transmission or storage of surround sound data across a network or infrastructure designed for stereo sound data. The audio spatial environment engine provides improved and flexible conversions between different spatial environments due to an advanced dynamic down-mixing unit and a high-resolution frequency band up-mixing unit. The dynamic down-mixing unit includes an intelligent analysis and correction loop capable of correcting for spectral, temporal, and spatial inaccuracies common to many down-mixing methods. The up-mixing unit utilizes the extraction and analysis of important inter-channel spatial cues across high-resolution frequency bands to derive the spatial placement of different frequency elements. The down-mixing and up-mixing units, when used individually or as a system, provide improved sound quality and spatial distinction.
摘要翻译: 提供音频空间环境引擎,用于在不同格式的音频数据之间进行转换。 音频空间环境引擎允许在N信道数据和M信道数据之间进行灵活转换,并且从M信道数据转换回N信道数据,其中N,M和N'是整数,并且其中N不一定 等于N'。 例如,这样的系统可以用于通过设计用于立体声声音数据的网络或基础设施的环绕声数据的传输或存储。 音频空间环境引擎由于采用先进的动态下混单元和高分辨率频段向上混合单元,在不同的空间环境之间提供了改进和灵活的转换。 动态下混单元包括智能分析和校正回路,能够校正许多下混方法所共有的光谱,时间和空间不准确。 上混单元利用跨高分辨率频带的重要信道间空间提示的提取和分析,得出不同频率元素的空间位置。 下混和上混单元当单独使用或作为系统使用时,可提供改善的声音质量和空间差异。
-
公开(公告)号:US20060093152A1
公开(公告)日:2006-05-04
申请号:US11262029
申请日:2005-10-28
申请人: Jeffrey Thompson , Robert Reams , Aaron Warner
发明人: Jeffrey Thompson , Robert Reams , Aaron Warner
IPC分类号: H04R5/00
CPC分类号: H04S3/006
摘要: An audio spatial environment engine for flexible and scalable up-mixing from an M channel audio system to an N channel audio system, where M and N are integers and N is greater than M, is provided. The input M channel audio is provided to an analysis filter bank which converts the time domain signals into frequency domain signals. Relevant inter-channel spatial cues are extracted from the frequency domain signals on a sub-band basis and are used as parameters to generate adaptive N channel filters which control the spatial placement of a frequency band element in the up-mixed sound field. The N channel filters are smoothed across both time and frequency to limit filter variability which could cause annoying fluctuation effects. The smoothed N channel filters are then applied to adaptive combinations of the frequency domain input signals and are provided to a synthesis filter bank which generates the N channel time domain output signals.
摘要翻译: 提供了一种音频空间环境引擎,用于从M通道音频系统到N通道音频系统的灵活和可扩展的上混合,其中M和N是整数,N大于M。 输入M声道音频被提供给分析滤波器组,其将时域信号转换成频域信号。 相关的信道间空间提示是以子带为基础从频域信号中提取的,并被用作生成自适应N信道滤波器的参数,该自适应N信道滤波器控制上混频声场中的频带元素的空间位置。 N通道滤波器在时间和频率上均匀平滑,以限制滤波器变化,这可能导致烦人的波动效应。 然后将经平滑的N沟道滤波器应用于频域输入信号的自适应组合,并被提供给产生N个信道时域输出信号的合成滤波器组。
-
公开(公告)号:US07929708B2
公开(公告)日:2011-04-19
申请号:US10975841
申请日:2004-10-28
申请人: Robert Reams , Jeffrey K. Thompson , Aaron Warner
发明人: Robert Reams , Jeffrey K. Thompson , Aaron Warner
CPC分类号: H04S3/00 , H04S2400/01
摘要: An audio spatial environment engine for converting from an N channel audio system to an M channel audio system, where N is an integer greater than M, is provided. The audio spatial environment engine includes one or more correlators receiving two of the N channels of audio data and eliminating delays between the channels that are irrelevant to an average human listener. One or more Hilbert transform systems each perform a Hilbert transform on one or more of the correlated channels of audio data. One or more summers receive at least one of the correlated channels of audio data and at least one of the Hilbert transformed correlated channels of audio data and generate one of the M channels of audio data.
摘要翻译: 提供了一种用于从N声道音频系统转换成M通道音频系统的音频空间环境引擎,其中N是大于M的整数。 音频空间环境引擎包括一个或多个相关器,其接收N个音频数据中的两个,并消除与平均人类听者无关的频道之间的延迟。 一个或多个希尔伯特变换系统在音频数据的一个或多个相关通道上执行希尔伯特变换。 一个或多个夏季接收音频数据的相关信道和至少一个音频数据的希尔伯特变换的相关信道中的至少一个,并生成M个音频数据之一。
-
公开(公告)号:US20090060204A1
公开(公告)日:2009-03-05
申请号:US12245398
申请日:2008-10-03
申请人: Robert Reams , Jeffrey K. Thompson , Aaron Warner
发明人: Robert Reams , Jeffrey K. Thompson , Aaron Warner
CPC分类号: H04S3/00 , H04S2400/01
摘要: An audio spatial environment engine for converting from an N channel audio system to an M channel audio system, where N is an integer greater than M, is provided. The audio spatial environment engine includes one or more correlators receiving two or more of the N channels of audio data and eliminating delays between the channels that are irrelevant to an average human listener. One or more Hilbert transform systems each perform a Hilbert transform on one or more of the correlated channels of audio data. One or more summers receive at least one of the correlated channels of audio data and at least one of the Hilbert transformed correlated channels of audio data and generate one of the M channels of audio data.
摘要翻译: 提供了一种用于从N声道音频系统转换成M通道音频系统的音频空间环境引擎,其中N是大于M的整数。 音频空间环境引擎包括一个或多个相关器,其接收N个音频数据中的两个或更多个信道,并消除与平均人类听者无关的频道之间的延迟。 一个或多个希尔伯特变换系统在音频数据的一个或多个相关通道上执行希尔伯特变换。 一个或多个夏季接收音频数据的相关信道和至少一个音频数据的希尔伯特变换的相关信道中的至少一个,并生成M个音频数据之一。
-
-
-
-
-
-
-
-
-