-
公开(公告)号:US20200265849A1
公开(公告)日:2020-08-20
申请号:US16869477
申请日:2020-05-07
IPC分类号: G10L19/02 , G10L19/008 , G10L25/21
摘要: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20190052991A9
公开(公告)日:2019-02-14
申请号:US15538892
申请日:2016-02-09
发明人: Jun WANG , Lie LU , Lianwu CHEN , Mingqing HU
摘要: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
公开(公告)号:US20170026017A1
公开(公告)日:2017-01-26
申请号:US15284953
申请日:2016-10-04
发明人: Jun WANG , Lie LU , Alan SEEFELDT
CPC分类号: H03G7/002 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/3089 , H03G3/32 , H03G5/165 , H03G7/007 , H04M7/006 , H04M2203/305
摘要: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
摘要翻译: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。
-
公开(公告)号:US20160267914A1
公开(公告)日:2016-09-15
申请号:US15031887
申请日:2014-11-25
发明人: Mingqing HU , Lie LU , Jun WANG
IPC分类号: G10L19/02 , G10L19/038 , H04S3/00 , G10L19/008
CPC分类号: G10L19/02 , G10L19/008 , G10L19/038 , H04S3/008 , H04S2400/11
摘要: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.
摘要翻译: 本发明的实施例涉及音频对象提取。 公开了一种基于多个频道的格式的音频内容提取音频对象的方法。 该方法包括至少部分地基于多个频道之间的频谱相似度来对音频内容的各个帧应用音频对象提取。 该方法还包括基于在各个帧上的音频对象提取来执行跨音频内容的帧的音频对象组合,以产生至少一个音频对象的轨道。 还公开了相应的系统和计算机程序产品。
-
公开(公告)号:US20160049915A1
公开(公告)日:2016-02-18
申请号:US14777271
申请日:2014-03-17
发明人: Jun WANG , Lie LU , Alan SEEFELDT
CPC分类号: H03G7/002 , G10L21/0364 , G10L25/30 , G10L25/51 , H03G3/3089 , H03G3/32 , H03G5/165 , H03G7/007 , H04M7/006 , H04M2203/305
摘要: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
摘要翻译: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。
-
公开(公告)号:US20200327897A1
公开(公告)日:2020-10-15
申请号:US16090739
申请日:2017-04-05
发明人: Jun WANG
IPC分类号: G10L21/0308 , H04S3/00 , G10L19/08
摘要: The present document describes a method (600) for estimating source parameters of audio sources (101) from mix audio signals (102), with. The mix audio signals (102) comprise a plurality of frames. The mix audio signals (102) are representable as a mix audio matrix in a frequency domain and the audio sources (101) are representable as a source matrix in the frequency domain. The method (600) comprises updating (601) an un-mixing matrix (221) which is configured to provide an estimate of the source matrix from the mix audio matrix, based on a mixing matrix (225) which is configured to provide an estimate of the mix audio matrix from the source matrix. Furthermore, the method (600) comprises updating (602) the mixing matrix (225) based on the un-mixing matrix (221) and based on the mix audio signals (102). In addition, the method (600) comprises iterating (603) the updating steps (601, 602) until an overall convergence criteria is met.
-
公开(公告)号:US20180262856A1
公开(公告)日:2018-09-13
申请号:US15538892
申请日:2016-02-09
发明人: Jun WANG , Lie LU , Lianwu CHEN , Mingqing HU
摘要: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
-
公开(公告)号:US20180240470A1
公开(公告)日:2018-08-23
申请号:US15549651
申请日:2016-02-11
发明人: Jun WANG
IPC分类号: G10L21/028 , G06F3/0484 , G06F3/16
CPC分类号: G10L21/028 , G06F3/0484 , G06F3/16 , G06F3/165 , G06F3/167 , G10H2210/305 , G10L19/008 , G10L21/0232 , G10L2021/02166
摘要: Example embodiments disclosed herein relate to source separation in audio content. A method for separating sources from audio content is disclosed, the audio content being of a multi-channel format based on a plurality of channels. The method comprises performing a component analysis on the audio content for each of the plurality of channels to generate a plurality of components, each of the plurality of components comprising a plurality of time-frequency tiles in full frequency band; generating at least one dominant source with at least one of the time-frequency tiles from the plurality of the components and separating the sources from the audio content by estimating spatial parameters and spectral parameters based on the dominant source. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20170365273A1
公开(公告)日:2017-12-21
申请号:US15543938
申请日:2016-02-12
发明人: Jun WANG , David S. MC GRATH
IPC分类号: G10L21/0272 , G10L25/21
CPC分类号: G10L21/0272 , G10L25/21
摘要: A method of audio source separation from audio content is disclosed. The method includes determining a spatial parameter of an audio source based on a linear combination characteristic of the audio source and an orthogonality characteristic of two or more audio sources to be separated in the audio content. The method also includes separating the audio source from the audio content based on the spatial parameter. Corresponding system and computer program product are also disclosed.
-
公开(公告)号:US20170230024A1
公开(公告)日:2017-08-10
申请号:US15433486
申请日:2017-02-15
发明人: Lie LU , Jun WANG , Alan J. SEEFELDT , Mingqing HU
摘要: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
-
-
-
-
-
-
-
-
-