DECOMPOSING AUDIO SIGNALS
    1.
    发明申请

    公开(公告)号:US20200265849A1

    公开(公告)日:2020-08-20

    申请号:US16869477

    申请日:2020-05-07

    发明人: Jun WANG Lie LU

    摘要: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.

    UPMIXING OF AUDIO SIGNALS
    2.
    发明申请

    公开(公告)号:US20190052991A9

    公开(公告)日:2019-02-14

    申请号:US15538892

    申请日:2016-02-09

    IPC分类号: H04S5/00 H04R1/32 H04S7/00

    摘要: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD
    3.
    发明申请
    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD 有权
    体积调节器和控制方法

    公开(公告)号:US20170026017A1

    公开(公告)日:2017-01-26

    申请号:US15284953

    申请日:2016-10-04

    IPC分类号: H03G7/00 G10L25/30 G10L25/51

    摘要: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

    摘要翻译: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。

    AUDIO OBJECT EXTRACTION
    4.
    发明申请
    AUDIO OBJECT EXTRACTION 有权
    音频对象提取

    公开(公告)号:US20160267914A1

    公开(公告)日:2016-09-15

    申请号:US15031887

    申请日:2014-11-25

    摘要: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.

    摘要翻译: 本发明的实施例涉及音频对象提取。 公开了一种基于多个频道的格式的音频内容提取音频对象的方法。 该方法包括至少部分地基于多个频道之间的频谱相似度来对音频内容的各个帧应用音频对象提取。 该方法还包括基于在各个帧上的​​音频对象提取来执行跨音频内容的帧的音频对象组合,以产生至少一个音频对象的轨道。 还公开了相应的系统和计算机程序产品。

    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD
    5.
    发明申请
    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD 有权
    体积调节器和控制方法

    公开(公告)号:US20160049915A1

    公开(公告)日:2016-02-18

    申请号:US14777271

    申请日:2014-03-17

    IPC分类号: H03G3/32 H03G5/16

    摘要: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

    摘要翻译: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。

    AUDIO SOURCE PARAMETERIZATION
    6.
    发明申请

    公开(公告)号:US20200327897A1

    公开(公告)日:2020-10-15

    申请号:US16090739

    申请日:2017-04-05

    发明人: Jun WANG

    摘要: The present document describes a method (600) for estimating source parameters of audio sources (101) from mix audio signals (102), with. The mix audio signals (102) comprise a plurality of frames. The mix audio signals (102) are representable as a mix audio matrix in a frequency domain and the audio sources (101) are representable as a source matrix in the frequency domain. The method (600) comprises updating (601) an un-mixing matrix (221) which is configured to provide an estimate of the source matrix from the mix audio matrix, based on a mixing matrix (225) which is configured to provide an estimate of the mix audio matrix from the source matrix. Furthermore, the method (600) comprises updating (602) the mixing matrix (225) based on the un-mixing matrix (221) and based on the mix audio signals (102). In addition, the method (600) comprises iterating (603) the updating steps (601, 602) until an overall convergence criteria is met.

    UPMIXING OF AUDIO SIGNALS
    7.
    发明申请

    公开(公告)号:US20180262856A1

    公开(公告)日:2018-09-13

    申请号:US15538892

    申请日:2016-02-09

    IPC分类号: H04S5/00 H04R1/32 H04S7/00

    摘要: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

    SEPARATING AUDIO SOURCES
    8.
    发明申请

    公开(公告)号:US20180240470A1

    公开(公告)日:2018-08-23

    申请号:US15549651

    申请日:2016-02-11

    发明人: Jun WANG

    摘要: Example embodiments disclosed herein relate to source separation in audio content. A method for separating sources from audio content is disclosed, the audio content being of a multi-channel format based on a plurality of channels. The method comprises performing a component analysis on the audio content for each of the plurality of channels to generate a plurality of components, each of the plurality of components comprising a plurality of time-frequency tiles in full frequency band; generating at least one dominant source with at least one of the time-frequency tiles from the plurality of the components and separating the sources from the audio content by estimating spatial parameters and spectral parameters based on the dominant source. Corresponding system and computer program product are also disclosed.

    AUDIO SOURCE SEPARATION
    9.
    发明申请

    公开(公告)号:US20170365273A1

    公开(公告)日:2017-12-21

    申请号:US15543938

    申请日:2016-02-12

    IPC分类号: G10L21/0272 G10L25/21

    CPC分类号: G10L21/0272 G10L25/21

    摘要: A method of audio source separation from audio content is disclosed. The method includes determining a spatial parameter of an audio source based on a linear combination characteristic of the audio source and an orthogonality characteristic of two or more audio sources to be separated in the audio content. The method also includes separating the audio source from the audio content based on the spatial parameter. Corresponding system and computer program product are also disclosed.