Binaural Dialogue Enhancement
    1.
    发明申请

    公开(公告)号:US20200329326A1

    公开(公告)日:2020-10-15

    申请号:US16915670

    申请日:2020-06-29

    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

    AUDIO DECODER AND DECODING METHOD

    公开(公告)号:US20220399027A1

    公开(公告)日:2022-12-15

    申请号:US17887429

    申请日:2022-08-13

    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

    Audio Decoder and Decoding Method
    4.
    发明申请

    公开(公告)号:US20180233156A1

    公开(公告)日:2018-08-16

    申请号:US15752699

    申请日:2016-08-23

    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

    Binaural Dialogue Enhancement
    5.
    发明申请

    公开(公告)号:US20220060838A1

    公开(公告)日:2022-02-24

    申请号:US17465733

    申请日:2021-09-02

    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

    Binaural Dialogue Enhancement
    6.
    发明申请

    公开(公告)号:US20190356997A1

    公开(公告)日:2019-11-21

    申请号:US16532143

    申请日:2019-08-05

    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

    Bitstream Syntax for Spatial Voice Coding
    7.
    发明申请
    Bitstream Syntax for Spatial Voice Coding 有权
    用于空间语音编码的位流语法

    公开(公告)号:US20160155447A1

    公开(公告)日:2016-06-02

    申请号:US14392287

    申请日:2014-06-26

    Abstract: An encoding system (100) encodes a first (E1) and further (E2, E3) audio signals as a layered bitstream (B), wherein a quantizer for each frequency band of each signal is selected using a rate allocation rule based on signal-specific rate allocation data, a spectral envelope of the signal and a reference level (EnvE1Max), which is determined based on the spectral envelope of the first signal and is not necessarily included in the bitstream. Further disclosed is a decoding system for reconstructing the audio signals based on the bitstream. In embodiments, the bitstream has a basic layer (BE1), which contains data that enable decoding of the first audio signal, and a spatial layer (Bspatial) facilitating decoding of the further audio signal(s). In embodiments, the encoding system prepares the bitstream subject to a basic-layer bitrate constraint and a total bitrate constraint.

    Abstract translation: 编码系统(100)将第一(E1)和另外(E2,E3)音频信号编码为分层比特流(B),其中使用基于信号的比特率的速率分配规则来选择每个信号的每个频带的量化器, 特定速率分配数据,信号的频谱包络和基于第一信号的频谱包络确定的参考电平(EnvE1Max),并且不一定包括在比特流中。 还公开了一种用于基于比特流重建音频信号的解码系统。 在实施例中,比特流具有包含能够对第一音频信号进行解码的数据的基本层(BE1)以及便于对其它音频信号进行解码的空间层(B空间)。 在实施例中,编码系统根据基本层比特率约束和总比特率约束准备比特流。

    AUDIO CODING WITH GAIN PROFILE EXTRACTION AND TRANSMISSION FOR SPEECH ENHANCEMENT AT THE DECODER
    8.
    发明申请
    AUDIO CODING WITH GAIN PROFILE EXTRACTION AND TRANSMISSION FOR SPEECH ENHANCEMENT AT THE DECODER 有权
    音频编码与增益简档提取和传输在解码器中进行语音增强

    公开(公告)号:US20150356978A1

    公开(公告)日:2015-12-10

    申请号:US14427908

    申请日:2013-09-11

    Abstract: The invention provides a layered audio coding format with a monophonic layer and at least one sound field layer. A plurality of audio signals is decomposed, in accordance with decomposition parameters controlling the quantitative properties of an orthogonal energy-compacting transform, into rotated audio signals. Further, a time-variable gain profile specifying constructively how the rotated audio signals may be processed to attenuate undesired audio content is derived. The monophonic layer may comprise one of the rotated signals and the gain profile. The sound field layer may comprise the rotated signals and the decomposition parameters. In one embodiment, the gain profile comprises a cleaning gain profile with the main purpose of eliminating non-speech components and/or noise. The gain profile may also comprise mutually independent broadband gains. Because signals in the audio coding format can be mixed with a limited computational effort, the invention may advantageously be applied in a tele-conferencing application.

    Abstract translation: 本发明提供了具有单声道层和至少一个声场层的分层音频编码格式。 根据控制正交能量压缩变换的定量特性的分解参数将多个音频信号分解成旋转音频信号。 此外,推导出时间变量增益曲线,其具体地指定旋转的音频信号如何被处理以衰减不需要的音频内容。 单声道层可以包括旋转的信号之一和增益分布。 声场层可以包括旋转的信号和分解参数。 在一个实施例中,增益分布包括清除增益曲线,其主要目的是消除非语音分量和/或噪声。 增益曲线也可以包括相互独立的宽带增益。 因为音频编码格式的信号可以以有限的计算量来混合,所以本发明可以有利地应用于远程会议应用中。

Patent Agency Ranking