Apparatus for encoding and decoding of integrated speech and audio
    2.
    发明授权
    Apparatus for encoding and decoding of integrated speech and audio 有权
    用于编码和解码集成语音和音频的装置

    公开(公告)号:US08959015B2

    公开(公告)日:2015-02-17

    申请号:US13054377

    申请日:2009-07-14

    IPC分类号: G10L19/20 G10L19/02 G10L19/12

    摘要: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.

    摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的装置。 用于对语音信号和音频信号进行整体编码的编码装置可以包括:模块选择单元,用于分析输入信号的特性,并选择用于编码输入信号的第一帧的第一编码模块; 语音编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成语音比特流; 音频编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成音频位流; 以及比特流生成单元,用于根据模块选择单元的选择从语音编码单元或音频编码单元生成输出比特流。

    APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO
    3.
    发明申请
    APPARATUS FOR ENCODING AND DECODING OF INTEGRATED SPEECH AND AUDIO 有权
    编码和解码集成语音和音频的设备

    公开(公告)号:US20110119054A1

    公开(公告)日:2011-05-19

    申请号:US13054377

    申请日:2009-07-14

    IPC分类号: G10L19/12 G10L19/00 G10L19/02

    摘要: Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.

    摘要翻译: 提供了一种用于对语音信号和音频信号进行整体编码和解码的装置。 用于对语音信号和音频信号进行整体编码的编码装置可以包括:模块选择单元,用于分析输入信号的特性,并选择用于编码输入信号的第一帧的第一编码模块; 语音编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成语音比特流; 音频编码单元,用于根据模块选择单元的选择对输入信号进行编码并生成音频位流; 以及比特流生成单元,用于根据模块选择单元的选择从语音编码单元或音频编码单元生成输出比特流。

    AUDIO ENHANCEMENT THROUGH SUPERVISED LATENT VARIABLE REPRESENTATION OF TARGET SPEECH AND NOISE

    公开(公告)号:US20200349965A1

    公开(公告)日:2020-11-05

    申请号:US16865111

    申请日:2020-05-01

    摘要: Systems and methods for generating an enhanced audio signal comprise a trained neural network configured to receive an input audio signal and generate an enhanced target signal, the trained neural network comprising a pre-processing neural network configured to receive a segment of the input audio signal and output an audio classification, the pre-processing neural network including at least one hidden layer comprising an embedding vector, and a noise reduction neural network configured to receive the segment of the input audio signal, and the embedding vector and generate the enhanced target signal. The pre-processing neural network may comprise a target signal pre-processing neural network configured to output a target signal classification and comprising at least one hidden layer comprising a target embedding vector. The pre-processing neural network may comprise a noise pre-processing neural network configured output a noise classification and comprising at least one hidden layer comprising a noise embedding vector.