DEVICE AND METHOD FOR AUDIO FRAME PROCESSING
    5.
    发明公开
    DEVICE AND METHOD FOR AUDIO FRAME PROCESSING 审中-公开
    用于音频帧处理的装置和方法

    公开(公告)号:EP3309777A1

    公开(公告)日:2018-04-18

    申请号:EP16306350.6

    申请日:2016-10-13

    申请人: Thomson Licensing

    IPC分类号: G10L15/02 G10L25/03 G06F17/30

    摘要: A device (200) and method for calculating scattering features for audio signal recognition. An interface (240) receives an audio signal that is processed (S610) by a processor (210) to obtain an audio frame. The processor (210) calculates (S620) a first order scattering features from at least one audio frame and then calculates (S630) for the first order scattering features an estimation of whether the first order scattering features comprises sufficient information for accurate audio signal recognition. The processor (240) calculates (S650) a second order scattering features from the first order scattering features only in case the first order scattering features does not comprise sufficient information for accurate audio signal recognition. As second order features are calculated only when it is deemed necessary, less processing power can be used by the device, which can lead to less power used by the device.

    摘要翻译: 一种用于计算音频信号识别的散射特征的设备(200)和方法。 接口(240)接收由处理器(210)处理(S610)的音频信号以获得音频帧。 处理器(210)从至少一个音频帧计算(S620)一阶散射特征,然后针对一阶散射特征计算(S630)对一阶散射特征是否包括用于精确音频信号识别的足够信息的估计。 仅当第一阶散射特征不包含用于精确音频信号识别的足够信息时,处理器(240)才从第一阶散射特征计算(S650)第二阶散射特征。 由于二阶特征仅在必要时才被计算,所以设备可以使用较少的处理能力,这可以导致设备使用较少的功率。

    DYNAMICALLY ADAPTED PITCH CORRECTION BASED ON AUDIO INPUT
    6.
    发明公开
    DYNAMICALLY ADAPTED PITCH CORRECTION BASED ON AUDIO INPUT 审中-公开
    基于音频输入的动态适应音调校正

    公开(公告)号:EP3288022A1

    公开(公告)日:2018-02-28

    申请号:EP17195678.2

    申请日:2013-12-18

    摘要: Systems and methods for adjusting pitch of an audio signal include detecting input notes in the audio signal, mapping the input notes to corresponding output notes, each output note having an associated upper note boundary and lower note boundary, and modifying at least one of the upper note boundary and the lower note boundary of at least one output note in response to previously received input notes. Pitch of the input notes may be shifted to match an associated pitch of corresponding output notes. Delay of the pitch shifting process may be dynamically adjusted based on detected stability of the input notes.

    摘要翻译: 用于调节音频信号的音高的系统和方法包括检测音频信号中的输入音符,将输入音符映射到对应的输出音符,每个输出音符具有相关联的上音调边界和下音调边界,并且修改上音调边界中的至少一个 音符边界和至少一个输出音符的低音符边界,以响应先前接收到的输入音符。 输入音符的音高可以被移动以匹配相应输出音符的相关音高。 可以基于检测到的输入音符的稳定性来动态调整音高移位过程的延迟。

    AUDIO CODING METHOD AND APPARATUS
    7.
    发明公开
    AUDIO CODING METHOD AND APPARATUS 审中-公开
    音频编码方法和设备

    公开(公告)号:EP3144933A4

    公开(公告)日:2017-03-22

    申请号:EP15811228

    申请日:2015-06-23

    发明人: WANG ZHE

    IPC分类号: G10L19/02 G10L19/20 G10L19/22

    摘要: An audio encoding method and an apparatus are provided. The method includes: determining sparseness of distribution, on spectrums, of energy of N input audio frames (101), where the N audio frames include a current audio frame, and N is a positive integer; and determining, according to the sparseness of distribution, on the spectrums, of the energy of the N audio frames, whether to use a first encoding method or a second encoding method to encode the current audio frame (102), where the first encoding method is an encoding method that is based on time-frequency transform and transform coefficient quantization and that is not based on linear prediction, and the second encoding method is a linear-predication-based encoding method. According to the method, when an audio frame is encoded, sparseness of distribution, on a spectrum, of energy of the audio frame is considered, which can reduce encoding complexity and ensure that encoding is of relatively high accuracy.

    ADAPTIVE SPEECH FILTER FOR ATTENUATION OF AMBIENT NOISE
    10.
    发明公开
    ADAPTIVE SPEECH FILTER FOR ATTENUATION OF AMBIENT NOISE 审中-公开
    ADAPTIVER SPRACHFILTER ZURUNTERDRÜCKUNGVONUMGEBUNGSGERÄUSCHEN

    公开(公告)号:EP3032536A1

    公开(公告)日:2016-06-15

    申请号:EP15198584.3

    申请日:2015-12-09

    IPC分类号: G10L21/0264 G10L25/03

    CPC分类号: G10L21/0264 G10L25/03

    摘要: According to a preferred aspect of the instant invention, there is provided a system and method that allows the user to attenuate ambient noise in speech recordings in the audio part of a video recording. The user does not need to define particular sections or samples or individual parameters. The system is automatically analyzing the input signal and in a plurality of individual steps detects the ambient noise, determines an adaptive filter, implements the filter and therewith attenuates the ambient noise accordingly.

    摘要翻译: 根据本发明的优选方面,提供了一种系统和方法,其允许用户在视频记录的音频部分中的语音记录中衰减环境噪声。 用户不需要定义特定的部分或样本或各个参数。 该系统自动分析输入信号,并且在多个单独步骤中检测环境噪声,确定自适应滤波器,实现滤波器并因此相应地衰减环境噪声。