Feature compensation apparatus and method for speech recognition in noisy environment

    公开(公告)号:US09799331B2

    公开(公告)日:2017-10-24

    申请号:US15074579

    申请日:2016-03-18

    CPC classification number: G10L15/20 G10L15/02

    Abstract: A feature compensation apparatus includes a feature extractor configured to extract corrupt speech features from a corrupt speech signal with additive noise that consists of two or more frames; a noise estimator configured to estimate noise features based on the extracted corrupt speech features and compensated speech features; a probability calculator configured to calculate a correlation between adjacent frames of the corrupt speech signal; and a speech feature compensator configured to generate compensated speech features by eliminating noise features of the extracted corrupt speech features while taking into consideration the correlation between adjacent frames of the corrupt speech signal and the estimated noise features, and to transmit the generated compensated speech features to the noise estimator.

    Mobile communication terminal and operating method thereof
    16.
    发明授权
    Mobile communication terminal and operating method thereof 有权
    移动通信终端及其操作方法

    公开(公告)号:US09100492B2

    公开(公告)日:2015-08-04

    申请号:US14018068

    申请日:2013-09-04

    CPC classification number: H04M1/72519 G10L15/25 H04M2250/52 H04M2250/74

    Abstract: Provided is a mobile communication terminal including: a camera module which captures an image of a set area; a microphone module which, when a sound including a voice of a user is input, extracts a sound level corresponding to the sound and a sound generating position; and a control module which estimates a position of a lip of the user from the image, extracts a voice level from the sound level corresponding to the position of the lip of the user and a voice generating position from the sound generating position, and recognizes the voice of the user based on at least one of the voice level and the voice generating position.

    Abstract translation: 提供了一种移动通信终端,包括:相机模块,其捕获设置区域的图像; 麦克风模块,当输入包括用户的声音的声音时,提取与声音和声音产生位置相对应的声级; 以及控制模块,其从图像估计用户的嘴唇的位置,从与声音产生位置的用户的嘴唇的位置和语音产生位置相对应的声级提取语音电平,并且识别出 基于语音电平和语音产生位置中的至少一个的用户的语音。

Patent Agency Ranking