Word-Level Correction of Speech Input
    16.
    发明申请
    Word-Level Correction of Speech Input 有权
    语音输入字词校正

    公开(公告)号:US20150294668A1

    公开(公告)日:2015-10-15

    申请号:US14747306

    申请日:2015-06-23

    Applicant: Google Inc.

    Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.

    Abstract translation: 除了别的以外,本说明书的主题可以实现用于校正转录文本中的单词的计算机实现的方法,包括从麦克风接收语音音频数据。 该方法还包括将语音音频数据发送到转录系统。 该方法还包括从转录系统接收从语音音频数据转录的单词格。 该方法还包括从单词格中呈现一个或多个转录词。 所述方法还包括接收所呈现的转录词中的至少一个的用户选择。 该方法还包括向所选择的转录词提供来自词格的一个或多个替代词。 该方法还包括接收至少一个替代单词的用户选择。 所述方法还包括用所选择的替代词替换所呈现的转录词中的所选转录词。

    Estimating Speech in the Presence of Noise
    19.
    发明申请
    Estimating Speech in the Presence of Noise 审中-公开
    估计噪音的演讲

    公开(公告)号:US20150287406A1

    公开(公告)日:2015-10-08

    申请号:US13771419

    申请日:2013-02-20

    Applicant: Google Inc.

    CPC classification number: G10L15/20 G10L21/0232

    Abstract: A method for estimating speech signal in the presence of non-stationary noise includes determining a plurality of initial speech estimates by subtracting a plurality of noise spectra, respectively, from an observed spectrum. Each of the noise spectra is represented by a noise component vector obtained from a Gaussian mixture model. The method also includes determining a plurality of initial noise estimates by subtracting a plurality of speech spectra, respectively, from the observed spectrum. Each of the speech spectra is represented by a speech component vector obtained from another Gaussian mixture model. A plurality of scores is determined, each score corresponding to one of the plurality of initial speech estimates, and calculated from a joint distribution defined by a combination of one of the noise component vectors and one of the speech component vectors. A clean speech estimate is determined as a combination of a subset of the scores.

    Abstract translation: 用于在存在非平稳噪声的情况下估计语音信号的方法包括通过从观测频谱中分别减去多个噪声谱来确定多个初始语音估计。 每个噪声频谱由从高斯混合模型获得的噪声分量矢量表示。 该方法还包括通过从观察到的频谱中分别减去多个语音频谱来确定多个初始噪声估计。 每个语音频谱由从另一个高斯混合模型获得的语音分量向量表示。 确定多个分数,每个分数对应于多个初始语音估计中的一个,并且根据由噪声分量矢量中的一个和语音分量矢量之一组合定义的联合分布来计算。 干净的语音估计被确定为分数子集的组合。

Patent Agency Ranking