-
公开(公告)号:US09881608B2
公开(公告)日:2018-01-30
申请号:US15608110
申请日:2017-05-30
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/00 , G10L21/00 , G10L25/00 , G06F17/27 , G06F17/21 , G10L15/22 , G10L15/30 , G10L15/26 , G06F17/24 , G06F3/0484 , G06F17/22 , G10L15/01 , G06F3/0482 , G06F3/0488
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20170270926A1
公开(公告)日:2017-09-21
申请号:US15608110
申请日:2017-05-30
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/22 , G10L15/30 , G10L15/26 , G06F3/0482 , G06F3/0488 , G06F3/0484 , G06F17/22 , G10L15/01 , G06F17/27 , G06F17/24
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US09570094B2
公开(公告)日:2017-02-14
申请号:US14753904
申请日:2015-06-29
Applicant: Google Inc.
Inventor: Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti T. Kristjansson , John Nicholas Jitkoff , Andrew W. Senior
IPC: G10L15/04 , G10L25/78 , G10L15/10 , G06F3/0346 , H04M1/725 , H04R1/08 , H04W4/02 , G10L17/00 , G06F3/16
CPC classification number: G10L25/78 , G06F3/0346 , G06F3/167 , G10L15/10 , G10L15/22 , G10L15/265 , G10L17/00 , G10L25/21 , H04M1/72569 , H04M2250/12 , H04M2250/74 , H04R1/08 , H04W4/026
Abstract: A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
Abstract translation: 公开了一种计算机实现的多感觉语音检测方法。 该方法包括基于移动设备的方向来确定移动设备的方位并确定移动设备的操作模式。 该方法还包括识别基于所确定的操作模式来指定语音检测何时开始或结束的语音检测参数,以及基于语音检测参数来检测来自移动设备的用户的语音。
-
公开(公告)号:US09466287B2
公开(公告)日:2016-10-11
申请号:US14988201
申请日:2016-01-05
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L21/00 , G10L15/01 , G06F17/27 , G10L15/22 , G10L15/30 , G10L15/26 , G06F17/24 , G06F3/0484 , G06F17/22
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20160163308A1
公开(公告)日:2016-06-09
申请号:US15045571
申请日:2016-02-17
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/01 , G10L15/30 , G06F17/22 , G10L15/26 , G06F3/0484
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US20150294668A1
公开(公告)日:2015-10-15
申请号:US14747306
申请日:2015-06-23
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
Abstract translation: 除了别的以外,本说明书的主题可以实现用于校正转录文本中的单词的计算机实现的方法,包括从麦克风接收语音音频数据。 该方法还包括将语音音频数据发送到转录系统。 该方法还包括从转录系统接收从语音音频数据转录的单词格。 该方法还包括从单词格中呈现一个或多个转录词。 所述方法还包括接收所呈现的转录词中的至少一个的用户选择。 该方法还包括向所选择的转录词提供来自词格的一个或多个替代词。 该方法还包括接收至少一个替代单词的用户选择。 所述方法还包括用所选择的替代词替换所呈现的转录词中的所选转录词。
-
公开(公告)号:US10020009B1
公开(公告)日:2018-07-10
申请号:US15392448
申请日:2016-12-28
Applicant: Google Inc.
Inventor: Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti T. Kristjansson , John Nicholas Jitkoff , Andrew W. Senior
CPC classification number: G10L25/78 , G06F3/0346 , G06F3/167 , G10L15/10 , G10L15/22 , G10L15/265 , G10L17/00 , G10L25/21 , H04M1/72569 , H04M2250/12 , H04M2250/74 , H04R1/08 , H04W4/026
Abstract: A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
-
公开(公告)号:US20150287423A1
公开(公告)日:2015-10-08
申请号:US14645802
申请日:2015-03-12
Applicant: Google Inc.
Inventor: Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti T. Kristjansson , John Nicholas Jitkoff , Andrew W. Senior
CPC classification number: G10L25/78 , G06F3/0346 , G06F3/167 , G10L15/10 , G10L15/22 , G10L15/265 , G10L17/00 , G10L25/21 , H04M1/72569 , H04M2250/12 , H04M2250/74 , H04R1/08 , H04W4/026
Abstract: A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
-
公开(公告)号:US20150287406A1
公开(公告)日:2015-10-08
申请号:US13771419
申请日:2013-02-20
Applicant: Google Inc.
Inventor: Trausti T. Kristjansson , Mitchel Weintraub
IPC: G10L15/20
CPC classification number: G10L15/20 , G10L21/0232
Abstract: A method for estimating speech signal in the presence of non-stationary noise includes determining a plurality of initial speech estimates by subtracting a plurality of noise spectra, respectively, from an observed spectrum. Each of the noise spectra is represented by a noise component vector obtained from a Gaussian mixture model. The method also includes determining a plurality of initial noise estimates by subtracting a plurality of speech spectra, respectively, from the observed spectrum. Each of the speech spectra is represented by a speech component vector obtained from another Gaussian mixture model. A plurality of scores is determined, each score corresponding to one of the plurality of initial speech estimates, and calculated from a joint distribution defined by a combination of one of the noise component vectors and one of the speech component vectors. A clean speech estimate is determined as a combination of a subset of the scores.
Abstract translation: 用于在存在非平稳噪声的情况下估计语音信号的方法包括通过从观测频谱中分别减去多个噪声谱来确定多个初始语音估计。 每个噪声频谱由从高斯混合模型获得的噪声分量矢量表示。 该方法还包括通过从观察到的频谱中分别减去多个语音频谱来确定多个初始噪声估计。 每个语音频谱由从另一个高斯混合模型获得的语音分量向量表示。 确定多个分数,每个分数对应于多个初始语音估计中的一个,并且根据由噪声分量矢量中的一个和语音分量矢量之一组合定义的联合分布来计算。 干净的语音估计被确定为分数子集的组合。
-
-
-
-
-
-
-
-