-
公开(公告)号:US20170185691A1
公开(公告)日:2017-06-29
申请号:US15460696
申请日:2017-03-16
Applicant: Google Inc.
Inventor: John Nicholas Jitkoff , Michael J. LeBeau , William J. Byrne , David P. Singleton
CPC classification number: G06F16/9535 , G06F3/167 , G06F16/338 , G06F16/638 , G06F16/951 , G10L13/00 , G10L13/043 , G10L15/30 , G10L2015/225 , H04M1/6041 , H04M1/72569 , H04R29/004
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving user input that defines a search query, and providing the search query to a server system. Information that a search engine system determined was responsive to the search query is received at a computing device. The computing device is identified as in a first state, and a first output mode for audibly outputting at least a portion of the information is selected. The first output mode is selected from a collection of the first output mode and a second output mode. The second output mode is selected in response to the computing device being in a second state and is for visually outputting at least the portion of the information and not audibly outputting the at least portion of the information. At least the portion of information is audibly output.
-
公开(公告)号:US20170147589A1
公开(公告)日:2017-05-25
申请号:US15427431
申请日:2017-02-08
Applicant: Google Inc.
Inventor: Michael J. LeBeau , John Nicholas Jitkoff , William J. Byrne
CPC classification number: G06F16/9537 , G06F16/24578 , G06F16/951 , G06F16/953 , G06F16/957 , G10L15/25 , H04M1/72561 , H04M3/4931 , H04M3/4935 , H04M2201/40 , H04M2242/15
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for providing search results automatically to a user of a computing device. A spoken input provided by a user to a computing device is received. The spoken input is transmitted to a computer server system that is remote from the computing device. Search result information that is responsive to the spoken input is receiving by the computing device and in response to the transmitted spoken input. An alert is provided to the user that the device will connect the user to a target of the search result information if the user does not intervene to stop the connecting of the user. The user is connected to the target of the search result information based on a determination that the user has not intervened to stop the connecting of the user.
-
公开(公告)号:US20170069322A1
公开(公告)日:2017-03-09
申请号:US15350309
申请日:2016-11-14
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L15/22 , G06F3/0482 , G10L15/26 , G06F17/27 , G06F17/24 , G06F17/22 , G10L15/30 , G06F3/0488 , G10L15/01
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
Abstract translation: 除了别的以外,本说明书的主题可以实现用于校正转录文本中的单词的计算机实现的方法,包括从麦克风接收语音音频数据。 该方法还包括将语音音频数据发送到转录系统。 该方法还包括从转录系统接收从语音音频数据转录的单词点阵。 该方法还包括从单词格中呈现一个或多个转录词。 所述方法还包括接收所呈现的转录词中的至少一个的用户选择。 该方法还包括向所选择的转录词提供来自词格的一个或多个替代词。 该方法还包括接收至少一个替代单词的用户选择。 所述方法还包括用所选择的替代词替换所呈现的转录词中的所选转录词。
-
14.
公开(公告)号:US20160314786A1
公开(公告)日:2016-10-27
申请号:US15201955
申请日:2016-07-05
Applicant: Google Inc.
Inventor: William J. Byrne , Alexander H. Gruenstein , Douglas H. Beeferman
CPC classification number: G10L15/22 , G06F3/04842 , G06F3/167 , G06F17/2795 , G10L15/02 , G10L15/063 , G10L15/1822 , G10L15/26 , G10L2015/0631 , G10L2015/0635 , G10L2015/0638 , G10L2015/221
Abstract: Predicting and learning users' intended actions on an electronic device based on free-form speech input. Users' actions can be monitored to develop a list of carrier phrases having one or more actions that correspond to the carrier phrases. A user can speak a command into a device to initiate an action. The spoken command can be parsed and compared to a list of carrier phrases. If the spoken command matches one of the known carrier phrases, the corresponding action(s) can be presented to the user for selection. If the spoken command does not match one of the known carrier phrases, search results (e.g., Internet search results) corresponding to the spoken command can be presented to the user. The actions of the user in response to the presented action(s) and/or the search results can be monitored to update the list of carrier phrases.
Abstract translation: 基于自由形式语音输入,预测和学习用户对电子设备的预期动作。 可以监视用户的动作以开发具有与运营商短语对应的一个或多个动作的运营商短语的列表。 用户可以向设备发出命令以启动动作。 可以解析口头命令并将其与载体短语列表进行比较。 如果口头命令与已知的运营商短语之一匹配,则可以将相应的动作呈现给用户进行选择。 如果口头命令与已知的运营商短语之一不匹配,则可以向用户呈现与口语命令相对应的搜索结果(例如,因特网搜索结果)。 可以监视用户响应于所呈现的动作和/或搜索结果的动作以更新运营商短语列表。
-
公开(公告)号:US09460712B1
公开(公告)日:2016-10-04
申请号:US14454198
申请日:2014-08-07
Applicant: Google Inc.
Inventor: Brian Strope , William J. Byrne , Francoise Beaufays
CPC classification number: G10L15/22 , G06F17/30241 , G06F17/30864 , G06F17/30867 , G06F17/3087 , G06Q30/02 , G10L15/18 , G10L15/197 , G10L15/26 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: A method of operating a voice-enabled business directory search system includes receiving category-business pairs, each category-business pair including a business category and a specific business, and establishing a data structure having nodes based on the category-business pairs. Each node of the data structure is associated with one or more business categories and a speech recognition language model for recognizing specific businesses associated with the one or more businesses categories.
Abstract translation: 操作启用语音的业务目录搜索系统的方法包括接收类别业务对,每个类别业务对包括业务类别和特定业务,以及基于类别业务对建立具有节点的数据结构。 数据结构的每个节点与一个或多个业务类别和用于识别与一个或多个企业类别相关联的特定业务的语音识别语言模型相关联。
-
公开(公告)号:US20160154881A1
公开(公告)日:2016-06-02
申请号:US15016707
申请日:2016-02-05
Applicant: Google Inc.
Inventor: John Nicholas Jitkoff , Michael J. LeBeau , William J. Byrne , David P. Singleton
Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving user input that defines a search query, and providing the search query to a server system. Information that a search engine system determined was responsive to the search query is received at a computing device. The computing device is identified as in a first state, and a first output mode for audibly outputting at least a portion of the information is selected. The first output mode is selected from a collection of the first output mode and a second output mode. The second output mode is selected in response to the computing device being in a second state and is for visually outputting at least the portion of the information and not audibly outputting the at least portion of the information. At least the portion of information is audibly output.
-
公开(公告)号:US20160133258A1
公开(公告)日:2016-05-12
申请号:US14988201
申请日:2016-01-05
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
-
公开(公告)号:US09251791B2
公开(公告)日:2016-02-02
申请号:US14299837
申请日:2014-06-09
Applicant: Google Inc.
Inventor: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau
IPC: G06F17/20 , G10L15/26 , G06F17/28 , G10L15/30 , G10L15/18 , G10L15/183 , G10L15/197
CPC classification number: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.
-
公开(公告)号:US09542932B2
公开(公告)日:2017-01-10
申请号:US15045571
申请日:2016-02-17
Applicant: Google Inc.
Inventor: Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti T. Kristjansson
IPC: G10L21/00 , G10L15/01 , G06F17/27 , G10L15/22 , G10L15/30 , G10L15/26 , G06F17/24 , G06F3/0484 , G06F17/22
CPC classification number: G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30
Abstract: The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.
Abstract translation: 除了别的以外,本说明书的主题可以实现用于校正转录文本中的单词的计算机实现的方法,包括从麦克风接收语音音频数据。 该方法还包括将语音音频数据发送到转录系统。 该方法还包括从转录系统接收从语音音频数据转录的单词格。 该方法还包括从单词格中呈现一个或多个转录词。 所述方法还包括接收所呈现的转录词中的至少一个的用户选择。 该方法还包括向所选择的转录词提供来自词格的一个或多个替代词。 该方法还包括接收至少一个替代单词的用户选择。 所述方法还包括用所选择的替代词替换所呈现的转录词中的所选转录词。
-
公开(公告)号:US20140288929A1
公开(公告)日:2014-09-25
申请号:US14299837
申请日:2014-06-09
Applicant: Google Inc.
Inventor: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson
CPC classification number: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
Abstract: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.
Abstract translation: 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求,识别用户即将向不依赖于应用的输入法编辑器提供口头输入, 并接收来自用户的口头输入。 口头输入对应于应用程序的输入,并转换为表示口头输入的文本。 该文本作为输入提供给应用程序。
-
-
-
-
-
-
-
-
-