-
公开(公告)号:US08571851B1
公开(公告)日:2013-10-29
申请号:US13731508
申请日:2012-12-31
Applicant: Google Inc.
Inventor: Simon Tickner , Richard Z. Cohen
IPC: G06F17/27
CPC classification number: G06F17/2785 , G10L15/1815 , G10L15/24
Abstract: Methods, and systems, including computer programs encoded on computer-readable storage mediums, including a method for performing semantic interpretation using gaze order. The method includes obtaining data identifying a sequence of gaze attention dwell positions, obtaining a semantic description of elements displayed on a visual display, obtaining a transcription of an utterance, correlating the gaze attention dwell positions with the semantic description of elements to generate a sequence of one or more of the elements, performing semantic interpretation of at least one term included in the transcription based at least on the sequence of the elements, and outputting a result of performing the semantic interpretation of the at least one term.
Abstract translation: 方法和系统,包括在计算机可读存储介质上编码的计算机程序,包括使用凝视顺序执行语义解释的方法。 该方法包括获得识别注视停留位置序列的数据,获得在视觉显示器上显示的元素的语义描述,获得话语的转录,将注视注意位置与元素的语义描述相关联以产生一系列 一个或多个元素,至少基于元素的序列执行包含在转录中的至少一个术语的语义解释,并输出执行至少一个术语的语义解释的结果。
-
公开(公告)号:US08731912B1
公开(公告)日:2014-05-20
申请号:US13828592
申请日:2013-03-14
Applicant: Google Inc.
Inventor: Simon Tickner , Peter J Hodgson , Richard Z. Cohen
IPC: G10L11/02
CPC classification number: G10L15/22 , G10L25/78 , G10L2015/223 , H04M19/04 , H04M2250/74
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for audible alert tones are disclosed. The methods, systems, and apparatus include actions of determining whether audio input data received after ceasing output of a first instance of an audible alert tone includes voice activity and determining whether to delay a successive instance of the audible alert tone based on determining whether the audio input data includes voice activity.
Abstract translation: 公开了包括在计算机存储介质上编码的计算机程序的方法,系统和装置,用于可听到的警报音。 方法,系统和装置包括确定在停止声音警报音的第一实例的输出之后接收到的音频输入数据是否包括语音活动并且基于确定音频是否延迟可听到的警报音的连续实例的动作 输入数据包括语音活动。
-
公开(公告)号:US08515751B2
公开(公告)日:2013-08-20
申请号:US13627744
申请日:2012-09-26
Applicant: Google Inc.
Inventor: Luca Zanolin , Marcus A. Foster , Richard Z. Cohen
IPC: G10L15/26
CPC classification number: G06F17/273
Abstract: This specification describes technologies relating to recognition of text in various media. In general, one aspect of the subject matter described in this specification can be embodied in methods that include receiving an input signal including data representing one or more words and passing the input signal to a text recognition system that generates a recognized text string based on the input signal. The methods may further include receiving the recognized text string from the text recognition system. The methods may further include presenting the recognized text string to a user and receiving a corrected text string based on input from the user. The methods may further include checking if an edit distance between the corrected text string and the recognized text string is below a threshold. If the edit distance is below the threshold, the corrected text string may be passed to the text recognition system for training purposes.
Abstract translation: 本说明书描述了在各种媒体中识别文本的技术。 通常,本说明书中描述的主题的一个方面可以体现在包括接收包括表示一个或多个字的数据的输入信号并将输入信号传递到文本识别系统的方法中,所述文本识别系统基于 输入信号。 所述方法还可以包括从文本识别系统接收所识别的文本串。 所述方法还可以包括将识别的文本串呈现给用户,并且基于来自用户的输入接收经校正的文本串。 所述方法还可以包括检查所述经修正的文本串与所识别的文本串之间的编辑距离是否低于阈值。 如果编辑距离低于阈值,则为了训练目的,校正的文本串可以被传递到文本识别系统。
-
公开(公告)号:US09378730B1
公开(公告)日:2016-06-28
申请号:US14077368
申请日:2013-11-12
Applicant: Google Inc.
Inventor: Simon Tickner , Richard Z. Cohen
CPC classification number: G10L15/1815 , G10L15/02 , G10L15/22 , G10L15/26
Abstract: Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
-
公开(公告)号:US09697829B1
公开(公告)日:2017-07-04
申请号:US15168355
申请日:2016-05-31
Applicant: Google Inc.
Inventor: Simon Tickner , Richard Z. Cohen
CPC classification number: G10L15/1815 , G10L15/02 , G10L15/22 , G10L15/26
Abstract: Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
-
公开(公告)号:US08606568B1
公开(公告)日:2013-12-10
申请号:US13658110
申请日:2012-10-23
Applicant: Google Inc.
Inventor: Simon Tickner , Richard Z. Cohen
IPC: G10L15/00
CPC classification number: G10L15/1815 , G10L15/02 , G10L15/22 , G10L15/26
Abstract: Methods, computer program products, and systems are described for receiving, by a speech recognition engine, audio data that encodes an utterance and determining, by the speech recognition engine, that a transcription of the utterance includes one or more keywords associated with a command, and a pronoun. In addition, the methods, computer program products, and systems described herein pertain to transmitting a disambiguation request to an application, wherein the disambiguation request identifies the pronoun, receiving, by the speech recognition engine, a response to the disambiguation request, wherein the response references an item of content identified by the application, and generating, by the speech recognition engine, the command using the keywords and the response.
Abstract translation: 描述了方法,计算机程序产品和系统,用于通过语音识别引擎接收编码语音的音频数据,并且由语音识别引擎确定话音的转录包括与命令相关联的一个或多个关键字, 和代词。 此外,本文描述的方法,计算机程序产品和系统涉及向应用发送消歧请求,其中消歧请求标识代词,由语音识别引擎接收对消歧请求的响应,其中响应 引用由应用标识的内容项,并且由语音识别引擎使用关键字和响应生成命令。
-
-
-
-
-