专利检索 ap:"Johan Schalkwyk" 第 1 页

1.

发明授权
Multi-modal input on an electronic device 有权
标题翻译：电子设备上的多模态输入

公开(公告)号：US08751217B2

公开(公告)日：2014-06-10

申请号：US13249172

申请日：2011-09-29

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

IPC分类号： G06F17/20

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.

摘要翻译： 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求，识别用户即将向不依赖于应用的输入法编辑器提供口头输入，并接收来自用户的口头输入。口头输入对应于应用程序的输入，并转换为表示口头输入的文本。该文本作为输入提供给应用程序。

2.

发明授权
Robust speech recognition 有权
标题翻译：强大的语音识别

公开(公告)号：US08682661B1

公开(公告)日：2014-03-25

申请号：US12872428

申请日：2010-08-31

申请人： Johan Schalkwyk , Bjorn Bringert , David P. Singleton

发明人： Johan Schalkwyk , Bjorn Bringert , David P. Singleton

IPC分类号： G10L15/00

CPC分类号： G10L15/1815 , G10L15/197 , G10L2015/223

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech input. In one aspect, a method includes receiving a user input and a grammar including annotations, the user input comprising audio data and the annotations providing syntax and semantics to the grammar, retrieving third-party statistical speech recognition information, the statistical speech recognition information being transmitted over a network, generating a statistical language model (SLM) based on the grammar and the statistical speech recognition information, the SLM preserving semantics of the grammar, processing the user input using the SLM to generate one or more results, comparing the one or more results to candidates provided in the grammar, identifying a particular candidate of the grammar based on the comparing, and providing the particular candidate for input to an application executed on a computing device.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的用于识别语音输入的计算机程序。一方面，一种方法包括接收包括注释的用户输入和语法，包括音频数据的用户输入和向语法提供语法和语义的注释，检索第三方统计语音识别信息，传输的统计语音识别信息通过网络生成基于语法和统计语音识别信息的统计语言模型（SLM），语法的SLM保留语义，使用SLM处理用户输入以生成一个或多个结果，比较一个或多个对语法中提供的候选者的结果，基于比较识别语法的特定候选者，并且向在计算设备上执行的应用提供用于输入的特定候选者。

3.

发明授权
Determining hotword suitability 有权
标题翻译：确定词汇适用性

公开(公告)号：US09536528B2

公开(公告)日：2017-01-03

申请号：US13567572

申请日：2012-08-06

申请人： Andrew E. Rubin , Johan Schalkwyk , Maria Carolina Parada San Martin

发明人： Andrew E. Rubin , Johan Schalkwyk , Maria Carolina Parada San Martin

IPC分类号： G10L21/00 , G10L25/00 , G10L17/24 , G10L25/51 , G06F21/46 , G06F21/32 , G10L15/08 , G10L15/22

CPC分类号： G10L17/24 , G06F21/32 , G06F21/46 , G10L15/06 , G10L15/08 , G10L15/22 , G10L25/51 , G10L2015/0638 , G10L2015/088 , G10L2015/225

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining hotword suitability. In one aspect, a method includes receiving speech data that encodes a candidate hotword spoken by a user, evaluating the speech data or a transcription of the candidate hotword, using one or more predetermined criteria, generating a hotword suitability score for the candidate hotword based on evaluating the speech data or a transcription of the candidate hotword, using one or more predetermined criteria, and providing a representation of the hotword suitability score for display to the user.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于确定热词适用性。一方面，一种方法包括接收语音数据，该语音数据编码由用户说出的候选词条，使用一个或多个预定标准评估语音数据或候选词条的转录，基于使用一个或多个预定标准来评估语音数据或候选词条的转录，以及提供用于显示给用户的热词适合性得分的表示。

4.

发明授权
Context based language model selection 有权
标题翻译：基于语境的语言模型选择

公开(公告)号：US09047870B2

公开(公告)日：2015-06-02

申请号：US13249181

申请日：2011-09-29

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley

IPC分类号： G10L15/00 , G10L15/28 , G10L15/18 , G10L15/26 , G10L15/30 , G10L15/183 , G10L15/197

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

摘要翻译： 描述了用于语音到文本转换的方法，计算机程序产品和系统。从电子设备的用户接收语音输入，并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。识别多个基本语言模型，其中每个基本语言模型对应于不同的文本语料库的内容。使用上下文元数据，基于来自基本语言模型的贡献生成内插语言模型。根据每个基本语言模型的加权来加权贡献。内插语言模型用于将接收的语音输入转换为文本输出。在远离电子设备的计算机服务器系统处接收语音输入。文本输出被传送到电子设备。

5.

发明申请
Speech Recognition Language Models 审中-公开
标题翻译：语音识别语言模型

公开(公告)号：US20110161081A1

公开(公告)日：2011-06-30

申请号：US12977017

申请日：2010-12-22

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen

IPC分类号： G10L15/06

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： Methods, computer program products and systems are described for forming a speech recognition language model. Multiple query-website relationships are determined by identifying websites that are determined to be relevant to queries using one or more search engines. Clusters are identified in the query-website relationships by connecting common queries and connecting common websites. A speech recognition language model is created for a particular website based on at least one of analyzing at queries in a cluster that includes the website or analyzing webpage content of web pages in the cluster that includes the website.

摘要翻译： 描述了用于形成语音识别语言模型的方法，计算机程序产品和系统。通过识别确定与使用一个或多个搜索引擎的查询相关的网站来确定多个查询 - 网站关系。通过连接常见查询和连接公共网站，在查询 - 网站关系中识别群集。基于在包括网站的集群中的查询分析中的至少一个或者分析包括网站的集群中的网页的网页内容中的至少一个，为特定网站创建语音识别语言模型。

6.

发明授权
Multi-modal input on an electronic device 有权
标题翻译：电子设备上的多模态输入

公开(公告)号：US09031830B2

公开(公告)日：2015-05-12

申请号：US12977003

申请日：2010-12-22

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

IPC分类号： G06F17/20 , G06F17/21 , G10L15/00 , G10L15/04 , G10L15/18 , G10L15/26 , G10L15/30

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.

摘要翻译： 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求，识别用户即将向不依赖于应用的输入法编辑器提供口头输入，并接收来自用户的口头输入。口头输入对应于应用程序的输入，并转换为表示口头输入的文本。该文本作为输入提供给应用程序。

7.

发明申请
Virtual Participant-based Real-Time Translation and Transcription System for Audio and Video Teleconferences 有权
标题翻译：基于虚拟参与者的音视频电话会议实时翻译和转录系统

公开(公告)号：US20130226557A1

公开(公告)日：2013-08-29

申请号：US13459293

申请日：2012-04-30

申请人： Jakob David Uszkoreit , Ashish Venugopal , Johan Schalkwyk , Joshua James Estelle

发明人： Jakob David Uszkoreit , Ashish Venugopal , Johan Schalkwyk , Joshua James Estelle

IPC分类号： G06F17/28 , G01K15/00

CPC分类号： G06F17/289 , G10L15/005 , H04M3/568 , H04N7/155

摘要： The present disclosure describes a teleconferencing system that may use a virtual participant processor to translate language content of the teleconference into each participant's spoken language without additional user inputs. The virtual participant processor may connect to the teleconference as do the other participants. The virtual participant processor may intercept all text or audio data that was previously exchanged between the participants may now be intercepted by the virtual participant processor. Upon obtaining a partial or complete language recognition result or making a language preference determination, the virtual participant processor may call a translation engine appropriate for each of the participants. The virtual participant processor may send the resulting translation to a teleconference management processor. The teleconference management processor may deliver the respective translated text or audio data to the appropriate participant.

摘要翻译： 本公开描述了一种电话会议系统，其可以使用虚拟参与者处理器将电话会议的语言内容翻译成每个参与者的口语，而不需要额外的用户输入。虚拟参与者处理器可以像其他参与者一样连接到电话会议。虚拟参与者处理器可以拦截以前在参与者之间交换的所有文本或音频数据现在可被虚拟参与者处理器拦截。在获得部分或完整的语言识别结果或进行语言偏好确定时，虚拟参与者处理器可以调用适合每个参与者的翻译引擎。虚拟参与者处理器可将所得到的翻译发送到电话会议管理处理器。电话会议管理处理器可将相应的翻译文本或音频数据传送给适当的参与者。

8.

发明申请
Speech to Text Conversion 有权
标题翻译：演讲文字转换

公开(公告)号：US20120022867A1

公开(公告)日：2012-01-26

申请号：US13249181

申请日：2011-09-29

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley

IPC分类号： G10L15/26 , G06F17/30

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.

摘要翻译： 描述了用于语音到文本转换的方法，计算机程序产品和系统。从电子设备的用户接收语音输入，并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。识别多个基本语言模型，其中每个基本语言模型对应于不同的文本语料库的内容。使用上下文元数据，基于来自基本语言模型的贡献生成内插语言模型。根据每个基本语言模型的加权来加权贡献。内插语言模型用于将接收的语音输入转换为文本输出。在远离电子设备的计算机服务器系统处接收语音输入。文本输出被传送到电子设备。

9.

发明申请
Multi-Modal Input on an Electronic Device 有权
标题翻译：电子设备上的多模态输入

公开(公告)号：US20120022853A1

公开(公告)日：2012-01-26

申请号：US13249172

申请日：2011-09-29

申请人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

发明人： Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau

IPC分类号： G10L15/26 , G06F17/20

CPC分类号： G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228

摘要： A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.

摘要翻译： 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求，识别用户即将向不依赖于应用的输入法编辑器提供口头输入，并接收来自用户的口头输入。口头输入对应于应用程序的输入，并转换为表示口头输入的文本。该文本作为输入提供给应用程序。

10.

发明申请
Mobile dictation correction user interface 审中-公开
标题翻译：移动听写矫正用户界面

公开(公告)号：US20060149551A1

公开(公告)日：2006-07-06

申请号：US11316347

申请日：2005-12-22

申请人： William Ganong , Johan Schalkwyk

发明人： William Ganong , Johan Schalkwyk

IPC分类号： G10L11/00

CPC分类号： G10L15/22 , G10L15/30

摘要： A method of speech recognition is described for use with mobile user devices. A speech signal representative of input speech is forwarded from a mobile user device to a remote server. At the mobile user device, a speech recognition result representative of the speech signal is received from the remote server. The speech recognition result includes alternate recognition hypotheses associated with one or more portions of the speech recognition result. A user correction selection representing a portion of the speech recognition result is obtained from the user. The user is presented with selected alternate recognition hypotheses associated with the user correction selection. A user chosen one of the selected alternate recognition hypotheses is substituted for the user correction selection to form a corrected speech recognition result.

摘要翻译： 描述了一种用于移动用户设备的语音识别方法。表示输入语音的语音信号从移动用户设备转发到远程服务器。在移动用户设备处，从远程服务器接收表示语音信号的语音识别结果。语音识别结果包括与语音识别结果的一个或多个部分相关联的替代识别假设。从用户获得表示语音识别结果的一部分的用户校正选择。向用户呈现与用户校正选择相关联的选择的替代识别假设。选择所选择的替代识别假设之一的用户代替用户校正选择以形成校正的语音识别结果。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类