-
公开(公告)号:US20120022867A1
公开(公告)日:2012-01-26
申请号:US13249181
申请日:2011-09-29
申请人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
发明人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.
摘要翻译: 描述了用于语音到文本转换的方法,计算机程序产品和系统。 从电子设备的用户接收语音输入,并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。 识别多个基本语言模型,其中每个基本语言模型对应于不同的文本语料库的内容。 使用上下文元数据,基于来自基本语言模型的贡献生成内插语言模型。 根据每个基本语言模型的加权来加权贡献。 内插语言模型用于将接收的语音输入转换为文本输出。 在远离电子设备的计算机服务器系统处接收语音输入。 文本输出被传送到电子设备。
-
公开(公告)号:US09047870B2
公开(公告)日:2015-06-02
申请号:US13249181
申请日:2011-09-29
申请人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
发明人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
IPC分类号: G10L15/00 , G10L15/28 , G10L15/18 , G10L15/26 , G10L15/30 , G10L15/183 , G10L15/197
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.
摘要翻译: 描述了用于语音到文本转换的方法,计算机程序产品和系统。 从电子设备的用户接收语音输入,并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。 识别多个基本语言模型,其中每个基本语言模型对应于不同的文本语料库的内容。 使用上下文元数据,基于来自基本语言模型的贡献生成内插语言模型。 根据每个基本语言模型的加权来加权贡献。 内插语言模型用于将接收的语音输入转换为文本输出。 在远离电子设备的计算机服务器系统处接收语音输入。 文本输出被传送到电子设备。
-
公开(公告)号:US20110161081A1
公开(公告)日:2011-06-30
申请号:US12977017
申请日:2010-12-22
IPC分类号: G10L15/06
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for forming a speech recognition language model. Multiple query-website relationships are determined by identifying websites that are determined to be relevant to queries using one or more search engines. Clusters are identified in the query-website relationships by connecting common queries and connecting common websites. A speech recognition language model is created for a particular website based on at least one of analyzing at queries in a cluster that includes the website or analyzing webpage content of web pages in the cluster that includes the website.
摘要翻译: 描述了用于形成语音识别语言模型的方法,计算机程序产品和系统。 通过识别确定与使用一个或多个搜索引擎的查询相关的网站来确定多个查询 - 网站关系。 通过连接常见查询和连接公共网站,在查询 - 网站关系中识别群集。 基于在包括网站的集群中的查询分析中的至少一个或者分析包括网站的集群中的网页的网页内容中的至少一个,为特定网站创建语音识别语言模型。
-
公开(公告)号:US20120022873A1
公开(公告)日:2012-01-26
申请号:US13249180
申请日:2011-09-29
IPC分类号: G10L21/00
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for forming a speech recognition language model. Multiple query-website relationships are determined by identifying websites that are determined to be relevant to queries using one or more search engines. Clusters are identified in the query-website relationships by connecting common queries and connecting common websites. A speech recognition language model is created for a particular website based on at least one of analyzing at queries in a cluster that includes the website or analyzing webpage content of web pages in the cluster that includes the website.
摘要翻译: 描述了用于形成语音识别语言模型的方法,计算机程序产品和系统。 通过识别确定与使用一个或多个搜索引擎的查询相关的网站来确定多个查询 - 网站关系。 通过连接常见查询和连接公共网站,在查询 - 网站关系中识别群集。 基于在包括网站的集群中的查询分析中的至少一个或者分析包括网站的集群中的网页的网页内容中的至少一个,为特定网站创建语音识别语言模型。
-
公开(公告)号:US09495127B2
公开(公告)日:2016-11-15
申请号:US12976920
申请日:2010-12-22
IPC分类号: G10L15/00 , G06F3/16 , G10L15/183 , G10L15/26 , G06F17/28 , G10L15/30 , G10L15/18 , G06F3/0488 , G06F17/27 , G10L15/22 , G10L15/197
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for converting speech to text. Sound information is received at a computer server system from an electronic device, where the sound information is from a user of the electronic device. A context identifier indicates a context within which the user provided the sound information. The context identifier is used to select, from among multiple language models, a language model appropriate for the context. Speech in the sound information is converted to text using the selected language model. The text is provided for use by the electronic device.
摘要翻译: 描述了将语音转换为文本的方法,计算机程序产品和系统。 在计算机服务器系统处从声音信息来自电子设备的用户的电子设备接收声音信息。 上下文标识符指示用户提供声音信息的上下文。 上下文标识符用于从多个语言模型中选择适合于上下文的语言模型。 使用所选择的语言模型将声音信息中的语音转换为文本。 该文本被提供供电子设备使用。
-
公开(公告)号:US20120022866A1
公开(公告)日:2012-01-26
申请号:US13249175
申请日:2011-09-29
IPC分类号: G10L15/26
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for converting speech to text. Sound information is received at a computer server system from an electronic device, where the sound information is from a user of the electronic device. A context identifier indicates a context within which the user provided the sound information. The context identifier is used to select, from among multiple language models, a language model appropriate for the context. Speech in the sound information is converted to text using the selected language model. The text is provided for use by the electronic device.
摘要翻译: 描述了将语音转换为文本的方法,计算机程序产品和系统。 在计算机服务器系统处从声音信息来自电子设备的用户的电子设备接收声音信息。 上下文标识符指示用户提供声音信息的上下文。 上下文标识符用于从多个语言模型中选择适合于上下文的语言模型。 使用所选择的语言模型将声音信息中的语音转换为文本。 该文本被提供供电子设备使用。
-
公开(公告)号:US20110161080A1
公开(公告)日:2011-06-30
申请号:US12976972
申请日:2010-12-22
申请人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
发明人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , Cyril Georges Luc Allauzen , Michael D. Riley
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for speech-to-text conversion. A voice input is received from a user of an electronic device and contextual metadata is received that describes a context of the electronic device at a time when the voice input is received. Multiple base language models are identified, where each base language model corresponds to a distinct textual corpus of content. Using the contextual metadata, an interpolated language model is generated based on contributions from the base language models. The contributions are weighted according to a weighting for each of the base language models. The interpolated language model is used to convert the received voice input to a textual output. The voice input is received at a computer server system that is remote to the electronic device. The textual output is transmitted to the electronic device.
摘要翻译: 描述了用于语音到文本转换的方法,计算机程序产品和系统。 从电子设备的用户接收语音输入,并且接收到在接收到语音输入时描述电子设备的上下文的语境元数据。 识别多个基本语言模型,其中每个基本语言模型对应于不同的文本语料库的内容。 使用上下文元数据,基于来自基本语言模型的贡献生成内插语言模型。 根据每个基本语言模型的加权来加权贡献。 内插语言模型用于将接收的语音输入转换为文本输出。 在远离电子设备的计算机服务器系统处接收语音输入。 文本输出被传送到电子设备。
-
公开(公告)号:US20110153324A1
公开(公告)日:2011-06-23
申请号:US12976920
申请日:2010-12-22
IPC分类号: G10L15/26
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: Methods, computer program products and systems are described for converting speech to text. Sound information is received at a computer server system from an electronic device, where the sound information is from a user of the electronic device. A context identifier indicates a context within which the user provided the sound information. The context identifier is used to select, from among multiple language models, a language model appropriate for the context. Speech in the sound information is converted to text using the selected language model. The text is provided for use by the electronic device.
摘要翻译: 描述了将语音转换为文本的方法,计算机程序产品和系统。 在计算机服务器系统处从声音信息来自电子设备的用户的电子设备接收声音信息。 上下文标识符指示用户提供声音信息的上下文。 上下文标识符用于从多个语言模型中选择适合于上下文的语言模型。 使用所选择的语言模型将声音信息中的语音转换为文本。 该文本被提供供电子设备使用。
-
公开(公告)号:US08751217B2
公开(公告)日:2014-06-10
申请号:US13249172
申请日:2011-09-29
申请人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau
发明人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau
IPC分类号: G06F17/20
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.
摘要翻译: 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求,识别用户即将向不依赖于应用的输入法编辑器提供口头输入, 并接收来自用户的口头输入。 口头输入对应于应用程序的输入,并转换为表示口头输入的文本。 该文本作为输入提供给应用程序。
-
公开(公告)号:US09031830B2
公开(公告)日:2015-05-12
申请号:US12977003
申请日:2010-12-22
申请人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau
发明人: Brandon M. Ballinger , Johan Schalkwyk , Michael H. Cohen , William J. Byrne , Gudmundur Hafsteinsson , Michael J. LeBeau
CPC分类号: G06F3/167 , G06F3/04886 , G06F17/277 , G06F17/289 , G10L15/005 , G10L15/18 , G10L15/183 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/265 , G10L15/30 , G10L2015/223 , G10L2015/228
摘要: A computer-implemented input-method editor process includes receiving a request from a user for an application-independent input method editor having written and spoken input capabilities, identifying that the user is about to provide spoken input to the application-independent input method editor, and receiving a spoken input from the user. The spoken input corresponds to input to an application and is converted to text that represents the spoken input. The text is provided as input to the application.
摘要翻译: 计算机实现的输入法编辑器处理包括从用户接收具有写入和口头输入能力的独立于应用的输入法编辑器的请求,识别用户即将向不依赖于应用的输入法编辑器提供口头输入, 并接收来自用户的口头输入。 口头输入对应于应用程序的输入,并转换为表示口头输入的文本。 该文本作为输入提供给应用程序。
-
-
-
-
-
-
-
-
-