专利检索 ap:("Shilei Zhang" OR "Shenghua Bao" OR "Wen Liu" OR "Yong Qin" OR "Zhiwei Shuang" OR "Jian Chen" OR "Zhong Su" OR "Qin Shi" OR "William F. Ganong, III") AND inv:"William F. Ganong, III" 第 1 页

1.

发明授权
Speaker and call characteristic sensitive open voice search 有权
标题翻译：扬声器和呼叫特性敏感开放语音搜索

公开(公告)号：US08630860B1

公开(公告)日：2014-01-14

申请号：US13039467

申请日：2011-03-03

申请人： Shilei Zhang , Shenghua Bao , Wen Liu , Yong Qin , Zhiwei Shuang , Jian Chen , Zhong Su , Qin Shi , William F. Ganong, III

发明人： Shilei Zhang , Shenghua Bao , Wen Liu , Yong Qin , Zhiwei Shuang , Jian Chen , Zhong Su , Qin Shi , William F. Ganong, III

IPC分类号： G06F17/27 , G10L15/00 , G10L15/26 , G10L17/00 , G10L21/00 , G10L25/00 , G10L15/04 , G06F7/00 , G06F17/30

CPC分类号： G10L15/26 , G06F17/30026 , G06F17/3053 , G06F17/30705 , G06F17/30867 , G10L15/18 , G10L15/1807 , G10L15/183 , G10L15/22

摘要： Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

摘要翻译： 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。技术包括使用语音信息，说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。这包括将文本索引与语音索引集成以支持整个搜索周期。给定语音查询，系统可以同时执行两个匹配过程。这可以包括基于语音识别输出的文本匹配过程，以及基于呼叫者或用户发出查询的特征的语音匹配过程。呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。根据这些特点，系统集群呼叫者。系统可以使用特定的语音和文本集群来修改语音识别结果，以及修改搜索结果。

2.

发明申请
SPEAKER AND CALL CHARACTERISTIC SENSITIVE OPEN VOICE SEARCH 有权
标题翻译：扬声器和呼叫特征敏感开放语音搜索

公开(公告)号：US20140129220A1

公开(公告)日：2014-05-08

申请号：US14152136

申请日：2014-01-10

申请人： Shilei Zhang , Shenghua Bao , Wen Liu , Yong Qin , Zhiwei Shuang , Jian Chen , Zhong Su , Qin Shi , William F. Ganong, III

发明人： Shilei Zhang , Shenghua Bao , Wen Liu , Yong Qin , Zhiwei Shuang , Jian Chen , Zhong Su , Qin Shi , William F. Ganong, III

IPC分类号： G10L15/26

CPC分类号： G10L15/26 , G06F17/30026 , G06F17/3053 , G06F17/30705 , G06F17/30867 , G10L15/18 , G10L15/1807 , G10L15/183 , G10L15/22

摘要： Techniques disclosed herein include systems and methods for open-domain voice-enabled searching that is speaker sensitive. Techniques include using speech information, speaker information, and information associated with a spoken query to enhance open voice search results. This includes integrating a textual index with a voice index to support the entire search cycle. Given a voice query, the system can execute two matching processes simultaneously. This can include a text matching process based on the output of speech recognition, as well as a voice matching process based on characteristics of a caller or user voicing a query. Characteristics of the caller can include output of voice feature extraction and metadata about the call. The system clusters callers according to these characteristics. The system can use specific voice and text clusters to modify speech recognition results, as well as modifying search results.

摘要翻译： 本文公开的技术包括用于开放式语音启用搜索的扬声器敏感的系统和方法。技术包括使用语音信息，说话者信息和与口语查询相关联的信息来增强开放语音搜索结果。这包括将文本索引与语音索引集成以支持整个搜索周期。给定语音查询，系统可以同时执行两个匹配过程。这可以包括基于语音识别输出的文本匹配过程，以及基于呼叫者或用户发出查询的特征的语音匹配过程。呼叫者的特征可以包括语音特征提取的输出和关于呼叫的元数据。根据这些特点，系统集群呼叫者。系统可以使用特定的语音和文本集群来修改语音识别结果，以及修改搜索结果。

3.

发明授权
Methods and apparatus for correcting recognition errors 有权

公开(公告)号：US10522133B2

公开(公告)日：2019-12-31

申请号：US13479010

申请日：2012-05-23

申请人： Martin Labsky , Jan Kleindienst , Tomas Macek , David Nahamoo , Jan Curin , William F. Ganong, III

发明人： Martin Labsky , Jan Kleindienst , Tomas Macek , David Nahamoo , Jan Curin , William F. Ganong, III

IPC分类号： G10L13/08 , G10L17/00 , G10L13/00 , G10L21/06 , G10L15/14 , G06F17/27 , G10L15/26 , G10L15/30 , G10L15/18 , G10L15/01 , G06F17/21 , G10L15/32 , G10L15/06 , G10L15/28 , G10L15/02

摘要： Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.

4.

发明授权
Method and apparatus for processing spoken search queries 有权
标题翻译：用于处理口语搜索查询的方法和装置

公开(公告)号：US08666963B2

公开(公告)日：2014-03-04

申请号：US13527500

申请日：2012-06-19

申请人： Vladimir Sejnoha , William F. Ganong, III , Paul J. Vozila , Nathan M. Bodenstab , Yik-Cheung Tam

发明人： Vladimir Sejnoha , William F. Ganong, III , Paul J. Vozila , Nathan M. Bodenstab , Yik-Cheung Tam

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30976

摘要： Some embodiments relate to a method of performing a search for content on the Internet, in which a user may speak a search query and speech recognition may be performed on the spoken query to generate a text search query to be provided to a plurality of search engines. This enables a user to speak the search query rather than having to type it, and also allows the user to provide the search query only once, rather than having to provide it separately to multiple different search engines.

摘要翻译： 一些实施例涉及对互联网上的内容执行搜索的方法，其中用户可以在其中说出搜索查询，并且可以在口头查询上执行语音识别以生成要提供给多个搜索引擎的文本搜索查询。这使得用户能够说出搜索查询而不必输入搜索查询，并且还允许用户仅提供一次搜索查询，而不必单独提供给多个不同的搜索引擎。

5.

发明申请
SPEAKER VERIFICATION METHODS AND APPARATUS 有权
标题翻译：扬声器验证方法和设备

公开(公告)号：US20120239398A1

公开(公告)日：2012-09-20

申请号：US13442170

申请日：2012-04-09

申请人： Kevin R. Farrell , David A. James , William F. Ganong, III , Jerry K. Carter

发明人： Kevin R. Farrell , David A. James , William F. Ganong, III , Jerry K. Carter

IPC分类号： G10L17/00

CPC分类号： G10L17/24 , G10L17/04 , G10L17/20

摘要： In one aspect, a method for determining a validity of an identity asserted by a speaker using a voice print is provided. The method comprises acts of performing a first verification stage comprising comparing a first voice signal from the speaker uttering at least one first challenge utterance-with at least a portion of the voice print and performing a second verification stage if it is concluded in the first verification stage that the first voice signal was obtained from an utterance by the user. The second verification stage comprises adapting at least one parameter of the voice print based, at least in part, on the first voice signal to obtain an adapted voice print, and comparing a second voice signal from the speaker uttering at least one second challenge utterance with at least a portion of the adapted voice print.

摘要翻译： 在一方面，提供了一种用于确定由使用语音打印的扬声器所确定的身份的有效性的方法。该方法包括执行第一验证阶段的动作，包括将来自扬声器的第一语音信号与至少一个第一挑战话语 - 与语音打印的至少一部分进行比较，并且如果在第一验证中得出结论，则执行第二验证阶段第一语音信号是由用户的话语获得的。第二验证阶段包括至少部分地基于第一语音信号来调整语音印刷的至少一个参数以获得适应的语音印刷，并且将来自扬声器的第二语音信号与至少一个第二挑战话语与至少一部分适应的语音打印。

6.

发明授权
Method and apparatus for processing spoken search queries 有权
标题翻译：用于处理口语搜索查询的方法和装置

公开(公告)号：US08239366B2

公开(公告)日：2012-08-07

申请号：US12877549

申请日：2010-09-08

申请人： Vladimir Sejnoha , William F. Ganong, III , Paul J. Vozila , Nathan M. Bodenstab , Yik-Cheung Tam

发明人： Vladimir Sejnoha , William F. Ganong, III , Paul J. Vozila , Nathan M. Bodenstab , Yik-Cheung Tam

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30976

摘要： Some embodiments relate to a method of performing a search for content on the Internet, in which a user may speak a search query and speech recognition may be performed on the spoken query to generate a text search query to be provided to a plurality of search engines. This enables a user to speak the search query rather than having to type it, and also allows the user to provide the search query only once, rather than having to provide it separately to multiple different search engines.

摘要翻译： 一些实施例涉及对互联网上的内容执行搜索的方法，其中用户可以在其中说出搜索查询，并且可以在口头查询上执行语音识别以生成要提供给多个搜索引擎的文本搜索查询。这使得用户能够说出搜索查询而不必输入搜索查询，并且还允许用户仅提供一次搜索查询，而不必单独提供给多个不同的搜索引擎。

7.

发明授权
System and method for modeless large vocabulary speech recognition 有权
标题翻译：无模式大词汇语音识别的系统和方法

公开(公告)号：US06292779B1

公开(公告)日：2001-09-18

申请号：US09267925

申请日：1999-03-09

申请人： Brian Wilson , Manfred Grabherr , Ramesh Sarukkai , William F. Ganong, III

发明人： Brian Wilson , Manfred Grabherr , Ramesh Sarukkai , William F. Ganong, III

IPC分类号： G10L1514

CPC分类号： G10L15/26 , G10L15/193 , G10L2015/223 , G10L2015/228

摘要： A modeless large vocabulary continuous speech recognition system is provided that represents an input utterance as a sequence of input vectors. The system includes a common library of acoustic model states for arrangement in sequences that form acoustic models. Each acoustic model is composed of a sequence of segment models and each segment model is composed of a sequence of model states. An input processor compares each vector in a sequence of input vectors to a set of model states in the common library to produce a match score for each model state in the set, reflecting the likelihood that a state is represented by a vector. The system also includes a plurality of recognition modules and associated recognition grammars. The recognition modules operate in parallel and use the match scores with the acoustic models to determine at least one recognition result in each of the recognition modules. The recognition modules includes a dictation module for producing at least one probable dictation recognition result, a select module for recognizing a portion of visually displayed text for processing with a command, and a command module for producing at least one probable command recognition result. An arbitrator uses an arbitration algorithm and a score ordered queue of recognition results, together with their associated recognition modules, to compare the recognition results of the recognition modules to select at least one system recognition result.

摘要翻译： 提供了一种无噪声大词汇连续语音识别系统，其表示输入语音作为输入向量的序列。该系统包括用于形成声学模型的序列中布置的声学模型状态的公共库。每个声学模型由段模型序列组成，每个分段模型由一系列模型状态组成。输入处理器将输入向量序列中的每个向量与公共库中的一组模型状态进行比较，以产生该集合中每个模型状态的匹配分数，反映状态由向量表示的可能性。该系统还包括多个识别模块和相关联的识别语法。识别模块并行运行，并使用声学模型的匹配分数来确定每个识别模块中的至少一个识别结果。识别模块包括用于产生至少一个可能的听写识别结果的听写模块，用于识别用于用命令处理的视觉显示文本的一部分的选择模块，以及用于产生至少一个可能的命令识别结果的命令模块。仲裁员使用仲裁算法和识别结果的得分排序队列及其相关联的识别模块来比较识别模块的识别结果以选择至少一个系统识别结果。

8.

发明授权
Detecting potential significant errors in speech recognition results 有权
标题翻译：检测语音识别结果中潜在的重大错误

公开(公告)号：US09064493B2

公开(公告)日：2015-06-23

申请号：US13544279

申请日：2012-07-09

申请人： William F. Ganong, III , Raghu Vemula , Robert Fleming

发明人： William F. Ganong, III , Raghu Vemula , Robert Fleming

IPC分类号： G10L15/22 , G10L15/18 , G10L15/08

CPC分类号： G10L15/01 , G10L15/08 , G10L15/1815 , G10L2015/085

摘要： In some embodiments, the recognition results produced by a speech processing system (which may include a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential significant errors. In some embodiments, the recognition results may be evaluated to determine whether a meaning of any of the alternative recognition results differs from a meaning of the top recognition result in a manner that is significant for the domain. In some embodiments, one or more of the recognition results may be evaluated to determine whether the result(s) include one or more words or phrases that, when included in a result, would change a meaning of the result in a manner that would be significant for the domain.

摘要翻译： 在一些实施例中，基于语音输入的分析，由语音处理系统（其可以包括顶部识别结果和一个或多个替代识别结果）产生的识别结果被评估用于潜在重大错误的指示。在一些实施例中，可以评估识别结果以确定任何替代识别结果的含义以对于域有意义的方式与顶部识别结果的含义不同。在一些实施例中，可以评估一个或多个识别结果以确定结果是否包括一个或多个单词或短语，当被包括在结果中时，将以以下方式改变结果的含义：对领域有重要意义

9.

发明授权
Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data 有权
标题翻译：使用辅助数据（如个人数据）提高转录准确性的系统和方法

公开(公告)号：US09009041B2

公开(公告)日：2015-04-14

申请号：US13190749

申请日：2011-07-26

申请人： George Zavaliagkos , William F. Ganong, III , Uwe H. Jost , Shreedhar Madhavapeddi , Gary B. Clayton

发明人： George Zavaliagkos , William F. Ganong, III , Uwe H. Jost , Shreedhar Madhavapeddi , Gary B. Clayton

IPC分类号： G10L15/00 , G10L15/26 , G10L15/24 , G10L15/22 , G10L15/08 , G10L15/30

CPC分类号： G10L15/26 , G10L15/065 , G10L15/08 , G10L15/24 , G10L15/30 , G10L2015/227

摘要： A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words. The replacement words in the personal vocabulary are obtained from personal data associated with a user. A transcription is received of an audio recording. The transcription is generated by an ASR engine using an ASR vocabulary and includes a transcribed word that represents a spoken word in the audio recording. Data is received that is associated with the transcribed word. A replacement word from the personal vocabulary is identified, which is used to re-score the transcription and replace the transcribed word.

摘要翻译： 描述了一种用于提高由自动语音识别（ASR）引擎产生的转录的准确性的方法。维护包含替换词的个人词汇。个人词汇中的替换词是从与用户相关联的个人数据获得的。接收到录音的录音。转录由使用ASR词汇的ASR引擎生成，并且包括表示音频记录中的口语单词的转录词。收到与转录词相关联的数据。识别出个人词汇中的替代词，用于重新得分转录并替换转录词。

10.

发明申请
DETECTING POTENTIAL SIGNIFICANT ERRORS IN SPEECH RECOGNITION RESULTS 有权
标题翻译：检测语音识别结果中的潜在重要错误

公开(公告)号：US20140012580A1

公开(公告)日：2014-01-09

申请号：US13544279

申请日：2012-07-09

申请人： William F. Ganong, III , Raghu Vemula , Robert Fleming

发明人： William F. Ganong, III , Raghu Vemula , Robert Fleming

IPC分类号： G10L15/18

CPC分类号： G10L15/01 , G10L15/08 , G10L15/1815 , G10L2015/085

摘要： In some embodiments, the recognition results produced by a speech processing system (which may include a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential significant errors. In some embodiments, the recognition results may be evaluated to determine whether a meaning of any of the alternative recognition results differs from a meaning of the top recognition result in a manner that is significant for the domain. In some embodiments, one or more of the recognition results may be evaluated to determine whether the result(s) include one or more words or phrases that, when included in a result, would change a meaning of the result in a manner that would be significant for the domain.

摘要翻译： 在一些实施例中，基于语音输入的分析，由语音处理系统（其可以包括顶部识别结果和一个或多个替代识别结果）产生的识别结果被评估用于潜在重大错误的指示。在一些实施例中，可以评估识别结果以确定任何替代识别结果的含义以对于域有意义的方式与顶部识别结果的含义不同。在一些实施例中，可以评估一个或多个识别结果以确定结果是否包括一个或多个单词或短语，当被包括在结果中时，将以以下方式改变结果的含义：对领域有重要意义

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类