-
公开(公告)号:US07647225B2
公开(公告)日:2010-01-12
申请号:US11561568
申请日:2006-11-20
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A real-time speech recognition system includes distributed processing across a client and server for recognizing a spoken query by a user. Both the client and server can dedicate a variable number of processing resources for performing speech recognition functions. The partitioning of responsibility for speech recognition operations can be done on a client by client or connection by connection basis.
摘要翻译: 实时语音识别系统包括跨客户端和服务器的用于识别用户的口语查询的分布式处理。 客户端和服务器都可以专用于执行语音识别功能的可变数量的处理资源。 语音识别操作责任划分可以通过客户端进行,也可以通过连接进行连接。
-
公开(公告)号:US07139714B2
公开(公告)日:2006-11-21
申请号:US11030864
申请日:2005-01-07
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A real-time speech recognition system includes distributed processing across a client and server for recognizing a spoken query by a user. Both the client and server can dedicate a variable number of processing resources for performing speech recognition functions. In some implementations the partitioning of responsibility for speech recognition operations can be done on a client by client or query by query basis.
摘要翻译: 实时语音识别系统包括跨客户端和服务器的用于识别用户的口语查询的分布式处理。 客户端和服务器都可以专用于执行语音识别功能的可变数量的处理资源。 在一些实现中,用于语音识别操作的责任划分可以在客户端上通过客户端进行查询或通过查询进行查询。
-
公开(公告)号:US06615172B1
公开(公告)日:2003-09-02
申请号:US09439060
申请日:1999-11-12
IPC分类号: G10L1518
CPC分类号: G06F17/2775 , G06F17/3043 , G06F17/30737 , G10L15/18 , G10L15/30
摘要: An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's query from speech to text, a 2-step algorithm employing a natural language engine, a database processor and a full-text SQL database is implemented to find a single answer that best matches the user's query. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.
摘要翻译: 公开了一种用于处理基于语音查询的智能查询系统。 这种分布式客户端 - 服务器系统通常在内部网或互联网上实现,使用语音输入接口在他/她的计算机,PDA或工作站处接收用户的查询。 将用户的查询从语音转换为文本后,实施采用自然语言引擎,数据库处理器和全文SQL数据库的两步算法,以找到与用户查询最匹配的单个答案。 实施的系统接受用户选择的环境变量,并且是可扩展的,以提供对用户发起的查询的各种和数量的答案。
-
公开(公告)号:US09190063B2
公开(公告)日:2015-11-17
申请号:US11932250
申请日:2007-10-31
IPC分类号: G10L15/18 , G10L15/22 , G10L15/30 , G10L17/22 , G06F17/30 , G10L15/00 , G10L15/183 , G06F17/28 , G10L15/14
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A speech recognition system includes distributed processing across a client and server for recognizing a spoken query by a user. A number of different speech models for different natural languages are used to support and detect a natural language spoken by a user. In some implementations an interactive electronic agent responds in the user's native language to facilitate an real-time, human like dialog.
摘要翻译: 语音识别系统包括跨客户端和服务器的用于识别用户的口语查询的分布式处理。 用于不同自然语言的许多不同的语音模型被用于支持和检测用户使用的自然语言。 在一些实现中,交互式电子代理以用户的母语响应,以促进实时的,类似人的对话。
-
公开(公告)号:US07277854B2
公开(公告)日:2007-10-02
申请号:US11031633
申请日:2005-01-07
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A speech recognition system includes distributed processing across a client and server for recognizing a spoken query by a user. A number of different speech models for different languages are used to support and detect a language spoken by a user. In some implementations an interactive electronic agent responds in the user's language to facilitate a real-time, human like dialogue.
摘要翻译: 语音识别系统包括跨客户端和服务器的用于识别用户的口语查询的分布式处理。 用于不同语言的许多不同语音模型被用于支持和检测用户使用的语言。 在一些实现中,交互式电子代理以用户的语言进行响应,以促进实时的,类似人的对话。
-
公开(公告)号:US06665640B1
公开(公告)日:2003-12-16
申请号:US09439173
申请日:1999-11-12
IPC分类号: G10L1518
CPC分类号: G06F17/3043 , G09B5/04 , G09B7/00 , G10L15/22 , Y10S707/99934
摘要: A real-time speech-based learning/training system distributed between client and server, and incorporating speech recognition and linguistic processing for recognizing a spoken question and to provide an answer to the student in a learning or training environment implemented on an intranet or over the Internet, is disclosed. The system accepts the student's question in the form of speech at his or her computer, PDA or workstation where minimal processing extracts a sufficient number of acoustic speech vectors representing the utterance. The system as implemented accepts environmental variables such as course, chapter, section as selected by the user so that the search time, accuracy and response time for the question can be optimized. A minimum set of acoustic vectors extracted at the client are then sent via a communications channel to the server where additional acoustic vectors are derived. Using Hidden Markov Models (HMMs), and appropriate grammars and dictionaries conditioned by the course, chapter and section selections made by the student, the speech representing the user's query is fully decoding to text at the server. This text corresponding to the user's query is then simultaneously sent to a natural language engine and a database processor where an optimized SQL statement is constructed for a full-text search from a SQL database for a recordset of several stored questions that best matches the user's query. Further processing in the natural language engine narrows the search down to a single stored question. The answer that is paired to this single stored question is then retrieved from the file path and sent to the student computer in compressed form. At the student's computer, the answer is articulated using a text-to-speech engine in his or her native natural language. The system requires no training and can operate in several natural languages.
-
公开(公告)号:US06633846B1
公开(公告)日:2003-10-14
申请号:US09439145
申请日:1999-11-12
IPC分类号: G10L1502
CPC分类号: G06F17/3043 , G10L15/142 , G10L15/1815 , G10L15/183 , G10L15/285 , G10L15/30 , H04M2250/74
摘要: A real-time system incorporating speech recognition and linguistic processing for recognizing a spoken query by a user and distributed between client and server, is disclosed. The system accepts user's queries in the form of speech at the client where minimal processing extracts a sufficient number of acoustic speech vectors representing the utterance. These vectors are sent via a communications channel to the server where additional acoustic vectors are derived. Using Hidden Markov Models (HMMs), and appropriate grammars and dictionaries conditioned by the selections made by the user, the speech representing the user's query is fully decoded into text (or some other suitable form) at the server. This text corresponding to the user's query is then simultaneously sent to a natural language engine and a database processor where optimized SQL statements are constructed for a full-text search from a database for a recordset of several stored questions that best matches the user's query. Further processing in the natural language engine narrows the search to a single stored question. The answer corresponding to this single stored question is next retrieved from the file path and sent to the client in compressed form. At the client, the answer to the user's query is articulated to the user using a text-to-speech engine in his or her native natural language. The system requires no training and can operate in several natural languages.
摘要翻译: 公开了一种包含语音识别和语言处理的实时系统,用于识别由用户进行的口语查询并分发在客户端与服务器之间。 该系统以客户端的语音形式接受用户的查询,其中最小处理提取足够数量的表示话语的声学语音向量。 这些向量经由通信信道被发送到服务器,其中导出附加的声向量。 使用隐马尔可夫模型(HMM),以及由用户做出的选择所限制的适当的语法和词典,表示用户查询的语音在服务器处被完全解码为文本(或其他合适的形式)。 然后将与用户查询相对应的文本同时发送到自然语言引擎和数据库处理器,其中为数据库构建优化的SQL语句,用于针对与用户查询最匹配的多个存储问题的记录集的全文搜索。 自然语言引擎中的进一步处理将搜索缩小到单个存储的问题。 对应于该单个存储问题的答案接下来从文件路径检索并以压缩形式发送给客户端。 在客户端,使用他或她的母语自然语言的文本到语音引擎向用户阐述用户查询的答案。 该系统不需要训练,可以使用多种自然语言进行操作。 PTEXT>
-
公开(公告)号:US08762152B2
公开(公告)日:2014-06-24
申请号:US11865592
申请日:2007-10-01
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: Methods and systems for performing speech recognition using an electronic interactive agent are disclosed. In embodiments of the invention, an electronic agent is presented in a form perceptible to a user. The electronic agent is used to solicit speech input from a user and to respond to the user's recognized speech, and mimics the behavior of a human agent in a natural language query session with the user. The electronic agent may be implemented in a distributed speech recognition system in which speech recognition tasks are divided between client and server.
摘要翻译: 公开了使用电子交互代理执行语音识别的方法和系统。 在本发明的实施例中,以用户可察觉的形式呈现电子代理。 电子代理用于征求来自用户的语音输入并响应用户的识别语音,并且模仿与用户的自然语言查询会话中的人类代理的行为。 电子代理可以在其中语音识别任务在客户端和服务器之间划分的分布式语音识别系统中实现。
-
9.
公开(公告)号:US07225125B2
公开(公告)日:2007-05-29
申请号:US11031207
申请日:2005-01-07
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A speech recognition system uses speech recognition models which are specifically trained and optimized for users residing in a particular geographic area or region. The speech models are trained with samples of word variants expected to be used in a natural language by representative members of a population associated with the geographic region or community of users. The speech recognition system is configured to have a real-time response that imitates a dialogue with a human operator.
摘要翻译: 语音识别系统使用对驻留在特定地理区域或区域中的用户进行专门训练和优化的语音识别模型。 语言模型使用与地理区域或用户社区相关联的群体的代表成员以自然语言使用的单词变体样本进行训练。 语音识别系统被配置为具有模仿与人类操作者的对话的实时响应。
-
公开(公告)号:US09076448B2
公开(公告)日:2015-07-07
申请号:US10684357
申请日:2003-10-10
IPC分类号: G10L15/02 , G10L15/22 , G10L17/22 , G06F17/30 , G10L15/00 , G10L15/183 , G10L15/30 , G06F17/28 , G10L15/18 , G10L15/14
CPC分类号: G10L17/22 , G06F17/289 , G06F17/3043 , G10L15/005 , G10L15/142 , G10L15/18 , G10L15/183 , G10L15/22 , G10L15/30 , Y10S707/99935
摘要: A real-time system incorporating speech recognition and linguistic processing for recognizing a spoken query by a user and distributed between client and server, is disclosed. The system accepts user's queries in the form of speech at the client where minimal processing extracts a sufficient number of acoustic speech vectors representing the utterance. These vectors are sent via a communications channel to the server where additional acoustic vectors are derived. Using Hidden Markov Models (HMMs), and appropriate grammars and dictionaries conditioned by the selections made by the user, the speech representing the user's query is fully decoded into text (or some other suitable form) at the server. This text corresponding to the user's query is then simultaneously sent to a natural language engine and a database processor where optimized SQL statements are constructed for a full-text search from a database for a recordset of several stored questions that best matches the user's query. Further processing in the natural language engine narrows the search to a single stored question. The answer corresponding to this single stored question is next retrieved from the file path and sent to the client in compressed form. At the client, the answer to the user's query is articulated to the user using a text-to-speech engine in his or her native natural language. The system requires no training and can operate in several natural languages.
摘要翻译: 公开了一种包含语音识别和语言处理的实时系统,用于识别由用户进行的口语查询并分发在客户端与服务器之间。 该系统以客户端的语音形式接受用户的查询,其中最小处理提取足够数量的表示话语的声学语音向量。 这些向量经由通信信道被发送到服务器,其中导出附加的声向量。 使用隐马尔可夫模型(HMM),以及由用户做出的选择所限制的适当的语法和词典,表示用户查询的语音在服务器处被完全解码为文本(或其他合适的形式)。 然后将与用户查询相对应的文本同时发送到自然语言引擎和数据库处理器,其中为数据库构建优化的SQL语句,用于针对与用户查询最匹配的多个存储问题的记录集的全文搜索。 自然语言引擎中的进一步处理将搜索缩小到单个存储的问题。 对应于该单个存储问题的答案接下来从文件路径中检索并以压缩形式发送给客户端。 在客户端,使用他或她的母语自然语言的文本到语音引擎向用户阐述用户查询的答案。 该系统不需要培训,可以使用多种自然语言进行操作。
-
-
-
-
-
-
-
-
-