Web-based speech recognition with scripting and semantic objects

    公开(公告)号:US08510412B2

    公开(公告)日:2013-08-13

    申请号:US13225791

    申请日:2011-09-06

    IPC分类号: G06F15/16

    摘要: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as, for example, consumer survey systems. A speech application in accordance with the present invention is represented within a Web page, as an application script that interprets semantic objects according to a context. Any commonly known scripting language can be used to write the application script, such as JavaScript (or ECMAScript), PerlScript, and VBscript. The present invention is “Web-based” to the extent that it implements Web technologies, but it need not include or access the World Wide Web.

    WEB-BASED SPEECH RECOGNITION WITH SCRIPTING AND SEMANTIC OBJECTS
    2.
    发明申请
    WEB-BASED SPEECH RECOGNITION WITH SCRIPTING AND SEMANTIC OBJECTS 有权
    基于WEB的语音识别与脚本和语义对象

    公开(公告)号:US20110320188A1

    公开(公告)日:2011-12-29

    申请号:US13225791

    申请日:2011-09-06

    IPC分类号: G06F17/27

    摘要: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as, for example, consumer survey systems. A speech application in accordance with the present invention is represented within a Web page, as an application script that interprets semantic objects according to a context. Any commonly known scripting language can be used to write the application script, such as JavaScript (or ECMAScript), PerlScript, and VBscript. The present invention is “Web-based” to the extent that it implements Web technologies, but it need not include or access the World Wide Web.

    摘要翻译: 本发明是一种使用Web技术创建和实现事务语音应用(SA)的系统和方法,而不依赖于服务器端标准或定制服务。 交易语音应用可以是需要与语音识别(SR)系统(例如,消费者调查系统)结合语音的解释的任何应用。 根据本发明的语音应用程序在网页内被表示为根据上下文解释语义对象的应用脚本。 任何常见的脚本语言都可用于编写应用程序脚本,如JavaScript(或ECMAScript),PerlScript和VBscript。 本发明是基于“基于Web”的,它实现了Web技术,但不需要包括或访问万维网。

    Web-based speech recognition with scripting and semantic objects
    3.
    发明授权
    Web-based speech recognition with scripting and semantic objects 有权
    基于Web的语音识别与脚本和语义对象

    公开(公告)号:US08024422B2

    公开(公告)日:2011-09-20

    申请号:US12062144

    申请日:2008-04-03

    IPC分类号: G06F15/16

    摘要: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as, for example, consumer survey systems. A speech application in accordance with the present invention is represented within a Web page, as an application script that interprets semantic objects according to a context. Any commonly known scripting language can be used to write the application script, such as JavaScript (or ECMAScript), PerlScript, and VBscript. The present invention is “Web-based” to the extent that it implements Web technologies, but it need not include or access the World Wide Web.

    摘要翻译: 本发明是一种使用Web技术创建和实现事务语音应用(SA)的系统和方法,而不依赖于服务器端标准或定制服务。 交易语音应用可以是需要与语音识别(SR)系统(例如,消费者调查系统)结合语音的解释的任何应用。 根据本发明的语音应用程序在网页内被表示为根据上下文解释语义对象的应用脚本。 任何常见的脚本语言都可用于编写应用程序脚本,如JavaScript(或ECMAScript),PerlScript和VBscript。 本发明是基于“基于Web”的,它实现了Web技术,但不需要包括或访问万维网。

    Web-Based Speech Recognition With Scripting and Semantic Objects
    4.
    发明申请
    Web-Based Speech Recognition With Scripting and Semantic Objects 有权
    基于Web的语音识别与脚本和语义对象

    公开(公告)号:US20080183469A1

    公开(公告)日:2008-07-31

    申请号:US12062144

    申请日:2008-04-03

    IPC分类号: G10L15/00

    摘要: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as, for example, consumer survey systems. A speech application in accordance with the present invention is represented within a Web page, as an application script that interprets semantic objects according to a context. Any commonly known scripting language can be used to write the application script, such as JavaScript (or ECMAScript), PerlScript, and VBscript. The present invention is “Web-based” to the extent that it implements Web technologies, but it need not include or access the World Wide Web.

    摘要翻译: 本发明是一种使用Web技术创建和实现事务语音应用(SA)的系统和方法,而不依赖于服务器端标准或定制服务。 交易语音应用可以是需要与语音识别(SR)系统(例如,消费者调查系统)结合语音的解释的任何应用。 根据本发明的语音应用程序在网页内被表示为根据上下文解释语义对象的应用脚本。 任何常见的脚本语言都可用于编写应用程序脚本,如JavaScript(或ECMAScript),PerlScript和VBscript。 本发明是基于“基于Web”的,它实现了Web技术,但不需要包括或访问万维网。

    Web-based speech recognition with scripting and semantic objects
    5.
    发明授权
    Web-based speech recognition with scripting and semantic objects 有权
    基于Web的语音识别与脚本和语义对象

    公开(公告)号:US07366766B2

    公开(公告)日:2008-04-29

    申请号:US09815726

    申请日:2001-03-23

    IPC分类号: G06F15/16

    摘要: The present invention is a system and method for creating and implementing transactional speech applications (SAs) using Web technologies, without reliance on server-side standard or custom services. A transactional speech application may be any application that requires interpretation of speech in conjunction with a speech recognition (SR) system, such as, for example, consumer survey systems. A speech application in accordance with the present invention is represented within a Web page, as an application script that interprets semantic objects according to a context. Any commonly known scripting language can be used to write the application script, such as Jscript, PerlScript, and VBscript. The present invention is “Web-based” to the extent that it implements Web technologies, but it need not include or access the World Wide Web.

    摘要翻译: 本发明是一种使用Web技术创建和实现事务语音应用(SA)的系统和方法,而不依赖于服务器端标准或定制服务。 交易语音应用可以是需要与语音识别(SR)系统(例如,消费者调查系统)结合语音的解释的任何应用。 根据本发明的语音应用程序在网页内被表示为根据上下文解释语义对象的应用脚本。 任何常见的脚本语言都可用于编写应用程序脚本,如Jscript,PerlScript和VBscript。 本发明是基于“基于Web”的,它实现了Web技术,但不需要包括或访问万维网。

    Method of and system for improving accuracy in a speech recognition system
    7.
    发明授权
    Method of and system for improving accuracy in a speech recognition system 有权
    提高语音识别系统精度的方法和系统

    公开(公告)号:US07624010B1

    公开(公告)日:2009-11-24

    申请号:US09918733

    申请日:2001-07-31

    IPC分类号: G10L15/26 G10L21/00

    CPC分类号: G10L15/22

    摘要: A method for transcribing an audio response includes: A. constructing an application including a plurality of queries and a set of expected responses for each query, the set including a plurality of expected responses to each query in a textual form; B. posing each of the queries to a respondent with a querying device; C. receiving an audio response to each query from the respondent; D. performing a speech recognition function on each audio response with an automatic speech recognition device to transcribe each audio response to a textual response to each query; E. recording each audio response with a recording device; and F. comparing, with the automatic speech recognition device, each textual response to the set of expected responses for each corresponding query to determine if each textual response corresponds to any of the expected responses in the set of expected responses for the corresponding query. The method includes flagging each audio response corresponding to a textual response that does not correspond to one of the expected responses in the set of expected responses to the corresponding query, reviewing each flagged audio response to determine if a corresponding expected response is included in the set of expected responses the query associated with each audio response, and entering a text response if no such match exists.

    摘要翻译: 用于转录音频响应的方法包括:A.构建包括多个查询的应用和针对每个查询的一组预期响应,该集合包括以文本形式的每个查询的多个预期响应; B.使用查询设备将每个查询构成回答者; C.从受访者接收每个查询的音频响应; D.使用自动语音识别装置对每个音频响应执行语音识别功能,以将每个音频响应转录为对每个查询的文本响应; E.用记录装置记录每个音频响应; 将自动语音识别装置与每个对应查询的预期响应集合进行每个文本响应,以确定每个文本响应是否对应于相应查询的预期响应集合中的任何预期响应。 该方法包括标记对应于文本响应的每个音频响应,该文本响应与对应查询的预期响应集合中的预期响应之一不对应,检查每个标记的音频响应以确定相应的预期响应是否包括在该组中 预期响应与每个音频响应相关联的查询,并且如果不存在这样的匹配则输入文本响应。

    Method of and system for improving accuracy in a speech recognition system
    9.
    发明授权
    Method of and system for improving accuracy in a speech recognition system 有权
    提高语音识别系统精度的方法和系统

    公开(公告)号:US08812314B2

    公开(公告)日:2014-08-19

    申请号:US12616874

    申请日:2009-11-12

    IPC分类号: G10L15/22

    CPC分类号: G10L15/22

    摘要: A method for transcribing an audio response includes: A. constructing an application including a plurality of queries and a set of expected responses for each query, the set including a plurality of expected responses to each query in a textual form; B. posing each of the queries to a respondent with a querying device; C. receiving an audio response to each query from the respondent; D. performing a speech recognition function on each audio response with an automatic speech recognition device to transcribe each audio response to a textual response to each query; E. recording each audio response with a recording device; and F. comparing, with the automatic speech recognition device, each textual response to the set of expected responses for each corresponding query to determine if each textual response corresponds to any of the expected responses in the set of expected responses for the corresponding query.

    摘要翻译: 用于转录音频响应的方法包括:A.构建包括多个查询的应用和针对每个查询的一组预期响应,该集合包括以文本形式的每个查询的多个预期响应; B.使用查询设备将每个查询构成回答者; C.从受访者接收每个查询的音频响应; D.使用自动语音识别装置对每个音频响应执行语音识别功能,以将每个音频响应转录为对每个查询的文本响应; E.用记录装置记录每个音频响应; 将自动语音识别装置与每个对应查询的预期响应集合进行每个文本响应,以确定每个文本响应是否对应于相应查询的预期响应集合中的任何预期响应。

    Phonetic data processing system and method
    10.
    发明授权
    Phonetic data processing system and method 有权
    语音数据处理系统及方法

    公开(公告)号:US06895377B2

    公开(公告)日:2005-05-17

    申请号:US09815769

    申请日:2001-03-23

    摘要: A phonetic data processing system processes phonetic stream data to produce a set of semantic data, using a context-free rich semantic grammar database (RSG DB) that includes a grammar tree, comprised of sub-trees, representing words and phrases. A phonetic searcher accepts the phonetic estimates and searches the RSG DB to produce a best word list, which is processed by a semantic parser, using the RSG DB, to produce a semantic tree instance, including all valid interpretations of the phonetic stream. An application accesses a semantic tree evaluator to interpret the semantic tree instance according to a context to produce a final linguistic interpretation of the phonetic stream, which is returned to the application.

    摘要翻译: 语音数据处理系统使用无上下文的富语义语法语法数据库(RSG DB)来处理语音流数据以产生一组语义数据,该数据库包括表示单词和短语的子树的语法树。 语音搜索器接受语音估计并搜索RSG数据库以产生一个最好的单词列表,该列表由语义解析器使用RSG DB处理,以产生语义树实例,包括语音流的所有有效解释。 应用程序访问语义树评估器,以根据上下文解释语义树实例,以产生语法流的最终语言解释,并返回给应用程序。