Automatic Speech and Concept Recognition
    4.
    发明申请
    Automatic Speech and Concept Recognition 失效
    自动语音和概念识别

    公开(公告)号:US20130046539A1

    公开(公告)日:2013-02-21

    申请号:US13210471

    申请日:2011-08-16

    IPC分类号: G10L15/22

    CPC分类号: G10L15/197 G10L15/193

    摘要: A method, an apparatus and an article of manufacture for automatic speech recognition. The method includes obtaining at least one language model word and at least one rule-based grammar word, determining an acoustic similarity of at least one pair of language model word and rule-based grammar word, and increasing a transition cost to the at least one language model word based on the acoustic similarity of the at least one language model word with the at least one rule-based grammar word to generate a modified language model for automatic speech recognition.

    摘要翻译: 一种用于自动语音识别的方法,装置和制品。 该方法包括获得至少一个语言模型词和至少一个基于规则的语法词,确定至少一对语言模型词和基于规则的语法单词的声学相似度,以及增加至少一个 基于所述至少一个语言模型词与所述至少一个基于规则的语法词的声学相似性来生成用于自动语音识别的修改语言模型的语言模型词。

    Automatic speech and concept recognition
    5.
    发明授权
    Automatic speech and concept recognition 失效
    自动语音和概念识别

    公开(公告)号:US08676580B2

    公开(公告)日:2014-03-18

    申请号:US13210471

    申请日:2011-08-16

    CPC分类号: G10L15/197 G10L15/193

    摘要: A method, an apparatus and an article of manufacture for automatic speech recognition. The method includes obtaining at least one language model word and at least one rule-based grammar word, determining an acoustic similarity of at least one pair of language model word and rule-based grammar word, and increasing a transition cost to the at least one language model word based on the acoustic similarity of the at least one language model word with the at least one rule-based grammar word to generate a modified language model for automatic speech recognition.

    摘要翻译: 一种用于自动语音识别的方法,装置和制品。 该方法包括获得至少一个语言模型词和至少一个基于规则的语法词,确定至少一对语言模型词和基于规则的语法单词的声学相似度,以及增加至少一个 基于所述至少一个语言模型词与所述至少一个基于规则的语法词的声学相似性来生成用于自动语音识别的修改语言模型的语言模型词。

    Automatic evaluation of spoken fluency
    6.
    发明授权
    Automatic evaluation of spoken fluency 有权
    自动评价口语流利

    公开(公告)号:US08457967B2

    公开(公告)日:2013-06-04

    申请号:US12541927

    申请日:2009-08-15

    CPC分类号: G10L15/26 G09B19/04

    摘要: A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then analyzing the patterns of disfluencies in the speech to compute a numerical score to quantify the spoken fluency skills of the speakers. The numerical fluency score accounts for various prosodic and lexical features, including formant-based filled-pause detection, closely-occurring exact and inexact repeat N-grams, normalized average distance between consecutive occurrences of N-grams. The lexical features and prosodic features are combined to classify the speaker with a C-class classification and develop a rating for the speaker.

    摘要翻译: 一个程序,通过提示说话者在给定的主题上进行谈话,记录讲话者的语音以获得记录的语音样本,然后分析语音中的不清楚的模式以计算数字得分,自动评估讲话者的口语流畅性 量化演讲者的口语流利能力。 数值流利度分数考虑到各种韵律和词汇特征,包括基于共振峰的填充暂停检测,紧密发生的精确和不精确的重复N克,连续出现的N克之间的归一化平均距离。 词汇特征和韵律特征相结合,将扬声器分类为C级分类,并为扬声器开发评级。

    Evaluating spoken skills
    7.
    发明授权
    Evaluating spoken skills 失效
    评价口语技能

    公开(公告)号:US08775184B2

    公开(公告)日:2014-07-08

    申请号:US12354849

    申请日:2009-01-16

    IPC分类号: G10L15/00 G10L15/04

    摘要: Techniques for evaluating one or more spoken language skills of a speaker are provided. The techniques include identifying one or more temporal locations of interest in a speech passage spoken by a speaker, computing one or more acoustic parameters, wherein the one or more acoustic parameters capture one or more properties of one or more acoustic-phonetic features of the one or more locations of interest, and combining the one or more acoustic parameters with an output of an automatic speech recognizer to modify an output of a spoken language skill evaluation.

    摘要翻译: 提供了用于评估扬声器的一种或多种口语技能的技术。 所述技术包括识别由扬声器说出的语音通道中的一个或多个感兴趣的时间位置,计算一个或多个声学参数,其中所述一个或多个声学参数捕获一个或多个声学特征的一个或多个属性 或更多的感兴趣的位置,并且将一个或多个声学参数与自动语音识别器的输出组合以修改口语技能评估的输出。

    Intent Discovery in Audio or Text-Based Conversation
    8.
    发明申请
    Intent Discovery in Audio or Text-Based Conversation 有权
    在音频或基于文本的对话中的意图发现

    公开(公告)号:US20130339021A1

    公开(公告)日:2013-12-19

    申请号:US13526637

    申请日:2012-06-19

    IPC分类号: G10L15/18

    摘要: Techniques, an apparatus and an article of manufacture identifying one or more utterances that are likely to carry the intent of a speaker, from a conversation between two or more parties. A method includes obtaining an input of a set of utterances in chronological order from a conversation between two or more parties, computing an intent confidence value of each utterance by summing intent confidence scores from each of the constituent words of the utterance, wherein intent confidence scores capture each word's influence on the subsequent utterances in the conversation based on (i) the uniqueness of the word in the conversation and (ii) the number of times the word subsequently occurs in the conversation, and generating a ranked order of the utterances from highest to lowest intent confidence value, wherein the highest intent value corresponds to the utterance which is most likely to carry intent of the speaker.

    摘要翻译: 从两个或多个方之间的对话中识别出可能携带说话人意图的一个或多个话语的技术,装置和制品。 一种方法包括从两个或更多方之间的会话按时间顺序获得一组话语的输入,通过将来自每个话语的组成词的意图置信度得分相加来计算每个话语的意图置信度值,其中意图置信度得分 基于(i)会话中的单词的唯一性和(ii)单词随后在会话中发生的次数,并且从最高级别生成排序的话语顺序,从而捕获每个单词对对话中后续话语的影响 到最低意图置信度值,其中最高意图值对应于最有可能携带说话者意图的话语。

    Automatic Evaluation of Spoken Fluency
    9.
    发明申请
    Automatic Evaluation of Spoken Fluency 有权
    自动评价口语流利

    公开(公告)号:US20110040554A1

    公开(公告)日:2011-02-17

    申请号:US12541927

    申请日:2009-08-15

    IPC分类号: G06F17/27 G10L15/26 G10L13/08

    CPC分类号: G10L15/26 G09B19/04

    摘要: A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then analyzing the patterns of disfluencies in the speech to compute a numerical score to quantify the spoken fluency skills of the speakers. The numerical fluency score accounts for various prosodic and lexical features, including formant-based filled-pause detection, closely-occurring exact and inexact repeat N-grams, normalized average distance between consecutive occurrences of N-grams. The lexical features and prosodic features are combined to classify the speaker with a C-class classification and develop a rating for the speaker.

    摘要翻译: 一个程序,通过提示说话者在给定的主题上进行谈话,记录讲话者的语音以获得记录的语音样本,然后分析语音中的不清楚的模式以计算数字得分,自动评估讲话者的口语流畅性 量化演讲者的口语流利能力。 数值流利度分数考虑到各种韵律和词汇特征,包括基于共振峰的填充暂停检测,紧密发生的精确和不精确的重复N克,连续出现的N克之间的归一化平均距离。 词汇特征和韵律特征相结合,将扬声器分类为C级分类,并为扬声器开发评级。

    ENABLING ACCESS TO INFORMATION ON A WEB PAGE
    10.
    发明申请
    ENABLING ACCESS TO INFORMATION ON A WEB PAGE 审中-公开
    启用对网页上的信息的访问

    公开(公告)号:US20100185648A1

    公开(公告)日:2010-07-22

    申请号:US12353669

    申请日:2009-01-14

    IPC分类号: G06F7/06 G06F17/30

    摘要: Techniques for enabling voice access to information residing on the World Wide Web are provided. The techniques include receiving a query from a user, wherein the query comprises a voice-based request to access information residing on the World Wide Web, identifying one or more websites corresponding to the query, fetching the information from a website, wherein fetching the information comprises executing a hypertext transfer protocol (HTTP) request, organizing the information into a voice-based response and delivering the response to the user.

    摘要翻译: 提供了能够对驻留在万维网上的信息进行语音访问的技术。 这些技术包括从用户接收查询,其中查询包括访问驻留在万维网上的信息的基于语音的请求,识别与查询相对应的一个或多个网站,从网站获取信息,其中获取信息 包括执行超文本传输​​协议(HTTP)请求,将信息组织成基于语音的响应并将响应传递给用户。