System and a method for generating semantically similar sentences for building a robust SLM
    11.
    发明授权
    System and a method for generating semantically similar sentences for building a robust SLM 有权
    系统和一种用于生成语义上相似的句子来构建稳健的SLM的方法

    公开(公告)号:US09135237B2

    公开(公告)日:2015-09-15

    申请号:US13181923

    申请日:2011-07-13

    IPC分类号: G06F17/27 G10L15/26 G06F17/28

    摘要: A system and method are described for generating semantically similar sentences for a statistical language model. A semantic class generator determines for each word in an input utterance a set of corresponding semantically similar words. A sentence generator computes a set of candidate sentences each containing at most one member from each set of semantically similar words. A sentence verifier grammatically tests each candidate sentence to determine a set of grammatically correct sentences semantically similar to the input utterance. Also note that the generated semantically similar sentences are not restricted to be selected from an existing sentence database.

    摘要翻译: 描述了用于为统计语言模型生成语义上类似的句子的系统和方法。 语义类生成器确定输入语义中的每个单词一组相应的语义上相似的单词。 句子生成器从每个语义上相似的单词集合中计算出一组候选句子,每个候选句子最多包含一个成员。 句子验证器语法测试每个候选句子以确定一组语法上正确的句子,其语义上类似于输入的话语。 还要注意,生成的语义上相似的句子不限于从现有句子数据库中选择。

    Automatic speech and concept recognition
    12.
    发明授权
    Automatic speech and concept recognition 失效
    自动语音和概念识别

    公开(公告)号:US08676580B2

    公开(公告)日:2014-03-18

    申请号:US13210471

    申请日:2011-08-16

    CPC分类号: G10L15/197 G10L15/193

    摘要: A method, an apparatus and an article of manufacture for automatic speech recognition. The method includes obtaining at least one language model word and at least one rule-based grammar word, determining an acoustic similarity of at least one pair of language model word and rule-based grammar word, and increasing a transition cost to the at least one language model word based on the acoustic similarity of the at least one language model word with the at least one rule-based grammar word to generate a modified language model for automatic speech recognition.

    摘要翻译: 一种用于自动语音识别的方法,装置和制品。 该方法包括获得至少一个语言模型词和至少一个基于规则的语法词,确定至少一对语言模型词和基于规则的语法单词的声学相似度,以及增加至少一个 基于所述至少一个语言模型词与所述至少一个基于规则的语法词的声学相似性来生成用于自动语音识别的修改语言模型的语言模型词。

    Evaluating spoken skills
    14.
    发明授权
    Evaluating spoken skills 失效
    评价口语技能

    公开(公告)号:US08775184B2

    公开(公告)日:2014-07-08

    申请号:US12354849

    申请日:2009-01-16

    IPC分类号: G10L15/00 G10L15/04

    摘要: Techniques for evaluating one or more spoken language skills of a speaker are provided. The techniques include identifying one or more temporal locations of interest in a speech passage spoken by a speaker, computing one or more acoustic parameters, wherein the one or more acoustic parameters capture one or more properties of one or more acoustic-phonetic features of the one or more locations of interest, and combining the one or more acoustic parameters with an output of an automatic speech recognizer to modify an output of a spoken language skill evaluation.

    摘要翻译: 提供了用于评估扬声器的一种或多种口语技能的技术。 所述技术包括识别由扬声器说出的语音通道中的一个或多个感兴趣的时间位置,计算一个或多个声学参数,其中所述一个或多个声学参数捕获一个或多个声学特征的一个或多个属性 或更多的感兴趣的位置,并且将一个或多个声学参数与自动语音识别器的输出组合以修改口语技能评估的输出。

    Intent Discovery in Audio or Text-Based Conversation
    15.
    发明申请
    Intent Discovery in Audio or Text-Based Conversation 有权
    在音频或基于文本的对话中的意图发现

    公开(公告)号:US20130339021A1

    公开(公告)日:2013-12-19

    申请号:US13526637

    申请日:2012-06-19

    IPC分类号: G10L15/18

    摘要: Techniques, an apparatus and an article of manufacture identifying one or more utterances that are likely to carry the intent of a speaker, from a conversation between two or more parties. A method includes obtaining an input of a set of utterances in chronological order from a conversation between two or more parties, computing an intent confidence value of each utterance by summing intent confidence scores from each of the constituent words of the utterance, wherein intent confidence scores capture each word's influence on the subsequent utterances in the conversation based on (i) the uniqueness of the word in the conversation and (ii) the number of times the word subsequently occurs in the conversation, and generating a ranked order of the utterances from highest to lowest intent confidence value, wherein the highest intent value corresponds to the utterance which is most likely to carry intent of the speaker.

    摘要翻译: 从两个或多个方之间的对话中识别出可能携带说话人意图的一个或多个话语的技术,装置和制品。 一种方法包括从两个或更多方之间的会话按时间顺序获得一组话语的输入,通过将来自每个话语的组成词的意图置信度得分相加来计算每个话语的意图置信度值,其中意图置信度得分 基于(i)会话中的单词的唯一性和(ii)单词随后在会话中发生的次数,并且从最高级别生成排序的话语顺序,从而捕获每个单词对对话中后续话语的影响 到最低意图置信度值,其中最高意图值对应于最有可能携带说话者意图的话语。

    Automatic Speech and Concept Recognition
    16.
    发明申请
    Automatic Speech and Concept Recognition 失效
    自动语音和概念识别

    公开(公告)号:US20130046539A1

    公开(公告)日:2013-02-21

    申请号:US13210471

    申请日:2011-08-16

    IPC分类号: G10L15/22

    CPC分类号: G10L15/197 G10L15/193

    摘要: A method, an apparatus and an article of manufacture for automatic speech recognition. The method includes obtaining at least one language model word and at least one rule-based grammar word, determining an acoustic similarity of at least one pair of language model word and rule-based grammar word, and increasing a transition cost to the at least one language model word based on the acoustic similarity of the at least one language model word with the at least one rule-based grammar word to generate a modified language model for automatic speech recognition.

    摘要翻译: 一种用于自动语音识别的方法,装置和制品。 该方法包括获得至少一个语言模型词和至少一个基于规则的语法词,确定至少一对语言模型词和基于规则的语法单词的声学相似度,以及增加至少一个 基于所述至少一个语言模型词与所述至少一个基于规则的语法词的声学相似性来生成用于自动语音识别的修改语言模型的语言模型词。

    BACK OFFICE PROCESS MONITORING AND ANALYSIS
    17.
    发明申请
    BACK OFFICE PROCESS MONITORING AND ANALYSIS 审中-公开
    返回办公过程监控和分析

    公开(公告)号:US20110218841A1

    公开(公告)日:2011-09-08

    申请号:US12718335

    申请日:2010-03-05

    IPC分类号: G06Q10/00 G06F17/30

    CPC分类号: G06Q10/06393

    摘要: According to one illustrative embodiment, a method is provided for monitoring and analyzing an office process. Information relating to desktop interaction activities of an agent and non-desktop activities of the agent are collected to form collected information. The collected information is inferred to derive delimiters relating to the desktop interaction activities and the non-desktop activities of the agent, at least one transaction performed by the agent, at least one application used by the agent, and the office process. Metrics and key performance indicators relating to a behavior of the agent, a behavior of the at least one transaction, a behavior of the at least one application and a behavior of an office process are computed to form computed information, and the collected information and the computed information are stored into a data store.

    摘要翻译: 根据一个说明性实施例,提供了一种用于监视和分析办公过程的方法。 收集有关代理的桌面交互活动和代理的非桌面活动的信息,以形成收集的信息。 推断所收集的信息以导出与代理的桌面交互活动和非桌面活动相关的分隔符,由代理执行的至少一个交易,代理使用的至少一个应用和办公过程。 计算与代理行为有关的指标和关键绩效指标,至少一个交易的行为,至少一个应用的行为和办公过程的行为,以形成计算的信息,并且收集的信息和 计算的信息被存储到数据存储器中。

    Automatic Evaluation of Spoken Fluency
    18.
    发明申请
    Automatic Evaluation of Spoken Fluency 有权
    自动评价口语流利

    公开(公告)号:US20110040554A1

    公开(公告)日:2011-02-17

    申请号:US12541927

    申请日:2009-08-15

    IPC分类号: G06F17/27 G10L15/26 G10L13/08

    CPC分类号: G10L15/26 G09B19/04

    摘要: A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then analyzing the patterns of disfluencies in the speech to compute a numerical score to quantify the spoken fluency skills of the speakers. The numerical fluency score accounts for various prosodic and lexical features, including formant-based filled-pause detection, closely-occurring exact and inexact repeat N-grams, normalized average distance between consecutive occurrences of N-grams. The lexical features and prosodic features are combined to classify the speaker with a C-class classification and develop a rating for the speaker.

    摘要翻译: 一个程序,通过提示说话者在给定的主题上进行谈话,记录讲话者的语音以获得记录的语音样本,然后分析语音中的不清楚的模式以计算数字得分,自动评估讲话者的口语流畅性 量化演讲者的口语流利能力。 数值流利度分数考虑到各种韵律和词汇特征,包括基于共振峰的填充暂停检测,紧密发生的精确和不精确的重复N克,连续出现的N克之间的归一化平均距离。 词汇特征和韵律特征相结合,将扬声器分类为C级分类,并为扬声器开发评级。

    ENABLING ACCESS TO INFORMATION ON A WEB PAGE
    19.
    发明申请
    ENABLING ACCESS TO INFORMATION ON A WEB PAGE 审中-公开
    启用对网页上的信息的访问

    公开(公告)号:US20100185648A1

    公开(公告)日:2010-07-22

    申请号:US12353669

    申请日:2009-01-14

    IPC分类号: G06F7/06 G06F17/30

    摘要: Techniques for enabling voice access to information residing on the World Wide Web are provided. The techniques include receiving a query from a user, wherein the query comprises a voice-based request to access information residing on the World Wide Web, identifying one or more websites corresponding to the query, fetching the information from a website, wherein fetching the information comprises executing a hypertext transfer protocol (HTTP) request, organizing the information into a voice-based response and delivering the response to the user.

    摘要翻译: 提供了能够对驻留在万维网上的信息进行语音访问的技术。 这些技术包括从用户接收查询,其中查询包括访问驻留在万维网上的信息的基于语音的请求,识别与查询相对应的一个或多个网站,从网站获取信息,其中获取信息 包括执行超文本传输​​协议(HTTP)请求,将信息组织成基于语音的响应并将响应传递给用户。