User Driven Audio Content Navigation
    1.
    发明申请
    User Driven Audio Content Navigation 有权
    用户驱动的音频内容导航

    公开(公告)号:US20110320950A1

    公开(公告)日:2011-12-29

    申请号:US12822802

    申请日:2010-06-24

    IPC分类号: G06F3/16

    摘要: Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

    摘要翻译: 描述了配置成为口头网络提供用户驱动的音频内容导航的系统和相关方法。 实施例允许用户对似乎与用户相关的内容来删除音频,类似于标准网页的视觉撇除,并标记音频内的兴趣点。 实施例提供了用于在客户机 - 服务器环境中与信息系统交互的情况下导航音频内容的技术,其中客户端设备可以是简单的标准电话。

    User Driven Audio Content Navigation
    3.
    发明申请
    User Driven Audio Content Navigation 有权
    用户驱动的音频内容导航

    公开(公告)号:US20120324356A1

    公开(公告)日:2012-12-20

    申请号:US13596313

    申请日:2012-08-28

    IPC分类号: G06F3/16

    摘要: Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.

    摘要翻译: 描述了配置成为口头网络提供用户驱动的音频内容导航的系统和相关方法。 实施例允许用户对似乎与用户相关的内容来删除音频,类似于标准网页的视觉撇除,并标记音频内的兴趣点。 实施例提供了用于在客户机 - 服务器环境中与信息系统交互的情况下导航音频内容的技术,其中客户端设备可以是简单的标准电话。

    Automatic evaluation of spoken fluency
    5.
    发明授权
    Automatic evaluation of spoken fluency 有权
    自动评价口语流利

    公开(公告)号:US08457967B2

    公开(公告)日:2013-06-04

    申请号:US12541927

    申请日:2009-08-15

    CPC分类号: G10L15/26 G09B19/04

    摘要: A procedure to automatically evaluate the spoken fluency of a speaker by prompting the speaker to talk on a given topic, recording the speaker's speech to get a recorded sample of speech, and then analyzing the patterns of disfluencies in the speech to compute a numerical score to quantify the spoken fluency skills of the speakers. The numerical fluency score accounts for various prosodic and lexical features, including formant-based filled-pause detection, closely-occurring exact and inexact repeat N-grams, normalized average distance between consecutive occurrences of N-grams. The lexical features and prosodic features are combined to classify the speaker with a C-class classification and develop a rating for the speaker.

    摘要翻译: 一个程序,通过提示说话者在给定的主题上进行谈话,记录讲话者的语音以获得记录的语音样本,然后分析语音中的不清楚的模式以计算数字得分,自动评估讲话者的口语流畅性 量化演讲者的口语流利能力。 数值流利度分数考虑到各种韵律和词汇特征,包括基于共振峰的填充暂停检测,紧密发生的精确和不精确的重复N克,连续出现的N克之间的归一化平均距离。 词汇特征和韵律特征相结合,将扬声器分类为C级分类,并为扬声器开发评级。

    Intent discovery in audio or text-based conversation
    7.
    发明授权
    Intent discovery in audio or text-based conversation 有权
    音频或基于文本的对话中的意图发现

    公开(公告)号:US08983840B2

    公开(公告)日:2015-03-17

    申请号:US13526637

    申请日:2012-06-19

    IPC分类号: G10L15/18 G06F17/27

    摘要: Techniques, an apparatus and an article of manufacture identifying one or more utterances that are likely to carry the intent of a speaker, from a conversation between two or more parties. A method includes obtaining an input of a set of utterances in chronological order from a conversation between two or more parties, computing an intent confidence value of each utterance by summing intent confidence scores from each of the constituent words of the utterance, wherein intent confidence scores capture each word's influence on the subsequent utterances in the conversation based on (i) the uniqueness of the word in the conversation and (ii) the number of times the word subsequently occurs in the conversation, and generating a ranked order of the utterances from highest to lowest intent confidence value, wherein the highest intent value corresponds to the utterance which is most likely to carry intent of the speaker.

    摘要翻译: 从两个或多个方之间的对话中识别出可能携带说话人意图的一个或多个话语的技术,装置和制品。 一种方法包括从两个或更多方之间的会话按时间顺序获得一组话语的输入,通过将来自每个话语的组成词的意图置信度得分相加来计算每个话语的意图置信度值,其中意图置信度得分 基于(i)会话中的单词的唯一性和(ii)单词随后在会话中发生的次数,并且从最高级别生成排序的话语顺序,从而捕获每个单词对对话中后续话语的影响 到最低意图置信度值,其中最高意图值对应于最有可能携带说话者意图的话语。

    System and a Method for Generating Semantically Similar Sentences for Building a Robust SLM
    9.
    发明申请
    System and a Method for Generating Semantically Similar Sentences for Building a Robust SLM 有权
    用于生成语义类似句子的系统和方法,用于构建稳健的SLM

    公开(公告)号:US20130018649A1

    公开(公告)日:2013-01-17

    申请号:US13181923

    申请日:2011-07-13

    IPC分类号: G06F17/27

    摘要: A system and method are described for generating semantically similar sentences for a statistical language model. A semantic class generator determines for each word in an input utterance a set of corresponding semantically similar words. A sentence generator computes a set of candidate sentences each containing at most one member from each set of semantically similar words. A sentence verifier grammatically tests each candidate sentence to determine a set of grammatically correct sentences semantically similar to the input utterance. Also note that the generated semantically similar sentences are not restricted to be selected from an existing sentence database.

    摘要翻译: 描述了用于为统计语言模型生成语义上类似的句子的系统和方法。 语义类生成器确定输入语义中的每个单词一组相应的语义上相似的单词。 句子生成器从每个语义上相似的单词集合中计算出一组候选句子,每个候选句子最多包含一个成员。 句子验证器语法测试每个候选句子以确定一组语法上正确的句子,其语义上类似于输入的话语。 还要注意,生成的语义上相似的句子不限于从现有句子数据库中选择。

    EVALUATING SPOKEN SKILLS
    10.
    发明申请
    EVALUATING SPOKEN SKILLS 失效
    评估SPOKEN技能

    公开(公告)号:US20100185435A1

    公开(公告)日:2010-07-22

    申请号:US12354849

    申请日:2009-01-16

    IPC分类号: G06F17/20 G10L15/04

    摘要: Techniques for evaluating one or more spoken language skills of a speaker are provided. The techniques include identifying one or more temporal locations of interest in a speech passage spoken by a speaker, computing one or more acoustic parameters, wherein the one or more acoustic parameters capture one or more properties of one or more acoustic-phonetic features of the one or more locations of interest, and combining the one or more acoustic parameters with an output of an automatic speech recognizer to modify an output of a spoken language skill evaluation.

    摘要翻译: 提供了用于评估扬声器的一种或多种口语技能的技术。 所述技术包括识别由扬声器说出的语音通道中的一个或多个感兴趣的时间位置,计算一个或多个声学参数,其中所述一个或多个声学参数捕获一个或多个声学特征的一个或多个属性 或更多的感兴趣的位置,并且将一个或多个声学参数与自动语音识别器的输出组合以修改口语技能评估的输出。