Disambiguation in speech recognition
    1.
    发明授权
    Disambiguation in speech recognition 有权
    消歧语音识别

    公开(公告)号:US09484021B1

    公开(公告)日:2016-11-01

    申请号:US14673089

    申请日:2015-03-30

    CPC classification number: G10L15/08 G10L15/00 G10L15/02 G10L15/22

    Abstract: Automatic speech recognition (ASR) processing including a two-stage configuration. After ASR processing of an incoming utterance where the ASR outputs an N-best list including multiple hypotheses, a first stage determines whether to execute a command associated with one of the hypotheses or whether to output some of the hypotheses of the N-best list for disambiguation. A second stage determines what hypotheses should be included in the disambiguation choices. A first machine learning model is used at the first stage and a second machine learning model is used at the second stage. The multi-stage configuration allows for reduced speech processing errors as well as a reduced number of utterances sent for disambiguation, which thus improves the user experience.

    Abstract translation: 自动语音识别(ASR)处理包括两级配置。 在ASR处理ASR输出包括多个假设的N最佳列表的传入话语的ASR处理之后,第一阶段确定是否执行与假设之一相关联的命令,或者是否输出N最佳列表的一些假设 消歧 第二阶段确定在消歧选择中应包含哪些假设。 在第一阶段使用第一机器学习模型,在第二阶段使用第二机器学习模型。 多级配置允许减少语音处理错误以及为消除歧义而发送的减少的话语数量,从而改善用户体验。

    Confidence scoring for selecting tones and text of voice browsing conversations

    公开(公告)号:US11978445B1

    公开(公告)日:2024-05-07

    申请号:US17217994

    申请日:2021-03-30

    CPC classification number: G10L15/22

    Abstract: Dialog acts (e.g., questions) are selected for voice browsing by a model trained to identify a dialog act that is most likely to lead to a desired outcome. Upon receiving an invocation to begin a conversation, a score indicative of a level of confidence that the conversation will have a successful outcome is determined, and a dialog act is selected based on the score. Subsequently, at each turn of the conversation, the score is updated or a new score is calculated, and a dialog act is selected based on the updated or new score. Confidence scores are calculated based on input features that are determined based on the user who uttered the invocation or responses to dialog acts, as well as a context of the conversation, and provided to a linear model or a machine learning model as inputs.

    Disambiguation in speech recognition
    3.
    发明授权
    Disambiguation in speech recognition 有权
    消歧语音识别

    公开(公告)号:US09558740B1

    公开(公告)日:2017-01-31

    申请号:US14673343

    申请日:2015-03-30

    CPC classification number: G10L15/08 G10L15/22 G10L15/30

    Abstract: Automatic speech recognition (ASR) processing including a feedback configuration to allow for improved disambiguation between ASR hypotheses. After ASR processing of an incoming utterance where the ASR outputs an N-best list including multiple hypotheses, the multiple hypotheses are passed downstream for further processing. The downstream further processing may include natural language understanding (NLU) or other processing to determine a command result for each hypothesis. The command results are compared to determine if any hypotheses of the N-best list would yield similar command results. If so, the hypothesis(es) with similar results are removed from the N-best list so that only one hypothesis of the similar results remains in the N-best list. The remaining non-similar hypotheses are sent for disambiguation, or, if only one hypothesis remains, it is sent for execution.

    Abstract translation: 自动语音识别(ASR)处理包括反馈配置,以便改善ASR假设之间的歧义。 在ASR处理ASR输出包含多个假设的N最佳列表的传入语句之后,多个假设被传递到下游进行进一步处理。 下游进一步处理可以包括自然语言理解(NLU)或其他处理以确定每个假设的命令结果。 比较命令结果以确定N最佳列表的任何假设是否会产生类似的命令结果。 如果是这样,具有类似结果的假设从N最佳列表中删除,因此只有一个类似结果的假设保留在N最佳列表中。 发送剩余的非类似假设以消除歧义,或者如果只剩下一个假设,则将其发送执行。

    Disambiguation in speech recognition

    公开(公告)号:US10283111B1

    公开(公告)日:2019-05-07

    申请号:US15382874

    申请日:2016-12-19

    Abstract: Automatic speech recognition (ASR) processing including a feedback configuration to allow for improved disambiguation between ASR hypotheses. After ASR processing of an incoming utterance where the ASR outputs an N-best list including multiple hypotheses, the multiple hypotheses are passed downstream for further processing. The downstream further processing may include natural language understanding (NLU) or other processing to determine a command result for each hypothesis. The command results are compared to determine if any hypotheses of the N-best list would yield similar command results. If so, the hypothesis(es) with similar results are removed from the N-best list so that only one hypothesis of the similar results remains in the N-best list. The remaining non-similar hypotheses are sent for disambiguation, or, if only one hypothesis remains, it is sent for execution.

Patent Agency Ranking