SEQUENTIAL SPEECH RECOGNITION WITH TWO UNEQUAL ASR SYSTEMS
    4.
    发明申请
    SEQUENTIAL SPEECH RECOGNITION WITH TWO UNEQUAL ASR SYSTEMS 有权
    具有两个非正常ASR系统的顺序语音识别

    公开(公告)号:US20100082343A1

    公开(公告)日:2010-04-01

    申请号:US12240143

    申请日:2008-09-29

    IPC分类号: G10L15/18

    CPC分类号: G10L15/32 G10L2015/0631

    摘要: Sequential speech recognition using two unequal automatic speech recognition (ASR) systems may be provided. The system may provide two sets of vocabulary data. A determination may be made as to whether entries in one set of vocabulary data are likely to be confused with entries in the other set of vocabulary data. If confusion is likely, a decoy entry from one set of the vocabulary data may be placed in the other set of vocabulary data to ensure more efficient and accurate speech recognition processing may take place.

    摘要翻译: 可以提供使用两个不等的自动语音识别(ASR)系统的顺序语音识别。 系统可以提供两组词汇数据。 可以确定一组词汇数据中的条目是否可能与另一组词汇数据中的条目混淆。 如果混淆很可能,则一组词汇数据中的诱饵条目可以被放置在另一组词汇数据中,以确保可能发生更有效和准确的语音识别处理。

    Sequential speech recognition with two unequal ASR systems
    6.
    发明授权
    Sequential speech recognition with two unequal ASR systems 有权
    具有两个不等的ASR系统的顺序语音识别

    公开(公告)号:US08180641B2

    公开(公告)日:2012-05-15

    申请号:US12240143

    申请日:2008-09-29

    IPC分类号: G10L15/28

    CPC分类号: G10L15/32 G10L2015/0631

    摘要: Sequential speech recognition using two unequal automatic speech recognition (ASR) systems may be provided. The system may provide two sets of vocabulary data. A determination may be made as to whether entries in one set of vocabulary data are likely to be confused with entries in the other set of vocabulary data. If confusion is likely, a decoy entry from one set of the vocabulary data may be placed in the other set of vocabulary data to ensure more efficient and accurate speech recognition processing may take place.

    摘要翻译: 可以提供使用两个不等的自动语音识别(ASR)系统的顺序语音识别。 系统可以提供两组词汇数据。 可以确定一组词汇数据中的条目是否可能与另一组词汇数据中的条目混淆。 如果混淆很可能,则一组词汇数据中的诱饵条目可以被放置在另一组词汇数据中,以确保可能发生更有效和准确的语音识别处理。

    Leveraging interaction context to improve recognition confidence scores
    7.
    发明授权
    Leveraging interaction context to improve recognition confidence scores 有权
    利用互动环境提高识别信心分数

    公开(公告)号:US08886532B2

    公开(公告)日:2014-11-11

    申请号:US12912938

    申请日:2010-10-27

    IPC分类号: G10L15/00 G10L15/08 G10L15/22

    CPC分类号: G10L15/01 G10L15/08 G10L15/22

    摘要: On a computing device a speech utterance is received from a user. The speech utterance is a section of a speech dialog that includes a plurality of speech utterances. One or more features from the speech utterance are identified. Each identified feature from the speech utterance is a specific characteristic of the speech utterance. One or more features from the speech dialog are identified. Each identified feature from the speech dialog is associated with one or more events in the speech dialog. The one or more events occur prior to the speech utterance. One or more identified features from the speech utterance and one or more identified features from the speech dialog are used to calculate a confidence score for the speech utterance.

    摘要翻译: 在计算设备上,从用户接收到语音说话。 讲话话语是包括多个语音话语的语音对话框的一部分。 识别来自语音话语的一个或多个特征。 讲话话语中的每一个特征都是言语发音的一个特定特征。 识别来自语音对话的一个或多个特征。 来自语音对话的每个识别的特征与语音对话中的一个或多个事件相关联。 一个或多个事件发生在语音发音之前。 来自语音话语的一个或多个识别的特征和来自语音对话的一个或多个识别的特征被用于计算语音话语的置信度。

    LEVERAGING INTERACTION CONTEXT TO IMPROVE RECOGNITION CONFIDENCE SCORES
    8.
    发明申请
    LEVERAGING INTERACTION CONTEXT TO IMPROVE RECOGNITION CONFIDENCE SCORES 有权
    利用互动语境提高认知信心

    公开(公告)号:US20150046163A1

    公开(公告)日:2015-02-12

    申请号:US14521990

    申请日:2014-10-23

    IPC分类号: G10L15/01

    CPC分类号: G10L15/01 G10L15/08 G10L15/22

    摘要: On a computing device a speech utterance is received from a user. The speech utterance is a section of a speech dialog that includes a plurality of speech utterances. One or more features from the speech utterance are identified. Each identified feature from the speech utterance is a specific characteristic of the speech utterance. One or more features from the speech dialog are identified. Each identified feature from the speech dialog is associated with one or more events in the speech dialog. The one or more events occur prior to the speech utterance. One or more identified features from the speech utterance and one or more identified features from the speech dialog are used to calculate a confidence score for the speech utterance.

    摘要翻译: 在计算设备上,从用户接收到语音说话。 讲话话语是包括多个语音话语的语音对话框的一部分。 识别来自语音话语的一个或多个特征。 讲话话语中的每一个特征都是言语发音的一个特定特征。 识别来自语音对话的一个或多个特征。 来自语音对话的每个识别的特征与语音对话中的一个或多个事件相关联。 一个或多个事件发生在语音发音之前。 来自语音话语的一个或多个识别的特征和来自语音对话的一个或多个识别的特征被用于计算语音话语的置信度。

    Leveraging Interaction Context to Improve Recognition Confidence Scores
    9.
    发明申请
    Leveraging Interaction Context to Improve Recognition Confidence Scores 有权
    利用互动背景来提升识别信心

    公开(公告)号:US20120109652A1

    公开(公告)日:2012-05-03

    申请号:US12912938

    申请日:2010-10-27

    IPC分类号: G10L15/00

    CPC分类号: G10L15/01 G10L15/08 G10L15/22

    摘要: On a computing device a speech utterance is received from a user. The speech utterance is a section of a speech dialog that includes a plurality of speech utterances. One or more features from the speech utterance are identified. Each identified feature from the speech utterance is a specific characteristic of the speech utterance. One or more features from the speech dialog are identified. Each identified feature from the speech dialog is associated with one or more events in the speech dialog. The one or more events occur prior to the speech utterance. One or more identified features from the speech utterance and one or more identified features from the speech dialog are used to calculate a confidence score for the speech utterance.

    摘要翻译: 在计算设备上,从用户接收到语音说话。 讲话话语是包括多个语音话语的语音对话框的一部分。 识别来自语音话语的一个或多个特征。 讲话话语中的每一个特征都是言语发音的一个特定特征。 识别来自语音对话的一个或多个特征。 来自语音对话的每个识别的特征与语音对话中的一个或多个事件相关联。 一个或多个事件发生在语音发音之前。 来自语音话语的一个或多个识别的特征和来自语音对话的一个或多个识别的特征被用于计算语音话语的置信度。

    RECOGNITION USING RE-RECOGNITION AND STATISTICAL CLASSIFICATION
    10.
    发明申请
    RECOGNITION USING RE-RECOGNITION AND STATISTICAL CLASSIFICATION 有权
    使用重新识别和统计分类的识别

    公开(公告)号:US20100312546A1

    公开(公告)日:2010-12-09

    申请号:US12477918

    申请日:2009-06-04

    IPC分类号: G10L15/18

    摘要: Architecture that employs an overall grammar as a set of context-specific grammars for recognition of an input, each responsible for a specific context, such as subtask category, geographic region, etc. The grammars together cover the entire domain. Moreover, multiple recognitions can be run in parallel against the same input, where each recognition uses one or more of the context-specific grammars. The multiple intermediate recognition results from the different recognizer-grammars are reconciled by running re-recognition using a dynamically composed grammar based on the multiple recognition results and potentially other domain knowledge, or selecting the winner using a statistical classifier operating on classification features extracted from the multiple recognition results and other domain knowledge.

    摘要翻译: 使用整体语法作为一组上下文特定语法的体系结构,用于识别输入,每个语言负责特定上下文,例如子任务类别,地理区域等。语法一起涵盖整个域。 此外,可以针对相同的输入并行运行多个识别,其中每个识别使用一个或多个上下文特定语法。 通过使用基于多个识别结果和潜在的其他领域知识的动态组合语法运行重新识别来协调来自不同识别器语法的多个中间识别结果,或者使用对从分类特征提取的分类特征进行运算的统计分类器来选择胜者 多重识别结果等领域知识。