Timing of speech recognition over lossy transmission systems
    65.
    发明授权
    Timing of speech recognition over lossy transmission systems 有权
    有损传输系统语音识别的时序

    公开(公告)号:US07496503B1

    公开(公告)日:2009-02-24

    申请号:US11611983

    申请日:2006-12-18

    IPC分类号: G10L19/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Spoken language understanding that incorporates prior knowledge into boosting
    66.
    发明授权
    Spoken language understanding that incorporates prior knowledge into boosting 有权
    将先前知识纳入提升的口语理解

    公开(公告)号:US07328146B1

    公开(公告)日:2008-02-05

    申请号:US11484120

    申请日:2006-07-11

    IPC分类号: G06F17/20

    CPC分类号: G06F17/28

    摘要: A system for understanding entries, such as speech, develops a classifier by employing prior knowledge with which a given corpus of training entries is enlarged threefold. A rule is created for each of the labels employed in the classifier, and the created rules are applied to the given corpus to create a corpus of attachments by appending a weight of ηp(x), or 1−ηp(x), to labels of entries that meet, or fail to meet, respectively, conditions of the labels' rules, and to also create a corpus of non-attachments by appending a weight of 1−ηp(x), or ηp(x), to labels of entries that meet, or fail to meet conditions of the labels' rules.

    摘要翻译: 用于理解诸如言语之类的条目的系统通过采用将给定的训练条目语料库放大三倍的先验知识来开发分类器。 为分类器中使用的每个标签创建规则,并将创建的规则应用于给定的语料库,以通过将一个权重为etap(x)或1-etap(x)附加到标签来创建附件语料库 分别符合或未能满足标签规则的条件的条目,并通过将1-etap(x)或etap(x)的权重附加到标签的标签上来创建非附件语料库 满足或不符合标签规则条件的条目。

    Speech recognition over lossy networks with rejection threshold
    67.
    发明授权
    Speech recognition over lossy networks with rejection threshold 有权
    具有拒绝门槛的有损网络的语音识别

    公开(公告)号:US07171359B1

    公开(公告)日:2007-01-30

    申请号:US10902304

    申请日:2004-07-29

    IPC分类号: G10L15/06

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 语音向量中的潜在损坏的特征在存在时被指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Method and system for performing speech recognition
    68.
    发明授权
    Method and system for performing speech recognition 失效
    执行语音识别的方法和系统

    公开(公告)号:US5806022A

    公开(公告)日:1998-09-08

    申请号:US575378

    申请日:1995-12-20

    摘要: Speech recognition processing is compensated for improving robustness of speech recognition in the presence of enhanced speech signals. The compensation overcomes the adverse effects that speech signal enhancement may have on speech recognition performance, where speech signal enhancement causes acoustical mismatches between recognition models trained using unenhanced speech signals and feature data extracted from enhanced speech signals. Compensation is provided at the front end of an automatic speech recognition system by combining linear predictive coding and mel-based cepstral parameter analysis for computing cepstral features of transmitted speech signals used for speech recognition processing by selectively weighting mel-filter banks when processing frequency domain representations of the enhanced speech signals.

    摘要翻译: 在增强语音信号的存在下,语音识别处理被补偿以提高语音识别的鲁棒性。 补偿克服语音信号增强可能对语音识别性能的不利影响,其中语音信号增强导致使用未增强语音信号训练的识别模型和从增强语音信号提取的特征数据之间的声学​​失配。 在自动语音识别系统的前端通过组合线性预测编码和基于梅尔的倒谱参数分析来提供用于计算用于语音识别处理的传输语音信号的倒谱特征的补偿,其中当处理频域表示时,通过选择性地加权梅尔滤波器组 的增强语音信号。

    Discriminative utterance verification for connected digits recognition
    69.
    发明授权
    Discriminative utterance verification for connected digits recognition 失效
    连接数字识别的歧视性话语验证

    公开(公告)号:US5737489A

    公开(公告)日:1998-04-07

    申请号:US528902

    申请日:1995-09-15

    摘要: In a speech recognition system, a recognition processor receives an unknown utterance signal as input. The recognition processor in response to the unknown utterance signal input accesses a recognition database and scores the utterance signal against recognition models in the recognition database to classify the unknown utterance and to generate a hypothesis speech signal. A verification processor receives the hypothesis speech signal as input to be verified. The verification processor accesses a verification database to test the hypothesis speech signal against verification models reflecting a preselected type of training stored in the verification database. Based on the verification test, the verification processor generates a confidence measure signal. The confidence measure signal can be compared against a verification threshold to determine the accuracy of the recognition decision made by the recognition processor.

    摘要翻译: 在语音识别系统中,识别处理器接收未知的话音信号作为输入。 响应于未知话语信号输入的识别处理器访问识别数据库,并根据识别数据库中的识别模型对话音信号进行评分,以对未知话语进行分类并生成假设语音信号。 验证处理器接收假设语音信号作为待验证的输入。 验证处理器访问验证数据库以针对反映存储在验证数据库中的预选类型的训练的验证模型来测试假设语音信号。 基于验证测试,验证处理器产生置信度测量信号。 可以将置信度信号与验证阈值进行比较,以确定由识别处理器进行的识别决策的准确性。