Timing of speech recognition over lossy transmission systems
    1.
    发明授权
    Timing of speech recognition over lossy transmission systems 有权
    有损传输系统语音识别的时序

    公开(公告)号:US07752036B2

    公开(公告)日:2010-07-06

    申请号:US12344815

    申请日:2008-12-29

    IPC分类号: G10L19/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    TIMING OF SPEECH RECOGNITION OVER LOSSY TRANSMISSION SYSTEMS
    2.
    发明申请
    TIMING OF SPEECH RECOGNITION OVER LOSSY TRANSMISSION SYSTEMS 有权
    语音识别的时序在损失传输系统中

    公开(公告)号:US20090112585A1

    公开(公告)日:2009-04-30

    申请号:US12344815

    申请日:2008-12-29

    IPC分类号: G10L15/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Speech recognition over lossy transmission systems
    3.
    发明授权
    Speech recognition over lossy transmission systems 失效
    有损传输系统的语音识别

    公开(公告)号:US06775652B1

    公开(公告)日:2004-08-10

    申请号:US09107784

    申请日:1998-06-30

    IPC分类号: G10L1528

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 语音向量中的潜在损坏的特征在存在时被指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Timing of speech recognition over lossy transmission systems
    4.
    发明授权
    Timing of speech recognition over lossy transmission systems 有权
    有损传输系统语音识别的时序

    公开(公告)号:US07496503B1

    公开(公告)日:2009-02-24

    申请号:US11611983

    申请日:2006-12-18

    IPC分类号: G10L19/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Speech recognition over lossy networks with rejection threshold
    5.
    发明授权
    Speech recognition over lossy networks with rejection threshold 有权
    具有拒绝门槛的有损网络的语音识别

    公开(公告)号:US07171359B1

    公开(公告)日:2007-01-30

    申请号:US10902304

    申请日:2004-07-29

    IPC分类号: G10L15/06

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 语音向量中的潜在损坏的特征在存在时被指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Natural language knowledge servers as network resources
    6.
    发明授权
    Natural language knowledge servers as network resources 有权
    自然语言知识服务器作为网络资源

    公开(公告)号:US06192338B1

    公开(公告)日:2001-02-20

    申请号:US09334916

    申请日:1999-06-17

    IPC分类号: G10L1518

    CPC分类号: G10L15/30 G10L2015/228

    摘要: A network resource system includes a first server which can communicate with a client computer. The first server produces a speech signal representing speech from a user at the client computer, and context information which indicates the semantic context of the user's speech and a predefined format in which data are returned to the first server. A network knowledge server is in communication with and separated from the first server. The network knowledge server returns to the first server a text structure having one or more fields corresponding to the predefined format. The first server uses data from the one or more fields to determine a response to the user's speech.

    摘要翻译: 网络资源系统包括可与客户端计算机进行通信的第一服务器。 第一服务器产生表示来自客户端计算机的用户的语音的语音信号,以及指示用户语音的语义上下文的上下文信息和数据返回到第一服务器的预定格式。 网络知识服务器与第一服务器通信并与第一服务器分离。 网络知识服务器向第一服务器返回具有与预定义格式相对应的一个或多个字段的文本结构。 第一台服务器使用一个或多个字段的数据来确定对用户演讲的响应。

    CONCISE DYNAMIC GRAMMARS USING N-BEST SELECTION
    7.
    发明申请
    CONCISE DYNAMIC GRAMMARS USING N-BEST SELECTION 有权
    使用N-BEST选择的概念动态灰度

    公开(公告)号:US20110202343A1

    公开(公告)日:2011-08-18

    申请号:US13096431

    申请日:2011-04-28

    IPC分类号: G10L15/14 G10L15/18

    摘要: A method and apparatus derive a dynamic grammar composed of a subset of a plurality of data elements that are each associated with one of a plurality of reference identifiers. The present invention generates a set of selection identifiers on the basis of a user-provided first input identifier and determines which of these selection identifiers are present in a set of pre-stored reference identifiers. The present invention creates a dynamic grammar that includes those data elements that are associated with those reference identifiers that are matched to any of the selection identifiers. Based on a user-provided second identifier and on the data elements of the dynamic grammar, the present invention selects one of the reference identifiers in the dynamic grammar.

    摘要翻译: 方法和装置导出由多个数据元素的子集组成的动态语法,每个数据元素与多个参考标识符之一相关联。 本发明基于用户提供的第一输入标识符生成一组选择标识符,并且确定这些选择标识符中的哪一个存在于一组预先存储的参考标识符中。 本发明创建动态语法,其包括与与任何选择标识符匹配的那些参考标识符相关联的那些数据元素。 基于用户提供的第二标识符和动态语法的数据元素,本发明在动态语法中选择参考标识符之一。

    Statistical database correction of alphanumeric account numbers for
speech recognition and touch-tone recognition
    8.
    发明授权
    Statistical database correction of alphanumeric account numbers for speech recognition and touch-tone recognition 失效
    用于语音识别和触摸音识别的字母数字帐号的统计数据库校正

    公开(公告)号:US6137863A

    公开(公告)日:2000-10-24

    申请号:US763382

    申请日:1996-12-13

    IPC分类号: G10L15/18 H04M11/06 H04M1/64

    摘要: A method and apparatus for recognizing an identifier entered by a user. A caller enters a predetermined identifier through a voice input device or a touch-tone keypad of a telephone handset. A signal representing the entered identifier is transmitted to a remote recognizer, which responds to the identifier signal by producing a recognized output intended to match the entered identifier. The present invention compares this recognized identifier with a list of valid reference identifiers to determine which one of these reference identifiers most likely matches the entered identifier. In performing this determination, the present invention employs a confusion matrix, which is an arrangement of probabilities that indicate the likelihood that a given character in a particular character position of the reference identifier would be recognized by the recognizer as a character in the corresponding character position of the recognized identifier. This determination yields an identifier recognition probability for every reference identifier, and the present invention selects the reference identifier with the highest identifier recognition probability as most likely corresponding to the entered identifier.

    摘要翻译: 一种用于识别由用户输入的标识符的方法和装置。 呼叫者通过电话听筒的语音输入装置或按键式键盘输入预定标识符。 表示输入的标识符的信号被发送到远程识别器,远程识别器通过产生用于匹配输入的标识符的识别的输出来响应标识符信号。 本发明将该识别的标识符与有效参考标识符的列表进行比较,以确定这些参考标识符中的哪一个最可能与输入的标识符匹配。 在执行该确定时,本发明采用混淆矩阵,其是概率排列,其指示参考标识符的特定字符位置中的给定字符将被识别器识别为对应字符位置中的字符的可能性 的识别标识符。 该确定对于每个参考标识符产生标识符识别概率,并且本发明以最可能对应于输入的标识符的方式选择具有最高标识符识别概率的参考标识符。

    Concise dynamic grammars using N-best selection
    9.
    发明授权
    Concise dynamic grammars using N-best selection 失效
    使用N最佳选择的简明动态语法

    公开(公告)号:US07937260B1

    公开(公告)日:2011-05-03

    申请号:US09097787

    申请日:1998-06-15

    IPC分类号: G10L17/20

    摘要: A method and apparatus derive a dynamic grammar composed of a subset of a plurality of data elements that are each associated with one of a plurality of reference identifiers. The present invention generates a set of selection identifiers on the basis of a user-provided first input identifier and determines which of these selection identifiers are present in a set of pre-stored reference identifiers. The present invention creates a dynamic grammar that includes those data elements that are associated with those reference identifiers that are matched to any of the selection identifiers. Based on a user-provided second identifier and on the data elements of the dynamic grammar, the present invention selects one of the reference identifiers in the dynamic grammar.

    摘要翻译: 方法和装置导出由多个数据元素的子集组成的动态语法,每个数据元素与多个参考标识符之一相关联。 本发明基于用户提供的第一输入标识符生成一组选择标识符,并且确定这些选择标识符中的哪一个存在于一组预先存储的参考标识符中。 本发明创建动态语法,其包括与与任何选择标识符匹配的那些参考标识符相关联的那些数据元素。 基于用户提供的第二标识符和动态语法的数据元素,本发明在动态语法中选择参考标识符之一。

    Statistical database correction of alphanumeric identifiers for speech recognition and touch-tone recognition
    10.
    发明授权
    Statistical database correction of alphanumeric identifiers for speech recognition and touch-tone recognition 失效
    用于语音识别和触摸音识别的字母数字标识符的统计数据库校正

    公开(公告)号:US06400805B1

    公开(公告)日:2002-06-04

    申请号:US09097769

    申请日:1998-06-15

    IPC分类号: H04M164

    CPC分类号: H04M1/271

    摘要: A method and apparatus recognize an identifier entered by a user. A caller enters a predetermined identifier through a voice input device or a touch-tone keypad of a telephone handset. A signal representing the entered identifier is transmitted to a remote recognizer, which responds to the identifier signal by producing a recognized output intended to match the entered identifier. The present invention compares this recognized identifier with a list of valid reference identifiers to determine which one of these reference identifiers most likely matches the entered identifier. In performing this determination, the present invention compares each character of the recognized identifier with a character in a corresponding character position of each reference identifier in light of a plurality of confusion sets. On the basis of this comparison, the set of reference identifiers is reduced to a candidate set of reference identifiers, from which a reference identifier that matches the input identifier provided by the user.

    摘要翻译: 方法和装置识别用户输入的标识符。 呼叫者通过电话听筒的语音输入装置或按键式键盘输入预定标识符。 表示输入的标识符的信号被发送到远程识别器,远程识别器通过产生用于匹配输入的标识符的识别的输出来响应标识符信号。 本发明将该识别的标识符与有效参考标识符的列表进行比较,以确定这些参考标识符中的哪一个最可能与输入的标识符匹配。 在执行该确定时,本发明根据多个混淆集合将识别的标识符的每个字符与每个参考标识符的对应字符位置中的字符进行比较。 基于该比较,参考标识符的集合被减少到参考标识符的候选集合,参考标识符与用户提供的输入标识符匹配。