Method for building a natural language understanding model for a spoken dialog system
    41.
    发明授权
    Method for building a natural language understanding model for a spoken dialog system 有权
    建立语言对话系统的自然语言理解模型的方法

    公开(公告)号:US07933766B2

    公开(公告)日:2011-04-26

    申请号:US12582062

    申请日:2009-10-20

    IPC分类号: G06F17/27

    摘要: A method of generating a natural language model for use in a spoken dialog system is disclosed. The method comprises using sample utterances and creating a number of hand crafted rules for each call-type defined in a labeling guide. A first NLU model is generated and tested using the hand crafted rules and sample utterances. A second NLU model is built using the sample utterances as new training data and using the hand crafted rules. The second NLU model is tested for performance using a first batch of labeled data. A series of NLU models are built by adding a previous batch of labeled data to training data and using a new batch of labeling data as test data to generate the series of NLU models with training data that increases constantly. If not all the labeling data is received, the method comprises repeating the step of building a series of NLU models until all labeling data is received. After all the training data is received, at least once, the method comprises building a third NLU model using all the labeling data, wherein the third NLU model is used in generating the spoken dialog service.

    摘要翻译: 公开了一种生成在口头对话系统中使用的自然语言模型的方法。 该方法包括对标签指南中定义的每个呼叫类型使用样本话语和创建许多手工制作规则。 使用手工制作的规则和样品说话来生成和测试第一个NLU模型。 使用示例语句作为新的训练数据并使用手工制作规则构建了第二个NLU模型。 使用第一批标签数据对第二个NLU模型进行性能测试。 通过将前一批标签数据添加到训练数据并使用新批签名数据作为测试数据来生成一系列NLU模型,训练数据不断增加,构建了一系列NLU模型。 如果不是全部接收到标签数据,则该方法包括重复建立一系列NLU模型的步骤,直到接收到所有标记数据为止。 在接收到所有训练数据之后,至少一次,该方法包括使用所有标签数据构建第三NLU模型,其中第三NLU模型用于生成口语对话服务。

    Timing of speech recognition over lossy transmission systems
    42.
    发明授权
    Timing of speech recognition over lossy transmission systems 有权
    有损传输系统语音识别的时序

    公开(公告)号:US07752036B2

    公开(公告)日:2010-07-06

    申请号:US12344815

    申请日:2008-12-29

    IPC分类号: G10L19/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Recognizing the numeric language in natural spoken dialogue
    43.
    发明授权
    Recognizing the numeric language in natural spoken dialogue 有权
    认识到自然语言对话中的数字语言

    公开(公告)号:US07624015B1

    公开(公告)日:2009-11-24

    申请号:US11276502

    申请日:2006-03-02

    IPC分类号: G10L15/14 G10L15/18

    CPC分类号: G10L15/142

    摘要: A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric language that represents a subset of a vocabulary. The subset includes a set of words identified as being for interpreting and understanding number strings. A numeric understanding processor contains classes of rules for converting the string of words into a sequence of digits. The speech recognition processor utilizes an acoustic model database. A validation database stores a set of valid sequences of digits. A string validation processor outputs validity information based on a comparison of a sequence of digits output by the numeric understanding processor with valid sequences of digits in the validation database.

    摘要翻译: 提供了一种系统和方法。 语音识别处理器接收无约束输入语音并输出一串字。 语音识别处理器基于代表词汇子集的数字语言。 该子集包括被识别为用于解释和理解数字串的一组单词。 数字理解处理器包含用于将字符串转换为数字序列的规则类型。 语音识别处理器使用声学模型数据库。 验证数据库存储一组有效的数字序列。 字符串验证处理器基于数字理解处理器输出的数字序列与验证数据库中的有效数字序列的比较来输出有效性信息。

    TIMING OF SPEECH RECOGNITION OVER LOSSY TRANSMISSION SYSTEMS
    44.
    发明申请
    TIMING OF SPEECH RECOGNITION OVER LOSSY TRANSMISSION SYSTEMS 有权
    语音识别的时序在损失传输系统中

    公开(公告)号:US20090112585A1

    公开(公告)日:2009-04-30

    申请号:US12344815

    申请日:2008-12-29

    IPC分类号: G10L15/00

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 在等待预定的时间之后,产生语音向量,并且在存在时将语音向量内潜在的损坏的特征指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Method for building a natural language understanding model for a spoken dialog system
    46.
    发明授权
    Method for building a natural language understanding model for a spoken dialog system 有权
    建立语言对话系统的自然语言理解模型的方法

    公开(公告)号:US07295981B1

    公开(公告)日:2007-11-13

    申请号:US10755014

    申请日:2004-01-09

    IPC分类号: G10L15/18

    摘要: A method of generating a natural language model for use in a spoken dialog system is disclosed. The method comprises using sample utterances and creating a number of hand crafted rules for each call-type defined in a labeling guide. A first NLU model is generated and tested using the hand crafted rules and sample utterances. A second NLU model is built using the sample utterances as new training data and using the hand crafted rules. The second NLU model is tested for performance using a first batch of labeled data. A series of NLU models are built by adding a previous batch of labeled data to training data and using a new batch of labeling data as test data to generate the series of NLU models with training data that increases constantly. If not all the labeling data is received, the method comprises repeating the step of building a series of NLU models until all labeling data is received. After all the training data is received, at least once, the method comprises building a third NLU model using all the labeling data, wherein the third NLU model is used in generating the spoken dialog service.

    摘要翻译: 公开了一种生成在口头对话系统中使用的自然语言模型的方法。 该方法包括对标签指南中定义的每个呼叫类型使用样本话语和创建许多手工制作规则。 使用手工制作的规则和样品说话来生成和测试第一个NLU模型。 使用示例语句作为新的训练数据并使用手工制作规则构建了第二个NLU模型。 使用第一批标签数据对第二个NLU模型进行性能测试。 通过将前一批标签数据添加到训练数据并使用新批签名数据作为测试数据来生成一系列NLU模型,训练数据不断增加,构建了一系列NLU模型。 如果不是全部接收到标签数据,则该方法包括重复建立一系列NLU模型的步骤,直到接收到所有标记数据为止。 在接收到所有训练数据之后,至少一次,该方法包括使用所有标签数据构建第三NLU模型,其中第三NLU模型用于生成口语对话服务。

    Active labeling for spoken language understanding
    47.
    发明授权
    Active labeling for spoken language understanding 有权
    积极标注口语理解

    公开(公告)号:US07292982B1

    公开(公告)日:2007-11-06

    申请号:US10447889

    申请日:2003-05-29

    IPC分类号: G06F17/21 G06F17/27 G10L15/08

    CPC分类号: G10L15/1822

    摘要: An active labeling process is provided that aims to minimize the number of utterances to be checked again by automatically selecting the ones that are likely to be erroneous or inconsistent with the previously labeled examples. In one embodiment, the errors and inconsistencies are identified based on the confidences obtained from a previously trained classifier model. In a second embodiment, the errors and inconsistencies are identified based on an unsupervised learning process. In both embodiments, the active labeling process is not dependent upon the particular classifier model.

    摘要翻译: 提供了一种主动标注过程,其目的是通过自动选择可能是错误的或与先前标记的示例不一致的那些来最小化要再次检查的话语的数量。 在一个实施例中,基于从先前训练的分类器模型获得的信心来识别误差和不一致性。 在第二实施例中,基于无监督的学习过程来识别错误和不一致。 在两个实施方案中,活性标记过程不依赖于特定的分类器模型。

    Speech recognition over lossy transmission systems
    49.
    发明授权
    Speech recognition over lossy transmission systems 失效
    有损传输系统的语音识别

    公开(公告)号:US06775652B1

    公开(公告)日:2004-08-10

    申请号:US09107784

    申请日:1998-06-30

    IPC分类号: G10L1528

    CPC分类号: G10L15/02 G10L15/20

    摘要: Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. Potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.

    摘要翻译: 识别通过有损通信链路作为语音向量接收的语音流包括:通过有损分组化传输链路从分组接收的分组来构建语音识别器的一系列语音向量,其中与每个语音向量相关联的一些分组丢失或损坏 传输。 每个构造的语音向量是多维的并且包括相关联的特征。 语音向量中的潜在损坏的特征在存在时被指示给语音识别器。 当存在损坏的特征时,语音识别器在语音向量上尝试语音识别。 该识别可以仅基于每个语音向量内的某些或有效特征。 当指示步骤指示损坏的值以及尝试的识别步骤失败时,请求重新发送丢失或损坏的数据包。

    Signal conditioned minimum error rate training for continuous speech
recognition
    50.
    发明授权
    Signal conditioned minimum error rate training for continuous speech recognition 失效
    用于连续语音识别的信号条件最小误差率训练

    公开(公告)号:US5806029A

    公开(公告)日:1998-09-08

    申请号:US528821

    申请日:1995-09-15

    摘要: Hierarchical signal bias removal (HSBR) signal conditioning uses a codebook constructed from the set of recognition models and is updated as the recognition models are modified during recognition model training. As a result, HSBR signal conditioning and recognition model training are based on the same set of recognition model parameters, which provides significant reduction in recognition error rate for the speech recognition system.

    摘要翻译: 分级信号偏移去除(HSBR)信号调理使用由该组识别模型构建的码本,并且在识别模型训练期间识别模型被修改时被更新。 因此,HSBR信号调理和识别模型训练基于相同的识别模型参数集,这显着降低了语音识别系统的识别误码率。