SYSTEM AND METHOD FOR TUNING AND TESTING IN A SPEECH RECOGNITION SYSTEM
    1.
    发明申请
    SYSTEM AND METHOD FOR TUNING AND TESTING IN A SPEECH RECOGNITION SYSTEM 有权
    用于在语音识别系统中进行调谐和测试的系统和方法

    公开(公告)号:US20090043576A1

    公开(公告)日:2009-02-12

    申请号:US12255564

    申请日:2008-10-21

    IPC分类号: G10L15/06

    摘要: Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition system. The tester and tuner modules can be configured to cooperate with each other. In one embodiment, the tuner module may include a module for playing back a selected portion of a digital data audio file, a module for creating and/or editing a transcript of the selected portion, and/or a module for displaying information associated with a decoding of the selected portion, the decoding generated by a speech recognition engine. In other embodiments, the tester module can include an editor for creating and/or modifying a grammar, a module for receiving a selected portion of a digital audio file and its corresponding transcript, and a scoring module for producing scoring statistics of the decoding based at least in part on the transcript.

    摘要翻译: 用于提高语音识别系统性能的系统和方法。 在一些实施例中,调谐器模块和/或测试器模块被配置为与语音识别系统协作。 测试器和调谐器模块可以配置为相互配合。 在一个实施例中,调谐器模块可以包括用于回放数字数据音频文件的所选部分的模块,用于创建和/或编辑所选部分的抄本的模块,和/或用于显示与所选择的部分相关联的信息的模块 解码所选择的部分,由语音识别引擎产生的解码。 在其他实施例中,测试器模块可以包括用于创建和/或修改语法的编辑器,用于接收数字音频文件的选定部分的模块及其对应的抄本,以及用于产生基于以下内容的解码的评分统计的评分模块: 至少部分在抄本上。

    System and method for tuning and testing in a speech recognition system
    2.
    发明授权
    System and method for tuning and testing in a speech recognition system 有权
    用于语音识别系统中调谐和测试的系统和方法

    公开(公告)号:US07440895B1

    公开(公告)日:2008-10-21

    申请号:US10725281

    申请日:2003-12-01

    IPC分类号: G10L15/06

    摘要: Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition system. The tester and tuner modules can be configured to cooperate with each other. In one embodiment, the tuner module may include a module for playing back a selected portion of a digital data audio file, a module for creating and/or editing a transcript of the selected portion, and/or a module for displaying information associated with a decoding of the selected portion, the decoding generated by a speech recognition engine. In other embodiments, the tester module can include an editor for creating and/or modifying a grammar, a module for receiving a selected portion of a digital audio file and its corresponding transcript, and a scoring module for producing scoring statistics of the decoding based at least in part on the transcript.

    摘要翻译: 用于提高语音识别系统性能的系统和方法。 在一些实施例中,调谐器模块和/或测试器模块被配置为与语音识别系统协作。 测试器和调谐器模块可以配置为相互配合。 在一个实施例中,调谐器模块可以包括用于回放数字数据音频文件的所选部分的模块,用于创建和/或编辑所选部分的抄本的模块,和/或用于显示与所选择的部分相关联的信息的模块 解码所选择的部分,由语音识别引擎产生的解码。 在其他实施例中,测试器模块可以包括用于创建和/或修改语法的编辑器,用于接收数字音频文件的选定部分的模块及其对应的抄本,以及用于产生基于以下内容的解码的评分统计的评分模块: 至少部分在抄本上。

    System and method for tuning and testing in a speech recognition system
    3.
    发明授权
    System and method for tuning and testing in a speech recognition system 有权
    用于语音识别系统中调谐和测试的系统和方法

    公开(公告)号:US07962331B2

    公开(公告)日:2011-06-14

    申请号:US12255564

    申请日:2008-10-21

    IPC分类号: G10L11/06

    摘要: Systems and methods for improving the performance of a speech recognition system. In some embodiments a tuner module and/or a tester module are configured to cooperate with a speech recognition system. The tester and tuner modules can be configured to cooperate with each other. In one embodiment, the tuner module may include a module for playing back a selected portion of a digital data audio file, a module for creating and/or editing a transcript of the selected portion, and/or a module for displaying information associated with a decoding of the selected portion, the decoding generated by a speech recognition engine. In other embodiments, the tester module can include an editor for creating and/or modifying a grammar, a module for receiving a selected portion of a digital audio file and its corresponding transcript, and a scoring module for producing scoring statistics of the decoding based at least in part on the transcript.

    摘要翻译: 用于提高语音识别系统性能的系统和方法。 在一些实施例中,调谐器模块和/或测试器模块被配置为与语音识别系统协作。 测试器和调谐器模块可以配置为相互配合。 在一个实施例中,调谐器模块可以包括用于回放数字数据音频文件的所选部分的模块,用于创建和/或编辑所选部分的抄本的模块,和/或用于显示与所选择的部分相关联的信息的模块 解码所选择的部分,由语音识别引擎产生的解码。 在其他实施例中,测试器模块可以包括用于创建和/或修改语法的编辑器,用于接收数字音频文件的选定部分的模块及其对应的抄本,以及用于产生基于以下内容的解码的评分统计的评分模块: 至少部分在抄本上。

    Speech recognition concept confidence measurement
    4.
    发明授权
    Speech recognition concept confidence measurement 有权
    语音识别概念置信度测量

    公开(公告)号:US07324940B1

    公开(公告)日:2008-01-29

    申请号:US10789389

    申请日:2004-02-27

    IPC分类号: G10L15/00

    CPC分类号: G10L15/10

    摘要: Systems and methods for determining a confidence score associated with a decoding output of a speech recognition engine. In one embodiment, a method of determining the confidence score comprises arranging time frame and acoustic score data into an array, determining a phoneme sequence in the array that yields the highest sum of acoustic scores under certain constraints, e.g., minimum number of time frames and order of phonemes in a phoneme string. A relative score is derived by applying a functional relationship between the acoustic score and different sums comprising acoustic scores from the array. The confidence score, in some embodiments, depends at least in part on the relative score and a measure of ambiguity associated with similar sounding phrases being included in different concepts of a specified grammar.

    摘要翻译: 用于确定与语音识别引擎的解码输出相关联的置信度得分的系统和方法。 在一个实施例中,确定置信度分数的方法包括将时间帧和声学分数数据布置到阵列中,确定阵列中的音素序列,其在某些约束下产生最高的声分数和,例如最小时间帧数, 音素字串中的音素顺序。 通过在声分数和包括来自阵列的声分数的不同和之间应用功能关系来导出相对得分。 在一些实施例中,置信度得分至少部分地依赖于相对分数和与包括在指定语法的不同概念中的类似的声音短语相关联的模糊度的度量。

    Application of user-specified transformations to automatic speech recognition results
    5.
    发明授权
    Application of user-specified transformations to automatic speech recognition results 有权
    用户指定的转换应用于自动语音识别结果

    公开(公告)号:US08775183B2

    公开(公告)日:2014-07-08

    申请号:US12483919

    申请日:2009-06-12

    IPC分类号: G10L15/00

    CPC分类号: G10L15/19 G10L15/20

    摘要: Textual transcription of speech is generated and formatted according to user-specified transformation and behavior requirements for a speech recognition system having input grammars and transformations. An apparatus may include a speech recognition platform configured to receive a user-specified transformation requirement, recognize speech in speech data into recognized speech according to a set of recognition grammars; and apply transformations to the recognized speech according to the user-specified transformation requirement. The apparatus may further be configured to receive a user-specified behavior requirement and transform the recognized speech according to the behavior requirement. Other embodiments are described and claimed.

    摘要翻译: 根据用户指定的具有输入语法和变换的语音识别系统的变换和行为要求,生成和格式化语音的文本转录。 一种装置可以包括配置成接收用户指定的变换要求的语音识别平台,根据一组识别语法将语音数据中的语音识别为已识别的语音; 并根据用户指定的转换要求对识别的语音进行转换。 该装置还可以被配置为接收用户指定的行为要求,并根据行为要求转换所识别的语音。 描述和要求保护其他实施例。

    APPLICATION-DEPENDENT INFORMATION FOR RECOGNITION PROCESSING
    6.
    发明申请
    APPLICATION-DEPENDENT INFORMATION FOR RECOGNITION PROCESSING 有权
    用于识别处理的应用相关信息

    公开(公告)号:US20100318359A1

    公开(公告)日:2010-12-16

    申请号:US12481612

    申请日:2009-06-10

    IPC分类号: G10L15/18

    CPC分类号: G10L15/197

    摘要: Architecture for integrating application-dependent information into a constraints component at deployment time or when available. In terms of a general grammar, the constraints component can include or be a general grammar that comprises application-independent information and is structured in such a way that application-dependent information can be integrated into the general grammar without loss of fidelity. The general grammar includes a probability space and reserves a section of the probability space for the integration of application-dependent information. An integration component integrates the application-dependent information into the reserved section of the probability space for recognition processing. The application-dependent information is integrated into the reserved section of the probability space at deployment time or when available. The general grammar is structured to support the integration and improve the overall system.

    摘要翻译: 用于将应用程序相关信息集成到部署时间或可用时的约束组件的体系结构。 在一般语法方面,约束组件可以包括或作为包含应用程序无关信息的通用语法,并且以这样的方式构造,使得依赖于应用程序的信息可以被集成到通用语法中而不失去保真度。 一般语法包括概率空间,并且保留用于集成应用依赖信息的概率空间的一部分。 集成部件将应用相关信息集成到用于识别处理的概率空间的保留部分中。 应用相关信息在部署时间或可用时被集成到概率空间的保留部分中。 一般语法的结构是支持整合和改进整体系统。

    Application-dependent information for recognition processing
    7.
    发明授权
    Application-dependent information for recognition processing 有权
    用于识别处理的应用依赖信息

    公开(公告)号:US08442826B2

    公开(公告)日:2013-05-14

    申请号:US12481612

    申请日:2009-06-10

    IPC分类号: G10L15/00

    CPC分类号: G10L15/197

    摘要: Architecture for integrating application-dependent information into a constraints component at deployment time or when available. In terms of a general grammar, the constraints component can include or be a general grammar that comprises application-independent information and is structured in such a way that application-dependent information can be integrated into the general grammar without loss of fidelity. The general grammar includes a probability space and reserves a section of the probability space for the integration of application-dependent information. An integration component integrates the application-dependent information into the reserved section of the probability space for recognition processing. The application-dependent information is integrated into the reserved section of the probability space at deployment time or when available. The general grammar is structured to support the integration and improve the overall system.

    摘要翻译: 用于将应用程序相关信息集成到部署时间或可用时的约束组件的体系结构。 在一般语法方面,约束组件可以包括或作为包含应用程序无关信息的通用语法,并且以这样的方式构造,使得依赖于应用程序的信息可以被集成到通用语法中而不会失去保真度。 一般语法包括概率空间,并且保留用于集成应用依赖信息的概率空间的一部分。 集成部件将应用相关信息集成到用于识别处理的概率空间的保留部分中。 应用相关信息在部署时或可用时被集成到概率空间的保留部分中。 一般语法的结构是支持整合和改进整体系统。

    APPLICATION OF USER-SPECIFIED TRANSFORMATIONS TO AUTOMATIC SPEECH RECOGNITION RESULTS
    8.
    发明申请
    APPLICATION OF USER-SPECIFIED TRANSFORMATIONS TO AUTOMATIC SPEECH RECOGNITION RESULTS 有权
    用户指定的变换应用于自动语音识别结果

    公开(公告)号:US20100318356A1

    公开(公告)日:2010-12-16

    申请号:US12483919

    申请日:2009-06-12

    IPC分类号: G10L15/04

    CPC分类号: G10L15/19 G10L15/20

    摘要: Textual transcription of speech is generated and formatted according to user-specified transformation and behavior requirements for a speech recognition system having input grammars and transformations. An apparatus may include a speech recognition platform configured to receive a user-specified transformation requirement, recognize speech in speech data into recognized speech according to a set of recognition grammars; and apply transformations to the recognized speech according to the user-specified transformation requirement. The apparatus may further be configured to receive a user-specified behavior requirement and transform the recognized speech according to the behavior requirement. Other embodiments are described and claimed.

    摘要翻译: 根据用户指定的具有输入语法和变换的语音识别系统的变换和行为要求,生成和格式化语音的文本转录。 一种装置可以包括配置成接收用户指定的变换要求的语音识别平台,根据一组识别语法将语音数据中的语音识别为已识别的语音; 并根据用户指定的转换要求对识别的语音进行转换。 该装置还可以被配置为接收用户指定的行为要求,并根据行为要求转换所识别的语音。 描述和要求保护其他实施例。