Reducing time for annotating speech data to develop a dialog application

发明授权

US07412383B1 Reducing time for annotating speech data to develop a dialog application 有权

标题翻译：减少注释语音数据开发对话应用程序的时间

请登陆查看更多内容

专利标题： Reducing time for annotating speech data to develop a dialog application
专利标题（中）： 减少注释语音数据开发对话应用程序的时间
申请号： US10407965

申请日： 2003-04-04
公开(公告)号： US07412383B1

公开(公告)日： 2008-08-12
发明人: Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson
申请人： Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson
申请人地址： US NY New York
专利权人： AT&T Corp
当前专利权人： AT&T Corp
当前专利权人地址： US NY New York
主分类号： G10L15/06
IPC分类号： G10L15/06 ; G10L15/22

Reducing time for annotating speech data to develop a dialog application

摘要：

Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A selection module uses speech models, including speech recognition models and spoken language understanding models, to identify utterances that should be annotated based on criteria such as confidence scores generated by the models. These utterances are placed in an annotation list along with a type of annotation to be performed for the utterances and an order in which the annotation should proceed. The utterances in the annotation list can be annotated for speech recognition purposes, spoken language understanding purposes, labeling purposes, etc. The selection module can also select utterances for annotation based on previously annotated speech data and deficiencies in the various models.

摘要（中）：

用于注释语音数据的系统和方法。本发明通过选择最有益的用于注释的话语来减少注释语音数据所需的时间。选择模块使用包括语音识别模型和语言理解模型在内的语音模型来基于诸如由模型产生的置信度得分的标准来识别应当注释的话语。这些话语被放置在注释列表中，以及要为语句执行的注释类型以及注释应该继续执行的顺序。注释列表中的话语可以被注释用于语音识别目的，语言理解目的，标签目的等。选择模块还可以基于先前注释的语音数据和各种模型中的缺陷来选择用于注释的话语。

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）