发明授权
US07412383B1 Reducing time for annotating speech data to develop a dialog application
有权
减少注释语音数据开发对话应用程序的时间
- 专利标题: Reducing time for annotating speech data to develop a dialog application
- 专利标题(中): 减少注释语音数据开发对话应用程序的时间
-
申请号: US10407965申请日: 2003-04-04
-
公开(公告)号: US07412383B1公开(公告)日: 2008-08-12
- 发明人: Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson
- 申请人: Tirso M. Alonso , Ilana Bromberg , Dilek Z. Hakkani-Tur , Barbara B. Hollister , Mazin G. Rahim , Giuseppe Riccardi , Lawrence Lyon Rose , Daniel Leon Stern , Gokhan Tur , James M. Wilson
- 申请人地址: US NY New York
- 专利权人: AT&T Corp
- 当前专利权人: AT&T Corp
- 当前专利权人地址: US NY New York
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G10L15/22
摘要:
Systems and methods for annotating speech data. The present invention reduces the time required to annotate speech data by selecting utterances for annotation that will be of greatest benefit. A selection module uses speech models, including speech recognition models and spoken language understanding models, to identify utterances that should be annotated based on criteria such as confidence scores generated by the models. These utterances are placed in an annotation list along with a type of annotation to be performed for the utterances and an order in which the annotation should proceed. The utterances in the annotation list can be annotated for speech recognition purposes, spoken language understanding purposes, labeling purposes, etc. The selection module can also select utterances for annotation based on previously annotated speech data and deficiencies in the various models.
信息查询