System and method for a multiclass approach for confidence modeling in automatic speech recognition systems

Invention Grant

US11195514B2 System and method for a multiclass approach for confidence modeling in automatic speech recognition systems 有权

Please log in to see more content

Patent Title: System and method for a multiclass approach for confidence modeling in automatic speech recognition systems
Application No.: US16414885

Application Date: 2019-05-17
Publication No.: US11195514B2

Publication Date: 2021-12-07
Inventor: Ramasubramanian Sundaram , Aravind Ganapathiraju , Yingyi Tan
Applicant: Genesys Telecommunications Laboratories, Inc.
Applicant Address: US CA Daly City
Assignee: Genesys Telecommunications Laboratories, Inc.
Current Assignee: Genesys Telecommunications Laboratories, Inc.
Current Assignee Address: US CA Daly City
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/10 ; G10L15/06 ; G06N20/00 ; G10L15/26 ; G10L15/14

System and method for a multiclass approach for confidence modeling in automatic speech recognition systems

Abstract:

A system and method are presented for a multiclass approach for confidence modeling in automatic speech recognition systems. A confidence model may be trained offline using supervised learning. A decoding module is utilized within the system that generates features for audio files in audio data. The features are used to generate a hypothesized segment of speech which is compared to a known segment of speech using edit distances. Comparisons are labeled from one of a plurality of output classes. The labels correspond to the degree to which speech is converted to text correctly or not. The trained confidence models can be applied in a variety of systems, including interactive voice response systems, keyword spotters, and open-ended dialog systems.

Public/Granted literature

US20190355348A1 SYSTEM AND METHOD FOR A MULTICLASS APPROACH FOR CONFIDENCE MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS Public/Granted day:2019-11-21

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）