Invention Grant
- Patent Title: System and method for improving robustness of speech recognition using vocal tract length normalization codebooks
- Patent Title (中): 使用声道长度归一化码本提高语音识别鲁棒性的系统和方法
-
Application No.: US12869039Application Date: 2010-08-26
-
Publication No.: US08160875B2Publication Date: 2012-04-17
- Inventor: Mazin Gilbert
- Applicant: Mazin Gilbert
- Applicant Address: US GA Atlanta
- Assignee: AT&T Intellectual Property II, L.P.
- Current Assignee: AT&T Intellectual Property II, L.P.
- Current Assignee Address: US GA Atlanta
- Main IPC: G10L15/06
- IPC: G10L15/06

Abstract:
Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.
Public/Granted literature
Information query