Invention Grant
US5655058A Segmentation of audio data for indexing of conversational speech for
real-time or postprocessing applications
失效
对用于实时或后处理应用程序的会话语音索引的音频数据进行分段
- Patent Title: Segmentation of audio data for indexing of conversational speech for real-time or postprocessing applications
- Patent Title (中): 对用于实时或后处理应用程序的会话语音索引的音频数据进行分段
-
Application No.: US226519Application Date: 1994-04-12
-
Publication No.: US5655058APublication Date: 1997-08-05
- Inventor: Vijay Balasubramanian , Francine R. Chen , Philip A. Chou , Donald G. Kimber , Alex D. Poon , Karon A. Weber , Lynn D. Wilcox
- Applicant: Vijay Balasubramanian , Francine R. Chen , Philip A. Chou , Donald G. Kimber , Alex D. Poon , Karon A. Weber , Lynn D. Wilcox
- Applicant Address: CT Stamford
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: CT Stamford
- Main IPC: G10L15/04
- IPC: G10L15/04 ; G10L15/10 ; G10L15/14 ; G10L17/00 ; H04R3/00 ; G10L5/06 ; G10L9/00
Abstract:
A method for segmenting audio data, comprising speech from a plurality of individual speakers, according to speaker is provided. The method comprises providing individual HMMs for each individual speaker, each individual HMM including at least one state, and constructing a speaker network HMM by connecting the individual HMMs in parallel. The audio data is then divided into segments by determining a most likely sequence of states through the speaker network HMM, each of the segments being associated with one of the individual HMMs. Afterward, the speaker of each of the segments is identified. The segmented data may be used to form an index into the audio data according to speaker.
Information query