Invention Grant
US07315819B2 Apparatus for performing speaker identification and speaker searching in speech or sound image data, and method thereof
有权
用于在语音或声音图像数据中执行说话者识别和说话者搜索的装置及其方法
- Patent Title: Apparatus for performing speaker identification and speaker searching in speech or sound image data, and method thereof
- Patent Title (中): 用于在语音或声音图像数据中执行说话者识别和说话者搜索的装置及其方法
-
Application No.: US10201069Application Date: 2002-07-23
-
Publication No.: US07315819B2Publication Date: 2008-01-01
- Inventor: Yasuhiro Toguri , Masayuki Nishiguchi
- Applicant: Yasuhiro Toguri , Masayuki Nishiguchi
- Applicant Address: JP Tokyo
- Assignee: Sony Corporation
- Current Assignee: Sony Corporation
- Current Assignee Address: JP Tokyo
- Agency: Rockey, Depke & Lyons, LLC.
- Agent Robert J. Depke
- Priority: JP2001-225051 20010725
- Main IPC: G10L17/00
- IPC: G10L17/00

Abstract:
A process of identifying a speaker in coded speech data and a process of searching for the speaker are efficiently performed with fewer computations and with a smaller storage capacity. In an information search apparatus, an LSP decoding section extracts and decodes only LSP information from coded speech data which is read for each block. An LPC conversion section converts the LSP information into LPC information. A Cepstrum conversion section converts the obtained LPC information into an LPC Cepstrum which represents features of speech. A vector quantization section performs vector quantization on the LPC Cepstrum. A speaker identification section identifies a speaker on the basis of the result of the vector quantization. Furthermore, the identified speaker is compared with a search condition in a condition comparison section, and based on the result, the search result is output.
Public/Granted literature
- US20030036905A1 Information detection apparatus and method, and information search apparatus and method Public/Granted day:2003-02-20
Information query