Invention Grant
- Patent Title: Automatic collection of speaker name pronunciations
- Patent Title (中): 自动收集扬声器名称发音
-
Application No.: US13970850Application Date: 2013-08-20
-
Publication No.: US09240181B2Publication Date: 2016-01-19
- Inventor: Aparna Khare , Neha Agrawal , Sachin S. Kajarekar , Matthias Paulik
- Applicant: Cisco Technology, Inc.
- Applicant Address: US CA San Jose
- Assignee: Cisco Technology, Inc.
- Current Assignee: Cisco Technology, Inc.
- Current Assignee Address: US CA San Jose
- Agency: Edell, Shapiro & Finnan, LLC
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/26 ; G10L15/06 ; G10L15/187 ; G10L15/04 ; G10L17/00 ; G10L15/02

Abstract:
An audio stream is segmented into a plurality of time segments using speaker segmentation and recognition (SSR), with each time segment corresponding to the speaker's name, producing an SSR transcript. The audio stream is transcribed into a plurality of word regions using automatic speech recognition (ASR), with each of the word regions having a measure of the confidence in the accuracy of the translation, producing an ASR transcript. Word regions with a relatively low confidence in the accuracy of the translation are identified. The low confidence regions are filtered using named entity recognition (NER) rules to identify low confidence regions that a likely names. The NER rules associate a region that is identified as a likely name with the name of the speaker corresponding to the current, the previous, or the next time segment. All of the likely name regions associated with that speaker's name are selected.
Public/Granted literature
- US20150058005A1 Automatic Collection of Speaker Name Pronunciations Public/Granted day:2015-02-26
Information query