Invention Grant
US09495964B2 Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment
有权
系统,计算机实现的方法和用于转录对准的有形计算机可读存储介质
- Patent Title: Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment
- Patent Title (中): 系统,计算机实现的方法和用于转录对准的有形计算机可读存储介质
-
Application No.: US15071644Application Date: 2016-03-16
-
Publication No.: US09495964B2Publication Date: 2016-11-15
- Inventor: Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter
- Applicant: AT&T Intellectual Property I, L.P.
- Applicant Address: US GA Atlanta
- Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee: AT&T Intellectual Property I, L.P.
- Current Assignee Address: US GA Atlanta
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/26 ; H04N21/488 ; G10L21/055 ; G10L13/08 ; H04N21/44 ; G11B27/10

Abstract:
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.
Public/Granted literature
Information query