Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

Invention Grant

US09305552B2 Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment 有权

Please log in to see more content

Patent Title: Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment
Application No.: US14492616

Application Date: 2014-09-22
Publication No.: US09305552B2

Publication Date: 2016-04-05
Inventor: Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter
Applicant: AT&T Intellectual Property I, L.P.
Applicant Address: US GA Atlanta
Assignee: AT&T Intellectual Property I, L.P.
Current Assignee: AT&T Intellectual Property I, L.P.
Current Assignee Address: US GA Atlanta
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/26 ; G11B27/10 ; G10L21/06

Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment

Abstract:

Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

Public/Granted literature

US20150046160A1 Systems, Computer-Implemented Methods, and Tangible Computer-Readable Storage Media For Transcription Alighnment Public/Granted day:2015-02-12

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）