-
公开(公告)号:US11582527B2
公开(公告)日:2023-02-14
申请号:US16975696
申请日:2018-02-26
Applicant: GOOGLE LLC
Inventor: Terrence Paul McCartney, Jr. , Brian Colonna , Michael Nechyba
IPC: H04N7/10 , H04N21/488 , G06F40/58 , G06F40/30 , H04N21/43
Abstract: A method for aligning a translation of original caption data with an audio portion of a video is provided. The method includes identifying, by a processing device, original caption data for a video that includes a plurality of caption character strings. The processing device identifies speech recognition data that includes a plurality of generated character strings and associated timing information for each generated character string. The processing device maps the plurality of caption character strings to the plurality of generated character strings using assigned values indicative of semantic similarities between character strings. The processing device assigns timing information to the individual caption character strings based on timing information of mapped individual generated character strings. The processing device aligns a translation of the original caption data with the audio portion of the video using assigned timing information of the individual caption character strings.