-
公开(公告)号:US12114048B2
公开(公告)日:2024-10-08
申请号:US18109243
申请日:2023-02-13
Applicant: Google LLC
Inventor: Terrance Paul McCartney, Jr. , Brian Colonna , Michael Nechyba
IPC: H04N7/10 , G06F40/30 , G06F40/58 , H04N21/43 , H04N21/488
CPC classification number: H04N21/4884 , G06F40/30 , G06F40/58 , H04N21/43074
Abstract: A method for aligning a translation of original caption data with an audio portion of a video is provided. The method involves identifying original caption data for the video that includes caption character strings, identifying translated language caption data for the video that includes translated character strings associated with audio portion of the video, and mapping caption sentence fragments generated from the caption character strings to corresponding translated sentence fragments generated from the translated character strings based on timing associated with the original caption data and the translated language caption data. The method further involves estimating time intervals for individual caption sentence fragments using timing information corresponding to individual caption character strings, assigning time intervals to individual translated sentence fragments based on estimated time intervals of the individual caption sentence fragments, generating a set of translated sentences using consecutive translated sentence fragments, and aligning the set of translated sentences with the audio portion of the video using assigned time intervals of individual translated sentence fragments from corresponding translated sentences.