GENERATING DUBBED AUDIO FROM A VIDEO-BASED SOURCE

Invention Publication

US20240087557A1 GENERATING DUBBED AUDIO FROM A VIDEO-BASED SOURCE 审中-公开

Please log in to see more content

Patent Title: GENERATING DUBBED AUDIO FROM A VIDEO-BASED SOURCE
Application No.: US17931026

Application Date: 2022-09-09
Publication No.: US20240087557A1

Publication Date: 2024-03-14
Inventor: Andrew R. Levine , Buddhika Kottahachchi , Christopher Davie , Kulumani Sriram , Richard James Potts , Sasakthi S. Abeysinghe
Applicant: GOOGLE LLC
Applicant Address: US CA Mountain View
Assignee: GOOGLE LLC
Current Assignee: GOOGLE LLC
Current Assignee Address: US CA Mountain View
Main IPC: G10L13/02
IPC: G10L13/02 ; G06F40/58 ; G10L13/08

GENERATING DUBBED AUDIO FROM A VIDEO-BASED SOURCE

Abstract:

The present disclosure relates to generating and adjusting translated audio from a video-based source. The method includes receiving video data and corresponding audio data in a first language; generating a translated preliminary transcript in a second language; aligning timing windows of portions of the translated preliminary transcript with corresponding segments of the audio data; determining portions of the translated aligned transcript in the second language that exceed a timing window range of the corresponding segments of the audio data in the first language to generate flagged transcript portions; transmitting the original transcript, the translated aligned transcript, and the first speech dub to a first device, the generated flagged transcript portions included in the original transcript and the translated aligned transcript; receiving, from the first device, a modified original transcript; and generating, based on the modified original transcript, a second speech dub in the second language.

Public/Granted literature

US12236935B2 Generating dubbed audio from a video-based source Public/Granted day:2025-02-25

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备