SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES

Invention Application

US20210279427A1 SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES 有权

Please log in to see more content

Patent Title: SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES
Application No.: US17196285

Application Date: 2021-03-09
Publication No.: US20210279427A1

Publication Date: 2021-09-09
Inventor: Aansh Malik , Ha Thanh Nguyen
Applicant: Warner Bros. Entertainment Inc.
Applicant Address: US CA Burbank
Assignee: Warner Bros. Entertainment Inc.
Current Assignee: Warner Bros. Entertainment Inc.
Current Assignee Address: US CA Burbank
Main IPC: G06F40/47
IPC: G06F40/47 ; G06F40/58 ; G10L15/16 ; G10L15/00

SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES

Abstract:

A method and system for automated voice casting compares candidate voices samples from candidate speakers in a target language with a primary voice sample from a primary speaker in a primary language. Utterances in the audio samples of the candidates speakers and the primary speaker are identified and typed and voice samples generated that meet applicable utterance type criteria. A neural network is used to generate an embedding for the voice samples. A voice sample can include groups of different utterance types and embeddings generated for each utterance group in the voice sample and then combined in a weighted form wherein the resulting embedding emphasizes selected utterance types. Similarities between embeddings for the candidate voice samples relative to the primary voice sample are evaluated and used to select a candidate speaker that is a vocal match.

Public/Granted literature

US12086558B2 Systems and methods for generating multi-language media content with automatic selection of matching voices Public/Granted day:2024-09-10

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/40	.自然语言的处理或翻译(自然语言分析入G06F40/20；语义分析入G06F40/30)
G06F40/42	..数据驱动翻译
G06F40/47	...机器辅助翻译，例如使用翻译存储器