Invention Grant
- Patent Title: Systems and methods for generating multi-language media content with automatic selection of matching voices
-
Application No.: US17196285Application Date: 2021-03-09
-
Publication No.: US12086558B2Publication Date: 2024-09-10
- Inventor: Aansh Malik , Ha Thanh Nguyen
- Applicant: Warner Bros. Entertainment Inc.
- Applicant Address: US CA Burbank
- Assignee: Warner Bros. Entertainment Inc.
- Current Assignee: Warner Bros. Entertainment Inc.
- Current Assignee Address: US CA Burbank
- Agency: Bookoff McAndrews, PLLC
- Main IPC: G06F40/47
- IPC: G06F40/47 ; G06F40/58 ; G10L15/00 ; G10L15/16

Abstract:
A method and system for automated voice casting compares candidate voices samples from candidate speakers in a target language with a primary voice sample from a primary speaker in a primary language. Utterances in the audio samples of the candidates speakers and the primary speaker are identified and typed and voice samples generated that meet applicable utterance type criteria. A neural network is used to generate an embedding for the voice samples. A voice sample can include groups of different utterance types and embeddings generated for each utterance group in the voice sample and then combined in a weighted form wherein the resulting embedding emphasizes selected utterance types. Similarities between embeddings for the candidate voice samples relative to the primary voice sample are evaluated and used to select a candidate speaker that is a vocal match.
Public/Granted literature
Information query