Invention Application
- Patent Title: SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES
-
Application No.: US17196285Application Date: 2021-03-09
-
Publication No.: US20210279427A1Publication Date: 2021-09-09
- Inventor: Aansh Malik , Ha Thanh Nguyen
- Applicant: Warner Bros. Entertainment Inc.
- Applicant Address: US CA Burbank
- Assignee: Warner Bros. Entertainment Inc.
- Current Assignee: Warner Bros. Entertainment Inc.
- Current Assignee Address: US CA Burbank
- Main IPC: G06F40/47
- IPC: G06F40/47 ; G06F40/58 ; G10L15/16 ; G10L15/00

Abstract:
A method and system for automated voice casting compares candidate voices samples from candidate speakers in a target language with a primary voice sample from a primary speaker in a primary language. Utterances in the audio samples of the candidates speakers and the primary speaker are identified and typed and voice samples generated that meet applicable utterance type criteria. A neural network is used to generate an embedding for the voice samples. A voice sample can include groups of different utterance types and embeddings generated for each utterance group in the voice sample and then combined in a weighted form wherein the resulting embedding emphasizes selected utterance types. Similarities between embeddings for the candidate voice samples relative to the primary voice sample are evaluated and used to select a candidate speaker that is a vocal match.
Public/Granted literature
- US12086558B2 Systems and methods for generating multi-language media content with automatic selection of matching voices Public/Granted day:2024-09-10
Information query