Translating a media asset with vocal characteristics of a speaker

发明授权

US11997344B2 Translating a media asset with vocal characteristics of a speaker 有权

请登陆查看更多内容

专利标题： Translating a media asset with vocal characteristics of a speaker
申请号： US17509401

申请日： 2021-10-25
公开(公告)号： US11997344B2

公开(公告)日： 2024-05-28
发明人: Vijay Kumar , Rajendran Pichaimurthy , Madhusudhan Seetharam
申请人： Rovi Guides, Inc.
申请人地址： US CA San Jose
专利权人： Rovi Guides, Inc.
当前专利权人： Rovi Guides, Inc.
当前专利权人地址： US CA San Jose
代理机构： HALEY GUILIANO LLP
主分类号： G06F40/40
IPC分类号： G06F40/40 ; G10L13/027 ; G10L13/033 ; G10L15/07 ; G10L15/19 ; G10L25/63 ; H04N21/43 ; H04N21/81

Translating a media asset with vocal characteristics of a speaker

摘要：

Systems and methods are described herein for generating alternate audio for a media stream. The media system receives media that is requested by the user. The media comprises a video and audio. The audio includes words spoken in a first language. The media system stores the received media in a buffer as it is received. The media system separates the audio from the buffered media and determines an emotional state expressed by spoken words of the first language. The media system translates the words spoken in the first language into words spoken in a second language. Using the translated words of the second language, the media system synthesizes speech having the emotional state previously determined. The media system then retrieves the video of the received media from the buffer and synchronizes the synthesized speech with the video to generate the media content in a second language.

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/40	.自然语言的处理或翻译(自然语言分析入G06F40/20；语义分析入G06F40/30)