- 专利标题: Translating a media asset with vocal characteristics of a speaker
-
申请号: US17509401申请日: 2021-10-25
-
公开(公告)号: US11997344B2公开(公告)日: 2024-05-28
- 发明人: Vijay Kumar , Rajendran Pichaimurthy , Madhusudhan Seetharam
- 申请人: Rovi Guides, Inc.
- 申请人地址: US CA San Jose
- 专利权人: Rovi Guides, Inc.
- 当前专利权人: Rovi Guides, Inc.
- 当前专利权人地址: US CA San Jose
- 代理机构: HALEY GUILIANO LLP
- 主分类号: G06F40/40
- IPC分类号: G06F40/40 ; G10L13/027 ; G10L13/033 ; G10L15/07 ; G10L15/19 ; G10L25/63 ; H04N21/43 ; H04N21/81
摘要:
Systems and methods are described herein for generating alternate audio for a media stream. The media system receives media that is requested by the user. The media comprises a video and audio. The audio includes words spoken in a first language. The media system stores the received media in a buffer as it is received. The media system separates the audio from the buffered media and determines an emotional state expressed by spoken words of the first language. The media system translates the words spoken in the first language into words spoken in a second language. Using the translated words of the second language, the media system synthesizes speech having the emotional state previously determined. The media system then retrieves the video of the received media from the buffer and synchronizes the synthesized speech with the video to generate the media content in a second language.
信息查询