TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS

发明申请

US20200211531A1 TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS 审中-公开

请登陆查看更多内容

专利标题： TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS
申请号： US16235776

申请日： 2018-12-28
公开(公告)号： US20200211531A1

公开(公告)日： 2020-07-02
发明人: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
申请人： Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
主分类号： G10L13/04
IPC分类号： G10L13/04 ; G06F16/683

TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS

摘要：

A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

公开/授权文献

US11114085B2 Text-to-speech from media content item snippets 公开/授权日：2021-09-07

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理