- 专利标题: Text-to-speech from media content item snippets
-
申请号: US16235776申请日: 2018-12-28
-
公开(公告)号: US11114085B2公开(公告)日: 2021-09-07
- 发明人: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
- 申请人: Spotify AB
- 申请人地址: SE Stockholm
- 专利权人: Spotify AB
- 当前专利权人: Spotify AB
- 当前专利权人地址: SE Stockholm
- 代理机构: Merchant & Gould P.C.
- 主分类号: G10L13/00
- IPC分类号: G10L13/00 ; G06F16/683 ; G10L13/04
摘要:
A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
公开/授权文献
- US20200211531A1 TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS 公开/授权日:2020-07-02
信息查询