-
公开(公告)号:US10902847B2
公开(公告)日:2021-01-26
申请号:US16124697
申请日:2018-09-07
Applicant: Spotify AB
Inventor: Aaron Springer , Henriette Cramer , Sravana Reddy
IPC: G10L15/187 , G10L15/18 , G10L15/22 , G06F40/295
Abstract: Methods, systems, and related products that provide detection of media content items that are under-locatable by machine voice-driven retrieval of uttered requests for retrieval of the media items. For a given media item, a resolvability value and/or an utterance resolve frequency is calculated by a number of playbacks of the media item by a speech retrieval modality to a total number of playbacks of the media item regardless of retrieval modality. In some examples, the methods, systems and related products also provide for improvement in the locatability of an under-locatable media item by collecting and/or generating one or more pronunciation aliases for the under-locatable item.
-
公开(公告)号:US12057114B2
公开(公告)日:2024-08-06
申请号:US16568835
申请日:2019-09-12
Applicant: Spotify AB
Inventor: Bryan Roy , Philip Edmonds , Matthew Joseph Kane , Jennifer Thom-Santelli , Neha Kothari , Sarah Mennicken , Karl Humphreys , Ruth Brillman , Sravana Reddy , Henriette Cramer , Robert L. Williams , Rohit Kumar
IPC: G10L15/22 , G06F3/16 , G06F16/635 , G06F16/638 , G06F16/68 , G06F40/211 , G10L15/26
CPC classification number: G10L15/22 , G06F3/165 , G06F16/635 , G06F16/639 , G06F16/686 , G06F40/211 , G10L15/26 , G10L2015/223
Abstract: A media content steering solution is provided to identify a user query to steer playback of media content that is currently playing or has been played. The user steering query can include a voice request for playing media content that is relatively different from the media content being currently played or having been played. The media content steering solution analyzes the utterance of the user query and uses it to identify such different content that satisfies the user intent contained in the user query.
-
公开(公告)号:US11710474B2
公开(公告)日:2023-07-25
申请号:US17146804
申请日:2021-01-12
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
CPC classification number: G10L13/00 , G06F16/685 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US20230267912A1
公开(公告)日:2023-08-24
申请号:US18310136
申请日:2023-05-01
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
CPC classification number: G10L13/00 , G06F16/685 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US11114085B2
公开(公告)日:2021-09-07
申请号:US16235776
申请日:2018-12-28
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US11657809B2
公开(公告)日:2023-05-23
申请号:US17136836
申请日:2020-12-29
Applicant: Spotify AB
Inventor: Aaron Springer , Henriette Cramer , Sravana Reddy
IPC: G10L15/187 , G10L15/18 , G10L15/22 , G06F40/295
CPC classification number: G10L15/187 , G06F40/295 , G10L15/1815 , G10L15/22 , G10L2015/223
Abstract: Methods, systems, and related products that provide detection of media content items that are under-locatable by machine voice-driven retrieval of uttered requests for retrieval of the media items. For a given media item, a resolvability value and/or an utterance resolve frequency is calculated by a number of playbacks of the media item by a speech retrieval modality to a total number of playbacks of the media item regardless of retrieval modality. In some examples, the methods, systems and related products also provide for improvement in the locatability of an under-locatable media item by collecting and/or generating one or more pronunciation aliases for the under-locatable item.
-
公开(公告)号:US20210241753A1
公开(公告)日:2021-08-05
申请号:US17146804
申请日:2021-01-12
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
-
-
-
-
-