-
公开(公告)号:US20210074289A1
公开(公告)日:2021-03-11
申请号:US17017542
申请日:2020-09-10
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Robert Williams , Kurt Jacobson , Henrik Lindström
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US10902847B2
公开(公告)日:2021-01-26
申请号:US16124697
申请日:2018-09-07
Applicant: Spotify AB
Inventor: Aaron Springer , Henriette Cramer , Sravana Reddy
IPC: G10L15/187 , G10L15/18 , G10L15/22 , G06F40/295
Abstract: Methods, systems, and related products that provide detection of media content items that are under-locatable by machine voice-driven retrieval of uttered requests for retrieval of the media items. For a given media item, a resolvability value and/or an utterance resolve frequency is calculated by a number of playbacks of the media item by a speech retrieval modality to a total number of playbacks of the media item regardless of retrieval modality. In some examples, the methods, systems and related products also provide for improvement in the locatability of an under-locatable media item by collecting and/or generating one or more pronunciation aliases for the under-locatable item.
-
公开(公告)号:US20190341038A1
公开(公告)日:2019-11-07
申请号:US15973240
申请日:2018-05-07
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Bo Williams , Kurt Jacobson , Henrik Lindström
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US11657809B2
公开(公告)日:2023-05-23
申请号:US17136836
申请日:2020-12-29
Applicant: Spotify AB
Inventor: Aaron Springer , Henriette Cramer , Sravana Reddy
IPC: G10L15/187 , G10L15/18 , G10L15/22 , G06F40/295
CPC classification number: G10L15/187 , G06F40/295 , G10L15/1815 , G10L15/22 , G10L2015/223
Abstract: Methods, systems, and related products that provide detection of media content items that are under-locatable by machine voice-driven retrieval of uttered requests for retrieval of the media items. For a given media item, a resolvability value and/or an utterance resolve frequency is calculated by a number of playbacks of the media item by a speech retrieval modality to a total number of playbacks of the media item regardless of retrieval modality. In some examples, the methods, systems and related products also provide for improvement in the locatability of an under-locatable media item by collecting and/or generating one or more pronunciation aliases for the under-locatable item.
-
公开(公告)号:US20210241753A1
公开(公告)日:2021-08-05
申请号:US17146804
申请日:2021-01-12
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US10803864B2
公开(公告)日:2020-10-13
申请号:US15973215
申请日:2018-05-07
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Robert Williams , Kurt Jacobson , Henrik Lindström
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US11935526B2
公开(公告)日:2024-03-19
申请号:US17017542
申请日:2020-09-10
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Robert Williams , Kurt Jacobson , Henrik Lindström
CPC classification number: G10L15/22 , G10L15/26 , G10L2015/223
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US11886486B2
公开(公告)日:2024-01-30
申请号:US16552287
申请日:2019-08-27
Applicant: Spotify AB
Inventor: Sarah Mennicken , Morteza Behrooz , Henriette Cramer , Rohit Kumar
CPC classification number: G06F16/4387 , G06F16/22 , G06F16/24 , G06F16/41 , G06F16/43 , G06F16/48 , G06F40/56
Abstract: Apparatus, systems and methods for augmenting a group of media content items by forming a graph including a plurality of nodes and a plurality of edges, where each node represents a segue option at a position in the graph and each edge represents a connection between a first node in the graph at a first position and a second node in the graph at a second position and finding a path in the graph.
-
公开(公告)号:US20230267912A1
公开(公告)日:2023-08-24
申请号:US18310136
申请日:2023-05-01
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
CPC classification number: G10L13/00 , G06F16/685 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US11114085B2
公开(公告)日:2021-09-07
申请号:US16235776
申请日:2018-12-28
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
-
-
-
-
-
-
-
-