-
公开(公告)号:US11769532B2
公开(公告)日:2023-09-26
申请号:US16573191
申请日:2019-09-17
Applicant: Spotify AB
Inventor: Henriette Susanne Martine Cramer , Sarah Mennicken , Kurt Jacobson , Rohit Kumar , Henrik Lindström , Karl Humphreys , Jennifer Thom-Santelli , Robert L. Williams
IPC: G06F17/00 , G11B27/034 , G06F16/68 , H04N21/2387 , G11B27/10 , G06F3/16
CPC classification number: G11B27/034 , G06F3/165 , G06F3/167 , G06F16/68 , G11B27/105 , H04N21/2387
Abstract: A system for generating and distributing a digital mixtape. In one example, the system can receive a user command to generate a digital mixtape including a user-defined compilation of music. The user command identifies a recipient of the digital mixtape and identifies one or more media content items to be included in the music compilation for the recipient. The digital mixtape can also include audio recordings from the user to be added to the digital mixtape.
-
公开(公告)号:US20210241753A1
公开(公告)日:2021-08-05
申请号:US17146804
申请日:2021-01-12
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US11886486B2
公开(公告)日:2024-01-30
申请号:US16552287
申请日:2019-08-27
Applicant: Spotify AB
Inventor: Sarah Mennicken , Morteza Behrooz , Henriette Cramer , Rohit Kumar
CPC classification number: G06F16/4387 , G06F16/22 , G06F16/24 , G06F16/41 , G06F16/43 , G06F16/48 , G06F40/56
Abstract: Apparatus, systems and methods for augmenting a group of media content items by forming a graph including a plurality of nodes and a plurality of edges, where each node represents a segue option at a position in the graph and each edge represents a connection between a first node in the graph at a first position and a second node in the graph at a second position and finding a path in the graph.
-
公开(公告)号:US20230267912A1
公开(公告)日:2023-08-24
申请号:US18310136
申请日:2023-05-01
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
CPC classification number: G10L13/00 , G06F16/685 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US11114085B2
公开(公告)日:2021-09-07
申请号:US16235776
申请日:2018-12-28
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
公开(公告)号:US12057114B2
公开(公告)日:2024-08-06
申请号:US16568835
申请日:2019-09-12
Applicant: Spotify AB
Inventor: Bryan Roy , Philip Edmonds , Matthew Joseph Kane , Jennifer Thom-Santelli , Neha Kothari , Sarah Mennicken , Karl Humphreys , Ruth Brillman , Sravana Reddy , Henriette Cramer , Robert L. Williams , Rohit Kumar
IPC: G10L15/22 , G06F3/16 , G06F16/635 , G06F16/638 , G06F16/68 , G06F40/211 , G10L15/26
CPC classification number: G10L15/22 , G06F3/165 , G06F16/635 , G06F16/639 , G06F16/686 , G06F40/211 , G10L15/26 , G10L2015/223
Abstract: A media content steering solution is provided to identify a user query to steer playback of media content that is currently playing or has been played. The user steering query can include a voice request for playing media content that is relatively different from the media content being currently played or having been played. The media content steering solution analyzes the utterance of the user query and uses it to identify such different content that satisfies the user intent contained in the user query.
-
公开(公告)号:US20200159761A1
公开(公告)日:2020-05-21
申请号:US16552287
申请日:2019-08-27
Applicant: Spotify AB
Inventor: Sarah Mennicken , Morteza Behrooz , Henriette Cramer , Rohit Kumar
IPC: G06F16/438 , G06F40/56 , G06F16/48
Abstract: Apparatus, systems and methods for augmenting a group of media content items by forming a graph including a plurality of nodes and a plurality of edges, where each node represents a segue option at a position in the graph and each edge represents a connection between a first node in the graph at a first position and a second node in the graph at a second position and finding a path in the graph.
-
公开(公告)号:US11710474B2
公开(公告)日:2023-07-25
申请号:US17146804
申请日:2021-01-12
Applicant: Spotify AB
Inventor: Rohit Kumar , Henrik Lindström , Henriette Cramer , Sarah Mennicken , Sravana Reddy , Jennifer Thom-Santelli
IPC: G10L13/00 , G06F16/683 , G10L13/04
CPC classification number: G10L13/00 , G06F16/685 , G10L13/04
Abstract: A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.
-
-
-
-
-
-
-