-
公开(公告)号:US11605393B2
公开(公告)日:2023-03-14
申请号:US17158312
申请日:2021-01-26
Applicant: Spotify AB
Inventor: Richard Mitic , Robert Swain , Daniel Bromand , Waqar Sheikh , James Robert Stansfield
IPC: G10L21/0232 , G10L25/51 , H04R3/00 , G10L15/20 , G10L15/22
Abstract: An audio cancellation system includes a voice enabled computing system that is connected to an audio output device using a wired or wireless communication network. The voice enabled computing device can provide media content to a user and receive a voice command from the user. The connection between the voice enabled computing system and the audio output device introduces a time delay between the media content being generated at the voice enabled computing device and the media content being reproduced at the audio output device. The system operates to determine a calibration value adapted for the voice enabled computing system and the audio output device. The system uses the calibration value to filter the user's voice command from a recording of ambient sound including the media content, without requiring significant use of memory and computing resources.
-
公开(公告)号:US11170787B2
公开(公告)日:2021-11-09
申请号:US16278305
申请日:2019-02-18
Applicant: Spotify AB
Inventor: Daniel Bromand
Abstract: Voice-based authentication can include methods, systems, devices, and computer program products for providing user-specific services or access based at least in part on an utterance. In one method, an audio clip having an utterance is obtained. The utterance has an activation trigger portion and a command portion. A first distance between a vector representation of the activation trigger portion and a registered activation trigger vector is determined; and a second distance between a vector representation of the command portion and a registered command vector is determined. Responsive to the first distance satisfying a first distance threshold, and the second distance satisfying a second distance threshold, access is provided to a service associated with a registered user.
-
公开(公告)号:US11099806B2
公开(公告)日:2021-08-24
申请号:US16396497
申请日:2019-04-26
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Johan Oskarsson
IPC: G06F3/048 , G06F3/16 , G06F3/0362
Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Various types of rotations of a knob part of the personal media streaming applicant system result in different media playback actions.
-
公开(公告)号:US20210200509A1
公开(公告)日:2021-07-01
申请号:US17142831
申请日:2021-01-06
Applicant: Spotify AB
Inventor: Emma-Camelia Gosu , Daniel Bromand , Karl Humphreys
Abstract: A system and method for confirming a voice command of a media playback device is disclosed. The method includes receiving an instruction of a voice command and producing an audio confirmation of the command. A confirmation may be playing a media context item associated with the command, playing a verbal confirmation phrase, or playing a non-verbal audio cue.
-
公开(公告)号:US20210072951A1
公开(公告)日:2021-03-11
申请号:US17100529
申请日:2020-11-20
Applicant: Spotify AB
Inventor: Emma-Camelia Gosu , Johan Oskarsson , Daniel Bromand
Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Customized voice communications are generated based on receiving input, such as a user query and/or a media track change indication.
-
公开(公告)号:US20210065696A1
公开(公告)日:2021-03-04
申请号:US16557734
申请日:2019-08-30
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Björn Erik Roth
Abstract: While a media content item is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound by using the timing information and the offset information to align the audio stream with the ambient sound and performing a subtraction operation to substantially subtract the audio stream from the ambient sound.
-
公开(公告)号:US20240203401A1
公开(公告)日:2024-06-20
申请号:US18530702
申请日:2023-12-06
Applicant: Spotify AB
Inventor: Daniel Bromand
CPC classification number: G10L15/063 , G06F7/582 , G10L13/02 , G10L15/07 , G10L2015/0635
Abstract: Systems, methods, and devices for training and testing utterance based frameworks are disclosed. The training and testing can be conducting using synthetic utterance samples in addition to natural utterance samples. The synthetic utterance samples can be generated based on a vector space representation of natural utterances. In one method, a synthetic weight vector associated with a vector space is generated. An average representation of the vector space is added to the synthetic weight vector to form a synthetic feature vector. The synthetic feature vector is used to generate a synthetic voice sample. The synthetic voice sample is provided to the utterance-based framework as at least one of a testing or training sample.
-
公开(公告)号:US11935534B2
公开(公告)日:2024-03-19
申请号:US17694756
申请日:2022-03-15
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Robert Williams , Kurt Jacobson , Henrik Lindström
CPC classification number: G10L15/22 , G06F3/165 , G10L15/26 , G10L2015/223
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US11755283B2
公开(公告)日:2023-09-12
申请号:US17720486
申请日:2022-04-14
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia-Dragos Jurcut , Henriette Susanne Martine Cramer , Ruth Brillman
IPC: G06F16/638 , G06F3/16 , G10L15/22 , G10L15/26
CPC classification number: G06F3/167 , G10L15/22 , G10L15/26 , G06F16/639 , G10L2015/223
Abstract: Systems, methods, and devices for human-machine interfaces for utterance-based playlist selection are disclosed. In one method, a list of playlists is traversed and a portion of each is audibly output until a playlist command is received. Based on the playlist command, the traversing is stopped and a playlist is selected for playback. In examples, the list of playlists is modified based on a modification input.
-
公开(公告)号:US11682411B2
公开(公告)日:2023-06-20
申请号:US17462660
申请日:2021-08-31
Applicant: Spotify AB
Inventor: Daniel Bromand , Mauricio Greene
IPC: G10L21/0232 , H04R1/40 , H04R3/00 , G10L21/0216
CPC classification number: G10L21/0232 , H04R1/406 , H04R3/005 , G10L2021/02166 , H04R2410/07
Abstract: Apparatus, methods and computer-readable medium are provided for processing wind noise. Audio input is processed by receiving an audio input. A wind noise level representative of a wind noise at the microphone array is measured using the audio input and a determination is made, based on the wind noise level, whether to perform either (i) a wind noise suppression process on the audio input on-device, or (ii) the wind noise suppression process on the audio input on-device and an audio reconstruction process in-cloud.
-
-
-
-
-
-
-
-
-