-
公开(公告)号:US12080316B2
公开(公告)日:2024-09-03
申请号:US18168840
申请日:2023-02-14
Applicant: Spotify AB
Inventor: Daniel Bromand , Mauricio Greene
IPC: G10L21/0232 , G10L21/0216 , H04R1/40 , H04R3/00
CPC classification number: G10L21/0232 , H04R1/406 , H04R3/005 , G10L2021/02166 , H04R2410/07
Abstract: Apparatus, methods and computer-readable medium are provided for processing wind noise. Audio input is processed by receiving an audio input. A wind noise level representative of a wind noise at the microphone array is measured using the audio input and a determination is made, based on the wind noise level, whether to perform either (i) a wind noise suppression process on the audio input on-device, or (ii) the wind noise suppression process on the audio input on-device and an audio reconstruction process in-cloud.
-
公开(公告)号:US11887582B2
公开(公告)日:2024-01-30
申请号:US17173659
申请日:2021-02-11
Applicant: Spotify AB
Inventor: Daniel Bromand
CPC classification number: G10L15/063 , G06F7/582 , G10L13/02 , G10L15/07 , G10L2015/0635
Abstract: Systems, methods, and devices for training and testing utterance based frameworks are disclosed. The training and testing can be conducting using synthetic utterance samples in addition to natural utterance samples. The synthetic utterance samples can be generated based on a vector space representation of natural utterances. In one method, a synthetic weight vector associated with a vector space is generated. An average representation of the vector space is added to the synthetic weight vector to form a synthetic feature vector. The synthetic feature vector is used to generate a synthetic voice sample. The synthetic voice sample is provided to the utterance-based framework as at least one of a testing or training sample.
-
43.
公开(公告)号:US11810564B2
公开(公告)日:2023-11-07
申请号:US17705233
申请日:2022-03-25
Applicant: Spotify AB
Inventor: Daniel Bromand , Joseph Cauteruccio , Sven Erland Fredrik Lewin
IPC: G10L15/22 , G10L15/08 , G10L15/30 , G10L21/0232 , H04R1/40 , H04R3/00 , H04R5/027 , G10L21/0208 , G10L21/0216
CPC classification number: G10L15/22 , G10L15/08 , G10L15/30 , G10L21/0232 , H04R1/406 , H04R3/005 , H04R5/027 , G10L2015/088 , G10L2015/223 , G10L2021/02082 , G10L2021/02166
Abstract: Systems and methods are provided for detecting wake words. An electronic device detects, from a microphone array, an audio signal in an environment proximate to the audio front end system. The electronic device processes the audio signal using a plurality of wake word detection engines, including dynamically adjusting how many wake word detection engines are available for processing the audio signal. The electronic device independently adjusts respective wake word detection thresholds for the plurality of wake word detection engines used to process the audio signal.
-
公开(公告)号:US11748058B2
公开(公告)日:2023-09-05
申请号:US17142831
申请日:2021-01-06
Applicant: Spotify AB
Inventor: Emma-Camelia Gosu , Daniel Bromand , Karl Humphreys
CPC classification number: G06F3/167 , G06F3/165 , H04L65/60 , G10H2210/076
Abstract: A system and method for confirming a voice command of a media playback device is disclosed. The method includes receiving an instruction of a voice command and producing an audio confirmation of the command. A confirmation may be playing a media context item associated with the command, playing a verbal confirmation phrase, or playing a non-verbal audio cue.
-
公开(公告)号:US20230237991A1
公开(公告)日:2023-07-27
申请号:US17584512
申请日:2022-01-26
Applicant: Spotify AB
Inventor: Daniel Bromand , Björn Erik Roth
CPC classification number: G10L15/08 , G10L15/22 , G10L15/30 , G10L2015/088
Abstract: A wake word detector, at a server of a content delivery network (CDN) that provides audio (or other) content to a device, such as a voice-enabled device, detects false wake words in the audio content. The CDN wake word detector analyzes the audio stream to determine if the audio stream contains any audio that sounds like the wake word. If so, the CDN wake word detector can generate metadata that describes the time period, within the audio content, in which the false wake word was encountered. The metadata can include time offsets, from the start of the audio content, which can instruct a voice-enabled device to deactivate during the time period. This metadata is stored and then sent to the media-playback device requests the media content. The media-playback device can then instruct or inform the voice-enabled device of the presence of the false wake word. In this way, the wake word detector, at the voice-enabled device, is not activated to receive the false wake word.
-
公开(公告)号:US11601486B2
公开(公告)日:2023-03-07
申请号:US17401850
申请日:2021-08-13
Applicant: Spotify AB
Inventor: Richard Mitic , Horia Jurcut , Daniel Bromand , David Gustafsson
IPC: G06F15/16 , H04L65/60 , H04L67/1097
Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system includes one or more preset buttons for playing media content associated with the preset buttons. Data about the preset buttons and the media content associated with the preset buttons can be stored in the media delivery system.
-
公开(公告)号:US11501764B2
公开(公告)日:2022-11-15
申请号:US16408887
申请日:2019-05-10
Applicant: Spotify AB
Inventor: Daniel Bromand
Abstract: Methods, systems, and related products for voice-enabled computer systems are described. A machine-learning model is trained to produce pronunciation output based on text input. The trained machine-learning model is used to produce pronunciation data for text input even where the text input includes numbers, punctuation, emoji, or other non-letter characters. The machine-learning model is further trained based on real-world data from users to improve pronunciation output.
-
公开(公告)号:US20220277743A1
公开(公告)日:2022-09-01
申请号:US17694756
申请日:2022-03-15
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia Jurcut , Jennifer Thom-Santelli , Henriette Cramer , Karl Humphreys , Bo Williams , Kurt Jacobson , Henrik Lindström
Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.
-
公开(公告)号:US20220210242A1
公开(公告)日:2022-06-30
申请号:US17694766
申请日:2022-03-15
Applicant: Spotify AB
Inventor: Daniel Bromand , Horia Jurcut , Carl-Johan Larsson , Fredrik Håkansson
IPC: H04L67/568 , H04W4/029 , G06N3/04 , G06N3/08 , H04L65/61
Abstract: Systems, devices, apparatuses, components, methods, and techniques for predicting user and media-playback device states are provided. Systems, devices, apparatuses, components, methods, and techniques for media content item caching on a media-playback device are also provided. Systems, devices, apparatuses, components, methods, and techniques for predicting a destination are also provided.
-
公开(公告)号:US11334315B2
公开(公告)日:2022-05-17
申请号:US16504892
申请日:2019-07-08
Applicant: Spotify AB
Inventor: Daniel Bromand , Richard Mitic , Horia-Dragos Jurcut , Henriette Susanne Martine Cramer , Ruth Brillman
IPC: G06F16/638 , G06F3/16 , G10L15/22 , G10L15/26
Abstract: Systems, methods, and devices for human-machine interfaces for utterance-based playlist selection are disclosed. In one method, a list of playlists is traversed and a portion of each is audibly output until a playlist command is received. Based on the playlist command, the traversing is stopped and a playlist is selected for playback. In examples, the list of playlists is modified based on a modification input.
-
-
-
-
-
-
-
-
-