Audio cancellation for voice recognition

    公开(公告)号:US11605393B2

    公开(公告)日:2023-03-14

    申请号:US17158312

    申请日:2021-01-26

    Applicant: Spotify AB

    Abstract: An audio cancellation system includes a voice enabled computing system that is connected to an audio output device using a wired or wireless communication network. The voice enabled computing device can provide media content to a user and receive a voice command from the user. The connection between the voice enabled computing system and the audio output device introduces a time delay between the media content being generated at the voice enabled computing device and the media content being reproduced at the audio output device. The system operates to determine a calibration value adapted for the voice enabled computing system and the audio output device. The system uses the calibration value to filter the user's voice command from a recording of ambient sound including the media content, without requiring significant use of memory and computing resources.

    Voice-based authentication
    22.
    发明授权

    公开(公告)号:US11170787B2

    公开(公告)日:2021-11-09

    申请号:US16278305

    申请日:2019-02-18

    Applicant: Spotify AB

    Inventor: Daniel Bromand

    Abstract: Voice-based authentication can include methods, systems, devices, and computer program products for providing user-specific services or access based at least in part on an utterance. In one method, an audio clip having an utterance is obtained. The utterance has an activation trigger portion and a command portion. A first distance between a vector representation of the activation trigger portion and a registered activation trigger vector is determined; and a second distance between a vector representation of the command portion and a registered command vector is determined. Responsive to the first distance satisfying a first distance threshold, and the second distance satisfying a second distance threshold, access is provided to a service associated with a registered user.

    Media playback actions based on knob rotation

    公开(公告)号:US11099806B2

    公开(公告)日:2021-08-24

    申请号:US16396497

    申请日:2019-04-26

    Applicant: Spotify AB

    Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Various types of rotations of a knob part of the personal media streaming applicant system result in different media playback actions.

    COMMAND CONFIRMATION FOR A MEDIA PLAYBACK DEVICE

    公开(公告)号:US20210200509A1

    公开(公告)日:2021-07-01

    申请号:US17142831

    申请日:2021-01-06

    Applicant: Spotify AB

    Abstract: A system and method for confirming a voice command of a media playback device is disclosed. The method includes receiving an instruction of a voice command and producing an audio confirmation of the command. A confirmation may be playing a media context item associated with the command, playing a verbal confirmation phrase, or playing a non-verbal audio cue.

    ADAPTIVE VOICE COMMUNICATION
    25.
    发明申请

    公开(公告)号:US20210072951A1

    公开(公告)日:2021-03-11

    申请号:US17100529

    申请日:2020-11-20

    Applicant: Spotify AB

    Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Customized voice communications are generated based on receiving input, such as a user query and/or a media track change indication.

    Systems and Methods for Generating a Cleaned Version of Ambient Sound

    公开(公告)号:US20210065696A1

    公开(公告)日:2021-03-04

    申请号:US16557734

    申请日:2019-08-30

    Applicant: Spotify AB

    Abstract: While a media content item is emitted by a second electronic device that is remote from the first electronic device, the first electronic device receives data that includes: timing information, offset information that indicates a difference between an initial position of the media content item and a current playback position of the media content item, and an audio stream that corresponds to the media content item. The first electronic device detects ambient sound that includes sound corresponding to the media content item emitted by the second electronic device. The first electronic device generates a cleaned version of the ambient sound by using the timing information and the offset information to align the audio stream with the ambient sound and performing a subtraction operation to substantially subtract the audio stream from the ambient sound.

    TRAINING AND TESTING UTTERANCE-BASED FRAMEWORKS

    公开(公告)号:US20240203401A1

    公开(公告)日:2024-06-20

    申请号:US18530702

    申请日:2023-12-06

    Applicant: Spotify AB

    Inventor: Daniel Bromand

    Abstract: Systems, methods, and devices for training and testing utterance based frameworks are disclosed. The training and testing can be conducting using synthetic utterance samples in addition to natural utterance samples. The synthetic utterance samples can be generated based on a vector space representation of natural utterances. In one method, a synthetic weight vector associated with a vector space is generated. An average representation of the vector space is added to the synthetic weight vector to form a synthetic feature vector. The synthetic feature vector is used to generate a synthetic voice sample. The synthetic voice sample is provided to the utterance-based framework as at least one of a testing or training sample.

    Wind noise suppresor
    30.
    发明授权

    公开(公告)号:US11682411B2

    公开(公告)日:2023-06-20

    申请号:US17462660

    申请日:2021-08-31

    Applicant: Spotify AB

    Abstract: Apparatus, methods and computer-readable medium are provided for processing wind noise. Audio input is processed by receiving an audio input. A wind noise level representative of a wind noise at the microphone array is measured using the audio input and a determination is made, based on the wind noise level, whether to perform either (i) a wind noise suppression process on the audio input on-device, or (ii) the wind noise suppression process on the audio input on-device and an audio reconstruction process in-cloud.

Patent Agency Ranking