Method and System for Selecting a Voice Assistant

    公开(公告)号:US20240221754A1

    公开(公告)日:2024-07-04

    申请号:US18090081

    申请日:2022-12-28

    申请人: Spotify AB

    摘要: A method for processing voice input is disclosed. The method may be performed by a device including a voice assistant manager and a plurality of voice assistants. In some embodiments, the method includes receiving an utterance from a user, detecting a category of the utterance, and communicating the utterance to a selected voice assistant of the plurality of voice assistants. The selected voice assistant may be associated with the detected category. In some embodiments, the selected voice assistant may generate a response to utterance, and the response may be output to the user.

    Voice Assistant Controller
    2.
    发明公开

    公开(公告)号:US20240221755A1

    公开(公告)日:2024-07-04

    申请号:US18090105

    申请日:2022-12-28

    申请人: Spotify AB

    IPC分类号: G10L15/32 G10L15/22

    摘要: A method for managing voice assistants is disclosed. The method may be performed by a voice assistant controller communicatively coupled to a plurality of voice assistants. The voice assistant controller may determine a first order of the plurality of voice assistants. Based at least in part on the first order, the voice assistant controller may activate one or more voice assistants. Furthermore, the voice assistant controller may determine a second order of the plurality of voice assistants. Based at least in part on the second order, the voice assistant controller may suspend an active assistant, activate a suspended assistant, or perform both operations.

    SYSTEMS AND METHODS FOR PLAYING MEDIA CONTENT ON A TARGET DEVICE

    公开(公告)号:US20210084113A1

    公开(公告)日:2021-03-18

    申请号:US17033326

    申请日:2020-09-25

    申请人: Spotify AB

    摘要: A first device receives a voice command from a first user of a second device. The first device determines, from content in the voice command, one or more characteristics of a target device and media content to be played on the target device. The first device identifies, using the characteristics of the target device, a third device. In response to identifying the third device: the first device modifies account information for the third device to associate the third device with the first user and transmits instructions to the third device to play the media content.

    TEXT-TO-SPEECH AND SPEECH RECOGNITION FOR NOISY ENVIRONMENTS

    公开(公告)号:US20220208174A1

    公开(公告)日:2022-06-30

    申请号:US17565826

    申请日:2021-12-30

    申请人: Spotify AB

    摘要: The present disclosure relates generally to speech processing. Humans change their speech patterns in noisy environments. The systems and devices described herein can compensate for noisy environments to be more human-like. Thus, the configurations and implementations herein can determine a sound profile for the sound environment where the user is listening. Based on the sound profile, the devices can determine a transform to apply to output speech from the device. This transform is applied to the wake word, speech recognition, and to the output speech to compensate for the noise level of the environment by mimicking the Lombard effect.