Altering audio to improve automatic speech recognition

    公开(公告)号:US11488591B1

    公开(公告)日:2022-11-01

    申请号:US16510060

    申请日:2019-07-12

    摘要: Techniques for altering audio being output by a voice-controlled device, or another device, to enable more accurate automatic speech recognition (ASR) by the voice-controlled device. For instance, a voice-controlled device may output audio within an environment using a speaker of the device. While outputting the audio, a microphone of the device may capture sound within the environment and may generate an audio signal based on the captured sound. The device may then analyze the audio signal to identify speech of a user within the signal, with the speech indicating that the user is going to provide a subsequent command to the device. Thereafter, the device may alter the output of the audio (e.g., attenuate the audio, pause the audio, switch from stereo to mono, etc.) to facilitate speech recognition of the user's subsequent command.

    PROCESSING SPOKEN COMMANDS TO CONTROL DISTRIBUTED AUDIO OUTPUTS

    公开(公告)号:US20210174802A1

    公开(公告)日:2021-06-10

    申请号:US17128982

    申请日:2020-12-21

    摘要: A system that is capable of controlling multiple entertainment systems and/or speakers using voice commands. The system receives voice commands and may determine audio sources and speakers indicated by the voice commands. The system may generate audio data from the audio sources and may send the audio data to the speakers using multiple interfaces. For example, the system may send the audio data directly to the speakers using a network address, may send the audio data to the speakers via a voice-enabled device or may send the audio data to the speakers via a speaker controller. The system may generate output zones including multiple speakers and may associate input devices with speakers within the output zones. For example, the system may receive a voice command from an input device in an output zone and may reduce output audio generated by speakers in the output zone.

    Enabling voice control of telephone device

    公开(公告)号:US10326869B2

    公开(公告)日:2019-06-18

    申请号:US15392329

    申请日:2016-12-28

    摘要: A system capable of connecting a device to a Public Switched Telephone Network (PSTN) using an adapter. The device may send audio data via a data network to a server and the server can determine a voice command included in the audio data. Based on the voice command, the server may send an instruction to an adapter via the data network, the instruction causing the adapter to initiate a telephone call over the PSTN. During the telephone call, the adapter and the server may forward audio data between the device and the PSTN, enabling the device to communicate over the PSTN. The system may enable the device to receive an incoming call from the PSTN and may provide additional functionality, such as determining call statistics during the telephone call, determining if another telephone receives audio data during the telephone call and detecting an alarm signal sent via the PSTN.

    MESSAGE PLAYBACK USING A SHARED DEVICE
    7.
    发明申请

    公开(公告)号:US20190156830A1

    公开(公告)日:2019-05-23

    申请号:US16251901

    申请日:2019-01-18

    摘要: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.

    Message playback using a shared device

    公开(公告)号:US10186267B1

    公开(公告)日:2019-01-22

    申请号:US15392844

    申请日:2016-12-28

    摘要: Methods and systems for prioritizing messages for playback are described herein. In some embodiments, a request for messages to be output may be received by a speech-processing system. The speech-processing system may include a message database that includes messages received for a speaker of the request's user account and/or a group account associated with a shared electronic device that the request was received from. One or more prioritization rules may be applied to the messages to order the messages for playback in order to provide an optimal voice user interface for the requesting individual. For instance, messages received for the user account may be prioritized over messages received for the group account, messages received from a similar sender or a high priority sender may be prioritized over other messages, and messages that are indicating as being urgent may be prioritized over messages that are indicated as being non-urgent.