CONTINGENT DEVICE ACTIONS DURING LOSS OF NETWORK CONNECTIVITY

    公开(公告)号:US20200168239A1

    公开(公告)日:2020-05-28

    申请号:US16775228

    申请日:2020-01-28

    IPC分类号: G10L21/06

    摘要: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.

    Associating identifiers with audio signals

    公开(公告)号:US10438582B1

    公开(公告)日:2019-10-08

    申请号:US14573943

    申请日:2014-12-17

    IPC分类号: G10L17/22 G10L15/18

    摘要: A voice-controlled device may receive a voice command uttered by a user, where the voice command may request that the voice-controlled device perform an operation. The voice-controlled device and/or one or more remote computing resources may process an audio signal associated with the voice command to determine text corresponding to the voice command. The resulting user utterance may be associated with a unique identifier, which may be provided to a third party and/or third party application that is to provide information responsive to the user request. The information provided by the third party/third party application may be output to the user based at least partly on the unique identifier, without disclosing user data associated with the user.

    Architecture for multi-domain natural language processing

    公开(公告)号:US10283119B2

    公开(公告)日:2019-05-07

    申请号:US15966400

    申请日:2018-04-30

    摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

    Load-balanced, persistent connection techniques

    公开(公告)号:US09712625B2

    公开(公告)日:2017-07-18

    申请号:US13858753

    申请日:2013-04-08

    摘要: Techniques for creating a persistent connection between client devices and one or more remote computing resources, which may form a portion of a network-accessible computing platform. This connection may be considered “permanent” or “nearly permanent” to allow the client device to both send data to and receive data from the remote resources at nearly any time. In addition, both the client device and the remote resources may establish virtual channels over this single connection. If no data is exchanged between the client device and the remote computing resources for a threshold amount of time, then the connection may be severed and the client device may attempt to establish a new connection with the remote computing resources.

    ARCHITECTURE FOR MULTI-DOMAIN NATURAL LANGUAGE PROCESSING

    公开(公告)号:US20220148590A1

    公开(公告)日:2022-05-12

    申请号:US17454716

    申请日:2021-11-12

    摘要: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

    Audio output control
    10.
    发明授权

    公开(公告)号:US10853031B2

    公开(公告)日:2020-12-01

    申请号:US16222751

    申请日:2018-12-17

    摘要: Systems and methods for audio output control are disclosed. Audio may be output via a speaker of a communal device associated with a first portion of an environment. A user may provide a user utterance indicating an intent to add another device in a second portion of the environment to the audio-output session, and/or an intent to move the audio-output session from the first device to the second device, and/or an intent to remove a device from an audio-output session. Based on this determined intent, audio-session queues may be associated and dissociated from devices and device states may be altered to effectuate the intent of the user utterance.