-
公开(公告)号:US20240296829A1
公开(公告)日:2024-09-05
申请号:US18663831
申请日:2024-05-14
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
CPC classification number: G10L15/01 , G06F3/017 , G10L13/00 , G10L15/18 , G10L15/187 , G10L15/24 , G10L2015/088
Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.
-
公开(公告)号:US20210035552A1
公开(公告)日:2021-02-04
申请号:US16902992
申请日:2020-06-16
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.
-
公开(公告)号:US11990120B2
公开(公告)日:2024-05-21
申请号:US16902992
申请日:2020-06-16
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
CPC classification number: G10L15/01 , G06F3/017 , G10L13/00 , G10L15/18 , G10L15/187 , G10L15/24 , G10L2015/088
Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.
-
公开(公告)号:US10692485B1
公开(公告)日:2020-06-23
申请号:US15389742
申请日:2016-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.
-
公开(公告)号:US10692489B1
公开(公告)日:2020-06-23
申请号:US15389623
申请日:2016-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
Abstract: A system and method for incorporating motion into a speech processing system. A wearable device that is capable of both capturing spoken utterances and capturing motion data may be used to interact with a speech processing system. In certain circumstances, such as when voice communication are unreliable (due to noise) or when controlling the system by motion is desired, motion of a device may be used to provide input to a speech processing system. For example, sensor data or gesture data resulting from movement of a device may be processed and input into a natural language system as representative of a spoken command portion or other input. The motion information may be interpreted to provide prompts to the system (e.g., “yes,” “no,” etc.), to perform certain commands (skip, forward, back, cancel) or to otherwise control the system.
-
公开(公告)号:US10515623B1
公开(公告)日:2019-12-24
申请号:US15389574
申请日:2016-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Travis Grizzel
Abstract: A system and method for a wearable device capable of detecting a wake gesture for purposes of capturing and forwarding audio data corresponding to a spoken utterance. The device may wake for purposes of capturing utterance audio data in response to a combination of a wake gesture and wakeword. The wake gesture may enable a wakeword detector. The device may also attempt to detect a wakeword utterance in a noisy environment. In response to determining the noisy environment, the device may receive motion data from a motion sensor; determining the motion data corresponds to a wake gesture, and send the audio data corresponding to an utterance to a remote device for processing. The device may also wake based on a combined confidence of wakeword and wake gesture detection.
-
-
-
-
-