NON-SPEECH INPUT TO SPEECH PROCESSING SYSTEM

    公开(公告)号:US20240296829A1

    公开(公告)日:2024-09-05

    申请号:US18663831

    申请日:2024-05-14

    Inventor: Travis Grizzel

    Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.

    NON-SPEECH INPUT TO SPEECH PROCESSING SYSTEM

    公开(公告)号:US20210035552A1

    公开(公告)日:2021-02-04

    申请号:US16902992

    申请日:2020-06-16

    Inventor: Travis Grizzel

    Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.

    Non-speech input to speech processing system

    公开(公告)号:US11990120B2

    公开(公告)日:2024-05-21

    申请号:US16902992

    申请日:2020-06-16

    Inventor: Travis Grizzel

    Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.

    Non-speech input to speech processing system

    公开(公告)号:US10692485B1

    公开(公告)日:2020-06-23

    申请号:US15389742

    申请日:2016-12-23

    Inventor: Travis Grizzel

    Abstract: A system and method for associating motion data with utterance audio data for use with a speech processing system. A device, such as a wearable device, may be capable of capturing utterance audio data and sending it to a remote server for speech processing, for example for execution of a command represented in the utterance. The device may also capture motion data using motion sensors of the device. The motion data may correspond to gestures, such as head gestures, that may be interpreted by the speech processing system to determine and execute commands. The device may associate the motion data with the audio data so the remote server knows what motion data corresponds to what portion of audio data for purposes of interpreting and executing commands. Metadata sent with the audio data and/or motion data may include association data such as timestamps, session identifiers, message identifiers, etc.

    Non-speech input to speech processing system

    公开(公告)号:US10692489B1

    公开(公告)日:2020-06-23

    申请号:US15389623

    申请日:2016-12-23

    Inventor: Travis Grizzel

    Abstract: A system and method for incorporating motion into a speech processing system. A wearable device that is capable of both capturing spoken utterances and capturing motion data may be used to interact with a speech processing system. In certain circumstances, such as when voice communication are unreliable (due to noise) or when controlling the system by motion is desired, motion of a device may be used to provide input to a speech processing system. For example, sensor data or gesture data resulting from movement of a device may be processed and input into a natural language system as representative of a spoken command portion or other input. The motion information may be interpreted to provide prompts to the system (e.g., “yes,” “no,” etc.), to perform certain commands (skip, forward, back, cancel) or to otherwise control the system.

    Non-speech input to speech processing system

    公开(公告)号:US10515623B1

    公开(公告)日:2019-12-24

    申请号:US15389574

    申请日:2016-12-23

    Inventor: Travis Grizzel

    Abstract: A system and method for a wearable device capable of detecting a wake gesture for purposes of capturing and forwarding audio data corresponding to a spoken utterance. The device may wake for purposes of capturing utterance audio data in response to a combination of a wake gesture and wakeword. The wake gesture may enable a wakeword detector. The device may also attempt to detect a wakeword utterance in a noisy environment. In response to determining the noisy environment, the device may receive motion data from a motion sensor; determining the motion data corresponds to a wake gesture, and send the audio data corresponding to an utterance to a remote device for processing. The device may also wake based on a combined confidence of wakeword and wake gesture detection.

Patent Agency Ranking