-
公开(公告)号:US11942085B1
公开(公告)日:2024-03-26
申请号:US17085011
申请日:2020-10-30
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Isaac Michael Taylor
CPC classification number: G10L15/22 , G06F3/167 , G10L2015/223
Abstract: Techniques for naming devices via voice commands are described herein. For instance, a user may issue a voice command to a voice-controlled device stating, “you are the kitchen device”. Thereafter, the device may respond to voice commands directed, by name, to this device. For instance, the user may issue a voice command requesting to “play music on my kitchen device”. Given that the user has configured the device to respond to this name, the device may respond to the command by outputting the requested music.
-
公开(公告)号:US20230148355A1
公开(公告)日:2023-05-11
申请号:US18149196
申请日:2023-01-03
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Rongzhou Shen , Vibhunandan Gavini , Hassan Haider Malik
IPC: G06F16/9535 , G06F40/00 , G06F16/9032 , G06F16/332
CPC classification number: G06F16/9535 , G06F40/00 , G06F16/90332 , G06F16/3329 , G10L15/1815
Abstract: Techniques for performing outputting additional content associated with but nonresponsive to an input command are described. A system receives input data from a device. The system determines an intent representing the input data and receives first output data responsive to the input data. The system determines, based on context data, that additional content associated with the first output data but nonresponsive to the input data should be output. The system receives second output data associated with but nonresponsive to the input data thereafter. The system then presents first content corresponding to the first output data and second content corresponding to the second output data.
-
公开(公告)号:US11100922B1
公开(公告)日:2021-08-24
申请号:US15716477
申请日:2017-09-26
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Vibhav Salgaonkar , Philip Lee , Bo Li , Vibhu Gavini
IPC: G10L15/04 , G06F40/00 , G10L15/22 , G10L15/18 , G10L17/22 , H04L12/28 , G10L13/00 , G10L15/14 , G10L15/08
Abstract: This disclosure is directed to systems, methods, and devices related to providing the execution of multi-operation sequences based on a trigger occurring which may be a voice-controlled utterance or execution may be based on a trigger occurring and a condition occurring. In accordance with various principles disclosed herein, multi-operation sequences may be executed based on voice-controlled commands and the identification that a trigger has occurred. The voice-controlled electronic devices can be configured to communicate with, and to directly control the operation of, a wide array of other devices. These devices can include, without limitation, outlets that can be turned ON and OFF remotely such that anything plugged into them can be controlled, turning lights ON and OFF, setting the temperature of a network accessible thermostat, etc.
-
公开(公告)号:US20200168239A1
公开(公告)日:2020-05-28
申请号:US16775228
申请日:2020-01-28
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Rohan Mutagi , Vikram Kumar Gundeti , Frederic Johan Georges Deramat
IPC: G10L21/06
Abstract: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US20200082823A1
公开(公告)日:2020-03-12
申请号:US16569780
申请日:2019-09-13
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Felix Wu , Rongzhou Shen , Neelam Satish Agrawal , Vibhunandan Gavini , Pablo Carballude Gonzalez
Abstract: Configurable core domains of a speech processing system are described. A core domain output data format for a given command is originally configured with default content portions. When a user indicates additional content should be output for the command, the speech processing system creates a new output data format for the core domain. The new output data format is user specific and includes both default content portions as well as user preferred content portions.
-
公开(公告)号:US10027662B1
公开(公告)日:2018-07-17
申请号:US15370103
申请日:2016-12-06
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Chintan Gohil , Sai Sailesh Kopuri , Philip Alexander Lee , Felix Wu , Nancy Yi Liang
Abstract: Systems, methods, and devices for dynamically authenticating a user are disclosed. A speech-controlled device captures a spoken command, and sends audio data corresponding thereto to a server. The server determines the audio data includes a spoken command to receive content, and therefrom determines a source storing the content. The server also determines threshold user authentication confidence score data associated with the content source. Based at least in part on the threshold user authentication confidence score data, the server determines a user authentication technique, and a device configured to capture user authentication data. The server determines user authentication confidence score data using user authentication data received from the device, and determines weighted user authentication confidence score data therefrom. If the weighted user authentication confidence score data satisfies the threshold user authentication confidence score data, the server receives the requested content from the content source.
-
公开(公告)号:US20240233726A1
公开(公告)日:2024-07-11
申请号:US18615766
申请日:2024-03-25
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Isaac Michael Taylor
CPC classification number: G10L15/22 , G06F3/167 , G10L2015/223
Abstract: Techniques for naming devices via voice commands are described herein. For instance, a user may issue a voice command to a voice-controlled device stating, “you are the kitchen device”. Thereafter, the device may respond to voice commands directed, by name, to this device. For instance, the user may issue a voice command requesting to “play music on my kitchen device”. Given that the user has configured the device to respond to this name, the device may respond to the command by outputting the requested music.
-
公开(公告)号:US12014117B2
公开(公告)日:2024-06-18
申请号:US17301703
申请日:2021-04-12
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , He Lu , Willy Lew Yuk Vong , Michael Dale Whiteley , Fred Torok , Shikher Sitoke , David Ross Bronaugh , Bo Li
CPC classification number: G06F3/167 , G10L15/02 , G10L15/18 , G10L15/22 , G10L15/26 , G10L17/22 , H04L12/2816 , G10L2015/223
Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.
-
公开(公告)号:US11996092B1
公开(公告)日:2024-05-28
申请号:US17516227
申请日:2021-11-01
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: G10L15/02 , G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/78 , G10L25/84 , G10L25/87
CPC classification number: G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/84 , G10L2015/223 , G10L2021/02087 , G10L2025/783
Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
公开(公告)号:US20240079005A1
公开(公告)日:2024-03-07
申请号:US18369291
申请日:2023-09-18
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Felix Wu , Rongzhou Shen , Neelam Satish Agrawal , Vibhunandan Gavini , Pablo Carballude Gonzalez
CPC classification number: G10L15/22 , G06F3/167 , G06F16/00 , G10L13/08 , G10L15/1815 , G10L15/30 , G10L17/22 , G10L13/00 , G10L2015/223
Abstract: Configurable core domains of a speech processing system are described. A core domain output data format for a given command is originally configured with default content portions. When a user indicates additional content should be output for the command, the speech processing system creates a new output data format for the core domain. The new output data format is user specific and includes both default content portions as well as user preferred content portions.
-
-
-
-
-
-
-
-
-