-
公开(公告)号:US11816394B1
公开(公告)日:2023-11-14
申请号:US18123849
申请日:2023-03-20
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
CPC classification number: G06F3/167 , G10L13/00 , G10L13/027 , G10L15/22 , H04M3/56 , H04M3/563 , H04M3/568 , H04N7/147 , H04N7/15 , G10L2015/223 , H04M2201/40 , H04M2203/5009 , H04M2250/74
Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
-
公开(公告)号:US11798556B2
公开(公告)日:2023-10-24
申请号:US17575699
申请日:2022-01-14
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Felix Wu , Rongzhou Shen , Neelam Satish Agrawal , Vibhunandan Gavini , Pablo Carballude Gonzalez
IPC: G10L15/22 , G10L15/30 , G10L17/22 , G10L15/18 , G10L13/08 , G06F3/16 , G06F16/00 , G10L13/00 , G10L15/00
CPC classification number: G10L15/22 , G06F3/167 , G06F16/00 , G10L13/08 , G10L15/1815 , G10L15/30 , G10L17/22 , G10L13/00 , G10L15/00 , G10L2015/223
Abstract: Configurable core domains of a speech processing system are described. A core domain output data format for a given command is originally configured with default content portions. When a user indicates additional content should be output for the command, the speech processing system creates a new output data format for the core domain. The new output data format is user specific and includes both default content portions as well as user preferred content portions.
-
公开(公告)号:US11580182B2
公开(公告)日:2023-02-14
申请号:US17099295
申请日:2020-11-16
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Rongzhou Shen , Vibhunandan Gavini , Hassan Haider Malik
IPC: G06F16/90 , G06F16/9535 , G06F40/00 , G06F16/9032 , G06F16/332 , G10L15/18
Abstract: Techniques for performing outputting additional content associated with but nonresponsive to an input command are described. A system receives input data from a device. The system determines an intent representing the input data and receives first output data responsive to the input data. The system determines, based on context data, that additional content associated with the first output data but nonresponsive to the input data should be output. The system receives second output data associated with but nonresponsive to the input data thereafter. The system then presents first content corresponding to the first output data and second content corresponding to the second output data.
-
公开(公告)号:US11429345B2
公开(公告)日:2022-08-30
申请号:US16657938
申请日:2019-10-18
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Michael Dale Whiteley , He Lu , Brian James Butler , Fred Torok , Willy Lew Yuk Vong , David Ross Bronaugh , Christopher Ryan Nies , Shikher Sitoke
Abstract: Techniques for remotely executing a secondary-device driver for generating commands for a secondary device are described herein. For instance, a secondary device (or “appliance”) may reside within an environment, along with a device to which the secondary device communicatively couples. The device may be configured to send control signals to the secondary device for causing the secondary device to perform certain operations. For instance, a user in the environment may provide, to the device, a request that the secondary device perform a certain operation. The device, which may lack some or all of a device driver associated with the secondary device, may then work with a remote service that executes the device driver for the purpose of receiving a command from the device driver and sending the command along to the secondary device. Upon receiving the command, the secondary device may perform the operation.
-
公开(公告)号:US11422772B1
公开(公告)日:2022-08-23
申请号:US16424285
申请日:2019-05-28
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , He Lu , Fred Torok , Willy Lew Yuk Vong , David Ross Bronaugh , Bo Li
Abstract: Techniques for causing different devices to perform different operations using a single voice command are described herein. In some instances, a user may define a “scene”, in which a user sets different devices to different states and then associates an utterance with those states or with the operations performed by the devices to reach those states. For instance, a user may dim a light, turn on his television, and turn on his set-top box before sending a request to a local device or to a remote service to associate those settings with a predefined utterance, such as “my movie scene”. Thereafter, the user may cause the light to dim, the television to turn on, and the set-top box to turn on simply by issuing the voice command “execute my movie scene”.
-
公开(公告)号:US10976996B1
公开(公告)日:2021-04-13
申请号:US16042389
申请日:2018-07-23
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , He Lu , Willy Lew Yuk Vong , Michael Dale Whiteley , Fred Torok , Shikher Sitoke , David Ross Bronaugh , Bo Li
Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.
-
公开(公告)号:US10825454B1
公开(公告)日:2020-11-03
申请号:US16031909
申请日:2018-07-10
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , Isaac Michael Taylor
Abstract: Techniques for naming devices via voice commands are described herein. For instance, a user may issue a voice command to a voice-controlled device stating, “you are the kitchen device”. Thereafter, the device may respond to voice commands directed, by name, to this device. For instance, the user may issue a voice command requesting to “play music on my kitchen device”. Given that the user has configured the device to respond to this name, the device may respond to the command by outputting the requested music.
-
公开(公告)号:US10235129B1
公开(公告)日:2019-03-19
申请号:US14753933
申请日:2015-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: G06Q10/10 , G06F17/28 , H04L29/08 , H04W88/06 , H04W88/02 , H04M3/56 , G06F3/16 , G10L13/027 , G10L15/22 , H04N7/15 , G10L13/04
Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
-
公开(公告)号:US10217461B1
公开(公告)日:2019-02-26
申请号:US15498086
申请日:2017-04-26
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: A61F11/06 , H03B29/00 , G10K11/16 , G10L15/20 , G10L25/84 , G10L21/0272 , G10L21/0208 , G10L15/22 , G10L25/78
Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
公开(公告)号:US10163437B1
公开(公告)日:2018-12-25
申请号:US15171787
申请日:2016-06-02
Applicant: Amazon Technologies, Inc.
Inventor: Lindo St. Angel , Nikko Strom , Rohan Mutagi
Abstract: Techniques for training machine-learning algorithms with the aid of voice tags are described herein. An environment may include sensors configured to generate sensor data and devices configured to perform operations. Sensor data as well as indications of actions performed by devices within the environment may be collected over time and analyzed to identify one or more patterns. Over time, a model that includes an association between this sensor data and device actions may be created and trained such that one or more device actions may be automatically initiated in response to identifying sensor data matching the sensor data of the model. To aid in the training, a user may utter a predefined voice tag each time she performs a particular sequence of actions, with the voice tag indicating to the system that temporally proximate sensor data and device-activity data should be used to train a particular model.
-
-
-
-
-
-
-
-
-