-
公开(公告)号:US20230282206A1
公开(公告)日:2023-09-07
申请号:US18182762
申请日:2023-03-13
Applicant: Amazon Technologies, Inc.
Inventor: Munir Mahmood , Leopold Bushkin , Alexander Thomas Loeb , Michael Schwartz , Mohammed Arif , Rongzhou Shen , Vikram Kumar Gundeti , Shemyla Anwar , Yaser Khan , Edward Page Foyle , Bo Li
CPC classification number: G10L15/1815 , G10L15/22 , G10L15/30 , G10L13/00 , G10L2015/223
Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natural language input, using a TTS voice of a second NLP system assistant.
-
公开(公告)号:US11626116B2
公开(公告)日:2023-04-11
申请号:US16775228
申请日:2020-01-28
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Rohan Mutagi , Vikram Kumar Gundeti , Frederic Johan Georges Deramat
Abstract: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US11574621B1
公开(公告)日:2023-02-07
申请号:US16734185
申请日:2020-01-03
Applicant: Amazon Technologies, Inc.
Inventor: Peter Spalding VanLund , Nicolas Anton Medhurst Hertl , Peter Paul Henri Carbon , Vikram Kumar Gundeti
Abstract: A system for enabling end user devices to access third party cloud-based resources. For example, the system may include a first party cloud-based resource for converting sound into a format accessible to the third party cloud-based resource, storing and/or maintaining state information related to the an open communication session between the end user device and the third party cloud-based resources, and converting text-based audio announcements into audio that may be output by the end user device. In some cases, the first party cloud-based resource may transmit user responses together with stored state information to the third party cloud-based resources in a manner that the third part cloud-based resources may treat each interaction with the end user as a separate communication session.
-
公开(公告)号:US11176933B1
公开(公告)日:2021-11-16
申请号:US16119803
申请日:2018-08-31
Applicant: Amazon Technologies, Inc.
Inventor: Vikram Kumar Gundeti
Abstract: Systems and methods for precomputed communication parameters are disclosed. A request to establish a communication channel may be received from a first device at a remote system. The remote system may query precached communication parameters associated with the first device to identify modalities and/or codecs associated with the first device. The remote system may also identify the second device to establish the communication channel with and may identify modalities and/or codecs associated with the second device, such as by utilizing user accounts associated with the devices. A transport-address type may be identified, such as based on whether the devices are associated with the same network access point identifier and/or based on past communication channels established between the devices.
-
公开(公告)号:US11120790B2
公开(公告)日:2021-09-14
申请号:US16580643
申请日:2019-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Munir Mahmood , Leopold Bushkin , Alexander Thomas Loeb , Michael Schwartz , Mohammed Arif , Rongzhou Shen , Vikram Kumar Gundeti , Shemyla Anwar , Yaser Khan , Edward Page Foyle , Bo Li
Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natural language input, using a TTS voice of a second NLP system assistant.
-
公开(公告)号:US20210144477A1
公开(公告)日:2021-05-13
申请号:US17157239
申请日:2021-01-25
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US20210090555A1
公开(公告)日:2021-03-25
申请号:US16580643
申请日:2019-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Munir Mahmood , Leopold Bushkin , Alexander Thomas Loeb , Michael Schwartz , Mohammed Arif , Rongzhou Shen , Vikram Kumar Gundeti , Shemyla Anwar , Yaser Khan , Edward Page Foyle , Bo Li
Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natural language input, using a TTS voice of a second NLP system assistant.
-
公开(公告)号:US10904665B2
公开(公告)日:2021-01-26
申请号:US16377044
申请日:2019-04-05
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US10178185B2
公开(公告)日:2019-01-08
申请号:US15589589
申请日:2017-05-08
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Frederic Johan Georges Deramat , Vikram Kumar Gundeti , Peter Spalding VanLund
Abstract: Techniques for creating a persistent connection between client devices and one or more remote computing resources, which may form a portion of a network-accessible computing platform. This connection may be considered “permanent” or “nearly permanent” to allow the client device to both send data to and receive data from the remote resources at nearly any time. In addition, both the client device and the remote resources may establish virtual channels over this single connection. If no data is exchanged between the client device and the remote computing resources for a threshold amount of time, then the connection may be severed and the client device may attempt to establish a new connection with the remote computing resources.
-
公开(公告)号:US20180233137A1
公开(公告)日:2018-08-16
申请号:US15433953
申请日:2017-02-15
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
CPC classification number: G10L15/22 , G06F3/167 , G10L13/08 , G10L15/183 , G10L15/30 , G10L2015/025 , G10L2015/223
Abstract: A user can utter a voice command in an environment where multiple audio playback devices are located to have audio output on a single device, or a predefined group of devices in a synchronized manner. In instances when the voice command uttered by the user does not specify a target for audio output, an implicit target selection algorithm can evaluate one or more criteria to determine an appropriate target for output of the audio corresponding to the voice command. An example criterion is met if a predetermined time period has lapsed since a last utterance was detected by a device in the environment. However, other criteria can be evaluated for determining a target output device(s).
-
-
-
-
-
-
-
-
-