-
公开(公告)号:US10055190B2
公开(公告)日:2018-08-21
申请号:US14107931
申请日:2013-12-16
Applicant: Amazon Technologies, Inc.
Inventor: Vikram Kumar Gundeti , Fred Torok , Peter Spalding VanLund , Frederic Johan Georges Deramat
CPC classification number: G06F3/165
Abstract: A speech-based system includes a local device in a user premises and a remote service that uses the local device to conduct speech dialogs with a user. The local device may also be directed to play audio such as music, audio books, etc. When designating audio for playing by the local device, the remote service may specify that the audio is either background audio or foreground audio. For background audio, the service indicates whether the background audio is mixable. For foreground audio, the service indicates an interrupt behavior. When the local device is playing background audio and receives foreground audio, the background audio is paused, attenuated, or not changed based on the indicated interrupt behavior of the foreground audio and whether the background audio has been designated as being mixable.
-
公开(公告)号:US09864594B1
公开(公告)日:2018-01-09
申请号:US14501601
申请日:2014-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Michael Dale Whiteley , Anastasios Iliopoulos , Rohan Mutagi , Bo Li , Fred Torok , Ian Daniel Lehmann
CPC classification number: G06F8/65 , G06F9/44505 , G06F11/3688 , H03M13/09 , H04B5/0037 , H04B5/0075 , H04B7/26 , H04W52/0229 , H04W52/0274 , Y02D70/00
Abstract: Embodiments of the disclosure permit upgrading software and/or testing operation of an electronic device within an unopened package. In one embodiment, an electronic device can be powered on inductively while contained in its unopened packaging. In other aspects, the powered on electronic device can receive a software upgrade and/or test information. In addition, the electronic device can validate the software upgrade, and can replace software present in the electronic device with the received software upgrade. The electronic device also can validate at least a portion of the test information, and can implement one or more tests as conveyed in the test information. Further, the electronic device can communicate information wirelessly in response to the test(s). Such information can be indicative or otherwise representative of one or more results of the implemented test(s).
-
公开(公告)号:US20140180697A1
公开(公告)日:2014-06-26
申请号:US13723026
申请日:2012-12-20
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Fred Torok , Frédéric Johan Georges Deramat , Vikram Kumar Gundeti
IPC: G10L15/22
CPC classification number: G10L15/26 , G06F17/30684 , G06F17/3074 , G06F17/30746 , G06F17/30778 , G10L15/08 , G10L15/222 , G10L15/30
Abstract: Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance.
Abstract translation: 公开了用于为音频呈现的元件或其他部分生成标记的特征,使得语音处理系统可以确定用户话语所指的音频呈现的哪一部分。 例如,话语可能包括没有明确先行词的代词。 标记可以用于将话语与相应的内容部分相关联以进行处理。 可以将标记提供给具有文本到语音(“TTS”)呈现的客户端设备。 然后可以将标记与客户端设备捕获的用户话语一起提供给语音处理系统。 可以包括自动语音识别(“ASR”)模块和/或自然语言理解(“NLU”)模块的语音处理系统可以基于标记产生提示。 可以将提示提供给ASR和/或NLU模块,以帮助处理用户话语的含义或意图。
-
公开(公告)号:US12051415B1
公开(公告)日:2024-07-30
申请号:US18224259
申请日:2023-07-20
Applicant: Amazon Technologies, Inc.
Inventor: Gonzalo Alvarez Barrio , Shantanu Vikas Kurhekar , Bharath Bhimanaik Kumar , Fred Torok , Frederic J Deramat
CPC classification number: G10L15/22 , G10L15/1815 , G10L15/30 , H04W8/005 , G10L2015/223 , G10L2015/228 , H04W88/08
Abstract: Systems and methods for integration of speech processing functionality with organization systems are disclosed. For example, a voice interface application may be created to enable a voice interface functionality for devices associated with an organization. Space identifiers of spaces of the organization may be created and associated with the voice interface application. Devices associated with the space identifiers may be enabled for utilizing the voice interface application and may be set up utilizing wireless network identifiers associated with the spaces and/or the organization.
-
公开(公告)号:US12014117B2
公开(公告)日:2024-06-18
申请号:US17301703
申请日:2021-04-12
Applicant: Amazon Technologies, Inc.
Inventor: Rohan Mutagi , He Lu , Willy Lew Yuk Vong , Michael Dale Whiteley , Fred Torok , Shikher Sitoke , David Ross Bronaugh , Bo Li
CPC classification number: G06F3/167 , G10L15/02 , G10L15/18 , G10L15/22 , G10L15/26 , G10L17/22 , H04L12/2816 , G10L2015/223
Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.
-
公开(公告)号:US11626116B2
公开(公告)日:2023-04-11
申请号:US16775228
申请日:2020-01-28
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Rohan Mutagi , Vikram Kumar Gundeti , Frederic Johan Georges Deramat
Abstract: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US20210144477A1
公开(公告)日:2021-05-13
申请号:US17157239
申请日:2021-01-25
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US10904665B2
公开(公告)日:2021-01-26
申请号:US16377044
申请日:2019-04-05
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US10178185B2
公开(公告)日:2019-01-08
申请号:US15589589
申请日:2017-05-08
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Frederic Johan Georges Deramat , Vikram Kumar Gundeti , Peter Spalding VanLund
Abstract: Techniques for creating a persistent connection between client devices and one or more remote computing resources, which may form a portion of a network-accessible computing platform. This connection may be considered “permanent” or “nearly permanent” to allow the client device to both send data to and receive data from the remote resources at nearly any time. In addition, both the client device and the remote resources may establish virtual channels over this single connection. If no data is exchanged between the client device and the remote computing resources for a threshold amount of time, then the connection may be severed and the client device may attempt to establish a new connection with the remote computing resources.
-
公开(公告)号:US20180233137A1
公开(公告)日:2018-08-16
申请号:US15433953
申请日:2017-02-15
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
CPC classification number: G10L15/22 , G06F3/167 , G10L13/08 , G10L15/183 , G10L15/30 , G10L2015/025 , G10L2015/223
Abstract: A user can utter a voice command in an environment where multiple audio playback devices are located to have audio output on a single device, or a predefined group of devices in a synchronized manner. In instances when the voice command uttered by the user does not specify a target for audio output, an implicit target selection algorithm can evaluate one or more criteria to determine an appropriate target for output of the audio corresponding to the voice command. An example criterion is met if a predetermined time period has lapsed since a last utterance was detected by a device in the environment. However, other criteria can be evaluated for determining a target output device(s).
-
-
-
-
-
-
-
-
-