-
31.
公开(公告)号:US09324322B1
公开(公告)日:2016-04-26
申请号:US13920446
申请日:2013-06-18
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Stan Weidner Salvador
CPC classification number: G10L15/22 , G10L17/00 , G10L25/51 , G10L2015/223 , H03G3/20 , H03G3/342 , H04M9/082 , H04R3/02 , H04R29/00 , H04R2430/01
Abstract: A speech recognition system that also automatically recognizes and acts in response to significant audio interruptions. Received audio is compared with stored acoustic signatures of noises which may trigger a change in device operation, such as pausing, loudening or attenuating of content playback after hearing a certain audio interruption, such as a doorbell, etc. If the received audio matches a stored acoustic model, the system alters an operational state of one or more devices, which may or may not include itself.
Abstract translation: 一种语音识别系统,还可以自动识别并响应重大音频中断而起作用。 接收的音频与存储的可能触发设备操作变化的噪声的声学特征进行比较,例如在听到诸如门铃等某些音频中断之后暂停,扬声或衰减内容播放。如果接收到的音频与存储的 声学模型,系统改变一个或多个设备的操作状态,其可以包括或可以不包括其自身。
-
公开(公告)号:US20230421955A1
公开(公告)日:2023-12-28
申请号:US18360402
申请日:2023-07-27
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
CPC classification number: H04R3/12 , G06F3/165 , H04R27/00 , G06F3/167 , H04R2420/07 , H04R2420/03 , H04R2227/005
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US11758328B2
公开(公告)日:2023-09-12
申请号:US17751061
申请日:2022-05-23
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
CPC classification number: H04R3/12 , G06F3/165 , G06F3/167 , H04R27/00 , H04R2227/005 , H04R2420/03 , H04R2420/07
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US11756550B1
公开(公告)日:2023-09-12
申请号:US18087133
申请日:2022-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Gonzalo Alvarez Barrio , Shantanu Vikas Kurhekar , Bharath Bhimanaik Kumar , Fred Torok , Frederic J Deramat
CPC classification number: G10L15/22 , G10L15/1815 , G10L15/30 , H04W8/005 , G10L2015/223 , G10L2015/228 , H04W88/08
Abstract: Systems and methods for integration of speech processing functionality with organization systems are disclosed. For example, a voice interface application may be created to enable a voice interface functionality for devices associated with an organization. Space identifiers of spaces of the organization may be created and associated with the voice interface application. Devices associated with the space identifiers may be enabled for utilizing the voice interface application and may be set up utilizing wireless network identifiers associated with the spaces and/or the organization.
-
公开(公告)号:US20200168239A1
公开(公告)日:2020-05-28
申请号:US16775228
申请日:2020-01-28
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Rohan Mutagi , Vikram Kumar Gundeti , Frederic Johan Georges Deramat
IPC: G10L21/06
Abstract: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US10264358B2
公开(公告)日:2019-04-16
申请号:US15433874
申请日:2017-02-15
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US20180234765A1
公开(公告)日:2018-08-16
申请号:US15433874
申请日:2017-02-15
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Michael Alan Pogue , Vikram Kumar Gundeti , Dharini Sundaram
CPC classification number: H04R3/12 , G06F3/165 , G06F3/167 , H04R27/00 , H04R2227/005 , H04R2420/03 , H04R2420/07
Abstract: Synchronized output of audio on a group of devices can comprise sending audio data from an audio distribution master device to one or more slave devices in the group. Scores can be assigned to respective audio playback devices, the scores being indicative of a performance level of the respective audio playback devices acting as a master device. The device with the highest score is designated as a candidate master device and one or more remaining devices are designated as a candidate slave(s). A throughput test is conducted with the highest scoring device acting as the candidate master device. The results of the throughput test are used to determine a master device for a group of devices. Latency of the throughput test can be reduced by using a prescribed time period for completion of the throughput test, and/or by selecting a first group configuration to passes the throughput test.
-
公开(公告)号:US09996148B1
公开(公告)日:2018-06-12
申请号:US13786254
申请日:2013-03-05
Applicant: Amazon Technologies, Inc.
IPC: G06F3/01
CPC classification number: G06F3/01
Abstract: Features are disclosed for presenting multiple media items based on one or more rules defining how the items are to be presented. One media item may be presented, and during presentation any number of additional media items may be received or scheduled for presentation. Rules may define which media items have priority over others, which media items may interrupt others or be interrupted, which media items may be delayed or presented early, whether particular media items are time-critical such that they are not to be delayed but rather should take presentation priority over others, etc. Metadata may be associated with particular media items or categories thereof. The metadata can provide details regarding how the rules should be applied to those media items. User feedback may also be obtained, and may affect the further application of the rules.
-
公开(公告)号:US09712625B2
公开(公告)日:2017-07-18
申请号:US13858753
申请日:2013-04-08
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Frederic Johan Georges Deramat , Vikram Kumar Gundeti , Peter Spalding VanLund
Abstract: Techniques for creating a persistent connection between client devices and one or more remote computing resources, which may form a portion of a network-accessible computing platform. This connection may be considered “permanent” or “nearly permanent” to allow the client device to both send data to and receive data from the remote resources at nearly any time. In addition, both the client device and the remote resources may establish virtual channels over this single connection. If no data is exchanged between the client device and the remote computing resources for a threshold amount of time, then the connection may be severed and the client device may attempt to establish a new connection with the remote computing resources.
-
公开(公告)号:US09240187B2
公开(公告)日:2016-01-19
申请号:US14642365
申请日:2015-03-09
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Frédéric Johan Georges Deramat , Vikram Kumar Gundeti
CPC classification number: G10L15/26 , G06F17/30684 , G06F17/3074 , G06F17/30746 , G06F17/30778 , G10L15/08 , G10L15/222 , G10L15/30
Abstract: Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance.
-
-
-
-
-
-
-
-
-