-
公开(公告)号:US20230223023A1
公开(公告)日:2023-07-13
申请号:US18149181
申请日:2023-01-03
Applicant: Amazon Technologies, Inc.
Inventor: Ariya Rastrow , Eli Joshua Fidler , Roland Maximilian Rolf Maas , Nikko Strom , Aaron Eakin , Diamond Bishop , Bjorn Hoffmeister , Sanjeev Mishra
CPC classification number: G10L15/22 , G10L15/26 , G10L15/1815 , G10L2015/088 , G10L2015/223 , G10L2015/228
Abstract: A speech interface device is configured to detect an interrupt event and process a voice command without detecting a wakeword. The device includes on-device interrupt architecture configured to detect when device-directed speech is present and send audio data to a remote system for speech processing. This architecture includes an interrupt detector that detects an interrupt event (e.g., device-directed speech) with low latency, enabling the device to quickly lower a volume of output audio and/or perform other actions in response to a potential voice command. In addition, the architecture includes a device directed classifier that processes an entire utterance and corresponding semantic information and detects device-directed speech with high accuracy. Using the device directed classifier, the device may reject the interrupt event and increase a volume of the output audio or may accept the interrupt event, causing the output audio to end and performing speech processing on the audio data.
-
公开(公告)号:US12236950B2
公开(公告)日:2025-02-25
申请号:US18149181
申请日:2023-01-03
Applicant: Amazon Technologies, Inc.
Inventor: Ariya Rastrow , Eli Joshua Fidler , Roland Maximilian Rolf Maas , Nikko Strom , Aaron Eakin , Diamond Bishop , Bjorn Hoffmeister , Sanjeev Mishra
Abstract: A speech interface device is configured to detect an interrupt event and process a voice command without detecting a wakeword. The device includes on-device interrupt architecture configured to detect when device-directed speech is present and send audio data to a remote system for speech processing. This architecture includes an interrupt detector that detects an interrupt event (e.g., device-directed speech) with low latency, enabling the device to quickly lower a volume of output audio and/or perform other actions in response to a potential voice command. In addition, the architecture includes a device directed classifier that processes an entire utterance and corresponding semantic information and detects device-directed speech with high accuracy. Using the device directed classifier, the device may reject the interrupt event and increase a volume of the output audio or may accept the interrupt event, causing the output audio to end and performing speech processing on the audio data.
-
公开(公告)号:US20210295833A1
公开(公告)日:2021-09-23
申请号:US16822744
申请日:2020-03-18
Applicant: Amazon Technologies, Inc.
Inventor: Ariya Rastrow , Eli Joshua Fidler , Roland Maximilian Rolf Maas , Nikko Strom , Aaron Eakin , Diamond Bishop , Bjorn Hoffmeister , Sanjeev Mishra
Abstract: A speech interface device is configured to detect an interrupt event and process a voice command without detecting a wakeword. The device includes on-device interrupt architecture configured to detect when device-directed speech is present and send audio data to a remote system for speech processing. This architecture includes an interrupt detector that detects an interrupt event (e.g., device-directed speech) with low latency, enabling the device to quickly lower a volume of output audio and/or perform other actions in response to a potential voice command. In addition, the architecture includes a device directed classifier that processes an entire utterance and corresponding semantic information and detects device-directed speech with high accuracy. Using the device directed classifier, the device may reject the interrupt event and increase a volume of the output audio or may accept the interrupt event, causing the output audio to end and performing speech processing on the audio data.
-
公开(公告)号:US09571357B1
公开(公告)日:2017-02-14
申请号:US14725759
申请日:2015-05-29
Applicant: Amazon Technologies, Inc.
Inventor: Richard Wasserman , Yusuf Bootwala , Thomas Park , Aaron Eakin
CPC classification number: H04L43/04 , G06F21/6254 , H04L51/046 , H04L51/12 , H04L51/28 , H04L63/0421 , H04L67/02 , H04L67/10
Abstract: Disclosed are various embodiments related to intercepting, modifying, and analyzing messages being exchanged between two entities. In one embodiment, among others, a method comprises intercepting messages being exchanged between a first entity and a second entity. The intercepted messages are modified to substitute sender information identifying the sender with proxy sender information such that the sender is anonymous to the intended recipient. The messages are also analyzed to determine at least one performance metric associated with the messages being exchanged.
Abstract translation: 公开了与在两个实体之间交换的消息的截取,修改和分析有关的各种实施例。 在一个实施例中,一种方法包括拦截正在第一实体和第二实体之间交换的消息。 被拦截的消息被修改为用代理发送者信息来标识发送者的发送者信息,使得发送者对于预期收件人是匿名的。 还分析消息以确定与正在交换的消息相关联的至少一个性能度量。
-
公开(公告)号:US11551685B2
公开(公告)日:2023-01-10
申请号:US16822744
申请日:2020-03-18
Applicant: Amazon Technologies, Inc.
Inventor: Ariya Rastrow , Eli Joshua Fidler , Roland Maximilian Rolf Maas , Nikko Strom , Aaron Eakin , Diamond Bishop , Bjorn Hoffmeister , Sanjeev Mishra
Abstract: A speech interface device is configured to detect an interrupt event and process a voice command without detecting a wakeword. The device includes on-device interrupt architecture configured to detect when device-directed speech is present and send audio data to a remote system for speech processing. This architecture includes an interrupt detector that detects an interrupt event (e.g., device-directed speech) with low latency, enabling the device to quickly lower a volume of output audio and/or perform other actions in response to a potential voice command. In addition, the architecture includes a device directed classifier that processes an entire utterance and corresponding semantic information and detects device-directed speech with high accuracy. Using the device directed classifier, the device may reject the interrupt event and increase a volume of the output audio or may accept the interrupt event, causing the output audio to end and performing speech processing on the audio data.
-
公开(公告)号:US11211058B1
公开(公告)日:2021-12-28
申请号:US16577394
申请日:2019-09-20
Applicant: Amazon Technologies, Inc.
Inventor: Aaron Eakin , Angela Sun , Ankur Gandhe , Ariya Rastrow , Chenlei Guo , Xing Fan
IPC: G10L15/197 , G10L15/30 , G10L15/22
Abstract: Described herein is a system for prompting a user for clarification when an automatic speech recognition (ASR) system encounters ambiguity with respect to the user's input. The feedback provided by the user is used to retrain machine-learning models and/or to generate new machine-learning models. Based on the type of ambiguity, the system may determine to retrain one or more ASR models that are widely used by the system or to generate/update one or more user-specific models that are used to process inputs from one or more particular users.
-
公开(公告)号:US09049102B1
公开(公告)日:2015-06-02
申请号:US13918235
申请日:2013-06-14
Applicant: Amazon Technologies, Inc.
Inventor: Rich Wasserman , Yusuf Bootwala , Thomas Park , Aaron Eakin
CPC classification number: H04L43/04 , G06F21/6254 , H04L51/046 , H04L51/12 , H04L51/28 , H04L63/0421 , H04L67/02 , H04L67/10
Abstract: Disclosed are various embodiments of a system. In one embodiment, among others, a method comprises intercepting a communication between a first party and a second party in a communication forum. The communication includes first party proxy information as an intended recipient information. The method further comprises accessing a communication pair using the intended recipient information. The intended recipient information is associated with second party proxy information. The second party proxy information is associated with second party information. Additionally, the method comprises determining whether the identity of the sender is valid.
Abstract translation: 公开了系统的各种实施例。 在一个实施例中,除其他之外,一种方法包括在通信论坛中拦截第一方和第二方之间的通信。 该通信包括作为预期接收者信息的第一方代理信息。 该方法还包括使用预期接收者信息访问通信对。 预期的收件人信息与第二方代理信息相关联。 第二方代理信息与第二方信息相关联。 另外,该方法包括确定发送者的身份是否有效。
-
-
-
-
-
-