-
公开(公告)号:US20180061403A1
公开(公告)日:2018-03-01
申请号:US15254458
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
Abstract: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US11862159B2
公开(公告)日:2024-01-02
申请号:US17464754
申请日:2021-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Shambhavi Sathyanarayana Rao , Anna Chen Santos , Tony Roy Hardie
CPC classification number: G10L15/22 , G06F40/30 , G10L15/1815 , G10L2015/088 , G10L2015/223
Abstract: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
-
公开(公告)号:US11368497B1
公开(公告)日:2022-06-21
申请号:US16134643
申请日:2018-09-18
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Shambhavi Sathyanarayana Rao , Tony Roy Hardie , Anna Chen Santos
Abstract: An autonomous mobile device (AMD) may be used in an environment as a communication endpoint for voice or video communications. An incoming request for communication may initiate a process in which the AMD finds a user within the environment. Information obtained from sensors onboard the AMD or in the environment may be used to determine the whereabouts of the user. If an existing communication endpoint is not available to the user or cannot support a requested communication modality, the AMD may travel to permitted areas within the environment to find the user, while avoiding areas designated as private. Once found, communication may be established with the user. If the incoming request expires, the AMD may present information indicative of the request to the user.
-
公开(公告)号:US20220059090A1
公开(公告)日:2022-02-24
申请号:US17464754
申请日:2021-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Shambhavi Sathyanarayana Rao , Anna Chen Santos , Tony Roy Hardie
Abstract: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
-
公开(公告)号:US11240331B2
公开(公告)日:2022-02-01
申请号:US16458932
申请日:2019-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Mario Chenier , Tony Roy Hardie , Nawdesh Uppal , Brian Oliver , Ran Mokady
Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.
-
公开(公告)号:US11232808B2
公开(公告)日:2022-01-25
申请号:US16394717
申请日:2019-04-25
Applicant: Amazon Technologies, Inc.
Inventor: Zhaoqing Ma , Tony Roy Hardie , Christo Frank Devaraj
Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
-
公开(公告)号:US20190325892A1
公开(公告)日:2019-10-24
申请号:US16458932
申请日:2019-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Mario Chenier , Tony Roy Hardie , Nawdesh Uppal , Brian Oliver , Ran Mokady
Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.
-
公开(公告)号:US20190318758A1
公开(公告)日:2019-10-17
申请号:US16394717
申请日:2019-04-25
Applicant: Amazon Technologies, Inc.
Inventor: Zhaoqing Ma , Tony Roy Hardie , Christo Frank Devaraj
Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
-
公开(公告)号:US10339957B1
公开(公告)日:2019-07-02
申请号:US15385265
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: Mario Chenier , Tony Roy Hardie , Nawdesh Uppal , Brian Oliver , Ran Mokady
Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.
-
公开(公告)号:US10325599B1
公开(公告)日:2019-06-18
申请号:US15392271
申请日:2016-12-28
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Pathivada Rajsekhar Naidu , Tony Roy Hardie
Abstract: Systems and methods for extracting contact information from a message are described. A system can receive a message for a recipient, where the message originates from a message source having a first contact identifier (i.e., phone number, text address, etc.). The system can determine text data associated with the content of that message and process the text data to determine that the message refers to a second contact identifier that is different from the first contact identifier. The system may output the message to a recipient device (such as using text-to-speech, etc.) and may store an association between the message source and the second contact identifier. When the recipient speaks a command to reply to the first message or contact the message source, the system may determine the reply is intended for the message source and may route the reply using the second contact identifier included in the first message.
-
-
-
-
-
-
-
-
-