-
公开(公告)号:US11996092B1
公开(公告)日:2024-05-28
申请号:US17516227
申请日:2021-11-01
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: G10L15/02 , G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/78 , G10L25/84 , G10L25/87
CPC classification number: G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/84 , G10L2015/223 , G10L2021/02087 , G10L2025/783
Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
公开(公告)号:US10963216B1
公开(公告)日:2021-03-30
申请号:US16356968
申请日:2019-03-18
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
-
公开(公告)号:US12260153B1
公开(公告)日:2025-03-25
申请号:US18387377
申请日:2023-11-06
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
-
公开(公告)号:US20240312454A1
公开(公告)日:2024-09-19
申请号:US18674519
申请日:2024-05-24
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/78 , G10L25/84
CPC classification number: G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0208 , G10L21/0272 , G10L25/84 , G10L2015/223 , G10L2021/02087 , G10L2025/783
Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
公开(公告)号:US11609740B1
公开(公告)日:2023-03-21
申请号:US17214639
申请日:2021-03-26
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
Abstract: Techniques for joining a device of a third user to a communication between a device of a first user and a device of a second user are described herein. For instance, two or more users may utilize respective computing devices to engage in a telephone call, a video call, an instant-messaging session, or any other type of communication in which the users communicate with each other audibly and/or visually. In some instances, a first user of the two users may issue a voice command requesting to join a device of a third user to the communication. One or more computing devices may recognize this voice command and may attempt to join a device of a third user to the communication.
-
公开(公告)号:US09959771B1
公开(公告)日:2018-05-01
申请号:US14975547
申请日:2015-12-18
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson
CPC classification number: G08G5/0039 , B64C39/024 , B64C2201/128 , B64D47/08 , G06Q10/083 , G08G5/0034 , G08G5/0091
Abstract: Weather data is used to create and/or update a flight plan prior to and/or during flight by an unmanned aerial vehicle (UAV). The weather data may be received using sensors onboard the UAV and/or the weather data may be received from other sources, such as weather aggregators, other UAVs, other vehicles, and/or local weather stations. In some embodiments, a UAV may be prematurely grounded after initiating flight toward a destination in response to some weather conditions identified in near real-time weather data, such as heavy winds and/or heavy precipitation. In various embodiments, the UAVs may leverage air stream information included in the weather data to cause flight along with an air stream, and thereby reduce power resources used to fly to a destination.
-
公开(公告)号:US20170169811A1
公开(公告)日:2017-06-15
申请号:US14963912
申请日:2015-12-09
Applicant: Amazon Technologies, Inc.
Inventor: Mahesh Babu Sabbavarapu , Ty Loren Carlson , Vijayabaskar Gangadaran
CPC classification number: G10L13/08 , G06F3/165 , G06F15/0291 , G09B5/06 , G09B21/006 , G10L13/04 , G10L15/22
Abstract: A system and method for performing text-to-speech (TTS) processing of textual works, such a literary works. The system and method process text of these works and determine offsets corresponding to one or more of chapters, paragraphs, sentences, words, section of dialogues, sections of other context. Using these offsets, the system and method determine which portion and how much of a work to process using TTS processing at a time to produce a high quality audio output. This audio output may then be sent to a user device to allow the user device to play the audio output of the TTS processing.
-
公开(公告)号:US11170766B1
公开(公告)日:2021-11-09
申请号:US16284085
申请日:2019-02-25
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Rohan Mutagi
IPC: G10L15/00 , G10L15/20 , G10L15/22 , G10L15/26 , G10L17/00 , G10L21/0272 , G10L21/0208 , G10L25/84 , G10L25/78
Abstract: A system has multiple audio-enabled devices that communicate with one another over an open microphone mode of communication. When a user says a trigger word, the nearest device validates the trigger word and opens a communication channel with another device. As the user talks, the device receives the speech and generates an audio signal representation that includes the user speech and may additionally include other background or interfering sound from the environment. The device transmits the audio signal to the other device as part of a conversation, while continually analyzing the audio signal to detect when the user stops talking. This analysis may include watching for a lack of speech in the audio signal for a period of time, or an abrupt change in context of the speech (indicating the speech is from another source), or canceling noise or other interfering sound to isolate whether the user is still speaking. Once the device confirms that the user has stopped talking, the device transitions from a transmission mode to a reception mode to await a reply in the conversation.
-
9.
公开(公告)号:US10417345B1
公开(公告)日:2019-09-17
申请号:US14579214
申请日:2014-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson , Hsuan-Cheng Lai
IPC: G06F17/28 , G06F17/27 , G06F16/638 , G06F16/9535
Abstract: A system in which a customer service agent (CSA) is able to assist a customer with obtaining a desired response from a speech-controlled appliance while protecting customer data. The customer service agent submits queries to a natural language understanding (NLU) processor that performs entity resolution using personalized library information stored in an entity library based on the customer identity information and/or an device identifier. The CSA is shielded from the entity library itself, as well as data stored on the speech-controlled appliance. The CSA can instruct the NLU processor to deliver results to multiple endpoints, including both the customer's appliance and the CSA agent's console.
-
公开(公告)号:US10212066B1
公开(公告)日:2019-02-19
申请号:US15395393
申请日:2016-12-30
Applicant: Amazon Technologies, Inc.
Inventor: Ty Loren Carlson
Abstract: A speech-based system is configured to use its audio-based user interface to present various types of device status information such as wireless signal strengths, communication parameters, battery levels, and so forth. In described embodiments, the system is configured to understand spoken user requests for device and system status. For example, the user may speak a request to obtain the current wireless signal strength of the speech-based system. The speech-based system may respond by determining the signal strength and by playing speech or other sound informing the user of the signal strength. Furthermore, the system may monitor operational parameters to detect conditions that may degrade the user experience, and may report such conditions using generated speech or other sounds.
-
-
-
-
-
-
-
-
-