-
公开(公告)号:US12100396B2
公开(公告)日:2024-09-24
申请号:US17583672
申请日:2022-01-25
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L15/22 , G06F3/16 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08
CPC分类号: G10L15/22 , G06F3/167 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08 , G10L2015/088 , G10L2015/223
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US11875820B1
公开(公告)日:2024-01-16
申请号:US17484300
申请日:2021-09-24
CPC分类号: G10L25/87 , G10L15/193 , G10L15/20 , G06F40/30 , G09B19/04 , G10L15/063
摘要: This disclosure describes, in part, context-driven device arbitration techniques to select a speech interface device from multiple speech interface devices to provide a response to a command included in a speech utterance of a user. In some examples, the context-driven arbitration techniques may include executing multiple pipeline instances to analyze audio signals and device metadata received from each of the multiple speech interface devices which detected the speech utterance. A remote speech processing service may execute the multiple pipeline instances and analyze the audio signals and/or metadata, at various stages of the pipeline instances, to determine which speech interface device is to respond to the speech utterance.
-
公开(公告)号:US11172001B1
公开(公告)日:2021-11-09
申请号:US16365476
申请日:2019-03-26
发明人: Vaidyanathan Puthucode Krishnamoorthy , Tony Roy Hardie , Rohit Lohani , Roopali Vasant Kaujalgi
摘要: Techniques for announcing a communications session after the communications session is established between multiple user devices are described. In an example, a computer system may instruct a first user device to establish a communications session with a second user device. The computer system may receive, from the second user device, data indicating a request of the first user device for the communications session. Based at least in part on the data, the computer system may generate content associated with the first user device. The computer system may also instruct the second user device to accept the request and present the content after the communications session is established between the first user device and the second user device.
-
公开(公告)号:US20240087567A1
公开(公告)日:2024-03-14
申请号:US18512606
申请日:2023-11-17
CPC分类号: G10L15/22 , G06F40/30 , G10L15/1815 , G10L2015/088
摘要: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
-
公开(公告)号:US10453449B2
公开(公告)日:2019-10-22
申请号:US15254458
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
摘要: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US10276185B1
公开(公告)日:2019-04-30
申请号:US15677659
申请日:2017-08-15
摘要: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
-
公开(公告)号:US10074369B2
公开(公告)日:2018-09-11
申请号:US15254359
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC分类号: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
摘要: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
公开(公告)号:US20180061403A1
公开(公告)日:2018-03-01
申请号:US15254458
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
摘要: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US20220165268A1
公开(公告)日:2022-05-26
申请号:US17583672
申请日:2022-01-25
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L15/22 , H04L51/224 , H04L51/10 , G10L15/30 , H04L67/306 , G06F3/16 , G06F40/35
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US11145301B1
公开(公告)日:2021-10-12
申请号:US16141172
申请日:2018-09-25
摘要: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
-
-
-
-
-
-
-
-
-