-
公开(公告)号:US12100396B2
公开(公告)日:2024-09-24
申请号:US17583672
申请日:2022-01-25
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L15/22 , G06F3/16 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08
CPC分类号: G10L15/22 , G06F3/167 , G06F40/35 , G10L15/30 , H04L51/10 , H04L51/224 , H04L67/306 , G06V40/10 , G10L13/00 , G10L15/08 , G10L2015/088 , G10L2015/223
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US20180061402A1
公开(公告)日:2018-03-01
申请号:US15254359
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC分类号: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
摘要: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
公开(公告)号:US11264030B2
公开(公告)日:2022-03-01
申请号:US16732943
申请日:2020-01-02
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L21/00 , G10L15/22 , H04L51/224 , H04L51/10 , G10L15/30 , H04L67/306 , G06F3/16 , G06F40/35 , G10L15/08 , G10L13/00 , G06K9/00
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10453449B2
公开(公告)日:2019-10-22
申请号:US15254458
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
摘要: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US10074369B2
公开(公告)日:2018-09-11
申请号:US15254359
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC分类号: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
摘要: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
公开(公告)号:US20180061403A1
公开(公告)日:2018-03-01
申请号:US15254458
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
摘要: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.
-
公开(公告)号:US11574633B1
公开(公告)日:2023-02-07
申请号:US16586457
申请日:2019-09-27
发明人: Sandra Lemon , Nancy Yi Liang
IPC分类号: G10L15/22 , G06F3/04817 , G06F3/0482 , G06F3/0488 , G10L25/81 , G10L25/90 , H04N7/18 , G10L15/26 , G06V40/20 , G06F40/47 , G06F40/40 , G06F40/58
摘要: Enhanced graphical user interfaces for transcription of audio and video messages is disclosed. Audio data may be transcribed, and the transcription may include emphasized words and/or punctuation corresponding to emphasis of user speech. Additionally, the transcription may be translated into a second language. A message spoken by a user depicted in one or more images of video data may also be transcribed and provided to one or more devices.
-
公开(公告)号:US20220165268A1
公开(公告)日:2022-05-26
申请号:US17583672
申请日:2022-01-25
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L15/22 , H04L51/224 , H04L51/10 , G10L15/30 , H04L67/306 , G06F3/16 , G06F40/35
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10580404B2
公开(公告)日:2020-03-03
申请号:US15254600
申请日:2016-09-01
发明人: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
IPC分类号: G10L21/00 , G10L15/22 , H04L12/58 , G10L15/30 , H04L29/08 , G06F17/27 , G06F3/16 , G10L15/08 , G10L13/00 , G06K9/00
摘要: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.
-
公开(公告)号:US10431216B1
公开(公告)日:2019-10-01
申请号:US15394433
申请日:2016-12-29
发明人: Sandra Lemon , Nancy Yi Liang
IPC分类号: G10L15/22 , G10L15/26 , G06F17/28 , G06F3/0481 , G06F3/0482 , G06F3/0488 , G10L25/81 , G10L25/90 , G06F17/21 , H04N7/18 , G06K9/00
摘要: Enhanced graphical user interfaces for transcription of audio and video messages is disclosed. Audio data may be transcribed, and the transcription may include emphasized words and/or punctuation corresponding to emphasis of user speech. Additionally, the transcription may be translated into a second language. A message spoken by a user depicted in one or more images of video data may also be transcribed and provided to one or more devices.
-
-
-
-
-
-
-
-
-