Patent search ap:("GOOGLE LLC") AND inv:"Joseph Smarr" Page 1

1.

发明授权
Enabling natural conversations with soft endpointing for an automated assistant 有权

公开(公告)号：US12020703B2

公开(公告)日：2024-06-25

申请号：US17532819

申请日：2021-11-22

Applicant: GOOGLE LLC

Inventor： Jaclyn Konzelmann , Trevor Strohman , Jonathan Bloom , Johan Schalkwyk , Joseph Smarr

IPC: G10L15/22 , G06N20/00 , G08B5/36 , G10L15/08 , G10L15/18

CPC classification number: G10L15/22 , G06N20/00 , G08B5/36 , G10L15/18 , G10L2015/088 , G10L2015/223

Abstract: As part of a dialog session between a user and an automated assistant, implementations can process, using a streaming ASR model, a stream of audio data that captures a portion of a spoken utterance to generate ASR output, process, using an NLU model, the ASR output to generate NLU output, and cause, based on the NLU output, a stream of fulfillment data to be generated. Further, implementations can further determine, based on processing the stream of audio data, audio-based characteristics associated with the portion of the spoken utterance captured in the stream of audio data. Based on the audio-based characteristics and/the stream of NLU output, implementations can determine whether the user has paused in providing the spoken utterance or has completed providing of the spoken utterance. If the user has paused, implementations can cause natural conversation output to be provided for presentation to the user.

2.

发明公开
ENABLING NATURAL CONVERSATIONS WITH SOFT ENDPOINTING FOR AN AUTOMATED ASSISTANT 审中-公开

公开(公告)号：US20240312460A1

公开(公告)日：2024-09-19

申请号：US18674479

申请日：2024-05-24

Applicant: GOOGLE LLC

Inventor： Jaclyn Konzelmann , Trevor Strohman , Jonathan Bloom , Johan Schalkwyk , Joseph Smarr

IPC: G10L15/22 , G06N20/00 , G08B5/36 , G10L15/08 , G10L15/18

CPC classification number: G10L15/22 , G06N20/00 , G08B5/36 , G10L15/18 , G10L2015/088 , G10L2015/223

Abstract: As part of a dialog session between a user and an automated assistant, implementations can process, using a streaming ASR model, a stream of audio data that captures a portion of a spoken utterance to generate ASR output, process, using an NLU model, the ASR output to generate NLU output, and cause, based on the NLU output, a stream of fulfillment data to be generated. Further, implementations can further determine, based on processing the stream of audio data, audio-based characteristics associated with the portion of the spoken utterance captured in the stream of audio data. Based on the audio-based characteristics and/the stream of NLU output, implementations can determine whether the user has paused in providing the spoken utterance or has completed providing of the spoken utterance. If the user has paused, implementations can cause natural conversation output to be provided for presentation to the user.

3.

发明申请
ENABLING NATURAL CONVERSATIONS WITH SOFT ENDPOINTING FOR AN AUTOMATED ASSISTANT 有权

公开(公告)号：US20230053341A1

公开(公告)日：2023-02-23

申请号：US17532819

申请日：2021-11-22

Applicant: GOOGLE LLC

Inventor： Jaclyn Konzelmann , Trevor Strohman , Jonathan Bloom , Johan Schalkwyk , Joseph Smarr

IPC: G10L15/22 , G10L15/18 , G08B5/36 , G06N20/00

Abstract: As part of a dialog session between a user and an automated assistant, implementations can process, using a streaming ASR model, a stream of audio data that captures a portion of a spoken utterance to generate ASR output, process, using an NLU model, the ASR output to generate NLU output, and cause, based on the NLU output, a stream of fulfillment data to be generated. Further, implementations can further determine, based on processing the stream of audio data, audio-based characteristics associated with the portion of the spoken utterance captured in the stream of audio data. Based on the audio-based characteristics and/the stream of NLU output, implementations can determine whether the user has paused in providing the spoken utterance or has completed providing of the spoken utterance. If the user has paused, implementations can cause natural conversation output to be provided for presentation to the user.

4.

发明申请
ENABLING NATURAL CONVERSATIONS FOR AN AUTOMATED ASSISTANT 有权

公开(公告)号：US20220366905A1

公开(公告)日：2022-11-17

申请号：US17537122

申请日：2021-11-29

Applicant: GOOGLE LLC

Inventor： Joseph Smarr , David Eisenberg , Hugo Santos , David Elson

IPC: G10L15/22 , G10L15/18 , G10L15/30 , G10L15/32 , G10L13/02 , G10L25/78

Abstract: As part of a dialog session between a user and an automated assistant, implementations can process, using a streaming ASR model, a stream of audio data to generate ASR output, process, using an NLU model, the ASR output to generate NLU output, and generate, based on the NLU output, a stream of fulfillment data. Further, implementations can further determine, based on processing the stream of audio data, audio-based characteristics associated with spoken utterance(s) captured in the stream of audio data. Based on a current state of the stream of NLU output, the stream of fulfillment data, and the audio-based characteristics, implementations can determine whether a next interaction state to be implemented is: (i) causing fulfillment output to be implemented; (ii) causing natural conversation output to be audibly rendered; or (iii) refrain from causing any interaction to be implemented, can cause the next interaction state to be implemented.

Patent Agency Ranking