Patent search ap:("Google LLC") AND inv:"Michiel Bacchiani" Page 1

1.

发明授权
Query endpointing based on lip detection 有权

公开(公告)号：US11308963B2

公开(公告)日：2022-04-19

申请号：US16936948

申请日：2020-07-23

Applicant: Google LLC

Inventor： Chanwoo Kim , Rajeev Nongpiur , Michiel Bacchiani

IPC: H04N19/176 , G06F3/16 , G10L15/22 , G10L15/04 , G10L25/78 , G10L15/25 , G10L15/26 , G06K9/00

Abstract: Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.

Patent Agency Ranking