Invention Grant
- Patent Title: Query endpointing based on lip detection
-
Application No.: US16412677Application Date: 2019-05-15
-
Publication No.: US10755714B2Publication Date: 2020-08-25
- Inventor: Chanwoo Kim , Rajeev Conrad Nongpiur , Michiel A. U. Bacchiani
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: GOOGLE LLC
- Current Assignee: GOOGLE LLC
- Current Assignee Address: US CA Mountain View
- Agency: Middleton Reutlinger
- Main IPC: G10L15/22
- IPC: G10L15/22 ; G06F40/30 ; G10L15/04 ; G10L25/78 ; G10L15/25 ; G06K9/00 ; G10L15/26

Abstract:
Systems and methods are described for improving endpoint detection of a voice query submitted by a user. In some implementations, a synchronized video data and audio data is received. A sequence of frames of the video data that includes images corresponding to lip movement on a face is determined. The audio data is endpointed based on first audio data that corresponds to a first frame of the sequence of frames and second audio data that corresponds to a last frame of the sequence of frames. A transcription of the endpointed audio data is generated by an automated speech recognizer. The generated transcription is then provided for output.
Public/Granted literature
- US20190333507A1 QUERY ENDPOINTING BASED ON LIP DETECTION Public/Granted day:2019-10-31
Information query