Invention Application
- Patent Title: SYSTEM AND METHOD FOR DISAMBIGUATING A SOURCE OF SOUND BASED ON DETECTED LIP MOVEMENT
-
Application No.: PCT/US2019/018212Application Date: 2019-02-15
-
Publication No.: WO2019161196A3Publication Date: 2019-08-22
- Inventor: SHUKLA, Nishant , DHARNE, Ashwin
- Applicant: DMAI, INC.
- Applicant Address: 10940 Wilshire Blvd, Suite 1100 Los Angeles, California 90024 US
- Assignee: DMAI, INC.
- Current Assignee: DMAI, INC.
- Current Assignee Address: 10940 Wilshire Blvd, Suite 1100 Los Angeles, California 90024 US
- Agency: GADKAR, Arush
- Priority: US62/630,988 20180215
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G10L15/00 ; G10L15/04 ; G10L17/00 ; G10L21/00
Abstract:
The present teaching relates to method, system, medium, and implementations for detecting a source of speech sound in a dialogue. A visual signal acquired from a dialogue scene is first received, where the visual signal captures a person present in the dialogue scene. A human lip associated with the person is detected from the visual signal and tracked to detect whether lip movement is observed. If lip movement is detected, a first candidate source of sound is generated corresponding to an area in the dialogue scene where the lip movement occurred.
Public/Granted literature
- WO2019161196A2 SYSTEM AND METHOD FOR DISAMBIGUATING A SOURCE OF SOUND BASED ON DETECTED LIP MOVEMENT Public/Granted day:2019-08-22
Information query