THREE-DIMENSIONAL FACE ANIMATION FROM SPEECH

    公开(公告)号:US20230419579A1

    公开(公告)日:2023-12-28

    申请号:US18462310

    申请日:2023-09-06

    Abstract: A method for training a three-dimensional model face animation model from speech, is provided. The method includes determining a first correlation value for a facial feature based on an audio waveform from a first subject, generating a first mesh for a lower portion of a human face, based on the facial feature and the first correlation value, updating the first correlation value when a difference between the first mesh and a ground truth image of the first subject is greater than a pre-selected threshold, and providing a three-dimensional model of the human face animated by speech to an immersive reality application accessed by a client device based on the difference between the first mesh and the ground truth image of the first subject. A non-transitory, computer-readable medium storing instructions to cause a system to perform the above method, and the system, are also provided.

    APPARATUS AND METHOD FOR GENERATING LIP SYNC IMAGE

    公开(公告)号:US20230178095A1

    公开(公告)日:2023-06-08

    申请号:US17764324

    申请日:2021-06-03

    CPC classification number: G10L21/10 G06T13/80 G06T13/40 G10L2021/105

    Abstract: An apparatus for generating a lip sync image according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance synthesis image by using a person background image and an utterance audio signal corresponding to the person background image as an input, and generate a silence synthesis image by using only the person background image as an input, and a second artificial neural network model configured to output, from a preset utterance maintenance image and the first artificial neural network model, classification values for the preset utterance maintenance image and the silence synthesis image by using the silence synthesis image as an input.

    Voice band detection and implementation

    公开(公告)号:US10062394B2

    公开(公告)日:2018-08-28

    申请号:US14674493

    申请日:2015-03-31

    Inventor: Lee Zamir

    CPC classification number: G10L21/10 G10L25/78 G10L2021/105

    Abstract: A system encourages experimentation with audio frequency and speaker technologies while causing an inanimate object to appear to lip-sync. The system applies a bandpass filter to an incoming audio stream to determine a magnitude of audio content in a frequency band of interest. For example, the system may filter results directed at the voice band, associated with speech. A controller controls a strobe light to flash at a particular point of travel of a platform reciprocating at a known frequency. An illusion is created that a sculpture, such as a piece of paper formed into a ring, is lip-synching to music.

    VOICE RECEIVING METHOD AND DEVICE
    8.
    发明申请

    公开(公告)号:US20170345437A1

    公开(公告)日:2017-11-30

    申请号:US15607419

    申请日:2017-05-26

    Inventor: YU ZHANG

    Abstract: A voice receiving device configured for accurate listening includes a microphone array, a camera, a capturing module, a determining module, a time module, a calculating module, and a de-noising module. The microphone array captures a first voice signal and a second voice signal and the camera captures mouth pictures of a user. The determining module determines whether the first voice signal is synchronized with the mouth pictures, and if so compares the first voice signal to a model preset voice signal of a user to determine a target voice signal. The time module obtains time delay difference between one voice reaching different microphones. The calculating module calculates a position of sound source of the target voice signal. According to the position of the sound source, the de-noising module de-noises by reference to the second voice signal. The disclosure further provides a voice receiving method.

Patent Agency Ranking