-
公开(公告)号:US20190130628A1
公开(公告)日:2019-05-02
申请号:US15858992
申请日:2017-12-29
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US12182919B2
公开(公告)日:2024-12-31
申请号:US18064140
申请日:2022-12-09
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US20230161419A1
公开(公告)日:2023-05-25
申请号:US18151857
申请日:2023-01-09
Applicant: Snap Inc.
Inventor: Yuncheng Li , Jonathan M. Rodriguez, II , Zehao Xue , Yingying Wang
IPC: G06F3/01 , G06T7/73 , G06T7/20 , H04N13/204 , G06V40/10 , G06F18/214 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/26
CPC classification number: G06F3/017 , G06T7/73 , G06T7/20 , G06F3/011 , H04N13/204 , G06V40/11 , G06F18/214 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/267 , G06T2207/20081 , G06T2207/20132 , G06T2207/20084 , G06T2207/30196 , G06T2207/10012
Abstract: Systems and methods herein describe using a neural network to identify a first set of joint location coordinates and a second set of joint location coordinates and identifying a three-dimensional hand pose based on both the first and second sets of joint location coordinates.
-
公开(公告)号:US11599741B1
公开(公告)日:2023-03-07
申请号:US16774796
申请日:2020-01-28
Applicant: Snap Inc.
Abstract: Systems and methods are provided for analyzing, by a computing device, location data associated with a location of the computing device to determine that an image or video captured using a messaging application on the computing device is captured near a food-related venue or event, receiving input related to food associated with the food-related venue or event, sending the image or video and the input related to food associated with the food-related venue or event to a computing system to train a machine learning model for food detection, and updating the messaging application to comprise the trained machine learning model for food detection.
-
公开(公告)号:US20220414985A1
公开(公告)日:2022-12-29
申请号:US17823764
申请日:2022-08-31
Applicant: Snap Inc.
Inventor: Liuhao Ge , Zhou Ren , Yuncheng Li , Zehao Xue , Yingying Wang
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and a method for receiving a monocular image that includes a depiction of a hand and extracting features of the monocular image using a plurality of machine learning techniques. The program and method further include modeling, based on the extracted features, a pose of the hand depicted in the monocular image by adjusting skeletal joint positions of a three-dimensional (3D) hand mesh using a trained graph convolutional neural network (CNN); modeling, based on the extracted features, a shape of the hand in the monocular image by adjusting blend shape values of the 3D hand mesh representing surface features of the hand depicted in the monocular image using the trained graph CNN; and generating, for display, the 3D hand mesh adjusted to model the pose and shape of the hand depicted in the monocular image.
-
公开(公告)号:US20210225077A1
公开(公告)日:2021-07-22
申请号:US17222176
申请日:2021-04-05
Applicant: Snap Inc.
Inventor: Liuhao Ge , Zhou Ren , Yuncheng LI , Zehao Xue , Yingying Wang
Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and a method for receiving a monocular image that includes a depiction of a hand and extracting features of the monocular image using a plurality of machine learning techniques. The program and method further include modeling, based on the extracted features, a pose of the hand depicted in the monocular image by adjusting skeletal joint positions of a three-dimensional (3D) hand mesh using a trained graph convolutional neural network (CNN); modeling, based on the extracted features, a shape of the hand in the monocular image by adjusting blend shape values of the 3D hand mesh representing surface features of the hand depicted in the monocular image using the trained graph CNN; and generating, for display, the 3D hand mesh adjusted to model the pose and shape of the hand depicted in the monocular image.
-
公开(公告)号:US11030454B1
公开(公告)日:2021-06-08
申请号:US16777098
申请日:2020-01-30
Applicant: Snap Inc.
Inventor: Xuehan Xiong , Zehao Xue
Abstract: A machine learning scheme can be trained on a set of labeled training images of a subject in different poses, with different textures, and with different background environments. The label or marker data of the subject may be stored as metadata to a 3D model of the subject or rendered images of the subject. The machine learning scheme may be implemented as a supervised learning scheme that can automatically identify the labeled data to create a classification model. The classification model can classify a depicted subject in many different environments and arrangements (e.g., poses).
-
公开(公告)号:US10586368B2
公开(公告)日:2020-03-10
申请号:US15858992
申请日:2017-12-29
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US20250086868A1
公开(公告)日:2025-03-13
申请号:US18955286
申请日:2024-11-21
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US11886966B2
公开(公告)日:2024-01-30
申请号:US18151268
申请日:2023-01-06
Applicant: Snap Inc.
CPC classification number: G06N20/00 , G06F18/214 , G06F18/40 , G06V20/20 , G06V20/70 , G06V20/68 , G06V2201/10
Abstract: Systems and methods are provided for analyzing, by a computing device, location data associated with a location of the computing device to determine that an image or video captured using a messaging application on the computing device is captured near a food-related venue or event, receiving input related to food associated with the food-related venue or event, sending the image or video and the input related to food associated with the food-related venue or event to a computing system to train a machine learning model for food detection, and updating the messaging application to comprise the trained machine learning model for food detection.
-
-
-
-
-
-
-
-
-