-
公开(公告)号:US12169765B2
公开(公告)日:2024-12-17
申请号:US18244016
申请日:2023-09-08
Applicant: Snap Inc.
Inventor: Xuehan Xiong , Zehao Xue
Abstract: A machine learning scheme can be trained on a set of labeled training images of a subject in different poses, with different textures, and with different background environments. The label or marker data of the subject may be stored as metadata to a 3D model of the subject or rendered images of the subject. The machine learning scheme may be implemented as a supervised learning scheme that can automatically identify the labeled data to create a classification model. The classification model can classify a depicted subject in many different environments and arrangements (e.g., poses).
-
公开(公告)号:US11790276B2
公开(公告)日:2023-10-17
申请号:US17322609
申请日:2021-05-17
Applicant: Snap Inc.
Inventor: Xuehan Xiong , Zehao Xue
CPC classification number: G06N20/10 , G06N20/00 , G06T17/20 , G06V10/772 , G06V20/10 , G06V20/80 , G06V40/103 , G06V40/107
Abstract: A machine learning scheme can be trained on a set of labeled training images of a subject in different poses, with different textures, and with different background environments. The label or marker data of the subject may be stored as metadata to a 3D model of the subject or rendered images of the subject. The machine learning scheme may be implemented as a supervised learning scheme that can automatically identify the labeled data to create a classification model. The classification model can classify a depicted subject in many different environments and arrangements (e.g., poses).
-
公开(公告)号:US11610354B2
公开(公告)日:2023-03-21
申请号:US17349015
申请日:2021-06-16
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US10757319B1
公开(公告)日:2020-08-25
申请号:US15624277
申请日:2017-06-15
Applicant: Snap Inc.
Inventor: Linjie Luo , Chongyang Ma , Zehao Xue
Abstract: A dolly zoom effect can be applied to one or more images captured via a resource-constrained device (e.g., a mobile smartphone) by manipulating the size of a target feature while the background in the one or more images changes due to physical movement of the resource-constrained device. The target feature can be detected using facial recognition or shape detection techniques. The target feature can be resized before the size is manipulated as the background changes (e.g., changes perspective).
-
公开(公告)号:US20200160580A1
公开(公告)日:2020-05-21
申请号:US16749753
申请日:2020-01-22
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US11880509B2
公开(公告)日:2024-01-23
申请号:US18151857
申请日:2023-01-09
Applicant: Snap Inc.
Inventor: Yuncheng Li , Jonathan M. Rodriguez, II , Zehao Xue , Yingying Wang
IPC: G06F3/01 , G06T7/73 , G06T7/20 , H04N13/204 , G06V40/10 , G06F18/214 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/26
CPC classification number: G06F3/017 , G06F3/011 , G06F18/214 , G06T7/20 , G06T7/73 , G06V10/267 , G06V10/764 , G06V10/774 , G06V10/82 , G06V40/11 , H04N13/204 , G06T2207/10012 , G06T2207/20081 , G06T2207/20084 , G06T2207/20132 , G06T2207/30196
Abstract: Systems and methods herein describe using a neural network to identify a first set of joint location coordinates and a second set of joint location coordinates and identifying a three-dimensional hand pose based on both the first and second sets of joint location coordinates.
-
公开(公告)号:US20230419188A1
公开(公告)日:2023-12-28
申请号:US18244016
申请日:2023-09-08
Applicant: Snap Inc.
Inventor: Xuehan Xiong , Zehao Xue
CPC classification number: G06N20/10 , G06T17/20 , G06N20/00 , G06V20/80 , G06V10/772 , G06V20/10 , G06V40/103 , G06V40/107
Abstract: A machine learning scheme can be trained on a set of labeled training images of a subject in different poses, with different textures, and with different background environments. The label or marker data of the subject may be stored as metadata to a 3D model of the subject or rendered images of the subject. The machine learning scheme may be implemented as a supervised learning scheme that can automatically identify the labeled data to create a classification model. The classification model can classify a depicted subject in many different environments and arrangements (e.g., poses).
-
公开(公告)号:US20230153396A1
公开(公告)日:2023-05-18
申请号:US18151268
申请日:2023-01-06
Applicant: Snap Inc.
IPC: G06F18/40 , G06N20/00 , G06F18/214
CPC classification number: G06F18/40 , G06N20/00 , G06F18/214 , G06V2201/10 , G06V20/68
Abstract: Systems and methods are provided for analyzing, by a computing device, location data associated with a location of the computing device to determine that an image or video captured using a messaging application on the computing device is captured near a food-related venue or event, receiving input related to food associated with the food-related venue or event, sending the image or video and the input related to food associated with the food-related venue or event to a computing system to train a machine learning model for food detection, and updating the messaging application to comprise the trained machine learning model for food detection.
-
公开(公告)号:US20210312681A1
公开(公告)日:2021-10-07
申请号:US17349015
申请日:2021-06-16
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US10719968B2
公开(公告)日:2020-07-21
申请号:US16387092
申请日:2019-04-17
Applicant: Snap Inc.
Abstract: Embodiments described herein relate to an augmented expression system to generate and cause display of a specially configured interface to present an augmented reality perspective. The augmented expression system receives image and video data of a user and tracks facial landmarks of the user based on the image and video data, in real-time to generate and present a 3-dimensional (3D) bitmoji of the user.
-
-
-
-
-
-
-
-
-