-
公开(公告)号:US12182919B2
公开(公告)日:2024-12-31
申请号:US18064140
申请日:2022-12-09
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US10586368B2
公开(公告)日:2020-03-10
申请号:US15858992
申请日:2017-12-29
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US11620001B2
公开(公告)日:2023-04-04
申请号:US16948018
申请日:2020-08-27
Applicant: Snap Inc.
Inventor: William Brendel , Francesco Barbieri , Xin Chen , Wei Chu , Venkata Satya Pradeep Karuturi , Luis Carlos Dos Santos Marujo , Leonardo Ribas Machado das Neves
IPC: G06F40/166 , G06F3/023 , G06N3/084 , G06K9/62 , G06F3/04817 , H04L51/04 , G06F40/274
Abstract: Symbol prediction can be implemented using a multi-task system trained for different tasks. The tasks may include a single symbol prediction, symbol category prediction, and symbol subcategory prediction. Categories of symbols can be generated by clustering sets of training data using a clustering scheme.
-
公开(公告)号:US11120597B2
公开(公告)日:2021-09-14
申请号:US16749753
申请日:2020-01-22
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US20250086868A1
公开(公告)日:2025-03-13
申请号:US18955286
申请日:2024-11-21
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US20210312681A1
公开(公告)日:2021-10-07
申请号:US17349015
申请日:2021-06-16
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US10818308B1
公开(公告)日:2020-10-27
申请号:US15965378
申请日:2018-04-27
Applicant: Snap Inc.
Inventor: Wei Chu
IPC: G10L13/00 , G10L21/013 , G10L15/26 , G10L13/033 , G10H1/06 , G10L13/04 , G10L15/02
Abstract: Systems, devices, media, and methods are presented for converting sounds in an audio stream. The systems and methods receive an audio conversion request initiating conversion of one or more sound characteristics of an audio stream from a first state to a second state. The systems and methods access an audio conversion model associated with an audio signature for the second state. The audio stream is converted based on the audio conversion model and an audio construct is compiled from the converted audio stream and a base audio segment. The compiled audio construct is presented at a client device.
-
公开(公告)号:US20190130628A1
公开(公告)日:2019-05-02
申请号:US15858992
申请日:2017-12-29
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US11610354B2
公开(公告)日:2023-03-21
申请号:US17349015
申请日:2021-06-16
Applicant: Snap Inc.
Abstract: The present invention relates to a joint automatic audio visual driven facial animation system that in some example embodiments includes a full scale state of the art Large Vocabulary Continuous Speech Recognition (LVCSR) with a strong language model for speech recognition and obtained phoneme alignment from the word lattice.
-
公开(公告)号:US10788900B1
公开(公告)日:2020-09-29
申请号:US16023912
申请日:2018-06-29
Applicant: Snap Inc.
Inventor: William Brendel , Francesco Barbieri , Xin Chen , Wei Chu , Venkata Satya Pradeep Karuturi , Luis Carlos Dos Santos Marujo , Leonardo Ribas Machado das Neves
IPC: G06F17/27 , G06F3/023 , G06N3/08 , G06K9/62 , G06F3/0481 , H04L12/58 , G06F40/166 , G06F40/274
Abstract: Symbol prediction can be implemented using a multi-task system trained for different tasks. The tasks may include a single symbol prediction, symbol category prediction, and symbol subcategory prediction. Categories of symbols can be generated by clustering sets of training data using a clustering scheme.
-
-
-
-
-
-
-
-
-