-
公开(公告)号:US11902705B2
公开(公告)日:2024-02-13
申请号:US16558620
申请日:2019-09-03
Applicant: Nvidia Corporation
Inventor: Kevin Shih , Aysegul Dundar , Animesh Garg , Robert Pottorff , Andrew Tao , Bryan Catanzaro
CPC classification number: H04N7/0135 , G06F18/214 , G06F18/217 , G06N3/044 , G06N3/045 , G06N3/08
Abstract: Apparatuses, systems, and techniques to enhance video are disclosed. In at least one embodiment, one or more neural networks are used to create, from a first video, a second video having one or more additional video frames.
-
公开(公告)号:US20230113950A1
公开(公告)日:2023-04-13
申请号:US17496569
申请日:2021-10-07
Applicant: Nvidia Corporation
Inventor: Kevin Shih , Jose Rafael Valle Gomes da Costa , Rohan Badlani , Adrian Lancucki , Wei Ping , Bryan Catanzaro
IPC: G10L13/047 , G10L25/90
Abstract: Generation of synthetic speech from an input text sequence may be difficult when durations of individual phonemes forming the input text sequence are unknown. A predominantly parallel process may model speech rhythm as a separate generative distribution such that phoneme duration may be sampled at inference. Additional information such as pitch or energy may also be sampled to provide improved diversity for synthetic speech generation.
-
公开(公告)号:US20230110905A1
公开(公告)日:2023-04-13
申请号:US17496636
申请日:2021-10-07
Applicant: Nvidia Corporation
Inventor: Kevin Shih , Jose Rafael Valle Gomes da Costa , Rohan Badlani , Adrian Lancucki , Wei Ping , Bryan Catanzaro
IPC: G10L13/08 , G10L13/047 , G10L13/033 , G06N3/08 , G06N3/04
Abstract: Generation of synthetic speech from an input text sequence may be difficult when durations of individual phonemes forming the input text sequence are unknown. A predominantly parallel process may model speech rhythm as a separate generative distribution such that phoneme duration may be sampled at inference. Additional information such as pitch or energy may also be sampled to provide improved diversity for synthetic speech generation.
-
公开(公告)号:US20220148256A1
公开(公告)日:2022-05-12
申请号:US17095556
申请日:2020-11-11
Applicant: Nvidia Corporation
Inventor: Shiqiu Liu , Robert Pottorff , Andrew Tao , Bryan Catanzaro
Abstract: Apparatuses, systems, and techniques are presented to reconstruct one or more images. In at least one embodiment, one or more neural networks are used to determine one or more blending weights for one or more images based, at least in part, upon one or more pixel value masks for the one or more images.
-
公开(公告)号:US20220114702A1
公开(公告)日:2022-04-14
申请号:US17406902
申请日:2021-08-19
Applicant: Nvidia Corporation
Inventor: Shiqiu Liu , Robert Pottorff , Guilin Liu , Karan Sapra , Jon Barker , David Tarjan , Pekka Janis , Edvard Fagerholm , Lei Yang , Kevin Jonathan Shih , Marco Salvi , Timo Roman , Andrew Tao , Bryan Catanzaro
Abstract: Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to generate one or more images using one or more pixel weights.
-
公开(公告)号:US20220114701A1
公开(公告)日:2022-04-14
申请号:US17172330
申请日:2021-02-10
Applicant: Nvidia Corporation
Inventor: Shiqiu Liu , Robert Pottorff , Guilin Liu , Karan Sapra , Jon Barker , David Tarjan , Pekka Janis , Edvard Fagerholm , Lei Yang , Kevin Shih , Marco Salvi , Timo Roman , Andrew Tao , Bryan Catanzaro
Abstract: Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to generate one or more images using one or more pixel weights determined based, at least in part, on one or more sub-pixel offset values.
-
公开(公告)号:US20240095447A1
公开(公告)日:2024-03-21
申请号:US17846866
申请日:2022-06-22
Applicant: Nvidia Corporation
Inventor: Wei Ping , Boxin Wang , Chaowei Xiao , Mohammad Shoeybi , Mostofa Patwary , Anima Anandkumar , Bryan Catanzaro
IPC: G06F40/279 , G06F40/205 , G06F40/55
CPC classification number: G06F40/279 , G06F40/205 , G06F40/55
Abstract: Apparatuses, systems, and techniques are presented to identify and prevent generation of restricted content. In at least one embodiment, one or more neural networks are used to identify restricted content based only on the restricted content.
-
公开(公告)号:US20240038212A1
公开(公告)日:2024-02-01
申请号:US18099840
申请日:2023-01-20
Applicant: NVIDIA Corporation
Inventor: Kevin Shih , José Rafael Valle Gomes da Costa , Rohan Badlani , João Felipe Santos , Bryan Catanzaro
IPC: G10L13/027 , G10L13/08 , G10L25/30
CPC classification number: G10L13/027 , G10L13/08 , G10L25/30
Abstract: Disclosed are apparatuses, systems, and techniques that may use machine learning for implementing generative text-to-speech models. The techniques include identifying a mapping of speech characteristics (SC) on a target distribution of a latent variable using a non-linear transformation for at least a subset of the SC. Parameters of the non-linear transformation are determined using a neural network that approximates a statistics of the SC with a statistics predicted for the SC based on the identified mapping and the target distribution of the latent variable.
-
公开(公告)号:US20230035306A1
公开(公告)日:2023-02-02
申请号:US17382027
申请日:2021-07-21
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Koki Nagano , Yeongho Seol , Jose Rafael Valle Gomes da Costa , Jaewoo Seo , Ting-Chun Wang , Arun Mallya , Sameh Khamis , Wei Ping , Rohan Badlani , Kevin Jonathan Shih , Bryan Catanzaro , Simon Yuen , Jan Kautz
Abstract: Apparatuses, systems, and techniques are presented to generate media content. In at least one embodiment, a first neural network is used to generate first video information based, at least in part, upon voice information corresponding to one or more users, and a second neural network is used to generate second video information corresponding to the one or more users based, at least in part, upon the first video information and one or more images corresponding to the one or more users
-
公开(公告)号:US20220156883A1
公开(公告)日:2022-05-19
申请号:US17665412
申请日:2022-02-04
Applicant: Nvidia Corporation
Inventor: Robert Pottorff , David Tarjan , Andrew Tao , Bryan Catanzaro
Abstract: Apparatuses, systems, and techniques are presented to generate images with one or more visual effects applied. In at least one embodiment, one or more visual effects are applied to one or more images having a resolution that is less than a first resolution and those visual effects approximated for one or more images having a resolution that is greater than or equal to the first resolution.
-
-
-
-
-
-
-
-
-