-
公开(公告)号:US20240323415A1
公开(公告)日:2024-09-26
申请号:US18188070
申请日:2023-03-22
Applicant: QUALCOMM Incorporated
Inventor: David Wilson ROMERO GUZMAN , Gabriele CESA , Guillaume Konrad SAUTIERE , Yunfan ZHANG , Taco Sebastiaan COHEN , Auke Joris WIGGERS
IPC: H04N19/42 , G06T3/40 , H04N19/182
CPC classification number: H04N19/42 , G06T3/4046 , H04N19/182
Abstract: Certain aspects of the present disclosure provide techniques and apparatus for encoding content using a neural network. An example method generally includes encoding video content into a latent space representation through an encoder implemented by a first machine learning model. A code is generated by upsampling the latent space representation of the video content. A prior is calculated based on a conditional probability of obtaining the upsampled latent space representation conditioned by the latent space representation of the video content. A compressed version of the video content is generated based on a probabilistic model implemented by a second machine learning model, the generated code, and the calculated prior, and the compressed version of the video content is output for transmission.