-
公开(公告)号:US11496746B2
公开(公告)日:2022-11-08
申请号:US17165680
申请日:2021-02-02
Applicant: QUALCOMM Incorporated
Inventor: Mahant Siddaramanna , Naveen Srinivasamurthy , Apoorva Nagarajan , Prasant Shekhar Singh , Pawan Kumar Baheti , Narendranath Malayath
IPC: H04N19/147 , H04N19/159 , H04N19/19 , G06N3/04 , H04N19/176
Abstract: Systems and techniques are described for data encoding using a machine learning approach to generate a distortion prediction {circumflex over (D)} and a predicted bit rate {circumflex over (R)}, and to use {circumflex over (D)} and {circumflex over (R)} to perform rate-distortion optimization (RDO). For example, a video encoder can generate the distortion prediction {circumflex over (D)} and the bit rate residual prediction based on outputs of the one or more neural networks in response to the one or more neural networks receiving a residual portion of a block of a video frame as input. The video encoder can determine bit rate metadata prediction based on metadata associated with a mode of compression, and determine {circumflex over (R)} to be the sum of and . The video encoder can determine a rate-distortion cost prediction Ĵ as a function of {circumflex over (D)} and {circumflex over (R)}, and can determine a prediction mode for compressing the block based on Ĵ.