-
公开(公告)号:US20230169694A1
公开(公告)日:2023-06-01
申请号:US17975471
申请日:2022-10-27
Applicant: QUALCOMM Incorporated
Inventor: Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Taco Sebastiaan COHEN
IPC: G06T9/00
CPC classification number: G06T9/002
Abstract: A processor-implemented method for video compression using an artificial neural network (ANN) includes receiving a video via the ANN. The ANN extracts a first set of features of a current frame of the video and a second set of features of a reference frame of the video. The ANN determines an estimate of correlation features between the first set of features of the current frame and the second set of features of the reference frame. The estimate of the correlation features are encoded and transmitted to a receiver.
-
公开(公告)号:US20230156207A1
公开(公告)日:2023-05-18
申请号:US17987844
申请日:2022-11-15
Applicant: QUALCOMM Incorporated
Inventor: Yang YANG , Hoang Cong Minh LE , Yinhao ZHU , Reza POURREZA , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN
IPC: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119
CPC classification number: H04N19/436 , H04N19/124 , H04N19/147 , H04N19/17 , H04N19/119
Abstract: A processor-implemented method for image compression using an artificial neural network (ANN) includes receiving, at an encoder of the ANN, an image and a spatial segmentation map corresponding to the image. The spatial segmentation map indicates one or more regions of interest. The encoder compresses the image according to a controllable spatial bit allocation. The controllable spatial bit allocation is based on a learned quantization bin size.
-
公开(公告)号:US20220303568A1
公开(公告)日:2022-09-22
申请号:US17207244
申请日:2021-03-19
Applicant: QUALCOMM Incorporated
Inventor: Reza POURREZA , Amir SAID , Yang YANG , Yinhao ZHU , Taco Sebastiaan COHEN
IPC: H04N19/51 , H04N19/172 , H04N19/137 , H04N19/107 , H04N19/593 , G06N3/08
Abstract: Systems and techniques are described for encoding and/or decoding data based on motion estimation that applies variable-scale warping. An encoding device can receive an input frame and a reference frame that depict a scene at different times. The encoding device can generate an optical flow identifying movements in the scene between the two frames. The encoding device can generate a weight map identifying how finely or coarsely the reference frame can be warped for input frame prediction. The encoding device can generate encoded video data based on the optical flow and the weight map. A decoding device can generate a reconstructed optical flow and a reconstructed weight map from the encoded data. A decoding device can generate a prediction frame by warping the reference frame based on the reconstructed optical flow and the reconstructed weight map. The decoding device can generate a reconstructed input frame based on the prediction frame.
-
公开(公告)号:US20220292725A1
公开(公告)日:2022-09-15
申请号:US17200694
申请日:2021-03-12
Applicant: QUALCOMM Incorporated
Inventor: Hoang Cong Minh LE , Reza POURREZA , Yang YANG , Yinhao ZHU , Amir SAID , Yizhe ZHANG , Taco Sebastiaan COHEN
Abstract: A method of image compression includes receiving an image. Multiple quantized latent representations are generated to represent features of the image. Each of the quantized latent representations has a different resolution and is generated at staggered timings. Each of the later generated quantized latent representations is conditioned on each of the prior generated quantized latent representations. The multiple quantized latent representations are decoded to reconstruct the image.
-
公开(公告)号:US20220237740A1
公开(公告)日:2022-07-28
申请号:US17648808
申请日:2022-01-24
Applicant: QUALCOMM Incorporated
Inventor: Yadong LU , Yang YANG , Yinhao ZHU , Amir SAID , Taco Sebastiaan COHEN
Abstract: Certain aspects of the present disclosure provide techniques for compressing content using a neural network. An example method generally includes receiving content for compression. The content is encoded into a first latent code space through an encoder implemented by an artificial neural network trained to generate a latent space representation of the content. A first compressed version of the encoded content is generated using a first quantization bin size of a series of quantization bin sizes. A refined compressed version of the encoded content is generated by scaling the first compressed version of the encoded content into one or more second quantization bin sizes smaller than the first quantization bin size, conditioned at least on a value of the first compressed version of the encoded content. The refined compressed version of the encoded content is output for transmission.
-
公开(公告)号:US20240364925A1
公开(公告)日:2024-10-31
申请号:US18636126
申请日:2024-04-15
Applicant: QUALCOMM Incorporated
Inventor: Hoang Cong Minh LE , Qiqi HOU , Farzad FARHADZADEH , Amir SAID , Auke Joris WIGGERS , Guillaume Konrad SAUTIERE , Reza POURREZA
IPC: H04N19/597 , H04N19/137 , H04N19/436
CPC classification number: H04N19/597 , H04N19/137 , H04N19/436
Abstract: Systems and techniques are described herein for processing video data. For example, a machine-learning based stereo video coding system can obtain video data including at least a right-view image of a right view of a scene and a left-view image of a left view of the scene. The machine-learning based stereo video coding system can compress the right-view image and the left-view image in parallel to generate a latent representation of the right-view image and the left-view image. The right-view image and the left-view image can be compressed in parallel based on inter-view information between the right-view image and the left-view image, determined using one or more parallel autoencoders.
-
公开(公告)号:US20240013441A1
公开(公告)日:2024-01-11
申请号:US17862149
申请日:2022-07-11
Applicant: QUALCOMM Incorporated
Inventor: Hoang Cong Minh LE , Reza POURREZA , Amir SAID
CPC classification number: G06T9/00 , G06T3/40 , G06T7/248 , G06T7/50 , G06T3/0093 , G06T2207/20224 , G06T2207/20084
Abstract: Systems and techniques are provided for coding (e.g., encoding and/or decoding) video data using camera motion information. For example, a decoding device can obtain a frame of encoded video data associated with an input frame, the frame of encoded video data including camera information associated with generating the video data and a residual. A camera motion compensated frame can be generated based on a reference frame and the camera information. Optical flow information associated with object motion determined based on at least the input frame and the reference frame can be generated. A motion compensated frame can be generated by warping the camera motion compensated frame based on the optical flow information. A reconstructed input frame can be generated based on the motion compensated frame and the residual.
-
公开(公告)号:US20190007705A1
公开(公告)日:2019-01-03
申请号:US16020511
申请日:2018-06-27
Applicant: QUALCOMM Incorporated
Inventor: Xin ZHAO , Vadim SEREGIN , Amir SAID , Marta KARCZEWICZ
IPC: H04N19/61 , H04N19/176 , H04N19/50 , H04N19/18
Abstract: Techniques are described in which a decoder is configured to receive an input data block and apply an inverse non-separable transform to at least part of the input data block to generate an inverse non-separable transform output coefficient block. The applying the inverse non-separable transform comprises assigning a window, assigning a weight for each position inside the assigned window, and determining the inverse non-separable transform output coefficient block based on the assigned weights. The decoder is further configured to forming a decoded video block based on the determined inverse non-separable transform output coefficient block, wherein forming the decoded video block comprises summing the residual video block with one or more predictive blocks.
-
9.
公开(公告)号:US20240364890A1
公开(公告)日:2024-10-31
申请号:US18608802
申请日:2024-03-18
Applicant: QUALCOMM Incorporated
Inventor: Amir SAID , Hoang Cong Minh LE , Farzad FARHADZADEH
IPC: H04N19/13 , H04N19/146 , H04N19/167 , H04N19/184 , H04N19/91
CPC classification number: H04N19/13 , H04N19/146 , H04N19/167 , H04N19/184 , H04N19/91
Abstract: Systems and techniques are described herein for processing video data. For example, an encoding device can obtain a sequence of video data and determine a minimum value in the sequence of video data. The encoding device can, based on the minimum value, identify positions in the sequence of video data associated with entry points for individually entropy codable parcels of a parallel entropy codable sequence of video data. The encoding device can generate the parallel entropy codable sequence of video data. The encoding device can further generate an index for the parallel entropy codable sequence of video data, the index identifying the individually entropy codable parcels within the parallel entropy codable sequence of video data.
-
公开(公告)号:US20220224926A1
公开(公告)日:2022-07-14
申请号:US17573568
申请日:2022-01-11
Applicant: QUALCOMM Incorporated
Inventor: Yadong LU , Yang YANG , Yinhao ZHU , Amir SAID , Reza POURREZA , Taco Sebastiaan COHEN
IPC: H04N19/42 , H04N19/30 , H04N19/13 , H04N19/136 , H04N19/124
Abstract: A computer-implemented method for operating an artificial neural network (ANN) includes receiving an input by the ANN. The ANN generates a latent representation of the input. The latent representation is communicated according to a bit rate based on a learned latent scaling parameter. The latent scaling parameter is learned based on a channel index and a tradeoff parameter value that corresponds to a value that balances the bit rate and a distortion.
-
-
-
-
-
-
-
-
-