-
公开(公告)号:US20250119556A1
公开(公告)日:2025-04-10
申请号:US18889977
申请日:2024-09-19
Applicant: QUALCOMM Incorporated
Inventor: Yun Li , Dmytro Rusanovskyy , Thomas Alexander Ryder , Samuel James Eadie , Marta Karczewicz
IPC: H04N19/176 , G06T9/00 , H04N19/117 , H04N19/159 , H04N19/82
Abstract: A method of processing video data includes receiving a picture; and filtering a current block of the picture, through a neural network and based on local correlations of proximate samples and distant, non-local correlations of non-proximate samples relative to the current block, to generate a filtered current block. The neural network comprises one or more backbone blocks and one or more transformer blocks. Each of the one or more transformer blocks is associated with a backbone block of the one or more backbone blocks. At least one of the backbone blocks is configured to capture the local correlations, relative to the current block and the proximate samples of the current block, and at least one of the transformer blocks is configured to generate features, based on applying an attention mechanism, that capture the distant, non-local correlations, relative to the current block and the non-proximate samples, in the picture for processing.
-
公开(公告)号:US20250119592A1
公开(公告)日:2025-04-10
申请号:US18888423
申请日:2024-09-18
Applicant: QUALCOMM Incorporated
Inventor: Thomas Alexander Ryder , Dmytro Rusanovskyy , Samuel James Eadie , Yun Li , Marta Karczewicz
IPC: H04N19/82 , H04N19/117 , H04N19/132 , H04N19/159 , H04N19/176
Abstract: Methods and devices for decoding video data are described. An example method includes in-loop filtering a current block of the video data using a neural network-based in-loop filter to generate an in-loop filtered current block, wherein the neural network-based in-loop filter is trained using an architecture comprising a U-Net architecture comprising one or more residual blocks and one or more transform blocks; and outputting the in-loop filtered current block.
-
公开(公告)号:US20250016339A1
公开(公告)日:2025-01-09
申请号:US18744171
申请日:2024-06-14
Applicant: QUALCOMM Incorporated
Inventor: Thomas Alexander Ryder , Samuel James Eadie , Marta Karczewicz , Muhammed Zeyd Coban , Vadim Seregin
IPC: H04N19/31 , H04N19/107 , H04N19/167 , H04N19/176
Abstract: An example device for decoding video data includes a processing system comprising one or more processors implemented in circuitry and configured to: determine that a first temporal layer identifier of a first picture of the video data is included in a first set of temporal layers; in response to the first temporal layer identifier being included in the first set of temporal layers, decode blocks of the first picture on a block by block basis; determine that a second temporal layer identifier of a second picture of the video data is included in a second set of temporal layers, the second set of temporal layers being higher than the first set of temporal layers; and in response to the second temporal layer identifier being included in the second set of temporal layers, execute a neural network-based video decoder to decode the second picture.
-
公开(公告)号:US20240323416A1
公开(公告)日:2024-09-26
申请号:US18188268
申请日:2023-03-22
Applicant: QUALCOMM Incorporated
Inventor: Thomas Alexander Ryder , Muhammed Zeyd Coban , Marta Karczewicz
IPC: H04N19/42 , H04N19/124 , H04N19/159 , H04N19/172 , H04N19/503
CPC classification number: H04N19/42 , H04N19/124 , H04N19/159 , H04N19/172 , H04N19/503
Abstract: A device for encoding video data can be configured to encode a set of input frames of the video data with an image analysis neural network to generate a corresponding set of output frames, wherein the set of output frames includes a first output frame and additional output frames, wherein the first output frame temporally precedes the additional output frames; determine a modification to the first output frame that results in an optimized amount of distortion between the set of input frames and decoded versions of the corresponding set of output frames; update the first output frame based on the determined modification; and output a bitstream that includes the updated first output frame.
-
-
-