-
公开(公告)号:US20240289590A1
公开(公告)日:2024-08-29
申请号:US18572100
申请日:2022-06-16
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRÌ , Nannan ZOU , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI
IPC: G06N3/045
CPC classification number: G06N3/045
Abstract: Various embodiments provide a method, an apparatus, and computer program product. The method comprising: defining an attention block comprising: a set of initial neural network layers, wherein each layer is caused to process an output of a previous layer, and wherein a first layer processes an input of a dense split attention block; core attention blocks process one or more outputs of the set of initial neural network layers; a concatenation block for concatenating one or more outputs of the core attention blocks and at least one intermediate output of the set of initial neural network layers; one or more final neural network layers process at least the output of the concatenation block; and a summation block caused to sum an output of the final neural network layers and an input to the attention block; and providing an output of the summation block as a final output of the attention block.
-
公开(公告)号:US20240249514A1
公开(公告)日:2024-07-25
申请号:US18560430
申请日:2022-05-13
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Yat Hong LAM , Miska Matias HANNUKSELA , Nannan ZOU
IPC: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
CPC classification number: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
Abstract: Various embodiments provide an apparatus, a method, and a computer program product. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform; train or finetune one or more additional parameters of at least one neural network (NN) or a portion of the at least one NN, wherein the one or more additional parameters comprise one or more scaling parameters; and encode or decode one or more media elements based on the at least one neural network or a portion of the at least one NN comprising the trained or finetuned one or more additional parameters.
-
3.
公开(公告)号:US20240146938A1
公开(公告)日:2024-05-02
申请号:US18548130
申请日:2022-03-09
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Nannan ZOU , Honglei ZHANG , Francesco CRICRÌ , Hamed REZAZADEGAN TAVAKOLI , Ramin GHAZNAVI YOUVALARI
IPC: H04N19/159 , H04N19/172
CPC classification number: H04N19/159 , H04N19/172
Abstract: Various embodiments provide an apparatus, a method and a computer program product for end-to-end learned predictive coding of media frames. An example apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode one or more media frames for at least one neural network; wherein an inter-frame codec is applied to at least one media frame of the one or more media frames; and wherein a first decoded reference frame and a second decoded reference frame refer to reference frames for the at least one media frame.
-
4.
公开(公告)号:US20240265240A1
公开(公告)日:2024-08-08
申请号:US18567736
申请日:2022-06-17
Applicant: Nokia Technologies Oy
Inventor: Honglei ZHANG , Francesco CRICRÌ , Ramin GHAZNAVI YOUVALARI , Hamed REZAZADEGAN TAVAKOLI , Nannan ZOU , Vinod Kumar MALAMAL VADAKITAL , Miska Matias HANNUKSELA , Yat Hong LAM , Jani LAINEMA , Emre Baris AKSU
IPC: G06N3/0455
CPC classification number: G06N3/0455
Abstract: An example apparatus includes at least one processor; and at least one non-transitory memory comprising computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform; learn importance of one or more parameters by using a training dataset; define one or more masks for indicating the importance of the one or more parameters for a model finetuning; share at least one mask of the one or more masks with at least one of an encoder or a decoder; finetune at least one parameter of the one or more parameters based at least on the at least one mask; send or signal one or more weight updates corresponding to the at least one parameter in a bitstream to the decoder.
-
5.
公开(公告)号:US20230269387A1
公开(公告)日:2023-08-24
申请号:US18001987
申请日:2021-06-11
Applicant: Nokia Technologies Oy
Inventor: Francesco CRICRÌ , Hamed REZAZADEGAN TAVAKOLI , Honglei ZHANG , Nannan ZOU
IPC: H04N19/42 , H04N19/124 , H04N19/176 , G06N3/0455 , G06N3/0985
CPC classification number: H04N19/42 , G06N3/0455 , G06N3/0985 , H04N19/124 , H04N19/176
Abstract: In example embodiments, an apparatus, a method, and a computer program product are provided. An example apparatus include processing circuitry; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the processing circuitry, cause the apparatus at least to: overfit a neural network on each media item, from a batch of media items, for a number of iterations to obtain an overfitted neural network model for the each media item; evaluate the overfitted neural network model on the each media item to obtain evaluation errors; and update parameters of the neural network to be based on the evaluation errors.
-
公开(公告)号:US20240267543A1
公开(公告)日:2024-08-08
申请号:US18425693
申请日:2024-01-29
Applicant: Nokia Technologies Oy
Inventor: Nannan ZOU , Francesco CRICRÌ , Honglei ZHANG
IPC: H04N19/30 , H04N19/172 , H04N19/88
CPC classification number: H04N19/30 , H04N19/172 , H04N19/88
Abstract: An example method includes: receiving a target frame and one or more reference frames; extracting a first feature map from a first predicted target frame predicted from a first reference frame, and a second feature map from a second predicted frame predicted from a second target frame, wherein the first predicted target frame is a backward predicted target frame and the second predicted target frame is a forward predicted target frame; generating a refined residual feature based at least on the first feature map, the second feature map, and a third feature map extracted from a feature decoder net module or circuit; generating a frame residual based at least on the refined residual feature; and generating an output reconstructed frame based at least on the frame residual and an average frame, wherein the average frame represents an average of the first predicted target frame and the second predicted target frame.
-
公开(公告)号:US20230196072A1
公开(公告)日:2023-06-22
申请号:US17644622
申请日:2021-12-16
Applicant: Nokia Technologies Oy
Inventor: Nannan ZOU , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Jani LAINEMA , Miska Matias HANNUKSELA
CPC classification number: G06N3/0454 , G06N3/08
Abstract: Various embodiments provide an apparatus, a method, and a computer program product. An example apparatus includes at least one processor; and at least one non-transitory memory comprising computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: determine a subset of parameters to overfit from a set of candidate parameters of decoder side neural network to be overfitted (OPs); wherein the subset of parameters to overfit is smaller than the set of candidate parameters to be overfitted; and overfit the determined subset of parameters.
-
公开(公告)号:US20230169372A1
公开(公告)日:2023-06-01
申请号:US17457136
申请日:2021-12-01
Applicant: Nokia Technologies Oy
Inventor: Nannan ZOU , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Jani LAINEMA , Miska Matias HANNUKSELA
CPC classification number: G06N7/005 , G06V10/84 , G06V10/467 , G06V10/44 , H04N19/176 , H04N19/61
Abstract: Various embodiments provide an apparatus, a method, and a computer program product. 1. An apparatus incudes at least one processor; and at least one non-transitory memory includes computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to: perform an overfitting operation, at an encoder side, to obtain an overfitted probability model, wherein overfitting comprises one or more training operations applied to a probability model, wherein one or more parameters of the probability model are trained; use the overfitted probability model to provide probability estimates to a lossless codec or a substantially lossless codec for encoding data or a portion of the data; and signal information to a decoder on whether to perform the overfitting operation at the decoder side.
-
-
-
-
-
-
-