Abstract:
Disclosed herein are a method, apparatus, system, and computer-readable recording medium for image compression. An encoding apparatus performs preprocessing of feature map information, frame packing, frame classification, and encoding. A decoding apparatus performs decoding, frame depacking, and postprocessing in order to reconstruct feature map information. By encoding the feature map information, inter-prediction and intra-block prediction for a frame are performed. The encoding apparatus provides the decoding apparatus with a feature map information bitstream for reconstructing the feature map information along with an image information bitstream.
Abstract:
Disclosed herein are a video decoding method and apparatus and a video encoding method and apparatus. A transformed block is generated by performing a first transformation that uses a prediction block for a target block. A reconstructed block for the target block is generated by performing a second transformation that uses the transformed block. The prediction block may be a block present in a reference image, or a reconstructed block present in a target image. The first transformation and the second transformation may be respectively performed by neural networks. Since each transformation is automatically performed by the corresponding neural network, information required for a transformation may be excluded from a bitstream.
Abstract:
Disclosed herein are a method and apparatus for video decoding and a method and apparatus for video encoding. A prediction block for a target block is generated by predicting the target block using a prediction network, and a reconstructed block for the target block is generated based on the prediction block and a reconstructed residual block. The prediction network includes an intra-prediction network and an inter-prediction network and uses a spatial reference block and/or a temporal reference block when it performs prediction. For learning in the prediction network, a loss function is defined, and learning in the prediction network is performed based on the loss function.
Abstract:
Disclosed herein are an inter-prediction method and apparatus using a reference frame generated based on deep learning. In the inter-prediction method and apparatus, a reference frame is selected, and a virtual reference frame is generated based on the selected reference frame. A reference picture list is configured to include the generated virtual reference frame, and inter prediction for a target block is performed based on the virtual reference frame. The virtual reference frame may be generated based on a deep-learning network architecture, and may be generated based on video interpolation and/or video extrapolation that use the selected reference frame.
Abstract:
Disclosed herein are a method, an apparatus and a storage medium for image encoding/decoding. In typical image encoding/decoding methods, a decoder-side motion information derivation method may be limitedly used. Therefore, the improvement of encoding efficiency attributable to the decoder-side motion information derivation method may also be limited. In embodiments, a motion information search method used in inter-prediction is disclosed. With the use of various motion search methods, encoding efficiency in inter-prediction may be improved.
Abstract:
Disclosed herein are a method and apparatus for compressing learning parameters for training of a deep-learning model and transmitting the compressed parameters in a distributed processing environment. Multiple electronic devices in the distributed processing system perform training of a neural network. By performing training, parameters are updated. The electronic device may share the updated parameter thereof with additional electronic devices. In order to efficiently share the parameter, the residual of the parameter is provided to the additional electronic devices. When the residual of the parameter is provided, the additional electronic devices update the parameter using the residual of the parameter.
Abstract:
A method and apparatus for image compression using a latent variable are provided. The multiple components of the latent variable may be sorted in order of importance. Through sorting, when the feature information of only some of the multiple components is used, the quality of a reconstructed image may be improved. In order to generate a latent variable, the components of which are sorted in order of importance, learning may be performed in various manners. Also, less important information may be eliminated from the latent variable, and processing, such as quantization, may be applied to the latent variable. Through elimination and processing, the amount of data for the latent variable may be reduced.
Abstract:
Disclosed herein are a video decoding method and apparatus and a video encoding method and apparatus. A virtual frame is generated by a video generation network including a generation encoder and a generation decoder. The virtual frame is used as a reference frame in inter prediction for a target. Further, a video generation network for inter prediction may be selected from among multiple video generation networks, and inter prediction that uses the selected video generation network may be performed.
Abstract:
Disclosed are an apparatus and a method for analyzing propagation of an electromagnetic wave by effectively using a ray tracing scheme in a radio wave system, in which; an electromagnetic wave scattered in an electromagnetic wave incident to an interface in a service area is detected; average scattering power for the interface is calculated by the scattered electromagnetic wave; the propagation of the electromagnetic wave in the service area is analyzed based on the average scattering power.
Abstract:
An encoding apparatus extracts features of an image by applying multiple padding operations and multiple downscaling operations to an image represented by data and transmits feature information indicating the features to a decoding apparatus. The multiple padding operations and the multiple downscaling operations are applied to the image in an order in which one padding operation is applied and thereafter one downscaling operation corresponding to the padding operation is applied. A decoding method receives feature information from an encoding apparatus, and generates a reconstructed image by applying multiple upscaling operations and multiple trimming operations to an image represented by the feature information. The multiple upscaling operations and the multiple trimming operations are applied to the image in an order in which one upscaling operation is applied and thereafter one trimming operation corresponding to the upscaling operation is applied.