IMAGE PROCESSING METHOD, TRAINING METHOD, AND APPARATUS

    公开(公告)号:US20240029406A1

    公开(公告)日:2024-01-25

    申请号:US18481096

    申请日:2023-10-04

    Abstract: An image processing method includes: obtains a to-be-processed image, performs feature extraction on the to-be-processed image by using a feature extraction layer included in a convolutional neural network, to obtain a first feature map, and compresses the first feature map by using a feature compression layer included in the convolutional neural network, to obtain a second feature map, where a channel quantity of the second feature map is less than a channel quantity of the first feature map; and the sending node sends the second feature map to a receiving node. The first feature map of the to-be-processed image is obtained through extraction by using the feature extraction layer, and the first feature map is compressed by using the feature compression layer, to obtain the second feature map.

    Bidirectional inter prediction method and apparatus

    公开(公告)号:US11838535B2

    公开(公告)日:2023-12-05

    申请号:US17827361

    申请日:2022-05-27

    CPC classification number: H04N19/513 H04N19/176 H04N19/577 H04N19/70

    Abstract: Embodiments of this application relate to the field of video picture coding technologies, and disclose a bidirectional inter prediction method and apparatus, to improve coding efficiency. The method includes: obtaining a first motion vector difference of a current picture block; determining a second motion vector difference of the current picture block based on the first motion vector difference, where the first motion vector difference belongs to motion information of the current picture block in a first direction, and the second motion vector difference belongs to motion information of the current picture block in a second direction; and determining prediction samples of the current picture block based on the obtained first motion vector difference and the determined second motion vector difference.

    ENCODING METHOD, DECODING METHOD, AND DEVICE
    304.
    发明公开

    公开(公告)号:US20230388490A1

    公开(公告)日:2023-11-30

    申请号:US18447885

    申请日:2023-08-10

    CPC classification number: H04N19/105 H04N19/159 H04N19/176 H04N19/70

    Abstract: This application discloses an encoding method, a decoding method, and a device. The encoding method includes: determining an index of an intra prediction mode syntax element value set of a current picture block; performing, by using a neural network, probability estimation on input data that represents a feature of the current picture block, to obtain probability distribution of a plurality of candidate intra prediction mode syntax element value sets, where the probability distribution represents respective probability values of the plurality of candidate intra prediction mode syntax element value sets; and performing entropy encoding on a probability value related to the index of the intra prediction mode syntax element value set of the current picture block based on the probability distribution of the plurality of candidate intra prediction mode syntax element value sets, to obtain a bitstream.

    MOTION VECTOR PREDICTION METHOD AND RELATED APPARATUS

    公开(公告)号:US20230370607A1

    公开(公告)日:2023-11-16

    申请号:US18318731

    申请日:2023-05-17

    CPC classification number: H04N19/139 H04N19/119 H04N19/176 H04N19/61

    Abstract: A motion vector prediction method includes parsing a bitstream to obtain an index value of a candidate motion vector list and constructing the candidate motion vector list that includes. candidate motion vectors of K control points of a current block. The candidate motion vectors of K control points are obtained based on a 2N-parameter affine transform model used for a neighboring block of the current block, where N and K are integers greater than or equal to 2 and less than or equal to 4 with N not being equal to K. The method further includes determining, in the candidate motion vector list, target candidate motion vectors of the K control points based on the index value and obtaining a predicted motion vector of each subblock of the current block based on the target candidate motion vectors of the K control points.

    MOTION VECTOR PREDICTION METHOD AND RELATED APPARATUS

    公开(公告)号:US20230370606A1

    公开(公告)日:2023-11-16

    申请号:US18318730

    申请日:2023-05-17

    CPC classification number: H04N19/139 H04N19/119 H04N19/176 H04N19/61

    Abstract: A motion vector prediction method includes parsing a bitstream to obtain an index value of a candidate motion vector list and constructing the candidate motion vector list that includes. candidate motion vectors of K control points of a current block. The candidate motion vectors of K control points are obtained based on a 2N-parameter affine transform model used for a neighboring block of the current block, where N and K are integers greater than or equal to 2 and less than or equal to 4 with N not being equal to K. The method further includes determining, in the candidate motion vector list, target candidate motion vectors of the K control points based on the index value and obtaining a predicted motion vector of each subblock of the current block based on the target candidate motion vectors of the K control points.

    Video picture prediction method and apparatus

    公开(公告)号:US11736715B2

    公开(公告)日:2023-08-22

    申请号:US17858567

    申请日:2022-07-06

    Abstract: This application provides video picture prediction methods and apparatuses. In an implementation, a method for encoding of video picture comprises generating a bitstream for video signals, the bitstream comprises a plurality of syntax elements, wherein the plurality of syntax elements comprises a first identifier indicating that an affine motion model based motion compensation is enabled for a video sequence including a picture block to be processed, wherein a second identifier is conditionally signaled at least based on a value of the first identifier, wherein a false value of the second identifier indicates that a 6-parameter affine motion model based motion compensation is disabled for the video sequence, and wherein a true value of the second identifier indicates that the 6-parameter affine motion model based motion compensation is enabled for the video sequence.

    Spatial varying transform for video coding

    公开(公告)号:US11606571B2

    公开(公告)日:2023-03-14

    申请号:US16848555

    申请日:2020-04-14

    Abstract: A video decoding device receives a bitstream including a prediction block and a residual block with coefficients transformed by a Spatial Varying Transform (SVT). The video decoding device determines a type of SVT employed to transform the coefficients in the residual block and determines a position of the SVT relative to the residual block by determining a candidate position step size and a position index for the SVT. The video decoding device applies an inverse transform to the coefficients based on the SVT type and position to create a reconstructed residual block. The video decoding device applies the reconstructed residual block to the prediction block to reconstruct a video block and reconstructs a video sequence for display, the video sequence including a video frame that includes the reconstructed video block.

    Encoding Method and Apparatus
    310.
    发明申请

    公开(公告)号:US20230069387A1

    公开(公告)日:2023-03-02

    申请号:US17973151

    申请日:2022-10-25

    Abstract: A video processing method includes: obtaining prediction information of a CU; obtaining, when the CU comprises only one residual TU and a size of the residual TU is less than a size of the CU, a TU partitioning mode of the CU and a residual position of the residual TU, wherein the TU partitioning mode and the residual position are used to determine a horizontal transform type and a vertical transform type; obtaining transform coefficients of the residual TU based on the horizontal transform type and the vertical transform type; and generating a bitstream that is for storing or transmitting and that includes the prediction information, a first flag that indicates the TU partitioning mode, a second flag that indicates the residual position, and the transform coefficients.

Patent Agency Ranking