-
公开(公告)号:US20240314345A1
公开(公告)日:2024-09-19
申请号:US18577831
申请日:2021-07-15
Applicant: GOOGLE LLC
Inventor: Hui Su , Debargha Mukherjee
IPC: H04N19/513 , H04N19/105 , H04N19/119 , H04N19/139 , H04N19/159 , H04N19/176
CPC classification number: H04N19/513 , H04N19/105 , H04N19/119 , H04N19/139 , H04N19/159 , H04N19/176
Abstract: A method for inter-prediction includes coding a first block of a current frame using a first motion vector (MV) and a reference frame type; storing, in at least one MV buffer, the first MV and the reference frame type; identifying MV candidates for coding a current block using the reference frame type; responsive to a determination that a cardinality of the MV candidates is less than a maximum number of MV candidates identifying the first motion vector in the at least one MV buffer, and responsive to a determination that the first MV is not included in the MV candidates, adding the first MV as an MV candidate; and selecting one of the MV candidates for coding the current block.
-
公开(公告)号:US12075089B2
公开(公告)日:2024-08-27
申请号:US17793618
申请日:2020-05-14
Applicant: GOOGLE LLC
Inventor: James Bankoski , Debargha Mukherjee
IPC: H04N19/593 , H04N19/105 , H04N19/11 , H04N19/159 , H04N19/167 , H04N19/176 , H04N19/182 , H04N19/196 , H04N19/59
CPC classification number: H04N19/593 , H04N19/105 , H04N19/11 , H04N19/159 , H04N19/167 , H04N19/176 , H04N19/182 , H04N19/196 , H04N19/59
Abstract: A method for coding a current block using an intra-prediction mode includes obtaining a focal point, the focal point having coordinates (a, b) in a coordinate system; and generating, using first peripheral pixels and second peripheral pixels, a prediction block for the current block, where the first peripheral pixels form a first peripheral pixel line constituting an x-axis, and where the second peripheral pixels form a second peripheral pixel line constituting a y-axis. Generating the prediction block includes, for each location of the prediction block at a location (i, j) of the prediction block, determining at least one of an x-intercept or a y-intercept; and determining a prediction pixel value for the each location of the prediction block using the at least one of the x-intercept or the y-intercept.
-
公开(公告)号:US11870993B2
公开(公告)日:2024-01-09
申请号:US17359767
申请日:2021-06-28
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee
IPC: H04N19/122 , H04N19/176 , H04N19/82 , H04N19/18 , H04N19/159 , H04N19/124
CPC classification number: H04N19/122 , H04N19/124 , H04N19/159 , H04N19/176 , H04N19/18 , H04N19/82
Abstract: Improved transforms are used to encode and decode large video and image blocks. During encoding, a prediction residual block having a large size (e.g., larger than 32×32) is generated. The pixel values of the prediction residual block are transformed to produce transform coefficients. After determining that the transform coefficients exceed a threshold cardinality representative of a maximum transform block size (e.g., 32×32), a number of the transform coefficients are discarded such that a remaining number of transform coefficients does not exceed the threshold cardinality. A transform block is then generated using the remaining number. During decoding, after determining that the transform coefficients exceed the threshold cardinality, a number of new coefficients are added to the transform coefficients such that a total number of transform coefficients exceeds the threshold cardinality. The transform coefficients are then inverse transformed into a prediction residual block having a large size.
-
公开(公告)号:US20230058845A1
公开(公告)日:2023-02-23
申请号:US17793618
申请日:2020-05-14
Applicant: GOOGLE LLC
Inventor: James Bankoski , Debargha Mukherjee
IPC: H04N19/593 , H04N19/11 , H04N19/176 , H04N19/182
Abstract: A method for coding a current block using an intra-prediction mode includes obtaining a focal point, the focal point having coordinates (a, b) in a coordinate system; and generating, using first peripheral pixels and second peripheral pixels, a prediction block for the current block, where the first peripheral pixels form a first peripheral pixel line constituting an x-axis, and where the second peripheral pixels form a second peripheral pixel line constituting a y-axis. Generating the prediction block includes, for each location of the prediction block at a location (i, j) of the prediction block, determining at least one of an x-intercept or a y-intercept; and determining a prediction pixel value for the each location of the prediction block using the at least one of the x-intercept or the y-intercept.
-
公开(公告)号:US11558631B2
公开(公告)日:2023-01-17
申请号:US16836078
申请日:2020-03-31
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee , Andrew Simpson
IPC: H04N19/117 , H04N19/176 , H04N19/50 , H04N7/01 , H04N19/184 , H04N19/61
Abstract: A super-resolution coding mode is described. Encoded image can be decoded by decoding, from an encoded bitstream, a flag indicating whether an image was encoded using the super-resolution mode. The image is encoded at a first resolution. Responsive to the flag indicating that the image was encoded using the super-resolution mode, bits indicating an amount of scaling of the image are decoded. The image is decoded from the encoded bitstream to obtain a reconstructed image at the first resolution, and the reconstructed image is upscaled to a second resolution using the amount of scaling to obtain an upscaled reconstructed image. The second resolution is higher than the first resolution. Loop restoration filtering is applied to the upscaled reconstructed image using loop restoration parameters to obtain a loop restored image at the second resolution.
-
公开(公告)号:US20220094950A1
公开(公告)日:2022-03-24
申请号:US17543126
申请日:2021-12-06
Applicant: Google LLC
Inventor: Rohit Singh , Debargha Mukherjee , Elliott Karpilovsky , Lester Lu
IPC: H04N19/184 , H04N19/124 , H04N19/61
Abstract: Transform modes are derived for inter-predicted blocks using side information. A prediction residual is generated for a current video block using a reference frame. Side information associated with one or both of the current video block or the reference frame is identified. A trained transform is determined from amongst multiple trained transforms based on the side information, in which each of the trained transforms is determined using individual side information types and combinations of the individual side information types and the side information represents values of one of the individual side information types or one of the combinations of the individual side information types. The prediction residual is transformed according to the trained transform, and data associated with the transformed prediction residual and the side information are encoded to a bitstream.
-
公开(公告)号:US11284071B2
公开(公告)日:2022-03-22
申请号:US16712057
申请日:2019-12-12
Applicant: GOOGLE LLC
Inventor: Lester Lu , Debargha Mukherjee , Elliott Karpilovsky
IPC: H04N19/107 , H04N19/122 , H04N19/176 , H04N19/50
Abstract: Coding a block of video data includes determining a prediction mode for the block, which is an inter-prediction or intra-prediction mode, determining a transform type for the block, and coding the block using the prediction mode and the transform type. The transform type is one of a first plurality of transform types when the prediction mode is the inter-prediction mode, and is one of a second plurality of transform types when the prediction mode is the intra-prediction mode. The first plurality of transform types includes first fixed transform types and first mode-dependent transform types that are based on a first learned transform generated using inter-predicted blocks. The second plurality of transform types includes second fixed transform types and second mode-dependent transform types that are based on a second learned transform generated using intra-predicted blocks. The first and second fixed transform types have at least some fixed transform types in common.
-
公开(公告)号:US10771783B2
公开(公告)日:2020-09-08
申请号:US16004929
申请日:2018-06-11
Applicant: GOOGLE LLC
Inventor: Urvang Joshi , Debargha Mukherjee
IPC: H04N19/122 , H04N19/176 , H04N19/82 , H04N19/18 , H04N19/159 , H04N19/124
Abstract: Improved transforms are used to encode and decode large video and image blocks. During encoding, a prediction residual block having a large size (e.g., larger than 32×32) is generated. The pixel values of the prediction residual block are transformed to produce transform coefficients. After determining that the transform coefficients exceed a threshold cardinality representative of a maximum transform block size (e.g., 32×32), a number of the transform coefficients are discarded such that a remaining number of transform coefficients does not exceed the threshold cardinality. A transform block is then generated using the remaining number. During decoding, after determining that the transform coefficients exceed the threshold cardinality, a number of new coefficients are added to the transform coefficients such that a total number of transform coefficients exceeds the threshold cardinality. The transform coefficients are then inverse transformed into a prediction residual block having a large size.
-
公开(公告)号:US10757408B2
公开(公告)日:2020-08-25
申请号:US15789400
申请日:2017-10-20
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Yue Chen , Sarah Parker
IPC: H04N11/02 , H04N19/117 , H04N19/14 , H04N19/82 , H04N19/86 , H04N19/192 , H04N19/174 , H04N19/46 , H04N19/154 , H04N19/176 , H04N19/895
Abstract: Restoring a degraded tile of a degraded frame resulting from reconstruction is disclosed. A method includes, for a scaling factor of at least some scaling factors, recursively filtering the degraded tile using the scaling factor to generate a respective restored tile, and determining a respective error for the respective restored tile with respect to the source tile. The method also includes selecting an optimal scaling factor from the at least some scaling factors and encoding, in an encoded bitstream, a scaling parameter based on the optimal scaling factor. The optimal scaling factor corresponding to a smallest respective error. An apparatus includes a processor and non-transitory memory storing instructions. The instructions cause the processor to determine, from an encoded bitstream, a scaling factor, which determines how strongly edges in the degraded tile affect filtering operations, and recursively filter, resulting in a restored tile, the degraded tile using the scaling factor.
-
公开(公告)号:US20200186809A1
公开(公告)日:2020-06-11
申请号:US16516784
申请日:2019-07-19
Applicant: GOOGLE LLC
Inventor: Debargha Mukherjee , Urvang Joshi , Yue Chen , Sarah Parker
IPC: H04N19/147 , G06N3/04 , H04N19/59 , H04N19/184
Abstract: A hybrid apparatus for coding a video stream includes a first encoder. The first encoder includes a neural network having at least one hidden layer, and the neural network receives source data from the video stream at a first hidden layer of the at least one hidden layer, receives side information correlated with the source data at the first hidden layer, and generates guided information using the source data and the side information. The first encoder outputs the guided information and the side information for a decoder to reconstruct the source data.
-
-
-
-
-
-
-
-
-