REFERENCE PICTURE MANAGEMENT AND LIST CONSTRUCTION

    公开(公告)号:US20220166969A1

    公开(公告)日:2022-05-26

    申请号:US17540356

    申请日:2021-12-02

    Applicant: Apple Inc.

    Abstract: Techniques for encoding video with motion compensation include a compressed bitstream syntax that includes a list of all motion prediction reference frames without distinguishing between short-term reference frame and long-term reference frames. The list of reference frames may be provided in a slice header and may apply to encoded data video data within the corresponding slice. The list may be prefaced with a single number indicating the total number of reference frames. In an aspect delta POC reference numbers may be encoded with a flag indicating the sign of the delta POC when the absolute value of the POC is not equal to zero. In another aspect, a flag may be encoded for every reference frame indicating if POC information should be used when scaling prediction references, and a weighting parameter may be included when POC information should be used.

    MULTIDIMENSIONAL QUANTIZATION TECHNIQUES FOR VIDEO CODING/DECODING SYSTEMS

    公开(公告)号:US20220103851A1

    公开(公告)日:2022-03-31

    申请号:US17493607

    申请日:2021-10-04

    Applicant: Apple Inc.

    Abstract: Video compression and decompression techniques are disclosed that provide improved bandwidth control for video compression and decompression systems. In particular, video coding and decoding techniques quantize input video in multiple dimensions. According to these techniques, pixel residuals may be generated from a comparison of an array of input data to an array of prediction data. The pixel residuals may be quantized in a first dimension. After the quantization, the quantized pixel residuals may be transformed to an array of transform coefficients. The transform coefficients may be quantized in a second dimension and entropy coded. Decoding techniques invert these processes. In still other embodiments, multiple quantizers may be provided upstream of the transform stage, either in parallel or in cascade, which provide greater flexibility to video coders to quantize data in different dimensions in an effort to balance the competing interest in compression efficiency and quality of reconstructed video.

    Hierarchical point cloud compression

    公开(公告)号:US11252441B2

    公开(公告)日:2022-02-15

    申请号:US17163204

    申请日:2021-01-29

    Applicant: Apple Inc.

    Abstract: A system comprises an encoder configured to compress attribute information for a point cloud and/or a decoder configured to decompress compressed attribute information for the point cloud. To compress the attribute information, multiple levels of detail are generated based on an ordering of the points according to a space filling curve and attribute values are predicted. The attribute values may be predicted simultaneously while points are being assigned to different levels of detail. A decoder follows a similar prediction process based on level of details. Also, attribute correction values may be determined to correct predicted attribute values and may be used by a decoder to decompress a point cloud compressed using level of detail attribute compression. In some embodiments, attribute correction values may take into account an influence factor of respective points in a given level of detail on attributes in other levels of detail.

    Point Cloud Compression with Adaptive Filtering

    公开(公告)号:US20210256735A1

    公开(公告)日:2021-08-19

    申请号:US17306863

    申请日:2021-05-03

    Applicant: Apple Inc.

    Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A processing/filtering element utilizes occupancy map information and/or auxiliary patch information to determine relationships between patches in image frames and adjusts encoding/decoding and/or filtering or pre/post-processing parameters based on the determined relationships.

    Geometry information signaling for occluded points in an occupancy map video

    公开(公告)号:US20210211703A1

    公开(公告)日:2021-07-08

    申请号:US17143882

    申请日:2021-01-07

    Applicant: Apple Inc.

    Abstract: In an example method, points that represent three-dimensional visual volumetric content are received, and patches are determined, where each patch corresponds to a respective portion of the visual volumetric content. A patch image representing a set of points corresponding to the patch projected onto a respective patch plane is generated for each patch. The patch images are packed into image frames, and the image frames are encoded. An occupancy map corresponding to the image frames is generated. The occupancy map indicates, for each image frame: locations of the patch images in the image frame, and depth information of sets of points corresponding to the patch images in the image frame. The depth information indicates, for each patch image, depths of the set of points corresponding to the patch image in a direction perpendicular to a patch plane of the patch image.

    MACHINE LEARNING VIDEO PROCESSING SYSTEMS AND METHODS

    公开(公告)号:US20210211683A1

    公开(公告)日:2021-07-08

    申请号:US17210478

    申请日:2021-03-23

    Applicant: Apple Inc.

    Abstract: System and method for improving video encoding and/or video decoding. In embodiments, a video encoding pipeline includes a main encoding pipeline that compresses source image data corresponding with an image frame by processing the source image data based at least in part on encoding parameters to generate encoded image data. Additionally the video encoding pipeline includes a machine learning block communicatively coupled to the main encoding pipeline, in which the machine learning block analyzes content of the image frame by processing the source image data based at least in part on machine learning parameters implemented in the machine learning block when the machine learning block is enabled by the encoding parameters; and the video encoding pipeline adaptively adjusts the encoding parameters based at least in part on the content expected to be present in the image frame to facilitate improving encoding efficiency.

    Adaptive distance based point cloud compression

    公开(公告)号:US11010928B2

    公开(公告)日:2021-05-18

    申请号:US16380920

    申请日:2019-04-10

    Applicant: Apple Inc.

    Abstract: A system comprises an encoder configured to compress attribute information for a point cloud and/or a decoder configured to decompress compressed attribute for the point cloud. To compress the attribute information, attribute values are predicted using one of a plurality of prediction strategies, wherein a selected prediction strategy is selected based at least in part on attribute variability of points in a neighborhood of points. A decoder follows a similar prediction process. Also, attribute correction values may be determined to correct predicted attribute values and may be used by a decoder to decompress a point cloud, wherein the decoder applies the same prediction strategy applied at the encoder.

    Three-Dimensional Mesh Compression Using a Video Encoder

    公开(公告)号:US20210090301A1

    公开(公告)日:2021-03-25

    申请号:US17030264

    申请日:2020-09-23

    Applicant: Apple Inc.

    Abstract: A system comprises an encoder configured to compress and encode data for a three-dimensional mesh using a video encoding technique. To compress the three-dimensional mesh, the encoder determines sub-meshes and for each sub-mesh: texture patches and geometry patches. Also the encoder determines patch connectivity information and patch texture coordinates for the texture patches and geometry patches. The texture patches and geometry patches are packed into video image frames and encoded using a video codec. Additionally, the encoder determines boundary stitching information for the sub-meshes. A decoder receives a bit stream as generated by the encoder and reconstructs the three-dimensional mesh.

Patent Agency Ranking