SPHERE PROJECTED MOTION ESTIMATION/COMPENSATION AND MODE DECISION

    公开(公告)号:WO2018118159A1

    公开(公告)日:2018-06-28

    申请号:PCT/US2017/051542

    申请日:2017-09-14

    Applicant: APPLE INC.

    CPC classification number: H04N19/597 H04N19/105 H04N19/176 H04N19/547

    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

    PROCESSING OF EQUIRECTANGULAR OBJECT DATA TO COMPENSATE FOR DISTORTION BY SPHERICAL PROJECTIONS

    公开(公告)号:WO2018151978A1

    公开(公告)日:2018-08-23

    申请号:PCT/US2018/017124

    申请日:2018-02-06

    Applicant: APPLE INC.

    Abstract: Multi-directional image data often contains distortions of image content that cause problems when processed by video coders that are designed to process traditional, "flat" image content. Embodiments of the present disclosure provide techniques for coding multi-directional image data using such coders. For each pixel block in a frame to be coded, an encoder may transform reference picture data within a search window about a location of the input pixel block based on displacement respectively between the location of the input pixel block and portions of the reference picture within the search window. The encoder may perform a prediction search among the transformed reference picture data to identify a match between the input pixel block and a portion of the transformed reference picture and, when a match is identified, the encoder may code the input pixel block differentially with respect to the matching portion of the transformed reference picture. The transform may counter-act distortions imposed on image content of the reference picture data by the multi-directional format, which aligns the content with image content of the input picture. The techniques apply both for intra-coding and inter-coding.

    SCALABILITY OF MULTI-DIRECTIONAL VIDEO STREAMING

    公开(公告)号:WO2020055655A1

    公开(公告)日:2020-03-19

    申请号:PCT/US2019/049678

    申请日:2019-09-05

    Applicant: APPLE INC.

    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. First streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

    VIDEO CODING TECHNIQUES FOR MULTI-VIEW VIDEO

    公开(公告)号:WO2018156403A1

    公开(公告)日:2018-08-30

    申请号:PCT/US2018/018246

    申请日:2018-02-14

    Applicant: APPLE INC.

    Abstract: Techniques are disclosed for coding and decoding video captured as cube map images. According to these techniques, padded reference images are generated for use during predicting input data, A reference image is stored in a cube map format, A padded reference image is generated from the reference image in which image data of a first view contained in reference image is replicated and placed adjacent to a second view contained in the cube map image. When coding a pixel block of an input image, a prediction search may be performed between the input pixel block and content of the padded reference image. When the prediction search identifies a match, the pixel block may be coded with respect to matching data from the padded reference image. Presence of replicated data in the padded reference image is expected to increase the likelihood that adequate prediction matches will be identified for input pixel block data, which will increase overall efficiency of the video coding.

Patent Agency Ranking