Sphere projected motion estimation/compensation and mode decision

    公开(公告)号:US11818394B2

    公开(公告)日:2023-11-14

    申请号:US17207267

    申请日:2021-03-19

    Applicant: Apple Inc.

    CPC classification number: H04N19/597 H04N19/105 H04N19/176 H04N19/547

    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

    Immersive Video Streaming Using View-Adaptive Prefetching and Buffer Control

    公开(公告)号:US20220377304A1

    公开(公告)日:2022-11-24

    申请号:US17326204

    申请日:2021-05-20

    Applicant: Apple Inc.

    Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.

    Smoothed directional and DC intra prediction

    公开(公告)号:US11388419B2

    公开(公告)日:2022-07-12

    申请号:US16834693

    申请日:2020-03-30

    Applicant: Apple Inc.

    Abstract: Improved video coding techniques include deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.

    SCALABILITY OF MULTI-DIRECTIONAL VIDEO STREAMING

    公开(公告)号:US20210227236A1

    公开(公告)日:2021-07-22

    申请号:US17221299

    申请日:2021-04-02

    Applicant: Apple Inc.

    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

Patent Agency Ranking