Object tracking in multi-view video

    公开(公告)号:US11093752B2

    公开(公告)日:2021-08-17

    申请号:US15613130

    申请日:2017-06-02

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.

    Scalability of multi-directional video streaming

    公开(公告)号:US10999583B2

    公开(公告)日:2021-05-04

    申请号:US16132219

    申请日:2018-09-14

    Applicant: Apple Inc.

    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

    Sphere Projected Motion Estimation/Compensation and Mode Decision

    公开(公告)号:US20180184121A1

    公开(公告)日:2018-06-28

    申请号:US15390202

    申请日:2016-12-23

    Applicant: Apple Inc.

    CPC classification number: H04N19/597 H04N19/105 H04N19/176 H04N19/547

    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Patent Agency Ranking