Object tracking in multi-view video

    公开(公告)号:US11093752B2

    公开(公告)日:2021-08-17

    申请号:US15613130

    申请日:2017-06-02

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.

    Scalability of multi-directional video streaming

    公开(公告)号:US10999583B2

    公开(公告)日:2021-05-04

    申请号:US16132219

    申请日:2018-09-14

    Applicant: Apple Inc.

    Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

    Sphere Projected Motion Estimation/Compensation and Mode Decision

    公开(公告)号:US20180184121A1

    公开(公告)日:2018-06-28

    申请号:US15390202

    申请日:2016-12-23

    Applicant: Apple Inc.

    CPC classification number: H04N19/597 H04N19/105 H04N19/176 H04N19/547

    Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

    Dynamic Video Configurations
    19.
    发明申请

    公开(公告)号:US20170359590A1

    公开(公告)日:2017-12-14

    申请号:US15585581

    申请日:2017-05-03

    Applicant: Apple Inc.

    Abstract: Techniques are disclosed for managing memory allocations when coding video data according to multiple codec configurations. According to these techniques, devices may negotiate parameters of a coding session that include parameters of a plurality of different codec configurations that may be used during the coding session. A device may estimate sizes of decoded picture buffers for each of the negotiated codec configurations and allocate in its memory a portion of memory sized according to a largest size of the estimated decoded picture buffers. Thereafter, the devices may exchange coded video data. The exchange may involve decoding coded data of reference pictures and storing the decoded reference pictures in the allocated memory. During the coding session, the devices may toggle among the different negotiated codec configurations. As they do, reallocations of memory may be avoided.

    SELECTIVE PACKET AND DATA DROPPING TO REDUCE DELAY IN REAL-TIME VIDEO COMMUNICATION
    20.
    发明申请
    SELECTIVE PACKET AND DATA DROPPING TO REDUCE DELAY IN REAL-TIME VIDEO COMMUNICATION 审中-公开
    选择性分组和数据丢弃,以减少实时视频通信中的延迟

    公开(公告)号:US20160360220A1

    公开(公告)日:2016-12-08

    申请号:US14730830

    申请日:2015-06-04

    Applicant: Apple Inc.

    Abstract: Techniques are described for responding to changes in bandwidth that are available to transmit coded video data between an encoder and a decoder. When such changes in bandwidth occur, estimates may be derived of visual significance of coded video data that has not yet been transmitted and also video data that is next to be coded. These estimates may be compared to each other. When the estimated visual significance of the coded video data that has not yet been transmitted is greater than the estimated visual significance of the video data that is next to be coded, transmission of the coded video data that has not yet been transmitted may be prioritized over coding of the video data that is next to be coded. When the estimated visual significance of the video data that is next to be coded is greater than the estimated visual significance of the coded video data that has not yet been transmitted, coding of the video data that is next to be coded may be prioritized over transmission of the coded video data that has not yet been transmitted. Resources may be allocated to the prioritized coder operation.

    Abstract translation: 描述了用于响应可用于在编码器和解码器之间传输编码视频数据的带宽变化的技术。 当这种带宽变化发生时,可能导出尚未被发送的编码视频数据的视觉重要性的估计,以及接下来被编码的视频数据。 这些估计可以相互比较。 当尚未被发送的编码视频数据的估计视觉含义大于接下来要被编码的视频数据的估计视觉有效性时,还没有发送的编码视频数据的传输可以优先于 对接下来被编码的视频数据进行编码。 当接下来被编码的视频数据的估计视觉含义大于尚未发送的编码视频数据的估计视觉有效性时,下一个被编码的视频数据的编码可以通过传输优先化 的尚未被发送的编码视频数据。 可以将资源分配给优先编码器操作。

Patent Agency Ranking