-
公开(公告)号:WO2018118159A1
公开(公告)日:2018-06-28
申请号:PCT/US2017/051542
申请日:2017-09-14
Applicant: APPLE INC.
Inventor: KIM, Jae Hoon , ZHOU, Xiaosong , ZHANG, Dazhong , YUAN, Hang , ZHAI, Jiefu , CHUNG, Chris Y. , WU, Hsi-Jung
IPC: H04N19/597 , H04N19/547
CPC classification number: H04N19/597 , H04N19/105 , H04N19/176 , H04N19/547
Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
-
公开(公告)号:WO2020112321A2
公开(公告)日:2020-06-04
申请号:PCT/US2019/060238
申请日:2019-11-07
Applicant: APPLE INC.
Inventor: YANG, Xiaohua , TOURAPIS, Alexandros , ZHANG, Dazhong , YUAN, Hang , WU, Hsi-Jung , KIM, Jae Hoon , ZHAI, Jiefu , CHEN, Ming , ZHOU, Xiaosong
IPC: H04N21/81 , H04N21/4728 , H04N21/6587 , H04N21/63 , H04N21/2343 , H04N21/84 , H04N21/845 , H04N21/442 , H04N19/00
Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
-
3.
公开(公告)号:WO2018151978A1
公开(公告)日:2018-08-23
申请号:PCT/US2018/017124
申请日:2018-02-06
Applicant: APPLE INC.
Inventor: KIM, Jae Hoon , CHUNG, Chris Y. , ZHANG, Dazhong , YUAN, Hang , WU, Hsi-Jung , ZHAI, Jiefu , ZHOU, Xiaosong
IPC: H04N19/597 , H04N19/547 , H04N19/105 , H04N19/139 , H04N19/176 , H04N19/51
Abstract: Multi-directional image data often contains distortions of image content that cause problems when processed by video coders that are designed to process traditional, "flat" image content. Embodiments of the present disclosure provide techniques for coding multi-directional image data using such coders. For each pixel block in a frame to be coded, an encoder may transform reference picture data within a search window about a location of the input pixel block based on displacement respectively between the location of the input pixel block and portions of the reference picture within the search window. The encoder may perform a prediction search among the transformed reference picture data to identify a match between the input pixel block and a portion of the transformed reference picture and, when a match is identified, the encoder may code the input pixel block differentially with respect to the matching portion of the transformed reference picture. The transform may counter-act distortions imposed on image content of the reference picture data by the multi-directional format, which aligns the content with image content of the input picture. The techniques apply both for intra-coding and inter-coding.
-
公开(公告)号:WO2020055655A1
公开(公告)日:2020-03-19
申请号:PCT/US2019/049678
申请日:2019-09-05
Applicant: APPLE INC.
Inventor: TOURAPIS, Alexandros , ZHANG, Dazhong , YUAN, Hang , WU, Hsi-Jung , KIM, Jae Hoon , ZHAI, Jiefu , CHEN, Ming , ZHOU, Xiaosong
IPC: H04N19/167 , H04N19/187 , H04N21/218
Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. First streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
-
公开(公告)号:WO2018156403A1
公开(公告)日:2018-08-30
申请号:PCT/US2018/018246
申请日:2018-02-14
Applicant: APPLE INC.
Inventor: KIM, Jae Hoon , CHUNG, Chris Y. , ZHANG, Dazhong , YUAN, Hang , WU, Hsi-Jung , ZHAI, Jiefu , ZHOU, Xiaosong
IPC: H04N19/597
Abstract: Techniques are disclosed for coding and decoding video captured as cube map images. According to these techniques, padded reference images are generated for use during predicting input data, A reference image is stored in a cube map format, A padded reference image is generated from the reference image in which image data of a first view contained in reference image is replicated and placed adjacent to a second view contained in the cube map image. When coding a pixel block of an input image, a prediction search may be performed between the input pixel block and content of the padded reference image. When the prediction search identifies a match, the pixel block may be coded with respect to matching data from the padded reference image. Presence of replicated data in the padded reference image is expected to increase the likelihood that adequate prediction matches will be identified for input pixel block data, which will increase overall efficiency of the video coding.
-
-
-
-