-
公开(公告)号:US11025933B2
公开(公告)日:2021-06-01
申请号:US15585581
申请日:2017-05-03
Applicant: Apple Inc.
Inventor: Dazhong Zhang , Hang Yuan , Peikang Song , Jae Hoon Kim , Xing Wen , Sudeng Hu , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/423 , H04N19/463 , H04N21/44 , H04N21/6379 , H04N21/6373
Abstract: Techniques are disclosed for managing memory allocations when coding video data according to multiple codec configurations. According to these techniques, devices may negotiate parameters of a coding session that include parameters of a plurality of different codec configurations that may be used during the coding session. A device may estimate sizes of decoded picture buffers for each of the negotiated codec configurations and allocate in its memory a portion of memory sized according to a largest size of the estimated decoded picture buffers. Thereafter, the devices may exchange coded video data. The exchange may involve decoding coded data of reference pictures and storing the decoded reference pictures in the allocated memory. During the coding session, the devices may toggle among the different negotiated codec configurations. As they do, reallocations of memory may be avoided.
-
公开(公告)号:US10924747B2
公开(公告)日:2021-02-16
申请号:US15443342
申请日:2017-02-27
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou
IPC: H04B1/66 , H04N19/176 , H04N19/105 , H04N19/112 , H04N19/174 , H04N19/593 , H04N19/597
Abstract: Techniques are disclosed for coding and decoding video captured as cube map images. According to these techniques, padded reference images are generated for use during predicting input data. A reference image is stored in a cube map format. A padded reference image is generated from the reference image in which image data of a first view contained in reference image is replicated and placed adjacent to a second view contained in the cube map image. When coding a pixel block of an input image, a prediction search may be performed between the input pixel block and content of the padded reference image. When the prediction search identifies a match, the pixel block may be coded with respect to matching data from the padded reference image. Presence of replicated data in the padded reference image is expected to increase the likelihood that adequate prediction matches will be identified for input pixel block data, which will increase overall efficiency of the video coding.
-
公开(公告)号:US10812832B2
公开(公告)日:2020-10-20
申请号:US14732393
申请日:2015-06-05
Applicant: Apple Inc.
Inventor: Hang Yuan , Chris Y. Chung , Jae Hoon Kim , Yeping Su , Jiefu Zhai , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/85 , H04N19/33 , H04N19/88 , H04N19/503
Abstract: Coding techniques for image data may cause a still image to be converted to a “phantom” video sequence, which is coded by motion compensated prediction techniques. Thus, coded video data obtained from the coding operation may include temporal prediction references between frames of the video sequence. Metadata may be generated that identifies allocations of content from the still image to the frames of the video sequence. The coded data and the metadata may be transmitted to another device, whereupon it may be decoded by motion compensated prediction techniques and converted back to a still image data. Other techniques may involve coding an image in both a base layer representation and at least one coded enhancement layer representation. The enhancement layer representation may be coded predictively with reference to the base layer representation. The coded base layer representation may be partitioned into a plurality of individually-transmittable segments and stored. Prediction references of elements of the enhancement layer representation may be confined to segments of the base layer representation that correspond to a location of those elements. Meaning, when a pixel block of an enhancement layer maps to a given segment of the base layer representation, prediction references are confined to that segment and do not reference portions of the base layer representation that may be found in other segment(s).
-
公开(公告)号:US10652578B2
公开(公告)日:2020-05-12
申请号:US15888559
申请日:2018-02-05
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Ming Chen , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/597 , H04N19/70 , H04N19/105 , H04N19/139 , H04N19/162 , H04N19/167 , H04N19/176 , H04N19/52
Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
-
公开(公告)号:US10523913B2
公开(公告)日:2019-12-31
申请号:US15638709
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: H04N13/161 , H04N19/597 , H04N19/70 , H04N5/232 , H04N13/243 , H04N13/139
Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
-
公开(公告)号:US20190007669A1
公开(公告)日:2019-01-03
申请号:US15638709
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: H04N13/00 , H04N19/597 , H04N13/02 , H04N19/70 , H04N5/232
CPC classification number: H04N13/161 , H04N5/23238 , H04N13/139 , H04N13/243 , H04N19/597 , H04N19/70
Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
-
公开(公告)号:US20180249164A1
公开(公告)日:2018-08-30
申请号:US15443342
申请日:2017-02-27
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou
IPC: H04N19/176 , H04N19/105 , H04N19/112 , H04N19/174 , H04N19/593
CPC classification number: H04N19/176 , H04N19/105 , H04N19/112 , H04N19/174 , H04N19/593 , H04N19/597
Abstract: Techniques are disclosed for coding and decoding video captured as cube map images. According to these techniques, padded reference images are generated for use during predicting input data. A reference image is stored in a cube map format. A padded reference image is generated from the reference image in which image data of a first view contained in reference image is replicated and placed adjacent to a second view contained in the cube map image. When coding a pixel block of an input image, a prediction search may be performed between the input pixel block and content of the padded reference image. When the prediction search identifies a match, the pixel block may be coded with respect to matching data from the padded reference image. Presence of replicated data in the padded reference image is expected to increase the likelihood that adequate prediction matches will be identified for input pixel block data, which will increase overall efficiency of the video coding.
-
公开(公告)号:US11282543B2
公开(公告)日:2022-03-22
申请号:US15917441
申请日:2018-03-09
Applicant: Apple Inc.
Inventor: Hang Yuan , Jiefu Zhai , Ming Chen , Jae Hoon Kim , Dazhong Zhang , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: G11B27/031 , G06T5/00 , G06T7/73 , G06K9/00 , G06T19/20
Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.
-
公开(公告)号:US20210321133A1
公开(公告)日:2021-10-14
申请号:US17207267
申请日:2021-03-19
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/597 , H04N19/176 , H04N19/105 , H04N19/547
Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
-
公开(公告)号:US11109042B2
公开(公告)日:2021-08-31
申请号:US16420740
申请日:2019-05-23
Applicant: Apple Inc.
Inventor: Sudeng Hu , Xing Wen , Jae Hoon Kim , Peikang Song , Hang Yuan , Dazhong Zhang , Xiaosong Zhou , Hsi-Jung Wu , Christopher Garrido , Ming Jin , Patrick Miauton , Karthick Santhanam
IPC: H04N19/172 , H04N19/182 , H04N19/124 , H04N19/176 , H04N19/52 , H04N19/159
Abstract: Systems and methods for coding a video to be overlaid by annotations are devised. A motion compensated predictive coding is employed, wherein coding parameters of video pixel blocks are determined based on the pixel blocks' relation to the annotations. A decoder decodes the video and annotates it based on metadata, obtained from the coder or other sources, describing the annotations' appearance and rendering mode.
-
-
-
-
-
-
-
-
-