-
公开(公告)号:US20190279681A1
公开(公告)日:2019-09-12
申请号:US15917441
申请日:2018-03-09
Applicant: Apple Inc.
Inventor: Hang Yuan , Jiefu Zhai , Ming Chen , Jae Hoon Kim , Dazhong Zhang , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: G11B27/031 , G06T5/00 , G06T7/73 , G06T19/20 , G06K9/00
Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.
-
12.
公开(公告)号:US20190246141A1
公开(公告)日:2019-08-08
申请号:US15888559
申请日:2018-02-05
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Ming Chen , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/597 , H04N19/139 , H04N19/176 , H04N19/105 , H04N19/52 , H04N19/167 , H04N19/162
CPC classification number: H04N19/597 , H04N19/105 , H04N19/139 , H04N19/162 , H04N19/167 , H04N19/176 , H04N19/52
Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.
-
公开(公告)号:US20190014332A1
公开(公告)日:2019-01-10
申请号:US15644270
申请日:2017-07-07
Applicant: Apple Inc.
Inventor: Peikang Song , Xing Wen , Sudeng Hu , Hang Yuan , Jae Hoon Kim , Dazhong Zhang , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/23 , H04N19/124 , H04N19/70 , H04N19/85 , H04N19/174 , H04N19/80 , H04N19/147
Abstract: Techniques for encoding and decoding video images based on image content types are described. Techniques include determining a plurality of image content types from metadata or an image content type recognition algorithm, where each image content type corresponding to a portion of a source video, such as a spatial or temporal portion. Encoding parameters, such as quantization parameter, may be selected for portions of source by a constrained search for encoding parameters, where the constraints are based on image content type.
-
公开(公告)号:US12096044B2
公开(公告)日:2024-09-17
申请号:US18181261
申请日:2023-03-09
Applicant: Apple Inc.
Inventor: Xiaohua Yang , Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/90 , G06F3/01 , H04N19/52 , H04N19/597 , H04N21/2343
CPC classification number: H04N19/90 , G06F3/013 , H04N19/52 , H04N19/597 , H04N21/234345
Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
-
公开(公告)号:US20220329756A1
公开(公告)日:2022-10-13
申请号:US17846896
申请日:2022-06-22
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai
IPC: H04N7/14 , H04N7/15 , G06F3/0488
Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.
-
公开(公告)号:US11093752B2
公开(公告)日:2021-08-17
申请号:US15613130
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Hang Yuan , Jiefu Zhai , Dazhong Zhang , Xiaosong Zhou , Chris Chung , Hsi-Jung Wu
Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.
-
公开(公告)号:US10999583B2
公开(公告)日:2021-05-04
申请号:US16132219
申请日:2018-09-14
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/29 , H04N19/103 , G09G5/14 , H04N21/44
Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
-
公开(公告)号:US10992919B2
公开(公告)日:2021-04-27
申请号:US16725245
申请日:2019-12-23
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: H04N13/161 , H04N19/597 , H04N19/70 , H04N5/232 , H04N13/243 , H04N13/139
Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
-
公开(公告)号:US10754242B2
公开(公告)日:2020-08-25
申请号:US15638848
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: G03B37/04 , G06T3/40 , H04N5/232 , H04N19/172 , H04N19/159 , G06T3/00 , H04N19/597 , H04N19/105 , H04N13/161
Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
-
公开(公告)号:US20180184121A1
公开(公告)日:2018-06-28
申请号:US15390202
申请日:2016-12-23
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/597 , H04N19/176 , H04N19/105
CPC classification number: H04N19/597 , H04N19/105 , H04N19/176 , H04N19/547
Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
-
-
-
-
-
-
-
-
-