Patent search ap:("Apple Inc.") AND inv:"Hang Yuan" Page 2

11.

发明申请
REAL-TIME FACE AND OBJECT MANIPULATION 审中-公开

公开(公告)号：US20190279681A1

公开(公告)日：2019-09-12

申请号：US15917441

申请日：2018-03-09

Applicant: Apple Inc.

Inventor： Hang Yuan , Jiefu Zhai , Ming Chen , Jae Hoon Kim , Dazhong Zhang , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu

IPC: G11B27/031 , G06T5/00 , G06T7/73 , G06T19/20 , G06K9/00

Abstract: Techniques are presented for modifying images of an object in video, for example to correct for lens distortion, or to beautify a face. These techniques include extracting and validating features of an object from a source video frame, tracking those features over time, estimating a pose of the object, modifying a 3D model of the object based on the features, and rendering a modified video frame based on the modified 3D model and modified intrinsic and extrinsic matrices. These techniques may be applied in real-time to an object in a sequence of video frames.

12.

发明申请
Processing of Multi-Directional Images in Spatially-Ordered Video Coding Applications 审中-公开

公开(公告)号：US20190246141A1

公开(公告)日：2019-08-08

申请号：US15888559

申请日：2018-02-05

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Ming Chen , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu

IPC: H04N19/597 , H04N19/139 , H04N19/176 , H04N19/105 , H04N19/52 , H04N19/167 , H04N19/162

CPC classification number: H04N19/597 , H04N19/105 , H04N19/139 , H04N19/162 , H04N19/167 , H04N19/176 , H04N19/52

Abstract: Image processing techniques may accelerate coding of viewport data contained within multi-view image data. According to such techniques, an encoder may shifting content of a multi-directional image data according to the viewport location data provided by a decoder. The encoder may code the shifted multi-directional image data by predictive coding, and transmit to the decoder, the coded multi-directional image data and data identifying an amount of the shift. Doing so may move the viewport location to positions in the image data that are coded earlier than the positions that the viewport location naturally occupies and, thereby, may accelerate coding. On decode, a decoder may compare its present viewport location with viewport location data provided by the encoder with coded video data. The decoder may decode the coded video data and extract a portion of the decoded video data corresponding to a present viewport location for display.

13.

发明申请
CONTENT-AWARE VIDEO CODING 审中-公开

公开(公告)号：US20190014332A1

公开(公告)日：2019-01-10

申请号：US15644270

申请日：2017-07-07

Applicant: Apple Inc.

Inventor： Peikang Song , Xing Wen , Sudeng Hu , Hang Yuan , Jae Hoon Kim , Dazhong Zhang , Xiaosong Zhou , Hsi-Jung Wu

IPC: H04N19/23 , H04N19/124 , H04N19/70 , H04N19/85 , H04N19/174 , H04N19/80 , H04N19/147

Abstract: Techniques for encoding and decoding video images based on image content types are described. Techniques include determining a plurality of image content types from metadata or an image content type recognition algorithm, where each image content type corresponding to a portion of a source video, such as a spatial or temporal portion. Encoding parameters, such as quantization parameter, may be selected for portions of source by a constrained search for encoding parameters, where the constraints are based on image content type.

14.

发明授权
Adaptive coding and streaming of multi-directional video 有权

公开(公告)号：US12096044B2

公开(公告)日：2024-09-17

申请号：US18181261

申请日：2023-03-09

Applicant: Apple Inc.

Inventor： Xiaohua Yang , Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou

IPC: H04N19/90 , G06F3/01 , H04N19/52 , H04N19/597 , H04N21/2343

CPC classification number: H04N19/90 , G06F3/013 , H04N19/52 , H04N19/597 , H04N21/234345

Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.

15.

发明申请
SYSTEMS AND METHODS FOR PERSPECTIVE SHIFTING IN VIDEO CONFERENCING SESSION 有权

公开(公告)号：US20220329756A1

公开(公告)日：2022-10-13

申请号：US17846896

申请日：2022-06-22

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai

IPC: H04N7/14 , H04N7/15 , G06F3/0488

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

16.

发明授权
Object tracking in multi-view video 有权

公开(公告)号：US11093752B2

公开(公告)日：2021-08-17

申请号：US15613130

申请日：2017-06-02

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Ming Chen , Hang Yuan , Jiefu Zhai , Dazhong Zhang , Xiaosong Zhou , Chris Chung , Hsi-Jung Wu

IPC: G06K9/00 , G06T7/292 , G06K9/32

Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.

17.

发明授权
Scalability of multi-directional video streaming 有权

公开(公告)号：US10999583B2

公开(公告)日：2021-05-04

申请号：US16132219

申请日：2018-09-14

Applicant: Apple Inc.

Inventor： Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou

IPC: H04N19/29 , H04N19/103 , G09G5/14 , H04N21/44

Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.

18.

发明授权
Packed image format for multi-directional video 有权

公开(公告)号：US10992919B2

公开(公告)日：2021-04-27

申请号：US16725245

申请日：2019-12-23

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung

IPC: H04N13/161 , H04N19/597 , H04N19/70 , H04N5/232 , H04N13/243 , H04N13/139

Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.

19.

发明授权
Adaptive resolution and projection format in multi-direction video 有权

公开(公告)号：US10754242B2

公开(公告)日：2020-08-25

申请号：US15638848

申请日：2017-06-30

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung

IPC: G03B37/04 , G06T3/40 , H04N5/232 , H04N19/172 , H04N19/159 , G06T3/00 , H04N19/597 , H04N19/105 , H04N13/161

Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.

20.

发明申请
Sphere Projected Motion Estimation/Compensation and Mode Decision 审中-公开

公开(公告)号：US20180184121A1

公开(公告)日：2018-06-28

申请号：US15390202

申请日：2016-12-23

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu

IPC: H04N19/597 , H04N19/176 , H04N19/105

CPC classification number: H04N19/597 , H04N19/105 , H04N19/176 , H04N19/547

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification