-
公开(公告)号:US11605224B2
公开(公告)日:2023-03-14
申请号:US16882959
申请日:2020-05-26
Applicant: Apple Inc.
Inventor: Bartlomiej Rymkowski , Robert Bailey , Ethan Tira-Thompson , Shuang Gao , Ben Englert , Emilie Kim , Shujie Liu , Ke Zhang , Vinay Sharma , Xiaosong Zhou
Abstract: Techniques disclosed for managing video captured by an imaging device. Methods disclosed capture a video in response to a capture command received at the imaging device. Following a video capture, techniques for classifying the captured video based on feature(s) extracted therefrom, for marking the captured video based on the classification, and for generating a media item from the captured video according to the marking are disclosed. Accordingly, the captured video may be classified as representing a static event, and, as a result, a media item of a still image may be generated. Otherwise, the captured video may be classified as representing a dynamic event, and, as a result, a media item of a video may be generated.
-
公开(公告)号:US20220329756A1
公开(公告)日:2022-10-13
申请号:US17846896
申请日:2022-06-22
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai
IPC: H04N7/14 , H04N7/15 , G06F3/0488
Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.
-
公开(公告)号:US11240492B2
公开(公告)日:2022-02-01
申请号:US16254528
申请日:2019-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Xingyu Zhang , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , Yeping Su
IPC: H04N19/105 , H04N19/61 , H04N19/147 , H04N19/159 , H04N19/176 , G06N3/08
Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
-
公开(公告)号:US11093752B2
公开(公告)日:2021-08-17
申请号:US15613130
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Hang Yuan , Jiefu Zhai , Dazhong Zhang , Xiaosong Zhou , Chris Chung , Hsi-Jung Wu
Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.
-
公开(公告)号:US10999583B2
公开(公告)日:2021-05-04
申请号:US16132219
申请日:2018-09-14
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/29 , H04N19/103 , G09G5/14 , H04N21/44
Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
-
公开(公告)号:US10992919B2
公开(公告)日:2021-04-27
申请号:US16725245
申请日:2019-12-23
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: H04N13/161 , H04N19/597 , H04N19/70 , H04N5/232 , H04N13/243 , H04N13/139
Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
-
公开(公告)号:US10972753B1
公开(公告)日:2021-04-06
申请号:US16569725
申请日:2019-09-13
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Eduardo Asbun , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , John Su , Samir Gehani , Christopher Flick , Shalini Sahoo
IPC: H04N19/597 , H04N19/187 , H04N19/176
Abstract: Techniques are disclosed for coding and delivering multi-view video in which the video is represented as a manifest file identifying a plurality of segments of the video available for download. The multi-view video may be partitioned spatially into a plurality of tiles that, in aggregate, encompass the entire spatial area of the video. The tiles are coded as segments contains coded video representing content contained within its respective tile. Tiles may be given different sizes based on saliency of the content within their respective regions. In this manner, tiles with high levels of interest may have relatively large spatial areas, which can lead to efficient coding in the presence of content motion.
-
公开(公告)号:US10754242B2
公开(公告)日:2020-08-25
申请号:US15638848
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: G03B37/04 , G06T3/40 , H04N5/232 , H04N19/172 , H04N19/159 , G06T3/00 , H04N19/597 , H04N19/105 , H04N13/161
Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
-
公开(公告)号:US10536731B2
公开(公告)日:2020-01-14
申请号:US14710520
申请日:2015-05-12
Applicant: Apple Inc.
Inventor: Yeping Su , Jiefu Zhai , Ke Zhang , Xiaosong Zhou , Hsi-Jung Wu , Chris Y. Chung
IPC: H04N19/50 , H04N21/2343 , H04N21/234 , H04N21/2385 , H04N21/438 , H04N21/44 , H04N21/4402 , H04N19/186 , H04N19/184
Abstract: Systems and methods are provided for processing high quality video data, such as data having a higher than standard bit depth, a high dynamic range, or a wide or custom color gamut, to be compatible with conventional encoders and decoders without significant loss of quality. High quality data is encoded into a plurality of layers with a base layer having the standard quality data and one or more higher quality layers. Decoding systems and methods may map the base layer to the dynamic range or color gamut of the enhancement layer, combine the layers, and map the combined layers to a dynamic range or color gamut appropriate for the target display. Each of the standard quality and the high quality data may be encoded as a plurality of tiers of increasing quality and reference lower level tiers as sources of prediction during predictive coding.
-
公开(公告)号:US20190342351A1
公开(公告)日:2019-11-07
申请号:US16405864
申请日:2019-05-07
Applicant: Apple Inc.
Inventor: Christopher M. Garrido , Dazhong Zhang , Karthick Santhanam , Patrick Miauton , Xiaoxiao Zheng , Bess Chan , Peter Shiang , Sudeng Hu , Peikang Song , Xiaosong Zhou
IPC: H04L29/06 , H04N21/235 , H04N21/6583 , H04L29/08
Abstract: Techniques presented herein provide an improved relay user experience and improved management of scarce computing and network resources as the number of relay endpoints increases. A sourcing endpoint device may generate a media feed, such as video and/or audio feed, representing contribution from a conference participant. The sourcing endpoint device may generate a priority value for the media feed, and the priority value may be transmitted to other members of the relay along with the input feed. Priority values of the different relay participants may be used by other devices, for example, intermediate servers or receiving endpoint devices, to manage aspects of the relay. For example, a relay server may prune streams from select endpoint devices based on relative priority values received from those devices. Alternatively, receiving endpoint devices may alter presentation of received feeds based on their associated priority values.
-
-
-
-
-
-
-
-
-