-
公开(公告)号:US11240492B2
公开(公告)日:2022-02-01
申请号:US16254528
申请日:2019-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Xingyu Zhang , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , Yeping Su
IPC: H04N19/105 , H04N19/61 , H04N19/147 , H04N19/159 , H04N19/176 , G06N3/08
Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
-
公开(公告)号:US11093752B2
公开(公告)日:2021-08-17
申请号:US15613130
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Hang Yuan , Jiefu Zhai , Dazhong Zhang , Xiaosong Zhou , Chris Chung , Hsi-Jung Wu
Abstract: Techniques are disclosed for managing display of content from multi-view video data. According to these techniques, an object may be identified from content of the multi-view video. The object's location may be tracked across a sequence of multi-view video. The technique may extract a sub-set of video that is contained within a view window that is shifted in an image space of the multi-view video in correspondence to the tracked object's location. These techniques may be implemented either in an image source device or an image sink device.
-
公开(公告)号:US11017566B1
公开(公告)日:2021-05-25
申请号:US16459483
申请日:2019-07-01
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Khaled Mammou , Yeping Su , Jungsun Kim , Valery G. Valentin , David W. Singer , Fabrice A. Robinet , Hsi-Jung Wu
Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A processing/filtering element utilizes occupancy map information and/or auxiliary patch information to determine relationships between patches in image frames and adjusts encoding/decoding and/or filtering or pre/post-processing parameters based on the determined relationships.
-
公开(公告)号:US10999583B2
公开(公告)日:2021-05-04
申请号:US16132219
申请日:2018-09-14
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/29 , H04N19/103 , G09G5/14 , H04N21/44
Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
-
公开(公告)号:US10992919B2
公开(公告)日:2021-04-27
申请号:US16725245
申请日:2019-12-23
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: H04N13/161 , H04N19/597 , H04N19/70 , H04N5/232 , H04N13/243 , H04N13/139
Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.
-
公开(公告)号:US10972753B1
公开(公告)日:2021-04-06
申请号:US16569725
申请日:2019-09-13
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Eduardo Asbun , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , John Su , Samir Gehani , Christopher Flick , Shalini Sahoo
IPC: H04N19/597 , H04N19/187 , H04N19/176
Abstract: Techniques are disclosed for coding and delivering multi-view video in which the video is represented as a manifest file identifying a plurality of segments of the video available for download. The multi-view video may be partitioned spatially into a plurality of tiles that, in aggregate, encompass the entire spatial area of the video. The tiles are coded as segments contains coded video representing content contained within its respective tile. Tiles may be given different sizes based on saliency of the content within their respective regions. In this manner, tiles with high levels of interest may have relatively large spatial areas, which can lead to efficient coding in the presence of content motion.
-
公开(公告)号:US20210012536A1
公开(公告)日:2021-01-14
申请号:US17032711
申请日:2020-09-25
Applicant: Apple Inc.
Inventor: Khaled Mammou , Yeping Su , David W. Singer , Alexandros Tourapis , Fabrice A. Robinet , Hsi-Jung Wu
IPC: G06T9/00 , G06T7/20 , G06T7/50 , H04N19/172
Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. The encoder is configured project the point cloud on to patch planes to compress the point cloud, and supports multiple layered patch planes. For example, some point clouds may have a depth, and points at different depths may be assigned to different layered patch planes.
-
公开(公告)号:US10754242B2
公开(公告)日:2020-08-25
申请号:US15638848
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: G03B37/04 , G06T3/40 , H04N5/232 , H04N19/172 , H04N19/159 , G06T3/00 , H04N19/597 , H04N19/105 , H04N13/161
Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
-
公开(公告)号:US10699444B2
公开(公告)日:2020-06-30
申请号:US16198635
申请日:2018-11-21
Applicant: Apple Inc.
Inventor: Khaled Mammou , Yeping Su , David W. Singer , Alexandros Tourapis , Fabrice A. Robinet , Hsi-Jung Wu
Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A block/sub-block organization scheme is used to encode blocks and sub-blocks of an occupancy map used in compressing the point cloud. Binary values are assigned to blocks/sub-blocks based on whether they contain patches projected on the point cloud. A traversal path is chosen that takes advantage of run-length encoding strategies to reduce a size of an encoded occupancy map. Also, auxiliary information is used to further improve occupancy map compression.
-
公开(公告)号:US20200099942A1
公开(公告)日:2020-03-26
申请号:US16583153
申请日:2019-09-25
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Yeping Su , Khaled Mammou , Jungsun Kim , David W. Singer , Fabrice A. Robinet , Guy Cote , Jim C. Chou , Hsi-Jung Wu
IPC: H04N19/186 , H04N19/124 , H04N19/176 , H04N19/159
Abstract: Support for additional components may be specified in a coding scheme for image data. A layer of a coding scheme that specifies color components may also specify additional components. Characteristics of the components may be specified in the same layer or a different layer of the coding scheme. An encoder or decoder may identify the specified components and determine the respective characteristics to perform encoding and decoding of image data.
-
-
-
-
-
-
-
-
-