-
公开(公告)号:US20240323442A1
公开(公告)日:2024-09-26
申请号:US18603138
申请日:2024-03-12
Applicant: Apple Inc.
Inventor: Alican Nalci , Guoxin Jin , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yixin Du , Yunfei Zheng , Yeqing Wu
IPC: H04N19/625 , H04N19/176 , H04N19/70
CPC classification number: H04N19/625 , H04N19/176 , H04N19/70
Abstract: In an example method, a decoder accesses a bitstream representing video content, and parses one or more flexible coefficient position (FCP) syntax from the bitstream, where the one or more FCP syntax indicate one or more index values. The decoder further determines side information representing one or more characteristics of an encoded portion of the video content. The decoder interprets the one or more FCP syntax based on the side information, including determining a coefficient position with respect to the encoded portion of the video content based on the one or more index values and the side information. The decoder decodes the encoded portion of the video content according to the coefficient position.
-
公开(公告)号:US11818394B2
公开(公告)日:2023-11-14
申请号:US17207267
申请日:2021-03-19
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/176 , H04N19/105 , H04N19/597 , H04N19/547
CPC classification number: H04N19/597 , H04N19/105 , H04N19/176 , H04N19/547
Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
-
公开(公告)号:US11711527B2
公开(公告)日:2023-07-25
申请号:US17404452
申请日:2021-08-17
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Yeping Su , David W. Singer , Hsi-Jung Wu
IPC: H04N19/186 , H04N19/189 , H04N9/64 , H04N11/20 , H04N19/70 , H04N19/46 , H04N19/117 , H04N19/132 , H04N19/154 , H04N19/59 , H04N9/67 , H04N19/82 , H04N19/147 , H04N19/80 , H04N19/85 , H04N19/44
CPC classification number: H04N19/186 , H04N9/64 , H04N9/67 , H04N11/20 , H04N19/117 , H04N19/132 , H04N19/147 , H04N19/154 , H04N19/189 , H04N19/44 , H04N19/46 , H04N19/59 , H04N19/70 , H04N19/80 , H04N19/82 , H04N19/85
Abstract: A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
-
公开(公告)号:US20230188738A1
公开(公告)日:2023-06-15
申请号:US18076166
申请日:2022-12-06
Applicant: Apple Inc.
Inventor: Alican Nalci , Alexandros Tourapis , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yeqing Wu , Yixin Du , Yunfei Zheng
IPC: H04N19/44 , H04N19/184
CPC classification number: H04N19/44 , H04N19/184
Abstract: In an example method, a decoder obtains a data stream representing video content. The video content is partitioned into one or more logical units, and each of the logical units is partitioned into one or more respective logical sub-units. The decoder determines that the data stream includes first data indicating that a first logical unit has been encoded according to a flexible skip coding scheme. In response, the decoder determines a first set of decoding parameters based on the first data, and decodes each of the logical sub-units of the first logical unit according to the first set of decoding parameters.
-
公开(公告)号:US11627343B2
公开(公告)日:2023-04-11
申请号:US17188473
申请日:2021-03-01
Applicant: Apple Inc.
Inventor: Xiaohua Yang , Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/52 , H04N19/90 , H04N19/597 , H04N21/2343 , G06F3/01
Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
-
公开(公告)号:US11514611B2
公开(公告)日:2022-11-29
申请号:US17119893
申请日:2020-12-11
Applicant: Apple Inc.
Inventor: Khaled Mammou , Yeping Su , David W. Singer , Alexandros Tourapis , Fabrice A. Robinet , Hsi-Jung Wu
Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. A closed-loop color conversion process is used to improve compression while taking into consideration distortion introduced throughout the point cloud compression process.
-
公开(公告)号:US20220377304A1
公开(公告)日:2022-11-24
申请号:US17326204
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N21/218 , H04N13/178 , H04N21/24
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US20220360814A1
公开(公告)日:2022-11-10
申请号:US17736881
申请日:2022-05-04
Applicant: Apple Inc.
Inventor: Yeqing Wu , Alexandros Tourapis , Yunfei Zheng , Hsi-Jung Wu , Jun Xin , Albert E. Keinath , Mei Guo , Alican Nalci
IPC: H04N19/52 , H04N19/176 , H04N19/105 , H04N19/159 , H04N19/124
Abstract: An encoder or decoder can perform enhanced motion vector prediction by receiving an input block of data for encoding or decoding and accessing stored motion information for at least one other block of data. Based on the stored motion information, the encoder or decoder can generate a list of one or more motion vector predictor candidates for the input block in accordance with an adaptive list construction order. The encoder or decoder can predict a motion vector for the input block based on at least one of the one or more motion vector predictor candidates.
-
公开(公告)号:US11388419B2
公开(公告)日:2022-07-12
申请号:US16834693
申请日:2020-03-30
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Hsi-Jung Wu , Alexandros Tourapis , Krishnakanth Rapaka
IPC: H04N19/159 , H04N19/182 , H04N19/176
Abstract: Improved video coding techniques include deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.
-
公开(公告)号:US20210227236A1
公开(公告)日:2021-07-22
申请号:US17221299
申请日:2021-04-02
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Ming Chen , Xiaosong Zhou
IPC: H04N19/29 , H04N19/103 , G09G5/14 , H04N21/44
Abstract: Aspects of the present disclosure provide techniques for reducing latency and improving image quality of a viewport extracted from multi-directional video communications. According to such techniques, first streams of coded video data are received from a source. The first streams include coded data for each of a plurality of tiles representing a multi-directional video, where each tile corresponding to a predetermined spatial region of the multi-directional video, and at least one tile of the plurality of tiles in the first streams contains a current viewport location at a receiver. The techniques include decoding the first streams and displaying the tile containing the current viewport location. When the viewport location at the receiver changes to include a new tile of the plurality of tiles, retrieving and decoding first streams for the new tile, displaying the decoded content for the changed viewport location, and transmitting the changed viewport location to the source.
-
-
-
-
-
-
-
-
-