-
公开(公告)号:US12192440B2
公开(公告)日:2025-01-07
申请号:US17568266
申请日:2022-01-04
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Xingyu Zhang , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , Yeping Su
IPC: H04N19/105 , G06N3/08 , H04N19/147 , H04N19/159 , H04N19/176 , H04N19/61
Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
-
公开(公告)号:US12149699B2
公开(公告)日:2024-11-19
申请号:US17492789
申请日:2021-10-04
Applicant: Apple Inc.
Inventor: Yeping Su , Xingyu Zhang , Chris Chung , Jun Xin , Hsi-Jung Wu
IPC: H04N19/136 , G06N20/00 , H04N19/103 , H04N19/179 , H04N21/2343 , H04N21/845
Abstract: Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.
-
公开(公告)号:US20240195978A1
公开(公告)日:2024-06-13
申请号:US18065220
申请日:2022-12-13
Applicant: Apple Inc.
Inventor: Yeqing WU , Yunfei Zheng , Alexandros Tourapis , Yixin Du , Alican Nalci , Hilmi E. Egilmez , Guoxin Jin , Jun Xin , Hsi-Jung Wu
IPC: H04N19/137 , H04N19/105 , H04N19/172 , H04N19/176
CPC classification number: H04N19/137 , H04N19/105 , H04N19/172 , H04N19/176
Abstract: The present disclosure describes techniques for efficient coding of motion vectors developed for multi-hypothesis coding applications. According to these techniques, when coding hypotheses are developed, each having a motion vector identifying a source of prediction for a current pixel block, a motion vector for a first one of the coding hypotheses may be predicted from the motion vector of a second coding hypothesis. The first motion vector may be represented by coding a motion vector residual, which represents a difference between the developed motion vector for the first coding hypothesis and the predicted motion vector for the first coding hypothesis, and outputting the coded residual to a channel. In another embodiment, a motion vector residual may be generated for a motion vector of a first coding hypothesis, and the first motion vector and the motion vector residual may be used to predict a second motion vector and a predicted motion vector residual. The second hypothesis's motion vector may be coded as a difference between the motion vector, the predicted second motion vector, and the predicted motion vector residual. In a further embodiment, a single motion vector residual may be output for the motion vectors of two coding hypotheses representing a difference between the motion vector of one of the hypotheses and a predicted motion vector for that hypothesis.
-
公开(公告)号:US11570417B2
公开(公告)日:2023-01-31
申请号:US17326204
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/00 , H04N21/24 , H04N21/218 , H04N13/117 , H04N13/178
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US10200687B2
公开(公告)日:2019-02-05
申请号:US15612880
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Chris Y. Chung , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/117 , H04N19/136 , H04N19/186 , H04N19/46 , H04N19/157 , H04N19/80
Abstract: Methods of sample adaptive offset (SAO) for high dynamic range (HDR) video compression are disclosed. For band offset (BO), the sample range of each component is unequally divided into a number of bands based on the distribution of samples, the electro-optical transfer function (EOTF) of HDR source and one perceptually uniform EOTF. Each component may have a different number of bands, different number of signaled bands, and different range value for signaled offsets in BO mode. For edge offset (EO), Cb and Cr may use separate syntax elements. In the sample classification for EO, the difference between the current sample and its neighbor is compared with a threshold to determine whether it is a local valley, a local peak, or corners. A distortion measure for HDR source images are also disclosed.
-
公开(公告)号:US20240323442A1
公开(公告)日:2024-09-26
申请号:US18603138
申请日:2024-03-12
Applicant: Apple Inc.
Inventor: Alican Nalci , Guoxin Jin , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yixin Du , Yunfei Zheng , Yeqing Wu
IPC: H04N19/625 , H04N19/176 , H04N19/70
CPC classification number: H04N19/625 , H04N19/176 , H04N19/70
Abstract: In an example method, a decoder accesses a bitstream representing video content, and parses one or more flexible coefficient position (FCP) syntax from the bitstream, where the one or more FCP syntax indicate one or more index values. The decoder further determines side information representing one or more characteristics of an encoded portion of the video content. The decoder interprets the one or more FCP syntax based on the side information, including determining a coefficient position with respect to the encoded portion of the video content based on the one or more index values and the side information. The decoder decodes the encoded portion of the video content according to the coefficient position.
-
公开(公告)号:US20230188738A1
公开(公告)日:2023-06-15
申请号:US18076166
申请日:2022-12-06
Applicant: Apple Inc.
Inventor: Alican Nalci , Alexandros Tourapis , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yeqing Wu , Yixin Du , Yunfei Zheng
IPC: H04N19/44 , H04N19/184
CPC classification number: H04N19/44 , H04N19/184
Abstract: In an example method, a decoder obtains a data stream representing video content. The video content is partitioned into one or more logical units, and each of the logical units is partitioned into one or more respective logical sub-units. The decoder determines that the data stream includes first data indicating that a first logical unit has been encoded according to a flexible skip coding scheme. In response, the decoder determines a first set of decoding parameters based on the first data, and decodes each of the logical sub-units of the first logical unit according to the first set of decoding parameters.
-
公开(公告)号:US20220377304A1
公开(公告)日:2022-11-24
申请号:US17326204
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N21/218 , H04N13/178 , H04N21/24
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US20220360814A1
公开(公告)日:2022-11-10
申请号:US17736881
申请日:2022-05-04
Applicant: Apple Inc.
Inventor: Yeqing Wu , Alexandros Tourapis , Yunfei Zheng , Hsi-Jung Wu , Jun Xin , Albert E. Keinath , Mei Guo , Alican Nalci
IPC: H04N19/52 , H04N19/176 , H04N19/105 , H04N19/159 , H04N19/124
Abstract: An encoder or decoder can perform enhanced motion vector prediction by receiving an input block of data for encoding or decoding and accessing stored motion information for at least one other block of data. Based on the stored motion information, the encoder or decoder can generate a list of one or more motion vector predictor candidates for the input block in accordance with an adaptive list construction order. The encoder or decoder can predict a motion vector for the input block based on at least one of the one or more motion vector predictor candidates.
-
公开(公告)号:US11388419B2
公开(公告)日:2022-07-12
申请号:US16834693
申请日:2020-03-30
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Hsi-Jung Wu , Alexandros Tourapis , Krishnakanth Rapaka
IPC: H04N19/159 , H04N19/182 , H04N19/176
Abstract: Improved video coding techniques include deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.
-
-
-
-
-
-
-
-
-