-
公开(公告)号:US20190104315A1
公开(公告)日:2019-04-04
申请号:US15724798
申请日:2017-10-04
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Chris Y. Chung , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/142 , H04N19/172 , H04N19/66 , H04N19/14 , H04N19/44 , H04N19/115
Abstract: The present disclosure describes techniques for coding video data in a manner that provides consistency to portions of the video that have similar content. According to such techniques, a video sequence may be parsed into partitions and content of the partitions may be analyzed. Partitions may be grouped together based on detected similarities in content. Coding parameters may be selected for each partition based on the partition's membership in the groups. Thus, when the video sequence is coded, coding parameters for frames of two commonly-grouped partitions may be similar, which causes coded video data to have similar presentation.
-
公开(公告)号:US10212456B2
公开(公告)日:2019-02-19
申请号:US15612806
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Mei Guo , Feng Yi , Jun Xin , Chris Y. Chung , Yeping Su , Xiaosong Zhou , Jun Xu , Hsi-Jung Wu , Jingteng Xue
IPC: H04N19/86 , H04N19/117 , H04N19/159
Abstract: Methods of in-loop deblocking filter for high dynamic range (HDR) video compression are disclosed. HDR processing and standard dynamic range (SDR) processing adopt different electro-optical transfer function (EOTF) to convert digital code words to linear luminance. For compressing HDR video, EOTF is proposed to be involved in the selection of two deblocking parameters, β and tC, which control the strength of deblocking filter. In local activity checking for filter decisions, the calculated local signal characteristics and the thresholds are adjusted according to EOTF. After deblocking filter, the clipping range is modified based on EOTF. The chroma deblocking filter is also extended to inter-inter block boundary.
-
公开(公告)号:US20180352225A1
公开(公告)日:2018-12-06
申请号:US15612880
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Chris Y. Chung , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/117 , H04N19/136 , H04N19/186 , H04N19/46 , H04N19/157 , H04N19/80
CPC classification number: H04N19/117 , H04N19/136 , H04N19/157 , H04N19/186 , H04N19/46 , H04N19/80
Abstract: Methods of sample adaptive offset (SAO) for high dynamic range (HDR) video compression are disclosed. For band offset (BO), the sample range of each component is unequally divided into a number of bands based on the distribution of samples, the electro-optical transfer function (EOTF) of HDR source and one perceptually uniform EOTF. Each component may have a different number of bands, different number of signaled bands, and different range value for signaled offsets in BO mode. For edge offset (EO), Cb and Cr may use separate syntax elements. In the sample classification for EO, the difference between the current sample and its neighbor is compared with a threshold to determine whether it is a local valley, a local peak, or corners. A distortion measure for HDR source images are also disclosed.
-
公开(公告)号:US20170359580A1
公开(公告)日:2017-12-14
申请号:US15614881
申请日:2017-06-06
Applicant: Apple Inc.
Inventor: Yeping Su , Xingyu Zhang , Chris Chung , Jun Xin , Hsi-Jung Wu
IPC: H04N19/139 , H04N19/103 , H04N19/44 , G06N99/00 , H04N21/845
CPC classification number: G06N20/00 , H04N19/103 , H04N19/136 , H04N19/179 , H04N21/23439 , H04N21/8456
Abstract: Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.
-
公开(公告)号:US20170353704A1
公开(公告)日:2017-12-07
申请号:US15611636
申请日:2017-06-01
Applicant: Apple Inc.
Inventor: Yeping Su , Chris Chung , Hsi-Jung Wu , Xiaosong Zhou , Jun Xin , Jun Xu
Abstract: Embodiments of the present disclosure provide techniques for environment aware supervised HDR tone mapping. According to those techniques, responsive to metadata provided with HDR source video data, the received metadata may be compared to sensor date representing viewing conditions at a display device. Tone mapping corrections to the HDR source video data may be derived from the comparison of the received metadata to the sensor data. The HDR source video data may be altered based on the tone mapping corrections. The altered HDR source video data may be used to drive the display device.
-
公开(公告)号:US12137199B2
公开(公告)日:2024-11-05
申请号:US18407198
申请日:2024-01-08
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N13/178 , H04N21/218 , H04N21/24
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US20240048776A1
公开(公告)日:2024-02-08
申请号:US17956444
申请日:2022-09-29
Applicant: Apple Inc.
Inventor: Yixin Du , Alexandros Tourapis , Alican Nalci , Guoxin Jin , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yeqing Wu , Yunfei Zheng
IPC: H04N19/86 , H04N19/117 , H04N19/139 , H04N19/176
CPC classification number: H04N19/86 , H04N19/117 , H04N19/139 , H04N19/176
Abstract: Disclosed is a method that includes receiving an image frame having a plurality of coded blocks, determining a prediction unit (PU) from the plurality of coded blocks, determining one or more motion compensation units arranged in an array within the PU, and applying a filter to one or more boundaries of the one or more motion compensation units. Also disclosed is a method that includes receiving a reference frame that includes a reference block, determining a timing for deblocking a current block, performing motion compensation on the reference frame to obtain a predicted frame that includes a predicted block, performing reconstruction on the predicted frame to obtain a reconstructed frame that includes a reconstructed PU, and applying, at the timing for deblocking the current block, a deblocking filter based on one or more parameters to the reference block, the predicted block, or the reconstructed PU.
-
公开(公告)号:US20240040151A1
公开(公告)日:2024-02-01
申请号:US18312077
申请日:2023-05-04
Applicant: Apple Inc.
Inventor: Alican Nalci , Yunfei Zheng , Hilmi E. Egilmez , Yeqing WU , Yixin Du , Alexis Tourapis , Jun Xin , Hsi-Jung Wu
IPC: H04N19/60 , H04N19/124 , H04N19/18 , H04N19/172
CPC classification number: H04N19/60 , H04N19/124 , H04N19/18 , H04N19/172
Abstract: Techniques are described for express and implied signaling of transform mode selections in video coding. Information derived from coefficient samples in a given transform unit (TU) or prediction unit (PU) may constrain or modify signaling of certain syntax elements at the coding block (CB), TU, or PU levels. For instance, based on the spatial locations of decoded coefficients, the spatial patterns of coefficients, or the correlation with the coefficients in neighboring blocks, various syntax elements such as the transform type and related flags/indices or secondary transform modes/flags indices, a residual coding mode, intra and inter prediction modes, and scanning order may be disabled or constrained. In another case, if the coefficient samples match a desired spatial pattern or have other desired properties then a default transform type, a default secondary transform type, a default intra and inter prediction mode or other block level modes may be inferred at the decoder side. Similar ideas may be extended to infer intra-block copy (IBC) flags/modes/indices from coefficient samples, or to infer flags and modes related to predictive coding techniques such as block-wise differential pulse code modulation (BDPCM), i.e., horizontal and vertical BDPCM modes, palette mode syntax and other block level flags/indices.
-
公开(公告)号:US20230117742A1
公开(公告)日:2023-04-20
申请号:US18083468
申请日:2022-12-16
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N21/218 , H04N21/24 , H04N13/178
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
30.
公开(公告)号:US11323730B2
公开(公告)日:2022-05-03
申请号:US16561478
申请日:2019-09-05
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Hsi-Jung Wu , Alexandros Tourapis
IPC: G01R31/34 , H02P29/024 , H01H71/70 , H04N19/44 , H04N19/159 , H04N19/177 , H04N19/172 , H04N19/46
Abstract: Techniques for video exchange are disclosed in which a video source sends redundant copies of video coded at a given presentation time. A decoder may determine, from metadata stored with a first decoded frame, whether other decoded frames have been designated as correlated to the first decoded frame. If so, the decoder may fetch other decoding frames using time indicator values of other decoded frames in metadata stored with the first decoded frame. When other decoded frame(s) are found, the decoder may blend content of the first coded frame and the other decoded frame(s). The decoder may render the blended frame. When one such coded frame is an intra-coded frame, the proposed technique may alleviate beating artifacts by blending recovered data of the intra-coded frame with recovered data of another frame, which may be inter-coded.
-
-
-
-
-
-
-
-
-