-
公开(公告)号:US10574997B2
公开(公告)日:2020-02-25
申请号:US15796313
申请日:2017-10-27
Applicant: Apple Inc.
Inventor: Chris Y. Chung , Mei Guo , Hsi-Jung Wu , Jingteng Xue , Jun Xin
IPC: H04N19/176 , H04N19/117 , H04N19/147 , H04N19/82 , H04N19/105 , H04N19/196
Abstract: Systems and methods are disclosed for coding pixel blocks of an input frame in which coding costs of a plurality of candidate coding modes are estimated. A coding cost of a candidate coding mode may be estimated based on noise estimate associated with the candidate coding mode. A coding mode for the input pixel block may be selected based on a comparison of the estimated coding costs of the plurality of candidate coding modes. The input pixel block may then be coded according to the selected coding mode.
-
公开(公告)号:US10432946B2
公开(公告)日:2019-10-01
申请号:US14964965
申请日:2015-12-10
Applicant: Apple Inc.
Inventor: Yeping Su , Chris Y. Chung , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai
IPC: H04N7/12 , H04N19/154 , H04N19/115 , H04N19/137 , H04N19/187 , H04N19/177 , H04N7/01 , H04N19/117 , H04N19/31 , H04N19/587 , H04N19/46 , H04N19/85
Abstract: Judder artifacts are remedied in video coding system by employing frame rate conversion at an encoder. A source video sequence may be coded as base layer coded video at a first frame rate. An encoder may identify a portion of the coded video sequence that likely will exhibit judder effects when decoded. For those portions that likely will exhibit judder effects, video data representing the portion of the source video may be coded at a higher frame rate than a frame rate of the coded base layer data as enhancement layer data. Moreover, an encoder may generate metadata representing “FRC hints”—techniques that a decoder should employ when performing decoder-side frame rate conversion. An encoding terminal may transmit the base layer coded video and either the enhancement layer coded video or the FRC hints to a decoder. Thus, encoder infrastructure may mitigate against judder artifacts that may arise during decoding.
-
公开(公告)号:US20190004414A1
公开(公告)日:2019-01-03
申请号:US15638848
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung
IPC: G03B37/04 , G06T3/40 , H04N5/232 , H04N13/02 , H04N19/172 , H04N19/159
Abstract: Techniques are described for implementing format configurations for multi-directional video and for switching between them. Source images may be assigned to formats that may change during a coding session. When a change occurs between formats, video coders and decoder may transform decoded reference frames from the first format to the second format. Thereafter, new frames in the second configuration may be coded or decoded predictively using transformed reference frame(s) as source(s) of prediction. In this manner, video coders and decoders may use intra-coding techniques and achieve high efficiency in coding.
-
公开(公告)号:US20180262714A1
公开(公告)日:2018-09-13
申请号:US15455890
申请日:2017-03-10
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai
IPC: H04N7/14 , H04N7/15 , G06T7/194 , G06F3/0488
CPC classification number: H04N7/147 , G06F3/0488 , H04N7/15 , H04N13/239 , H04N2007/145 , H04N2013/0092
Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.
-
55.
公开(公告)号:US20180234700A1
公开(公告)日:2018-08-16
申请号:US15433505
申请日:2017-02-15
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou
IPC: H04N19/597 , H04N19/61 , H04N19/105 , H04N19/513 , H04N19/159 , H04N19/124 , H04N19/13
Abstract: Multi-directional image data often contains distortions of image content that cause problems when processed by video coders that are designed to process traditional, “flat” image content. Embodiments of the present disclosure provide techniques for coding multi-directional image data using such coders. For each pixel block in a frame to be coded, an encoder may transform reference picture data within a search window about a location of the input pixel block based on displacement respectively between the location of the input pixel block and portions of the reference picture within the search window. The encoder may perform a prediction search among the transformed reference picture data to identify a match between the input pixel block and a portion of the transformed reference picture and, when a match is identified, the encoder may code the input pixel block differentially with respect to the matching portion of the transformed reference picture. The transform may counter-act distortions imposed on image content of the reference picture data by the multi-directional format, which aligns the content with image content of the input picture. The techniques apply both for intra-coding and inter-coding.
-
公开(公告)号:US10021411B2
公开(公告)日:2018-07-10
申请号:US14704707
申请日:2015-05-05
Applicant: Apple Inc.
Inventor: Yeping Su , Chris Y. Chung , Hsi-Jung Wu , Jiefu Zhai , Ke Zhang , Xiaosong Zhou
IPC: H04N7/12 , H04N19/503 , H04N19/124 , H04N19/172 , H04N19/147 , H04N19/30
CPC classification number: H04N19/503 , H04N19/124 , H04N19/147 , H04N19/172 , H04N19/30
Abstract: A scalable coding system codes video as a base layer representation and an enhancement layer representation. A base layer coder may code an LDR representation of a source video. A predictor may predict an HDR representation of the source video from the coded base layer data. A comparator may generate prediction residuals which represent a difference between an HDR representation of the source video and the predicted HDR representation of the source video. A quantizer may quantize the residuals down to an LDR representation. An enhancement layer coder may code the LDR residuals. In other scalable coding systems, the enhancement layer coder may code LDR-converted HDR video directly.
-
公开(公告)号:US09992450B1
公开(公告)日:2018-06-05
申请号:US15468919
申请日:2017-03-24
Applicant: Apple Inc.
Inventor: Hang Yuan , Chris Y. Chung , Dazhong Zhang , Hsi-Jung Wu , Xiaosong Zhou , Jae Hoon Kim , Jiefu Zhai
CPC classification number: G06K9/4604 , G06K9/00711 , G06K2009/4666 , H04N5/262 , H04N7/147
Abstract: Embodiments of the present disclosure provide systems and methods for background concealment in a video conferencing session. In one exemplary method, a video stream may be captured and provided to a first terminal participating in a video chat session. A background element and a foreground element may be determined in the video stream. A border region may additionally be determined in the video stream. The border region may define a boundary between the foreground element and the background element. The background region may be modified based, at least in part, on video content of the border region. The modified video stream may be transmitted to a second terminal participating in the video conferencing session.
-
公开(公告)号:US09860485B2
公开(公告)日:2018-01-02
申请号:US14968025
申请日:2015-12-14
Applicant: Apple Inc.
Inventor: Chris Y. Chung , Wen Zhao , Luciano M. Verger , Hyeonkuk Jeong , Gang Lu , Srinivasan Vasudevan , Xiaosong Zhou , Hsi-Jung Wu , James O. Normile
CPC classification number: H04N7/15 , H04L12/1827 , H04L65/1069 , H04N5/23241 , H04N5/372 , H04W52/02 , H04W52/0209 , H04W52/0216 , Y02D70/00 , Y02D70/1224 , Y02D70/1242 , Y02D70/1262 , Y02D70/142 , Y02D70/144 , Y02D70/146 , Y02D70/164 , Y02D70/168 , Y02D70/24
Abstract: In video conferencing over a radio network, the radio equipment is a major power consumer especially in cellular networks such as LTE. In order to reduce the radio power consumption in video conferencing, it is important to introduce an enough radio inactive time. Several types of data buffering and bundling can be employed within a reasonable range of latency that doesn't significantly disrupt the real-time nature of video conferencing. In addition, the data transmission can be synchronized to the data reception in a controlled manner, which can result in an even longer radio inactive time and thus take advantage of radio power saving modes such as LTE C-DRX.
-
公开(公告)号:US09769473B2
公开(公告)日:2017-09-19
申请号:US13755928
申请日:2013-01-31
Applicant: Apple Inc.
Inventor: Yao-Chung Lin , Xiaosong Zhou , Hsi-Jung Wu , Douglas Scott Price , Chris Y. Chung , Dazhong Zhang
IPC: H04N19/503 , H04N19/51
CPC classification number: H04N19/51 , H04N19/503
Abstract: Video coders may perform perspective transformation of reference frames during coding in a manner that conserves processing resources. When a new input frame is available for coding, a camera position for the input frame may be estimated. A video coder may search for reference pictures having similar camera positions as the position of the input frame and, for each reference picture identified, the video coder may perform a prediction search to identify a reference picture that is the best prediction match for the input frame. Once the video coder identifies a reference picture to serve as a prediction source for the input frame, the video coder may derive a transform to match the reference frame data to the input frame data and may transform the reference picture accordingly. The video coder may code the input frame using the transformed reference picture as a prediction reference and may transmit coded frame data and the camera position of the input frame to a decoder. Thus, the video coder may perform derivation and execution of transforms on a limited basis which conserves system resources.
-
公开(公告)号:US09716871B2
公开(公告)日:2017-07-25
申请号:US14603109
申请日:2015-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Yeping Su , Hsi-Jung Wu , Chris Y. Chung , Xiaosong Zhou , Ke Zhang
Abstract: An encoder may include a luma transform, a transformer, and a chroma transform. The luma transform may determine a linear luminance value based upon a plurality of primary color values of a pixel. The transformer may generate a transformed luminance value based upon the linear luminance value and a plurality of transformed color values based upon corresponding more than one of the primary color values of the pixel. The chroma transform may determine a plurality of chroma values based upon corresponding plurality of transformed color values and the transformed luminance value of the pixel.
-
-
-
-
-
-
-
-
-