-
21.
公开(公告)号:US20240340438A1
公开(公告)日:2024-10-10
申请号:US18593281
申请日:2024-03-01
Applicant: Apple Inc.
Inventor: Yeqing WU , Yunfei ZHENG , Yixin DU , Alican NALCI , Hilmi E. EGILMEZ , Guoxin JIN , Alexandros TOURAPIS , Jun XIN , Hsi-Jung WU
IPC: H04N19/52 , H04N19/105 , H04N19/124 , H04N19/127 , H04N19/154 , H04N19/172 , H04N19/176
CPC classification number: H04N19/52 , H04N19/105 , H04N19/124 , H04N19/127 , H04N19/154 , H04N19/172 , H04N19/176
Abstract: Techniques are proposed to improve temporal motion projection in video coding. Candidate reference frames available for use in temporal motion projection are sorted in processing order according to scores assigned based on estimates of the reference frames' suitability for prediction. Such estimates may be based on temporal distance between each candidate reference frame and that reference frame's prediction references. Estimates may be based, for each reference frame, based on an estimate of coding quality of a reference frame from which the respective candidate reference frame makes a prediction reference. Once sorted, the candidate reference frames may be processing in the sorting order to supply prediction data to a current frame that is to be coded from the candidate reference frames. Additionally, hardware friendly designs of motion field hole filling and motion vector smoothing operations are proposed. Such designs can reduce hardware implementation complexity and benefit hardware parallel processing in several aspects: by removing the dependency among different processing block rows for hole filling and motion vector smoothing so that it is becomes easier and friendlier to achieve hardware parallel processing; by reducing the hardware bandwidth loading overhead; by improving hardware pipeline throughput; and/or by avoiding adding a line buffer to store the data from the above row since a line buffer will increase hardware cost.
-
公开(公告)号:US20230096567A1
公开(公告)日:2023-03-30
申请号:US17951919
申请日:2022-09-23
Applicant: APPLE INC.
Inventor: Alican NALCI , Alexandros TOURAPIS , Hsi-Jung WU , Jiefu ZHAI , Jingteng XUE , Jun XIN , Mei GUO , Xingyu ZHANG , Yeqing WU , Yunfei ZHENG , Jean Begaint
IPC: H04N19/147 , H04N19/42 , H04N19/172 , H04N19/176 , H04N19/186 , H04N19/119 , H04N19/91 , H04N19/70 , H04N19/124 , H04N19/60
Abstract: Improved neural-network-based image and video coding techniques are presented, including hybrid techniques that include both tools of a host codec and neural-network-based tools. In these improved techniques, the host coding tools may include conventional video coding standards such H.266 (VVC). In an aspects, source frames may be partitioned and either host or neural-network-based tools may be selected per partition. Coding parameter decisions for a partition may be constrained based on the partitioning and coding tool selection. Rate control for host and neural network tools may be combined. Multi-stage processing of neural network output may use a checkerboard prediction pattern.
-
公开(公告)号:US20200177927A1
公开(公告)日:2020-06-04
申请号:US16204792
申请日:2018-11-29
Applicant: Apple Inc.
Inventor: Xiaohua YANG , Alexandros TOURAPIS , Dazhong ZHANG , Hang YUAN , Hsi-Jung WU , Jae Hoon KIM , Jiefu ZHAI , Ming CHEN , Xiaosong ZHOU
IPC: H04N19/90 , H04N21/2343 , G06F3/01 , H04N19/597 , H04N19/52
Abstract: In communication applications, aggregate source image data at a transmitter exceeds the data that is needed to display a rendering of a viewport at a receiver. Improved streaming techniques that include estimating a location of a viewport at a future time. According to such techniques, the viewport may represent a portion of an image from a multi-directional video to be displayed at the future time, and tile(s) of the image may be identified in which the viewport is estimated to be located. In these techniques, the image data of tile(s) in which the viewport is estimated to be located may be requested at a first service tier, and the other tile in which the viewport is not estimated to be located may be requested at a second service tier, lower than the first service tier.
-
公开(公告)号:US20170359596A1
公开(公告)日:2017-12-14
申请号:US15178304
申请日:2016-06-09
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Xiaosong ZHOU , Sudeng Hu , Chris CHUNG , Dazhong ZHANG , Hsi-Jung WU
IPC: H04N19/59 , H04N19/61 , H04N19/167 , H04N19/182
CPC classification number: H04N19/59 , H04N19/103 , H04N19/124 , H04N19/132 , H04N19/146 , H04N19/167 , H04N19/176 , H04N19/182 , H04N19/187 , H04N19/31 , H04N19/33 , H04N19/61 , H04N19/85
Abstract: Video coding techniques are disclosed that can accommodate low bandwidth events and preserve visual quality, at least in areas of an image that have high significance to a viewer. Region(s) of interest may be identified from content of input frame that will be coded. Two representations of the input frame may be generated at different resolutions. A low resolution representation of the input frame may be coded according to predictive coding techniques in which a portion outside the region of interest is coded at higher quality than a portion inside the region of interest. A high resolution representation of the input frame may be coded according to predictive coding techniques in which a portion inside the region of interest is coded at higher quality than a portion outside the region of interest. Doing so preserves visual quality, at least in areas of the input image that correspond to the region of interest.
-
公开(公告)号:US20170347112A1
公开(公告)日:2017-11-30
申请号:US15168872
申请日:2016-05-31
Applicant: Apple Inc.
Inventor: Dazhong ZHANG , Jae Hoon KIM , Peikang SONG , Xiaosong ZHOU , Chris CHUNG , Hsi-Jung WU
IPC: H04N19/46 , H04L12/741 , H04L5/00 , H04L29/06 , H04N19/68
CPC classification number: H04N19/46 , H04L5/0055 , H04L45/745 , H04L65/602 , H04L65/607 , H04L65/608 , H04L65/80 , H04N19/68
Abstract: Coding and decoding techniques are disclosed in which a plurality of coding parameter sets is transmitted between an encoder and a decoder, each of which is distinguishable from the others by a respective identifier. When a new frame of video is to be coded, an encoder may identify a coding parameter set to be applied during coding, it may code the new frame according to the identified coding parameter set, and it may transmit the coded frame to the decoder along with an identifier of the coding parameter set used during the coding. A plurality of coding parameter sets is persistent at an encoder and the decoder simultaneously.
-
26.
公开(公告)号:US20140321534A1
公开(公告)日:2014-10-30
申请号:US14056043
申请日:2013-10-17
Applicant: Apple Inc.
Inventor: Yeping SU , Chris Y. CHUNG , Hao PAN , Hsi-Jung WU , Ke ZHANG
IPC: H04N19/117 , H04N19/124 , H04N19/139 , H04N19/136 , H04N19/179
CPC classification number: H04N19/117 , H04N19/136
Abstract: Video coding systems and methods are provided that protect against various artifacts (e.g., banding artifacts) in decoded frame content. The embodiments to protect against introducing artifacts include pre-processing the input video frame, controlling the encoding parameters and/or post-processing the decoded video frame. A method for encoding a frame may include calculating a dark-scene score for the frame based on measured statistics of the frame. The dark-scene score may be compared to a threshold to determine if the banding artifacts may be present in the frame. Based on the comparison, filtering parameters for the frame may be altered to reduce the amount of noise filtered from the frame. The frame may be filtered based on the altered filtering parameters and encoded.
Abstract translation: 提供了视频编码系统和方法,其防止解码的帧内容中的各种伪影(例如,带状伪影)。 防止引入伪像的实施例包括预处理输入视频帧,控制编码参数和/或后处理解码的视频帧。 用于编码帧的方法可以包括基于所测量的帧的统计来计算所述帧的暗场景得分。 暗场景得分可以与阈值进行比较,以确定条带伪像是否可能存在于帧中。 基于该比较,可以改变帧的滤波参数以减少从帧滤波的噪声量。 可以基于改变的滤波参数对帧进行滤波并进行编码。
-
公开(公告)号:US20130182778A1
公开(公告)日:2013-07-18
申请号:US13731819
申请日:2012-12-31
Applicant: Apple Inc.
Inventor: Hsi-Jung WU , Barin Geoffry Haskell , Xiaojin Shi , James Oliver Normile
IPC: H04N7/26
CPC classification number: H04N19/44 , H04N19/149 , H04N19/152
Abstract: Disclosed is a system and method of controlling a video decoder, including a reviewing channel data representing coded video data generated by an encoder to identify parameters of a hypothetical reference decoder (HRD) used by the encoder during coding operations. A parameter representing an exit data rate requirement of a coded picture buffer (CPB) of the HRD is compared against exit rate performance of the video decoder. If the exit rate performance of the video coder matches the exit rate requirement of the HRD, the coded video data is decoded, otherwise, a certain decoding degradation scheme can be applied, including disabling decoder from decoding the coded video data.
-
公开(公告)号:US20240040124A1
公开(公告)日:2024-02-01
申请号:US18358094
申请日:2023-07-25
Applicant: Apple Inc.
Inventor: Alican NALCI , Yunfei ZHENG , Hilmi Enes EGILMEZ , Yeqing WU , Yixin DU , Alexandros TOURAPIS , Jun XIN , Hsi-Jung WU , Arash VOSOUGHI , Dzung T. HOANG
IPC: H04N19/13 , H04N19/70 , H04N19/61 , H04N19/176
CPC classification number: H04N19/13 , H04N19/70 , H04N19/61 , H04N19/176
Abstract: A flexible coefficient coding (FCC) approach is presented. In the first aspect, spatial sub-regions are defined over a transform unit (TU) or a prediction unit (PU). These sub-regions organize the coefficient samples residing inside a TU or a PU into variable coefficient groups (VCGs). Each VCG corresponds to a sub-region inside a larger TU or PU. The shape of VCGs or the boundaries between different VCGs may be irregular, determined based on the relative distance of coefficient samples with respect to each other. Alternatively, the VCG regions may be defined according to scan ordering within a TU. Each VCG can encode a 1) different number of symbols for a given syntax element, or a 2) different number of syntax elements within the same TU or PU. Whether to code more symbols or more syntax elements may depend on the type of arithmetic coding engine used in a particular coding specification. For multi-symbol arithmetic coding (MS-AC), a VCG may encode a different number of symbols for a syntax element. For example, to encode absolute coefficient values inside a TU after performing a transform such as the discrete cosine transform (DCT), a VCG region may be defined around lower-frequency transform coefficients and for that VCG M-symbols can be encoded the absolute coefficient values. Another VCG region can be defined around the higher-frequency transform coefficients to encode K-symbols, where K may be different than M. For binary arithmetic coders (BACs), FCC allows for coding a variable number of syntax elements in different VCGs. In this case, one VCG in a TU may code M-syntax elements associated with signaling the absolute coefficient value, where each one of the M-syntax elements may have 2-symbols. Probability models and context derivation rules may be tailored for each VCG in a given TU or PU. Since each VCG may code a different number of symbols or syntax elements in different spatial locations of a TU or PU, different context models may be used for each VCG to provide better granularity for entropy modeling for arithmetic coding. Furthermore, different VCGs may also use different entropy coders including combinations of arithmetic coding, Golomb-Rice coding, Huffman coding.
-
公开(公告)号:US20240040120A1
公开(公告)日:2024-02-01
申请号:US18358111
申请日:2023-07-25
Applicant: Apple Inc.
Inventor: Hilmi Enes EGILMEZ , Yunfei ZHENG , Alican NALCI , Yeqing WU , Yixin DU , Guoxin JIN , Alexandros TOURAPIS , Jun XIN , Hsi-Jung WU
IPC: H04N19/122 , H04N19/61 , H04N19/176 , H04N19/70 , H04N19/159 , H04N19/136 , H04N19/18 , H04N19/186
CPC classification number: H04N19/122 , H04N19/61 , H04N19/176 , H04N19/70 , H04N19/159 , H04N19/136 , H04N19/18 , H04N19/186
Abstract: Video coders and decoders perform transform coding and decoding on blocks of video content according to an adaptively selected transform type. The transform types are organized into a hierarchy of transform sets where each transform set includes a respective number of transforms and each higher-level transform set includes the transforms of each lower-level transform set within the hierarchy. The video coders and video decoders may exchange signaling that establishes a transform set context from which a transform set that was selected for coding given block(s) may be identified. The video coders and video decoders may exchange signaling that establishes a transform decoding context from which a transform that was selected from the identified transform set to be used for decoding the transform unit. The block(s) may be coded and decoded by the selected transform.
-
公开(公告)号:US20230396819A1
公开(公告)日:2023-12-07
申请号:US18327364
申请日:2023-06-01
Applicant: APPLE INC.
Inventor: Ke ZHANG , Xiaoxia SUN , Shujie LIU , Xiaosong ZHOU , Jian LI , Xun SHI , Jiefu ZHAI , Albert E KEINATH , Hsi-Jung WU , Jingteng XUE , Xingyu ZHANG , Jun XIN
IPC: H04N21/2343 , H04N7/01 , G06V40/16 , G06V40/20 , G06V30/10 , H04N21/231
CPC classification number: H04N21/2343 , H04N7/0127 , H04N21/231 , G06V40/20 , G06V30/10 , G06V40/172
Abstract: A video delivery system generates and stores reduced bandwidth videos from source video. The system may include a track generator that executes functionality of application(s) to be used at sink devices, in which the track generator generates tracks from execution of the application(s) on source video and generates tracks having a reduced data size as compared to the source video. The track generator may execute a first instance of application functionality on the source video, which identifies region(s) of interest from the source video. The track generator further may downsample the source video according to downsampling parameters, and execute a second instance of application functionality on the downsampled video. The track generator may determine, from a comparison of outputs from the first and second instances of the application, whether the output from the second instance of application functionality is within an error tolerance of the output from the first instance of application functionality. If so, the track generator may generate a track from the downsampled video. In this manner, the system generates tracks that enable reliable application operation when processed by sink devices but also have reduced size as compared to source video.
-
-
-
-
-
-
-
-
-