-
公开(公告)号:US20170358059A1
公开(公告)日:2017-12-14
申请号:US15618967
申请日:2017-06-09
Applicant: Apple Inc.
Inventor: Ke Zhang , Jiefu Zhai , Yunfei Zheng , Shujie Liu , Albert E. Keinath , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu
IPC: G06T5/00 , G06T7/194 , G06T7/11 , G11B27/031
CPC classification number: G06T7/194 , G06K9/3233 , G06K9/346 , G06T7/11 , G06T2207/20132 , G11B27/031
Abstract: Techniques for cropping images containing an occlusion are presented. A method for image editing is presented comprising, when an occlusion is detected in an original digital image, determining an area occupied by the occlusion, assigning importance scores to different content elements of the original digital image, defining a cropping window around an area of the original digital image that does not include the area occupied by the occlusion and that is based on the importance scores, and cropping the original digital image to the cropping window.
-
22.
公开(公告)号:US20160094823A1
公开(公告)日:2016-03-31
申请号:US14603109
申请日:2015-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Yeping Su , Hsi-Jung Wu , Chris Y. Chung , Xiaosong Zhou , Ke Zhang
Abstract: An encoder may include a luma transform, a transformer, and a chroma transform. The luma transform may determine a linear luminance value based upon a plurality of primary color values of a pixel. The transformer may generate a transformed luminance value based upon the linear luminance value and a plurality of transformed color values based upon corresponding more than one of the primary color values of the pixel. The chroma transform may determine a plurality of chroma values based upon corresponding plurality of transformed color values and the transformed luminance value of the pixel.
Abstract translation: 编码器可以包括亮度变换,变压器和色度变换。 亮度变换可以基于像素的多个基色值来确定线性亮度值。 所述变压器可以基于所述线性亮度值和基于所述像素的主要颜色值中的多于一个的多个变换颜色值来生成经变换的亮度值。 色度变换可以基于对应的多个变换颜色值和像素的变换亮度值来确定多个色度值。
-
23.
公开(公告)号:US11847823B2
公开(公告)日:2023-12-19
申请号:US17339249
申请日:2021-06-04
Applicant: Apple Inc.
Inventor: Xiaoxia Sun , Jiefu Zhai , Ke Zhang , Xiaosong Zhou , Hsi-Jung Wu
CPC classification number: G06V10/82 , G06N3/045 , G06T3/40 , G06V10/25 , G06V20/20 , G06V20/46 , G06V40/113 , G06V40/28
Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.
-
公开(公告)号:US20230147442A1
公开(公告)日:2023-05-11
申请号:US17831738
申请日:2022-06-03
Applicant: Apple Inc.
Inventor: Shujie Liu , Jiefu Zhai , Xiaosong Zhou , Hsi-Jung Wu , Ke Zhang , Xiaoxia Sun , Jian Li
IPC: G06N3/045
CPC classification number: G06N3/045
Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.
-
公开(公告)号:US10021411B2
公开(公告)日:2018-07-10
申请号:US14704707
申请日:2015-05-05
Applicant: Apple Inc.
Inventor: Yeping Su , Chris Y. Chung , Hsi-Jung Wu , Jiefu Zhai , Ke Zhang , Xiaosong Zhou
IPC: H04N7/12 , H04N19/503 , H04N19/124 , H04N19/172 , H04N19/147 , H04N19/30
CPC classification number: H04N19/503 , H04N19/124 , H04N19/147 , H04N19/172 , H04N19/30
Abstract: A scalable coding system codes video as a base layer representation and an enhancement layer representation. A base layer coder may code an LDR representation of a source video. A predictor may predict an HDR representation of the source video from the coded base layer data. A comparator may generate prediction residuals which represent a difference between an HDR representation of the source video and the predicted HDR representation of the source video. A quantizer may quantize the residuals down to an LDR representation. An enhancement layer coder may code the LDR residuals. In other scalable coding systems, the enhancement layer coder may code LDR-converted HDR video directly.
-
公开(公告)号:US09716871B2
公开(公告)日:2017-07-25
申请号:US14603109
申请日:2015-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Yeping Su , Hsi-Jung Wu , Chris Y. Chung , Xiaosong Zhou , Ke Zhang
Abstract: An encoder may include a luma transform, a transformer, and a chroma transform. The luma transform may determine a linear luminance value based upon a plurality of primary color values of a pixel. The transformer may generate a transformed luminance value based upon the linear luminance value and a plurality of transformed color values based upon corresponding more than one of the primary color values of the pixel. The chroma transform may determine a plurality of chroma values based upon corresponding plurality of transformed color values and the transformed luminance value of the pixel.
-
公开(公告)号:US20170109596A1
公开(公告)日:2017-04-20
申请号:US15298777
申请日:2016-10-20
Applicant: Apple Inc.
Inventor: Shujie Liu , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu , Yunfei Zheng , Albert E. Keinath , Xiaosong Zhou , Ke Zhang
CPC classification number: G06K9/46 , G06F17/30256 , G06K9/036 , G06K9/6288 , G06T5/003
Abstract: A method for processing media assets includes, given a first media asset, deriving characteristics from the first media asset, searching for other media assets having characteristics that correlate to the characteristics of the first media asset, when a match is found, deriving content corrections for the first media asset or a matching media asset from the other of the first media asset or the matching media asset, and correcting content of the first media asset or the matching media asset based on the content corrections.
-
公开(公告)号:US09571827B2
公开(公告)日:2017-02-14
申请号:US13631605
申请日:2012-09-28
Applicant: Apple Inc.
Inventor: Yeping Su , Hsi-Jung Wu , Hao Pan , Ke Zhang
IPC: H04N19/10 , H04N21/234 , H04N21/2343 , H04N21/242 , H04N21/845 , H04N7/12 , G06K9/36
CPC classification number: H04N19/10 , H04N21/23424 , H04N21/23439 , H04N21/242 , H04N21/8456
Abstract: A video coding server may code a common video sequence into a plurality of coded data streams, each coded data stream representing the video sequence coded using coding parameters tailored for a respective transmission bit rate. The coding may cause a set of transmission units from among the coded data streams to include coded video data from a common point of the video sequence, and a first coded frame of each transmission unit of the set to be a synchronization frame. A manifest file may be built representing an index of transmission units of the respective coded data streams. The coded data streams and manifest file may be stored by the server for delivery to a client device. During download and decode, the chunks may be decoded efficiently even when switching among streams because the first frame in each chunk is a synchronization frame.
Abstract translation: 视频编码服务器可以将公共视频序列编码为多个编码数据流,每个编码数据流表示使用针对相应传输比特率定制的编码参数进行编码的视频序列。 编码可以使得编码数据流中的一组传输单元包括来自视频序列的公共点的编码视频数据和作为同步帧的组的每个传输单元的第一编码帧。 可以构建表示相应编码数据流的传输单元的索引的清单文件。 编码数据流和清单文件可以由服务器存储以传送到客户端设备。 在下载和解码期间,即使在每个块中的第一帧是同步帧时,即使切换流之间也可以有效地解码该块。
-
-
-
-
-
-
-