-
公开(公告)号:US11606569B2
公开(公告)日:2023-03-14
申请号:US16583153
申请日:2019-09-25
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Yeping Su , Khaled Mammou , Jungsun Kim , David W. Singer , Fabrice A. Robinet , Guy Cote , Jim C. Chou , Hsi-Jung Wu
IPC: H04N19/186 , H04N19/159 , H04N19/176 , H04N19/124
Abstract: Support for additional components may be specified in a coding scheme for image data. A layer of a coding scheme that specifies color components may also specify additional components. Characteristics of the components may be specified in the same layer or a different layer of the coding scheme. An encoder or decoder may identify the specified components and determine the respective characteristics to perform encoding and decoding of image data.
-
公开(公告)号:US20220303554A1
公开(公告)日:2022-09-22
申请号:US17837846
申请日:2022-06-10
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Hsi-Jung Wu , Alexandros Tourapis , Krishnakanth Rapaka
IPC: H04N19/159 , H04N19/182 , H04N19/176
Abstract: Techniques are disclosed for deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.
-
公开(公告)号:US11282238B2
公开(公告)日:2022-03-22
申请号:US17032711
申请日:2020-09-25
Applicant: Apple Inc.
Inventor: Khaled Mammou , Yeping Su , David W. Singer , Alexandros Tourapis , Fabrice A. Robinet , Hsi-Jung Wu
Abstract: A system comprises an encoder configured to compress attribute information and/or spatial for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. To compress the attribute and/or spatial information, the encoder is configured to convert a point cloud into an image based representation. Also, the decoder is configured to generate a decompressed point cloud based on an image based representation of a point cloud. The encoder is configured project the point cloud on to patch planes to compress the point cloud, and supports multiple layered patch planes. For example, some point clouds may have a depth, and points at different depths may be assigned to different layered patch planes.
-
234.
公开(公告)号:US11259046B2
公开(公告)日:2022-02-22
申请号:US15433505
申请日:2017-02-15
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou
IPC: H04N19/597 , H04N19/61 , H04N19/105 , H04N19/513 , H04N19/159 , H04N19/124 , H04N19/13 , H04N19/176 , H04N19/51 , H04N19/139 , H04N19/182 , H04N19/547 , H04N19/527
Abstract: Methods and Systems disclosed to counteract spatial distortions introduced by imaging processes of multi-directional video frames, where objects may be projected to spherical or equirectangular representations. Techniques provided to invert the spatial distortions in video frames used as reference picture data in predictive coding, by spatially transforming the image content of the reference picture data before this image content is being used for the prediction of input video data in prediction-based coders and decoders.
-
公开(公告)号:US20220030244A1
公开(公告)日:2022-01-27
申请号:US17492789
申请日:2021-10-04
Applicant: Apple Inc.
Inventor: Yeping Su , Xingyu Zhang , Chris Chung , Jun Xin , Hsi-Jung Wu
IPC: H04N19/136 , H04N19/103 , G06N20/00 , H04N19/179
Abstract: Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.
-
236.
公开(公告)号:US11206371B2
公开(公告)日:2021-12-21
申请号:US15495095
申请日:2017-04-24
Applicant: Apple Inc.
Inventor: Chris Y. Chung , Dazhong Zhang , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N7/14 , H04N7/15 , H04L29/06 , H04N21/242 , H04N21/43 , H04N5/14 , G06F40/169 , H04N19/136
Abstract: Techniques are disclosed for overcoming communication lag between interactive operations among devices in a streaming session. According to the techniques, a first device streaming video content to a second device and an annotation is entered to a first frame being displayed at the second device, which is communicated back to the first device. Responsive to a communication that identifies the annotation, a first device may identify an element of video content from the first frame to which the annotation applies and determine whether the identified element is present in a second frame of video content currently displayed at the first terminal. If so, the first device may display the annotation with the second frame in a location where the identified element is present. If not, the first device may display the annotation via an alternate technique.
-
公开(公告)号:US20210360192A1
公开(公告)日:2021-11-18
申请号:US17332829
申请日:2021-05-27
Applicant: Apple Inc.
Inventor: Elizabeth C. Cranfill , Stephen O. Lemay , Joe S. Abuan , Hsi-Jung Wu , Xiaosong Zhou , Roberto Garcia, JR.
IPC: H04N7/14 , G09G5/14 , H04N5/225 , G06F3/0488 , G06F9/451 , G06F3/0481 , G06F3/0482 , G06F3/0486 , H04M1/72469 , H04N7/15 , G06F3/0484 , H04N5/262 , H04N5/272
Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.
-
公开(公告)号:US11165989B2
公开(公告)日:2021-11-02
申请号:US16689458
申请日:2019-11-20
Applicant: Apple Inc.
Inventor: Johnny Trenh , Hsi-Jung Wu , Sarah K. Herrlinger , Xiaoxia Sun , Ian J. Baird , Dazhong Zhang , Xiaosong Zhou , Christopher M. Garrido
Abstract: Techniques are presented for managing for visual prominence of participants in a video conference, including conferences where participants communicate visually, such as with sign language. According to these techniques, a visual prominence indication of a participant in a video conference may be estimated, a video stream of the participant may be encoded, and the encoded video stream may be transmitted along with an indication of the estimated visual prominence to a receiving device in the video conference.
-
公开(公告)号:US11153594B2
公开(公告)日:2021-10-19
申请号:US15250608
申请日:2016-08-29
Applicant: Apple Inc.
Inventor: Alexandros Tourapis , Yeping Su , David Singer , Hsi-Jung Wu
IPC: H04N19/50 , H04N19/184 , H04N19/167 , H04N19/436 , H04N19/186 , H04N19/132 , H04N19/124 , H04N19/13 , H04N19/15 , H04N19/30 , H04N19/42 , H04N19/48 , H04N19/182
Abstract: Video compression and decompression techniques are disclosed that provide improved bandwidth control for video compression and decompression systems. In particular, video coding and decoding techniques quantize input video in multiple dimensions. According to these techniques, pixel residuals may be generated from a comparison of an array of input data to an array of prediction data. The pixel residuals may be quantized in a first dimension. After the quantization, the quantized pixel residuals may be transformed to an array of transform coefficients. The transform coefficients may be quantized in a second dimension and entropy coded. Decoding techniques invert these processes. In still other embodiments, multiple quantizers may be provided upstream of the transform stage, either in parallel or in cascade, which provide greater flexibility to video coders to quantize data in different dimensions in an effort to balance the competing interest in compression efficiency and quality of reconstructed video.
-
公开(公告)号:US10999602B2
公开(公告)日:2021-05-04
申请号:US15390202
申请日:2016-12-23
Applicant: Apple Inc.
Inventor: Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu
IPC: H04N19/176 , H04N19/105 , H04N19/597 , H04N19/547
Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.
-
-
-
-
-
-
-
-
-