Patent search ap:("APPLE Page INC.") AND inv:"Jiefu Zhai"

51.

发明申请
PREENCODER ASSISTED VIDEO ENCODING 审中-公开
Title translation: PREENCODER辅助视频编码

公开(公告)号：US20150350686A1

公开(公告)日：2015-12-03

申请号：US14290304

申请日：2014-05-29

Applicant: Apple Inc.

Inventor： Xiaosong Zhou , Chris Y. Chung , David R. Conrad , Dazhong Zhang , Feng Yi , Hsi-Jung Wu , Jae Hoon Kim , Jiefu Zhai , Peikang Song , Yunfei Zheng

IPC: H04N19/85 , H04N19/115 , H04N19/517 , H04N19/51 , H04N19/172 , H04N19/176

CPC classification number: H04N19/42 , H04N19/103

Abstract: A method and system of using a pre-encoder to improve encoder efficiency. The encoder may conform to ITU-T H.265 and the pre-encoder may conform to ITU-T H. 264. The pre-encoder may receive source video data and provide information regarding various coding modes, candidate modes, and a selected mode for coding the source video data. In an embodiment, the encoder may directly use the mode selected by the pre-encoder. In another embodiment, the encoder may receive both the source video data and information regarding the various coding modes (e.g., motion information, macroblock size, intra prediction direction, rate-distortion cost, and block pixel statistics) to simplify and/or refine its mode decision process. For example, the information provided by the pre-encoder may indicate unlikely modes, which unlikely modes need not be tested by the encoder, thus saving power and time.

Abstract translation: 一种使用预编码器来提高编码器效率的方法和系统。编码器可以符合ITU-T H.265标准，并且预编码器可以符合ITU-T H.264的要求。预编码器可以接收源视频数据并提供关于各种编码模式，候选模式和选择模式的信息用于对源视频数据进行编码。在一个实施例中，编码器可以直接使用由预编码器选择的模式。在另一个实施例中，编码器可以接收源视频数据和关于各种编码模式的信息（例如，运动信息，宏块大小，帧内预测方向，速率失真成本和块像素统计），以简化和/或改进其模式决策过程。例如，预编码器提供的信息可能指示不太可能的模式，不可能的模式不需要被编码器测试，从而节省功率和时间。

52.

发明申请
HIGH DYNAMIC RANGE VIDEO CAPTURE CONTROL FOR VIDEO TRANSMISSION 审中-公开
Title translation: 用于视频传输的高动态范围视频捕获控制

公开(公告)号：US20150350514A1

公开(公告)日：2015-12-03

申请号：US14726331

申请日：2015-05-29

Applicant: Apple Inc.

Inventor： Jiefu Zhai , Xiaosong Zhou , Chris Y. Chung , Hsi-Jung Wu

IPC: H04N5/235 , H04N19/59

CPC classification number: H04N5/2355 , H04N5/23219 , H04N5/2351 , H04N5/2353 , H04N19/115 , H04N19/117 , H04N19/14 , H04N19/154 , H04N19/156 , H04N19/176

Abstract: Systems and methods are provided for capturing high quality video data, including data having a high dynamic range, for use with conventional encoders and decoders. High dynamic range data is captured using multiple groups of pixels where each group is captured using different exposure times to create groups of pixels. The pixels that are captured at different exposure times may be determined adaptively based on the content of the image, the parameters of the encoding system, or on the available resources within the encoding system. The transition from single exposure to using two different exposure times may be implemented gradually.

Abstract translation: 提供了系统和方法，用于捕获高质量视频数据，包括具有高动态范围的数据，用于常规编码器和解码器。使用多组像素捕获高动态范围数据，其中使用不同的曝光时间拍摄每组，以创建像素组。可以基于图像的内容，编码系统的参数或编码系统内的可用资源自适应地确定在不同曝光时间捕获的像素。从单次曝光到使用两种不同曝光时间的过渡可能逐渐实现。

53.

发明授权
Object and keypoint detection system with low spatial jitter, low latency and low power usage 有权

公开(公告)号：US11847823B2

公开(公告)日：2023-12-19

申请号：US17339249

申请日：2021-06-04

Applicant: Apple Inc.

Inventor： Xiaoxia Sun , Jiefu Zhai , Ke Zhang , Xiaosong Zhou , Hsi-Jung Wu

IPC: G06V10/82 , G06T3/40 , G06V40/20 , G06V20/40 , G06N3/045 , G06V10/25 , G06V20/20 , G06V40/10

CPC classification number: G06V10/82 , G06N3/045 , G06T3/40 , G06V10/25 , G06V20/20 , G06V20/46 , G06V40/113 , G06V40/28

Abstract: Video object and keypoint location detection techniques are presented. The system includes a detection system for generation locations of an object's keypoints along with probabilities associated with the locations, and a stability system for stabilizing keypoint locations of the detected objects. In some aspects, the generated probabilities are two-dimensional array correspond locations within input images, and stability system fits the generated probabilities to a two-dimensional probability distribution function.

54.

发明授权
Systems and methods for perspective shifting in video conferencing session 有权

公开(公告)号：US11818502B2

公开(公告)日：2023-11-14

申请号：US17846896

申请日：2022-06-22

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai

IPC: H04N7/14 , H04N7/15 , G06F3/0488 , H04N13/00 , H04N13/239

CPC classification number: H04N7/147 , G06F3/0488 , H04N7/15 , H04N13/239 , H04N2007/145 , H04N2013/0092

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

55.

发明公开
Modular Machine Learning Architecture 审中-公开

公开(公告)号：US20230147442A1

公开(公告)日：2023-05-11

申请号：US17831738

申请日：2022-06-03

Applicant: Apple Inc.

Inventor： Shujie Liu , Jiefu Zhai , Xiaosong Zhou , Hsi-Jung Wu , Ke Zhang , Xiaoxia Sun , Jian Li

IPC: G06N3/045

CPC classification number: G06N3/045

Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.

56.

发明授权
Processing of equirectangular object data to compensate for distortion by spherical projections 有权

公开(公告)号：US11259046B2

公开(公告)日：2022-02-22

申请号：US15433505

申请日：2017-02-15

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou

IPC: H04N19/597 , H04N19/61 , H04N19/105 , H04N19/513 , H04N19/159 , H04N19/124 , H04N19/13 , H04N19/176 , H04N19/51 , H04N19/139 , H04N19/182 , H04N19/547 , H04N19/527

Abstract: Methods and Systems disclosed to counteract spatial distortions introduced by imaging processes of multi-directional video frames, where objects may be projected to spherical or equirectangular representations. Techniques provided to invert the spatial distortions in video frames used as reference picture data in predictive coding, by spatially transforming the image content of the reference picture data before this image content is being used for the prediction of input video data in prediction-based coders and decoders.

57.

发明授权
Sphere projected motion estimation/compensation and mode decision 有权

公开(公告)号：US10999602B2

公开(公告)日：2021-05-04

申请号：US15390202

申请日：2016-12-23

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu

IPC: H04N19/176 , H04N19/105 , H04N19/597 , H04N19/547

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

58.

发明申请
Packed Image Format for Multi-Directional Video 审中-公开

公开(公告)号：US20200213571A1

公开(公告)日：2020-07-02

申请号：US16725245

申请日：2019-12-23

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Ming Chen , Xiaosong Zhou , Hsi-Jung Wu , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung

IPC: H04N13/161 , H04N13/243 , H04N5/232 , H04N19/70 , H04N19/597

Abstract: Frame packing techniques are disclosed for multi-directional images and video. According to an embodiment, a multi-directional source image is reformatted into a format in which image data from opposing fields of view are represented in respective regions of the packed image as flat image content. Image data from a multi-directional field of view of the source image between the opposing fields of view are represented in another region of the packed image as equirectangular image content. It is expected that use of the formatted frame will lead to coding efficiencies when the formatted image is processed by predictive video coding techniques and the like.

59.

发明授权
De-juddering techniques for coded video 有权

公开(公告)号：US10432946B2

公开(公告)日：2019-10-01

申请号：US14964965

申请日：2015-12-10

Applicant: Apple Inc.

Inventor： Yeping Su , Chris Y. Chung , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai

IPC: H04N7/12 , H04N19/154 , H04N19/115 , H04N19/137 , H04N19/187 , H04N19/177 , H04N7/01 , H04N19/117 , H04N19/31 , H04N19/587 , H04N19/46 , H04N19/85

Abstract: Judder artifacts are remedied in video coding system by employing frame rate conversion at an encoder. A source video sequence may be coded as base layer coded video at a first frame rate. An encoder may identify a portion of the coded video sequence that likely will exhibit judder effects when decoded. For those portions that likely will exhibit judder effects, video data representing the portion of the source video may be coded at a higher frame rate than a frame rate of the coded base layer data as enhancement layer data. Moreover, an encoder may generate metadata representing “FRC hints”—techniques that a decoder should employ when performing decoder-side frame rate conversion. An encoding terminal may transmit the base layer coded video and either the enhancement layer coded video or the FRC hints to a decoder. Thus, encoder infrastructure may mitigate against judder artifacts that may arise during decoding.

60.

发明申请
Techniques for Correction of Visual Artifacts in Multi-View Images 审中-公开

公开(公告)号：US20190005709A1

公开(公告)日：2019-01-03

申请号：US15638587

申请日：2017-06-30

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Ming Chen , Xiaosong Zhou

IPC: G06T15/20 , H04N13/02 , H04N5/232 , H04N19/117

Abstract: Techniques are disclosed for correcting artifacts in multi-view images that include a plurality of planar views. Image content the planar views may be projected from the planar representation to a spherical projection. Thereafter, a portion of the image content may be projected from the spherical projection to a planar representation. The image content of the planar representation may be used for display. Extensions are disclosed that correct artifacts that may arise during deblocking filtering of the multi-view images.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification