Patent search ap:("APPLE INC.") AND inv:"Xiaosong Zhou" Page 17

161.

发明授权
Systems and methods for perspective shifting in video conferencing session 有权

公开(公告)号：US11818502B2

公开(公告)日：2023-11-14

申请号：US17846896

申请日：2022-06-22

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Xiaosong Zhou , Jiefu Zhai

IPC: H04N7/14 , H04N7/15 , G06F3/0488 , H04N13/00 , H04N13/239

CPC classification number: H04N7/147 , G06F3/0488 , H04N7/15 , H04N13/239 , H04N2007/145 , H04N2013/0092

Abstract: Embodiments of the present disclosure provide systems and methods for perspective shifting in a video conferencing session. In one exemplary method, a video stream may be generated. A foreground element may be identified in a frame of the video stream and distinguished from a background element of the frame. Data may be received representing a viewing condition at a terminal that will display the generated video stream. The frame of the video stream may be modified based on the received data to shift of the foreground element relative to the background element. The modified video stream may be displayed at the displaying terminal.

162.

发明授权
Encoding and decoding video content 有权

公开(公告)号：US11677934B2

公开(公告)日：2023-06-13

申请号：US17485298

申请日：2021-09-24

Applicant: Apple Inc.

Inventor： Sudeng Hu , David L. Biderman , Christopher M. Garrido , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang , Jinbo Qiu , Karthick Santhanam , Hang Yuan , Joshua L. Hare , Luciano M. Verger , Kevin Arthur Robertson , Sasanka Vemuri

IPC: H04N7/12 , H04N19/105 , H04N19/177 , H04N19/172 , H04N19/146 , H04N19/124

CPC classification number: H04N19/105 , H04N19/124 , H04N19/146 , H04N19/172 , H04N19/177

Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.

163.

发明公开
Modular Machine Learning Architecture 审中-公开

公开(公告)号：US20230147442A1

公开(公告)日：2023-05-11

申请号：US17831738

申请日：2022-06-03

Applicant: Apple Inc.

Inventor： Shujie Liu , Jiefu Zhai , Xiaosong Zhou , Hsi-Jung Wu , Ke Zhang , Xiaoxia Sun , Jian Li

IPC: G06N3/045

CPC classification number: G06N3/045

Abstract: In an example method, a system accesses first input data and a machine learning architecture. The machine learning architecture includes a first module having a first neural network, a second module having a second neural network, and a third module having a third neural network. The system generates a first feature set representing a first portion of the first input data using the first neural network, and a second feature set representing a second portion of the first input data using the second neural network. The system generates, using the third neural network, first output data based on the first feature set and the second feature set.

164.

发明申请
Encoding and Decoding Video Content 有权

公开(公告)号：US20230098082A1

公开(公告)日：2023-03-30

申请号：US17485298

申请日：2021-09-24

Applicant: Apple Inc.

Inventor： Sudeng Hu , David L. Biderman , Christopher M. Garrido , Hsi-Jung Wu , Xiaosong Zhou , Dazhong Zhang , Jinbo Qiu , Karthick Santhanam , Hang Yuan , Joshua L. Hare , Luciano M. Verger , Kevin Arthur Robertson , Sasanka Vemuri

IPC: H04N19/105 , H04N19/177 , H04N19/124 , H04N19/146 , H04N19/172

Abstract: In an example method, a system receives a plurality of frames of a video, and generates a data structure representing the video and representing a plurality of temporal layers. Generating the data structure includes: (i) determining a plurality of quality levels for presenting the video, where each of the quality levels corresponds to a different respective sampling period for sampling the frames of the video, (ii) assigning, based on the sampling periods, each of the frames to a respective one of the temporal layers of the data structure, and (iii) indicating, in the data structure, one or more relationships between (a) at least one the frames assigned to at least one of the temporal layers of the data structure, and (b) at least another one of the frames assigned to at least another one of the temporal layers of the data structure. Further, the system outputs the data structure.

165.

发明授权
Efficient coding of source video sequences partitioned into tiles 有权

公开(公告)号：US11606574B2

公开(公告)日：2023-03-14

申请号：US16882819

申请日：2020-05-26

Applicant: Apple Inc.

Inventor： Dazhong Zhang , Peikang Song , Beibei Wang , Giribalan Gopalan , Albert E. Keinath , Christopher M. Garrido , David R. Conrad , Hsi-Jung Wu , Ming Jin , Hang Yuan , Xiaohua Yang , Xiaosong Zhou , Vikrant Kasarabada , Davide Concion , Eric L. Chien , Bess C. Chan , Karthick Santhanam , Gurtej Singh Chandok

IPC: H04N19/507 , H04N19/65

Abstract: Techniques are disclosed for coding video data in which frames from a video source are partitioned into a plurality of tiles of common size, and the tiles are coded as a virtual video sequence according to motion-compensated prediction, each tile treated as having respective temporal location of the virtual video sequence. The coding scheme permits relative allocation of coding resources to tiles that are likely to have greater significance in a video coding session, which may lead to certain tiles that have low complexity or low motion content to be skipped during coding of the tiles for select source frames. Moreover, coding of the tiles may be ordered to achieve low coding latencies during a coding session.

166.

发明授权
Processing of equirectangular object data to compensate for distortion by spherical projections 有权

公开(公告)号：US11259046B2

公开(公告)日：2022-02-22

申请号：US15433505

申请日：2017-02-15

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Chris Y. Chung , Dazhong Zhang , Hang Yuan , Hsi-Jung Wu , Jiefu Zhai , Xiaosong Zhou

IPC: H04N19/597 , H04N19/61 , H04N19/105 , H04N19/513 , H04N19/159 , H04N19/124 , H04N19/13 , H04N19/176 , H04N19/51 , H04N19/139 , H04N19/182 , H04N19/547 , H04N19/527

Abstract: Methods and Systems disclosed to counteract spatial distortions introduced by imaging processes of multi-directional video frames, where objects may be projected to spherical or equirectangular representations. Techniques provided to invert the spatial distortions in video frames used as reference picture data in predictive coding, by spatially transforming the image content of the reference picture data before this image content is being used for the prediction of input video data in prediction-based coders and decoders.

167.

发明授权
Techniques to overcome communication lag between terminals performing video mirroring and annotation operations 有权

公开(公告)号：US11206371B2

公开(公告)日：2021-12-21

申请号：US15495095

申请日：2017-04-24

Applicant: Apple Inc.

Inventor： Chris Y. Chung , Dazhong Zhang , Hsi-Jung Wu , Xiaosong Zhou

IPC: H04N7/14 , H04N7/15 , H04L29/06 , H04N21/242 , H04N21/43 , H04N5/14 , G06F40/169 , H04N19/136

Abstract: Techniques are disclosed for overcoming communication lag between interactive operations among devices in a streaming session. According to the techniques, a first device streaming video content to a second device and an annotation is entered to a first frame being displayed at the second device, which is communicated back to the first device. Responsive to a communication that identifies the annotation, a first device may identify an element of video content from the first frame to which the annotation applies and determine whether the identified element is present in a second frame of video content currently displayed at the first terminal. If so, the first device may display the annotation with the second frame in a location where the identified element is present. If not, the first device may display the annotation via an alternate technique.

168.

发明申请
ESTABLISHING A VIDEO CONFERENCE DURING A PHONE CALL 有权

公开(公告)号：US20210360192A1

公开(公告)日：2021-11-18

申请号：US17332829

申请日：2021-05-27

Applicant: Apple Inc.

Inventor： Elizabeth C. Cranfill , Stephen O. Lemay , Joe S. Abuan , Hsi-Jung Wu , Xiaosong Zhou , Roberto Garcia, JR.

IPC: H04N7/14 , G09G5/14 , H04N5/225 , G06F3/0488 , G06F9/451 , G06F3/0481 , G06F3/0482 , G06F3/0486 , H04M1/72469 , H04N7/15 , G06F3/0484 , H04N5/262 , H04N5/272

Abstract: Some embodiments provide a method for initiating a video conference using a first mobile device. The method presents, during an audio call through a wireless communication network with a second device, a selectable user-interface (UI) item on the first mobile device for switching from the audio call to the video conference. The method receives a selection of the selectable UI item. The method initiates the video conference without terminating the audio call. The method terminates the audio call before allowing the first and second devices to present audio and video data exchanged through the video conference.

169.

发明授权
Gesture and prominence in video conferencing 有权

公开(公告)号：US11165989B2

公开(公告)日：2021-11-02

申请号：US16689458

申请日：2019-11-20

Applicant: Apple Inc.

Inventor： Johnny Trenh , Hsi-Jung Wu , Sarah K. Herrlinger , Xiaoxia Sun , Ian J. Baird , Dazhong Zhang , Xiaosong Zhou , Christopher M. Garrido

IPC: H04N7/14 , G06F3/01 , G06N3/08 , H04N7/15

Abstract: Techniques are presented for managing for visual prominence of participants in a video conference, including conferences where participants communicate visually, such as with sign language. According to these techniques, a visual prominence indication of a participant in a video conference may be estimated, a video stream of the participant may be encoded, and the encoded video stream may be transmitted along with an indication of the estimated visual prominence to a receiving device in the video conference.

170.

发明授权
Sphere projected motion estimation/compensation and mode decision 有权

公开(公告)号：US10999602B2

公开(公告)日：2021-05-04

申请号：US15390202

申请日：2016-12-23

Applicant: Apple Inc.

Inventor： Jae Hoon Kim , Xiaosong Zhou , Dazhong Zhang , Hang Yuan , Jiefu Zhai , Chris Y. Chung , Hsi-Jung Wu

IPC: H04N19/176 , H04N19/105 , H04N19/597 , H04N19/547

Abstract: Techniques are disclosed for coding video data predictively based on predictions made from spherical-domain projections of input pictures to be coded and reference pictures that are prediction candidates. Spherical projection of an input picture and the candidate reference pictures may be generated. Thereafter, a search may be conducted for a match between the spherical-domain representation of a pixel block to be coded and a spherical-domain representation of the reference picture. On a match, an offset may be determined between the spherical-domain representation of the pixel block to a matching portion of the of the reference picture in the spherical-domain representation. The spherical-domain offset may be transformed to a motion vector in a source-domain representation of the input picture, and the pixel block may be coded predictively with reference to a source-domain representation of the matching portion of the reference picture.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification