Invention Publication
- Patent Title: FEATURE RECONSTRUCTION USING NEURAL NETWORKS FOR VIDEO STREAMING SYSTEMS AND APPLICATIONS
-
Application No.: US17955754Application Date: 2022-09-29
-
Publication No.: US20240114170A1Publication Date: 2024-04-04
- Inventor: Aurobinda Maharana , Abhijit Patait
- Applicant: Nvidia Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Nvidia Corporation
- Current Assignee: Nvidia Corporation
- Current Assignee Address: US CA Santa Clara
- Main IPC: H04N19/70
- IPC: H04N19/70 ; G06T5/50 ; G06T7/246 ; G06T7/60 ; G06V10/25 ; G06V10/82 ; G06V40/16 ; H04N7/15

Abstract:
Systems and methods relate to facial video encoding and reconstruction, particularly in ultra-low bandwidth settings. In embodiments, a video conferencing or other streaming application uses automatically tracked feature cropping information. A bounding shape size—used to identify the cropped region—varies and is dynamically determined to maintain a proportion for feature reconstruction, such as resizing in the event of a zoom-in on a face (or other feature of interest) or a zoom-out. The tracking scheme may be used to smooth sudden movements, including lateral ones, to generate more natural transitions between frames. Tracking and cropping information (e.g., size and position of the cropped region) may be embedded within an encoded bitstream as supplemental enhancement information (“SEI”), for eventual decoding by a receiver and for compositing a decoded face at a proper location in the applicable stream.
Information query