-
公开(公告)号:US20240221166A1
公开(公告)日:2024-07-04
申请号:US18395198
申请日:2023-12-22
Applicant: NVIDIA Corporation
Inventor: Zhiding Yu , Shuaiyi Huang , De-An Huang , Shiyi Lan , Subhashree Radhakrishnan , Jose M. Alvarez Lopez , Anima Anandkumar
IPC: G06T7/12 , G06V10/764 , G06V20/70
CPC classification number: G06T7/12 , G06V10/764 , G06V20/70 , G06T2207/20081
Abstract: Video instance segmentation is a computer vision task that aims to detect, segment, and track objects continuously in videos. It can be used in numerous real-world applications, such as video editing, three-dimensional (3D) reconstruction, 3D navigation (e.g. for autonomous driving and/or robotics), and view point estimation. However, current machine learning-based processes employed for video instance segmentation are lacking, particularly because the densely annotated videos needed for supervised training of high-quality models are not readily available and are not easily generated. To address the issues in the prior art, the present disclosure provides point-level supervision for video instance segmentation in a manner that allows the resulting machine learning model to handle any object category.