-
公开(公告)号:US20250111520A1
公开(公告)日:2025-04-03
申请号:US18478093
申请日:2023-09-29
Applicant: Adobe Inc.
Inventor: Silky Singh , Shripad Vilasrao Deshmukh , Mausoom Sarkar , Balaji Krishnamurthy
Abstract: The present disclosure is directed toward systems, methods, and non-transitory computer readable media that provide self-supervised object discovery systems that combine motion and appearance information to generate segmentation masks from a digital image or digital video and delineate one or more salient objects within the digital image/digital video. The disclosed systems utilize a neural network encoder to generate a fully connected graph based on image patches from the digital input, incorporating image patch feature and optical flow patch feature similarities to produce edge weights. The disclosed systems partition the generated graph to produce a segmentation mask. Furthermore, the disclosed systems iteratively train a segmentation network based on the segmentation mask as a pseudo-ground truth via a bootstrapped, self-training process. By utilizing both motion and appearance information to generate a bi-partitioned graph, the disclosed systems produce high-quality object segmentation masks that represent a foreground and background of digital inputs.
-
公开(公告)号:US20240362941A1
公开(公告)日:2024-10-31
申请号:US18140143
申请日:2023-04-27
Applicant: Adobe Inc.
Inventor: Silky Singh , Surgan Jandial , Shripad Vilasrao Deshmukh , Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy , Arneh Jain , Abhinav Java
IPC: G06V30/262 , G06V30/14 , G06V30/19 , G06V30/414
CPC classification number: G06V30/274 , G06V30/1444 , G06V30/19147 , G06V30/414
Abstract: A corrective noise system receives an electronic version of a fillable form generated by a segmentation network and receives a correction to a segmentation error in the electronic version of the fillable form. The corrective noise system is trained to generate noise that represents the correction and superimpose the noise on the fillable form. The corrective noise system is further trained to identify regions in a corpus of forms that are semantically similar to a region that was subject to the correction. The generated noise is propagated to the semantically similar regions in the corpus of forms and the noisy corpus of forms is provided as input to the segmentation network. The noise causes the segmentation network to accurately identify fillable regions in the corpus of forms and output a segmented version of the corpus of forms having improved fidelity without retraining or otherwise modifying the segmentation network.
-