-
公开(公告)号:US12198224B2
公开(公告)日:2025-01-14
申请号:US17651075
申请日:2022-02-15
Applicant: ADOBE INC.
Inventor: Xin Yuan , Zhe Lin , Jason Wen Yong Kuen , Jianming Zhang , John Philip Collomosse
Abstract: Systems and methods for image generation are described. Embodiments of the present disclosure receive a text phrase that describes a target image to be generated; generate text features based on the text phrase; retrieve a search image based on the text phrase; and generate the target image using an image generation network based on the text features and the search image.
-
2.
公开(公告)号:US20230260164A1
公开(公告)日:2023-08-17
申请号:US17651075
申请日:2022-02-15
Applicant: ADOBE INC.
Inventor: Xin Yuan , Zhe Lin , Jason Wen Yong Kuen , Jianming Zhang , John Philip Collomosse
CPC classification number: G06T11/00 , G06F16/53 , G06N20/00 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for image generation are described. Embodiments of the present disclosure receive a text phrase that describes a target image to be generated; generate text features based on the text phrase; retrieve a search image based on the text phrase; and generate the target image using an image generation network based on the text features and the search image.
-
公开(公告)号:US20220157054A1
公开(公告)日:2022-05-19
申请号:US17588516
申请日:2022-01-31
Applicant: Adobe Inc.
Inventor: Zhe Lin , Xiaohui Shen , Mingyang Ling , Jianming Zhang , Jason Wen Yong Kuen
Abstract: In implementations of object detection in images, object detectors are trained using heterogeneous training datasets. A first training dataset is used to train an image tagging network to determine an attention map of an input image for a target concept. A second training dataset is used to train a conditional detection network that accepts as conditional inputs the attention map and a word embedding of the target concept. Despite the conditional detection network being trained with a training dataset having a small number of seen classes (e.g., classes in a training dataset), it generalizes to novel, unseen classes by concept conditioning, since the target concept propagates through the conditional detection network via the conditional inputs, thus influencing classification and region proposal. Hence, classes of objects that can be detected are expanded, without the need to scale training databases to include additional classes.
-
公开(公告)号:US11941884B2
公开(公告)日:2024-03-26
申请号:US17454740
申请日:2021-11-12
Applicant: ADOBE INC.
Inventor: Jason Wen Yong Kuen , Bo Sun , Zhe Lin , Simon Su Chen
CPC classification number: G06V20/41 , G06F18/2163 , G06N3/08 , G06T3/4046 , G06T9/002 , G06V10/751
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive an image having a plurality of object instances; encode the image to obtain image features; decode the image features to obtain object features; generate object detection information based on the object features using an object detection branch, wherein the object detection branch is trained based on a first training set using a detection loss; generate semantic segmentation information based on the object features using a semantic segmentation branch, wherein the semantic segmentation branch is trained based on a second training set different from the first training set using a semantic segmentation loss; and combine the object detection information and the semantic segmentation information to obtain panoptic segmentation information that indicates which pixels of the image correspond to each of the plurality of object instances.
-
公开(公告)号:US11868889B2
公开(公告)日:2024-01-09
申请号:US17588516
申请日:2022-01-31
Applicant: Adobe Inc.
Inventor: Zhe Lin , Xiaohui Shen , Mingyang Ling , Jianming Zhang , Jason Wen Yong Kuen
IPC: G06N3/08 , G06N3/04 , G06V20/20 , G06V20/64 , G06V10/82 , G06V20/10 , G06F18/214 , G06V10/764 , G06V10/44
CPC classification number: G06N3/08 , G06F18/214 , G06N3/04 , G06V10/454 , G06V10/764 , G06V10/82 , G06V20/10 , G06V20/20 , G06V20/64
Abstract: In implementations of object detection in images, object detectors are trained using heterogeneous training datasets. A first training dataset is used to train an image tagging network to determine an attention map of an input image for a target concept. A second training dataset is used to train a conditional detection network that accepts as conditional inputs the attention map and a word embedding of the target concept. Despite the conditional detection network being trained with a training dataset having a small number of seen classes (e.g., classes in a training dataset), it generalizes to novel, unseen classes by concept conditioning, since the target concept propagates through the conditional detection network via the conditional inputs, thus influencing classification and region proposal. Hence, classes of objects that can be detected are expanded, without the need to scale training databases to include additional classes.
-
公开(公告)号:US20230401827A1
公开(公告)日:2023-12-14
申请号:US17806097
申请日:2022-06-09
Applicant: ADOBE INC.
Inventor: Jason Wen Yong Kuen , Dat Ba Huynh , Zhe Lin , Jiuxiang Gu
IPC: G06V10/774 , G06V10/26 , G06V10/75 , G06V10/77 , G06V10/776 , G06V10/82
CPC classification number: G06V10/774 , G06V10/26 , G06V10/759 , G06V10/7715 , G06V10/776 , G06V10/82
Abstract: Systems and methods for image segmentation are described. Embodiments of the present disclosure receive a training image and a caption for the training image, wherein the caption includes text describing an object in the training image; generate a pseudo mask for the object using a teacher network based on the text describing the object; generate a mask for the object using a student network; compute noise information for the training image using a noise estimation network; and update parameters of the student network based on the mask, the pseudo mask, and the noise information.
-
公开(公告)号:US20230154185A1
公开(公告)日:2023-05-18
申请号:US17454740
申请日:2021-11-12
Applicant: ADOBE INC.
Inventor: Jason Wen Yong Kuen , Bo Sun , Zhe Lin , Simon Su Chen
CPC classification number: G06K9/00624 , G06K9/6202 , G06K9/6261 , G06N3/08 , G06T3/4046 , G06T9/002
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive an image having a plurality of object instances; encode the image to obtain image features; decode the image features to obtain object features; generate object detection information based on the object features using an object detection branch, wherein the object detection branch is trained based on a first training set using a detection loss; generate semantic segmentation information based on the object features using a semantic segmentation branch, wherein the semantic segmentation branch is trained based on a second training set different from the first training set using a semantic segmentation loss; and combine the object detection information and the semantic segmentation information to obtain panoptic segmentation information that indicates which pixels of the image correspond to each of the plurality of object instances.
-
公开(公告)号:US20230128792A1
公开(公告)日:2023-04-27
申请号:US17589114
申请日:2022-01-31
Applicant: Adobe Inc.
Inventor: Jason Wen Yong Kuen , Su Chen , Scott Cohen , Zhe Lin , Zijun Wei , Jianming Zhang
Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generates object masks for digital objects portrayed in digital images utilizing a detection-masking neural network pipeline. In particular, in one or more embodiments, the disclosed systems utilize detection heads of a neural network to detect digital objects portrayed within a digital image. In some cases, each detection head is associated with one or more digital object classes that are not associated with the other detection heads. Further, in some cases, the detection heads implement multi-scale synchronized batch normalization to normalize feature maps across various feature levels. The disclosed systems further utilize a masking head of the neural network to generate one or more object masks for the detected digital objects. In some cases, the disclosed systems utilize post-processing techniques to filter out low-quality masks.
-
公开(公告)号:US11610393B2
公开(公告)日:2023-03-21
申请号:US17062157
申请日:2020-10-02
Applicant: Adobe Inc.
Inventor: Jason Wen Yong Kuen , Zhe Lin , Jiuxiang Gu
IPC: G06V10/778 , G06K9/62 , G06N3/04 , G06T3/60 , G06T3/40 , G06V10/774
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and efficiently learning parameters of a distilled neural network from parameters of a source neural network utilizing multiple augmentation strategies. For example, the disclosed systems can generate lightly augmented digital images and heavily augmented digital images. The disclosed systems can further learn parameters for a source neural network from the lightly augmented digital images. Moreover, the disclosed systems can learn parameters for a distilled neural network from the parameters learned for the source neural network. For example, the disclosed systems can compare classifications of heavily augmented digital images generated by the source neural network and the distilled neural network to transfer learned parameters from the source neural network to the distilled neural network via a knowledge distillation loss function.
-
公开(公告)号:US11256918B2
公开(公告)日:2022-02-22
申请号:US16874114
申请日:2020-05-14
Applicant: Adobe Inc.
Inventor: Zhe Lin , Xiaohui Shen , Mingyang Ling , Jianming Zhang , Jason Wen Yong Kuen
Abstract: In implementations of object detection in images, object detectors are trained using heterogeneous training datasets. A first training dataset is used to train an image tagging network to determine an attention map of an input image for a target concept. A second training dataset is used to train a conditional detection network that accepts as conditional inputs the attention map and a word embedding of the target concept. Despite the conditional detection network being trained with a training dataset having a small number of seen classes (e.g., classes in a training dataset), it generalizes to novel, unseen classes by concept conditioning, since the target concept propagates through the conditional detection network via the conditional inputs, thus influencing classification and region proposal. Hence, classes of objects that can be detected are expanded, without the need to scale training databases to include additional classes.
-
-
-
-
-
-
-
-
-