-
公开(公告)号:US11657230B2
公开(公告)日:2023-05-23
申请号:US16899994
申请日:2020-06-12
Applicant: ADOBE INC.
Inventor: Joon-Young Lee , Seonguk Seo
CPC classification number: G06F40/30 , G06F16/90332 , G06F17/16 , G06F18/25 , G06F40/20 , G06T7/10 , G06V20/70 , G06T2207/20081 , G06T2207/20084
Abstract: A method, apparatus, and non-transitory computer readable medium for referring image segmentation are described. Embodiments of the method, apparatus, and non-transitory computer readable medium may extract an image feature vector from an input image, extract a plurality of language feature vectors for a referral expression, wherein each of the plurality of language feature vectors comprises a different number of dimensions, combine each of the language feature vectors with the image feature vector using a fusion module to produce a plurality of self-attention vectors, combine the plurality of self-attention vectors to produce a multi-modal feature vector, and decode the multi-modal feature vector to produce an image mask indicating a portion of the input image corresponding to the referral expression.
-
公开(公告)号:US11526698B2
公开(公告)日:2022-12-13
申请号:US16893803
申请日:2020-06-05
Applicant: ADOBE INC.
Inventor: Joon-Young Lee , Seonguk Seo
Abstract: Systems and methods for video object segmentation are described. Embodiments of systems and methods may receive a referral expression and a video comprising a plurality of image frames, generate a first image mask based on the referral expression and a first image frame of the plurality of image frames, generate a second image mask based on the referral expression, the first image frame, the first image mask, and a second image frame of the plurality of image frames, and generate annotation information for the video including the first image mask overlaid on the first image frame and the second image mask overlaid on the second image frame.
-