DEBIASING VISION-LANGUAGE MODELS WITH ADDITIVE RESIDUALS

    公开(公告)号:US20240395024A1

    公开(公告)日:2024-11-28

    申请号:US18322253

    申请日:2023-05-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for debiasing vision-language models utilizing additive residual learning. In particular, in one or more embodiments, the disclosed systems generate an encoded image representation of a digital image utilizing an image encoder of a vision-language neural network. Additionally, in some embodiments, the disclosed systems extract a protected attribute encoding from the encoded image representation of the digital image utilizing an additive residual learner. Upon extracting the protected attribute encoding, in some implementations, the disclosed systems determine a debiased image encoding for the digital image by combining the protected attribute encoding and the encoded image representation.

Patent Agency Ranking