-
公开(公告)号:US20250061610A1
公开(公告)日:2025-02-20
申请号:US18449286
申请日:2023-08-14
Applicant: ADOBE INC.
Inventor: Nipun Jindal , Brent Getlin , Oliver Brdiczka
IPC: G06T11/00 , G06F40/103 , G06F40/126 , G06T3/40 , G06V30/10
Abstract: Systems and methods for generating images with legible scene text are described. Embodiments are configured to obtain a prompt describing a scene, where the prompt includes scene text indicating text that is intended to be shown in a generated image; encode, using a prompt encoder, the prompt to generate a prompt embedding; encode, using a character-level encoder, the scene text to generate a character-level embedding; and generate, using an image generation network, an image that includes the scene text based on the prompt embedding and the character-level embedding.