IMAGE GENERATION WITH LEGIBLE SCENE TEXT

    公开(公告)号:US20250061610A1

    公开(公告)日:2025-02-20

    申请号:US18449286

    申请日:2023-08-14

    Applicant: ADOBE INC.

    Abstract: Systems and methods for generating images with legible scene text are described. Embodiments are configured to obtain a prompt describing a scene, where the prompt includes scene text indicating text that is intended to be shown in a generated image; encode, using a prompt encoder, the prompt to generate a prompt embedding; encode, using a character-level encoder, the scene text to generate a character-level embedding; and generate, using an image generation network, an image that includes the scene text based on the prompt embedding and the character-level embedding.

Patent Agency Ranking