PROMPT-TO-PROMPT IMAGE EDITING WITH CROSS-ATTENTION CONTROL

    公开(公告)号:US20240037822A1

    公开(公告)日:2024-02-01

    申请号:US18228614

    申请日:2023-07-31

    Applicant: GOOGLE LLC

    CPC classification number: G06T11/60 G06F3/04845 G06F40/40

    Abstract: Some implementations are directed to editing a source image, where the source image is one generated based on processing a source natural language (NL) prompt using a Large-scale language-image (LLI) model. Those implementations edit the source image based on user interface input that indicates an edit to the source NL prompt, and optionally independent of any user interface input that specifies a mask in the source image and/or independent of any other user interface input. Some implementations of the present disclosure are additionally or alternatively directed to applying prompt-to-prompt editing techniques to editing a source image that is one generated based on a real image, and that approximates the real image.

Patent Agency Ranking