-
公开(公告)号:US20240037822A1
公开(公告)日:2024-02-01
申请号:US18228614
申请日:2023-07-31
Applicant: GOOGLE LLC
Inventor: Kfir Aberman , Amir Hertz , Yael Pritch Knaan , Ron Mokady , Jay Tenenbaum , Daniel Cohen-Or
IPC: G06T11/60 , G06F3/04845 , G06F40/40
CPC classification number: G06T11/60 , G06F3/04845 , G06F40/40
Abstract: Some implementations are directed to editing a source image, where the source image is one generated based on processing a source natural language (NL) prompt using a Large-scale language-image (LLI) model. Those implementations edit the source image based on user interface input that indicates an edit to the source NL prompt, and optionally independent of any user interface input that specifies a mask in the source image and/or independent of any other user interface input. Some implementations of the present disclosure are additionally or alternatively directed to applying prompt-to-prompt editing techniques to editing a source image that is one generated based on a real image, and that approximates the real image.