GENERATION OF IMAGES CORRESPONDING TO INPUT TEXT USING MULTI-ALGORITHM DIFFUSION SAMPLING
Abstract:
Systems and methods are provided that include a processor executing a program to process an initial image through a first diffusion stage to generate a final first stage image, wherein the first diffusion stage includes using a diffusion model, a gradient estimator model smaller than the diffusion model, and a text-image match gradient calculator. The processor further executes the program to process the final first stage image through a second diffusion stage to generate a final second stage image. The second diffusion stage includes, for a second predetermined number of iterations, inputting the final first stage image to through the diffusion model, back-propagate the image through the text-image match gradient calculator to calculate a second stage gradient against the input text, and update the final first stage image by applying the second stage gradient to the final first stage image.
Information query
Patent Agency Ranking
0/0