Patent search ap:("Lemon Inc.") AND inv:"Yizhe ZHU" Page 1

1.

发明公开
GENERATION OF IMAGE CORRESPONDING TO INPUT TEXT USING DYNAMIC VALUE CLIPPING 审中-公开

公开(公告)号：US20240153152A1

公开(公告)日：2024-05-09

申请号：US18052866

申请日：2022-11-04

Applicant: Lemon Inc.

Inventor： Bingchen LIU , Yizhe ZHU , Xiao YANG

IPC: G06T11/00 , G06F40/40 , G06T5/00

CPC classification number: G06T11/00 , G06F40/40 , G06T5/002

Abstract: Systems and methods are provided that include a processor executing a program to receive input text from a user. The processor is further configured to, for a predetermined number of iterations, input an initial image into a diffusion process to generate a processed image, back-propagate the processed image through a text-image match gradient calculator to calculate a gradient against the input text, and update the initial image with an image generated by applying the calculated gradient to the processed image. The pixel values of the processed image during a first portion of the predetermined number of iterations are value clamped to a first range, and pixel values of the processed image during a second portion of the predetermined number of iterations are value clamped to a second range that is a subset of the first range.

2.

发明申请
VIDEO GENERATION METHOD, AND TRAINING METHOD FOR VIDEO GENERATION MODEL 有权

公开(公告)号：US20250131613A1

公开(公告)日：2025-04-24

申请号：US18834154

申请日：2022-12-15

Applicant: Lemon Inc.

Inventor： Yizhe ZHU , Bingchen LIU , Xiao YANG

IPC: G06T11/00 , G06T7/194

Abstract: Provided in the embodiments of the present disclosure are a video generation method, and a training method for a video generation model. The video generation method includes: acquiring a first video, wherein the first video includes a first object image; and inputting the first video into a pre-trained video generation model to obtain a second video, wherein the video generation model is obtained by means of performing training on the basis of a target image and a plurality of sample image pairs obtained from a plurality of first sample images, an object image in the second video is generated on the basis of a preset animal image in the target image and the first object image, and a background image of the second video is generated on the basis of a first background image of the first video.

3.

发明公开
GENERATION OF IMAGE CORRESPONDING TO INPUT TEXT USING MULTI-TEXT GUIDED IMAGE CROPPING 审中-公开

公开(公告)号：US20240153153A1

公开(公告)日：2024-05-09

申请号：US18052870

申请日：2022-11-04

Applicant: Lemon Inc.

Inventor： Bingchen LIU , Yizhe ZHU , Xiao YANG

IPC: G06T11/00 , G06F40/40 , G06T5/00

CPC classification number: G06T11/00 , G06F40/40 , G06T5/002

Abstract: Systems and methods are provided that include a processor executing a program to receive an input from a user, where the input including a first input text and a second input text. The processor is further configured to provide an initial image and, for a predetermined number of iterations, define a first and second regions of the initial image associated with the first and second input texts, respectively, define a plurality of patches of the initial image, input the initial image into a diffusion process to generate a processed image, back-propagate the processed image through a text-image match gradient calculator by generating an image embedding based on the processed image, generating a text embedding based on the region and the input text that are associated with a patch, and calculating a differential between the image embedding and the text embedding.

4.

发明公开
GENERATION OF CURATED TRAINING DATA FOR DIFFUSION MODELS 审中-公开

公开(公告)号：US20240153194A1

公开(公告)日：2024-05-09

申请号：US18052865

申请日：2022-11-04

Applicant: Lemon Inc.

Inventor： Bingchen LIU , Yizhe ZHU , Xiao YANG

IPC: G06T15/02 , G06F40/289 , G06T5/00

CPC classification number: G06T15/02 , G06F40/289 , G06T5/002 , G06T2207/20081

Abstract: Systems and methods are provided that include a processor executing a program to match sentences from a sentence dataset with artistic phrases from an artistic phrase dataset to generate a plurality of safe phrases. The processor is further configured to, for each of the safe phrases, generate a safe image by, for a predetermined number of iterations, performing steps to input an initial image into a diffusion process to generate a processed image, wherein the diffusion process includes a first diffusion model, back-propagate the processed image through a text-image match gradient calculator to calculate a gradient against the safe phrase, and update the initial image by applying the gradient to the processed image. The processor is further configured to pair each of the generated safe images with their respective safe phrase to form a plurality of safe phrase-image pairs.

5.

发明公开
GENERATION OF IMAGES CORRESPONDING TO INPUT TEXT USING MULTI-ALGORITHM DIFFUSION SAMPLING 审中-公开

公开(公告)号：US20240153151A1

公开(公告)日：2024-05-09

申请号：US18052862

申请日：2022-11-04

Applicant: Lemon Inc.

Inventor： Qing YAN , Bingchen LIU , Yizhe ZHU , Xiao YANG

IPC: G06T11/00 , G06F40/40 , G06T5/00

CPC classification number: G06T11/00 , G06F40/40 , G06T5/002

Abstract: Systems and methods are provided that include a processor executing a program to process an initial image through a first diffusion stage to generate a final first stage image, wherein the first diffusion stage includes using a diffusion model, a gradient estimator model smaller than the diffusion model, and a text-image match gradient calculator. The processor further executes the program to process the final first stage image through a second diffusion stage to generate a final second stage image. The second diffusion stage includes, for a second predetermined number of iterations, inputting the final first stage image to through the diffusion model, back-propagate the image through the text-image match gradient calculator to calculate a second stage gradient against the input text, and update the final first stage image by applying the second stage gradient to the final first stage image.

6.

发明申请
EXPRESSION DRIVING METHOD AND DEVICE, AND EXPRESSION DRIVING MODEL TRAINING METHOD AND DEVICE 有权

公开(公告)号：US20250078570A1

公开(公告)日：2025-03-06

申请号：US18726709

申请日：2023-01-04

Applicant: Lemon Inc.

Inventor： Yizhe ZHU , Xiao YANG , Jianwei LI , Xiaohui SHEN

IPC: G06V40/16 , G06V10/46 , G06V10/82

Abstract: The present disclosure provides an expression driving method and apparatus, and a training method and apparatus of an expression driving model. The expression driving method includes acquiring a first video; and inputting the first video into a pre-trained expression driving model to obtain a second video. The expression driving model is trained based on a target sample image and a plurality of first sample images. A facial image in the second video is generated based on the target sample image. A gesture expression feature of the facial image in the second video is the same as a gesture expression feature of a facial image in the first video.

7.

发明申请
METHOD FOR GENERATING IMAGE HAVING METAL TEXTURE, AND MODEL TRAINING METHOD 有权

公开(公告)号：US20250061641A1

公开(公告)日：2025-02-20

申请号：US18723339

申请日：2022-12-14

Applicant: Lemon Inc.

Inventor： Yizhe ZHU , Bingchen LIU , Chunpong LAI , Xiao YANG , Xiaohui SHEN

IPC: G06T15/04 , G06T7/11 , G06T7/73 , G06T11/00 , G06V10/77 , G06V10/80 , G06V40/16

Abstract: The present disclosure provides a method of generating an image with metallic texture, and a method of training a metallic texture image generation model. The method of generating an image with metallic texture includes: acquiring a first video; and inputting the first video into a pre-trained metallic texture image generation model to obtain a second video. Each frame of images in the second video is an image with metallic texture. The metallic texture image generation model is trained based on a plurality of first sample images and second sample images with metallic texture corresponding to each first sample image.

8.

发明公开
MACHINE LEARNING DIFFUSION MODEL WITH IMAGE ENCODER TRAINED FOR SYNTHETIC IMAGE GENERATION 审中-公开

公开(公告)号：US20240282016A1

公开(公告)日：2024-08-22

申请号：US18172192

申请日：2023-02-21

Applicant: Lemon Inc.

Inventor： Bingchen LIU , Qing YAN , Yizhe ZHU , Xiao YANG

IPC: G06T11/00 , G06F3/14 , G06Q50/00 , G06T3/60 , G06V10/70 , G06V40/16

CPC classification number: G06T11/00 , G06F3/14 , G06Q50/01 , G06T3/60 , G06V10/70 , G06V40/161 , G06V40/171

Abstract: The present disclosure provides systems and methods for generating a synthesized image of a user with a trained machine learning diffusion model. In one example, a computing system includes one or more processors configured to execute instructions stored in memory to execute a trained machine learning diffusion model including an image encoder, a text encoder, and a diffusion model. The image encoder is configured to receive an image of a user and generate a set of embeddings that semantically describe visual features of the user based at least on the image of the user. The text encoder is configured to receive the set of embeddings and generate an input feature vector based at least on the set of embeddings. The diffusion model is configured to receive the input feature vector and generate a synthesized image of the user based at least on the input feature vector.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification