-
公开(公告)号:US20250086758A1
公开(公告)日:2025-03-13
申请号:US18728042
申请日:2023-01-13
Applicant: Lemon Inc.
Inventor: Yichun SHI , Xiao YANG , Xiaohui SHEN
Abstract: The present disclosure provides an image processing method and device. The image processing method includes: performing, by an encoder and a first model, multiple iterations on an initial image to obtain a target image feature corresponding to the initial image; and performing, by a second model, image reconstruction based on the target image feature to obtain a reconstructed image of the initial image, both of the first model and the second model being neural networks for image reconstruction, wherein in the multiple iterations, an image feature extracted by the first model in the image reconstruction and an output image of the first model are feedback information for the encoder to assist the encoder in encoding the initial image.
-
公开(公告)号:US20240265628A1
公开(公告)日:2024-08-08
申请号:US18165619
申请日:2023-02-07
Applicant: Lemon Inc.
Inventor: Hongyi XU , Sizhe AN , Yichun SHI , Guoxian SONG , Linjie LUO
CPC classification number: G06T17/00 , G06T3/4053 , G06T7/194 , G06T7/70 , G06T11/00 , G06T15/10 , G06T19/20 , G06T2207/20081 , G06T2207/20084 , G06T2207/30196 , G06T2210/12 , G06T2210/22 , G06T2219/2004 , G06T2219/2016
Abstract: A three-dimensional generative adversarial network includes a generator, a discriminator, and a renderer. The generator is configured to receive an intermediate latent code mapped from a latent code and a camera pose, generate two-dimensional backgrounds for a set of images, and generate, based on the intermediate latent code, multi-grid representation features. The renderer is configured to synthesize images based on the camera pose, a camera pose offset, and the multi-grid representation features; the camera pose offset being mapped from the latent code and the camera pose; and render a foreground mask. The discriminator is configured to supervise a training of the foreground mask with an up-sampled image and a super-resolved image.
-
公开(公告)号:US20250078392A1
公开(公告)日:2025-03-06
申请号:US18238780
申请日:2023-08-28
Applicant: Lemon Inc.
Inventor: Yichun SHI , Peng WANG , Jianglong YE , Long MAI , Xiao YANG , Xiaohui SHEN
IPC: G06T15/20 , G06N3/0455 , G06N3/096
Abstract: An image generation system is described. The system comprises a neural network model configured to perform a diffusion process to generate a set of multi-view images from a same input prompt. The set of multi-view images have a same subject from different view orientation. The neural network model comprises a self-attention layer configured to relate pixels across the set of multi-view images.
-
公开(公告)号:US20250054271A1
公开(公告)日:2025-02-13
申请号:US18723150
申请日:2022-12-22
Applicant: Lemon Inc.
Inventor: Yichun SHI , Xiao YANG , Xiaohui SHEN
IPC: G06V10/44 , G06T3/4007 , G06V10/74
Abstract: The present disclosure provides a video generation method and device. The video generation method includes: extracting a first image feature from a first image; obtaining a plurality of intermediate image features by means of nonlinear interpolation according to the first image feature and a second image feature, wherein the second image feature is an image feature of a second image; and performing image reconstruction by means of an image generation model based on the first image feature, the second image feature, and the plurality of intermediate image features, so as to generate a target video, wherein the target video is used for presenting a process of a gradual change from the first image to the second image.
-
-
-