-
公开(公告)号:US20230252752A1
公开(公告)日:2023-08-10
申请号:US17666045
申请日:2022-02-07
Applicant: Lemon Inc.
Inventor: Shuo Cheng , Peng Wang
CPC classification number: G06V10/25 , G06N3/0454 , G06T7/50 , G06T7/70 , G06T2207/20076
Abstract: The present disclosure describes techniques for determining a bounding box. An image may be received. An X-frame, a Y-frame, and a normal frame may be estimated based on the image using a first neural network. At least one planar region may be detected from the image using a second neural network. A vanishing point detection may be performed on each of the at least one planar region. Output of the first neural network may be fused with results of the vanishing point detection. A depth value of each pixel in at least one plane corresponding to the at least one planar region may be determined based at least in part on a result of the fusing. A location of a bounding box may be determined based at least in part on the depth value of each pixel in the at least one plane.
-
2.
公开(公告)号:US20230237728A1
公开(公告)日:2023-07-27
申请号:US17586312
申请日:2022-01-27
Applicant: Lemon Inc.
Inventor: Peng Wang , Angtian Wang , Jian Sun
CPC classification number: G06T15/06 , G06T17/10 , G06T19/20 , G06T2219/2021
Abstract: The present disclosure describes techniques of rendering images using explicit object representation via rays tracing volume density aggregation. The techniques comprise reconstructing an object into a plurality of Gaussian ellipsoids; determining a volume density of each of the plurality of Gaussian ellipsoids along each of a plurality of viewing rays; determining a weight of each of the plurality of Gaussian ellipsoids based on the volume density; and synthesizing an image of the object using the determined weight on each pixel of the image to interpolate attributes of each of the plurality of Gaussian ellipsoids.
-
公开(公告)号:US12254631B2
公开(公告)日:2025-03-18
申请号:US17585301
申请日:2022-01-26
Applicant: Lemon Inc.
Inventor: Peng Wang , Xueqing Deng , Xiaochen Lian
Abstract: The techniques for dual-level semantic segmentation are provided. Data may be input to a first segmentation network. The input data comprises an image and label information associated with the image. The image may be captured at nighttime and may comprise a plurality of regions. At least one region among the plurality of regions may be determined based at least in part on output of the first segmentation network. The at least one region of the image may be cropped. The cropped at least one region may be input to a second segmentation network. A final output may be produced based on the output of the first segmentation network and output of the second segmentation network.
-
公开(公告)号:US11983239B2
公开(公告)日:2024-05-14
申请号:US17342483
申请日:2021-06-08
Applicant: Lemon Inc.
Inventor: Xiaochen Lian , Mingyu Ding , Linjie Yang , Peng Wang , Xiaojie Jin
CPC classification number: G06F18/213 , G06F18/24 , G06N3/04 , G06N3/08 , G06V10/82
Abstract: Systems and methods for obtaining attention features are described. Some examples may include: receiving, at a projector of a transformer, a plurality of tokens associated with image features of a first dimensional space; generating, at the projector of the transformer, projected features by concatenating the plurality of tokens with a positional map, the projected features having a second dimensional space that is less than the first dimensional space; receiving, at an encoder of the transformer, the projected features and generating encoded representations of the projected features using self-attention; decoding, at a decoder of the transformer, the encoded representations and obtaining a decoded output; and projecting the decoded output to the first dimensional space and adding the image features of the first dimensional space to obtain attention features associated with the image features.
-
公开(公告)号:US12190554B2
公开(公告)日:2025-01-07
申请号:US17666045
申请日:2022-02-07
Applicant: Lemon Inc.
Inventor: Shuo Cheng , Peng Wang
Abstract: The present disclosure describes techniques for determining a bounding box. An image may be received. An X-frame, a Y-frame, and a normal frame may be estimated based on the image using a first neural network. At least one planar region may be detected from the image using a second neural network. A vanishing point detection may be performed on each of the at least one planar region. Output of the first neural network may be fused with results of the vanishing point detection. A depth value of each pixel in at least one plane corresponding to the at least one planar region may be determined based at least in part on a result of the fusing. A location of a bounding box may be determined based at least in part on the depth value of each pixel in the at least one plane.
-
公开(公告)号:US20250139954A1
公开(公告)日:2025-05-01
申请号:US18926902
申请日:2024-10-25
Applicant: Lemon Inc.
Inventor: Xueqing Deng , Qi Fan , Peng Wang , Linjie Yang , Xiaojie Jin
IPC: G06V10/778 , G06V10/77 , G06V10/82
Abstract: The present application discloses a method and an apparatus for training a backbone network, an image processing method and apparatus, and a device. A weight selection cycle is set, where the weight selection cycle may include at least one backbone network training cycle. The backbone network is trained with sample data in the current weight selection cycle, and a cumulative weight adjustment amount for each weight in the backbone network in the current weight selection cycle is recorded. A target weight for which the cumulative weight adjustment amount meets a preset condition is selected from the backbone network based on the cumulative weight adjustment amount for each weight, and only the target weight in the backbone network is adjusted in a next weight selection cycle, to complete training of the backbone network in the next weight selection cycle based on the adjusted target weight.
-
公开(公告)号:US11871137B2
公开(公告)日:2024-01-09
申请号:US18089348
申请日:2022-12-27
Applicant: Lemon Inc.
Inventor: Shupeng Zhang , Boheng Qiu , Peng Wang , Jie Liao
CPC classification number: H04N5/265 , G06T5/002 , G06T5/003 , G06T7/194 , H04N5/2628 , G06T2207/10016 , G06T2207/20021 , G06T2207/20192 , G06T2207/20212
Abstract: A method and apparatus for converting a picture into a video, and a device and a storage medium. The method for converting a picture into a video includes: partitioning an original picture to obtain a foreground region and a background region; performing an iterative transformation for visual depth on the background region, and storing an image obtained through each transformation as a picture frame to obtain multiple frames of images; and splicing the multiple frames of images to obtain a target video. The iterative transformation includes at least two transformations for visual depth.
-
公开(公告)号:US20230237662A1
公开(公告)日:2023-07-27
申请号:US17585301
申请日:2022-01-26
Applicant: Lemon Inc.
Inventor: Peng Wang , Xueqing Deng , Xiaochen Lian
CPC classification number: G06T7/11 , G06T3/40 , G06T2207/20132 , G06T2207/20084
Abstract: The present disclosure describes techniques for dual-level semantic segmentation. Data may be input to a first segmentation network. The input data comprises an image and label information associated with the image. The image may be captured at nighttime and may comprise a plurality of regions. At least one region among the plurality of regions may be determined based at least in part on output of the first segmentation network. The at least one region of the image may be cropped. The cropped at least one region may be input to a second segmentation network. A final output may be produced based on the output of the first segmentation network and output of the second segmentation network.
-
公开(公告)号:US20230206067A1
公开(公告)日:2023-06-29
申请号:US18111756
申请日:2023-02-20
Applicant: Lemon Inc.
Inventor: Peng Wang , Heng Wang , Xianhang Li , Xinyu Li
IPC: G06N3/08 , G06N3/0455 , G06V10/75 , G06V10/771 , G06V10/77 , G06V10/82 , G06V20/40
CPC classification number: G06N3/08 , G06N3/0455 , G06V10/751 , G06V10/771 , G06V10/7715 , G06V10/82 , G06V20/46
Abstract: Systems and methods for performing temporal progressive learning for video processing are provided herein. Some examples include receiving a video that includes a plurality of frames, extracting a first subset of frames from the plurality of frames, and inputting the first subset of frames into a model that includes an encoder and a decoder. The examples further include comparing a first output of the model to the first subset of frames and updating the encoder, thereby training the encoder, and extracting a second subset of frames from the plurality of frames. The second subset of frames includes a number of frames that is larger than a number of frames in the first subset of frames. The examples further include inputting the second subset of frames into the model, comparing a second output of the model to the second subset of frames and updating the encoder, thereby further training the encoder.
-
公开(公告)号:US12045927B2
公开(公告)日:2024-07-23
申请号:US17586312
申请日:2022-01-27
Applicant: Lemon Inc.
Inventor: Peng Wang , Angtian Wang , Jian Sun
CPC classification number: G06T15/06 , G06T17/10 , G06T19/20 , G06T2219/2021
Abstract: The present disclosure describes techniques of rendering images using explicit object representation via rays tracing volume density aggregation. The techniques comprise reconstructing an object into a plurality of Gaussian ellipsoids; determining a volume density of each of the plurality of Gaussian ellipsoids along each of a plurality of viewing rays; determining a weight of each of the plurality of Gaussian ellipsoids based on the volume density; and synthesizing an image of the object using the determined weight on each pixel of the image to interpolate attributes of each of the plurality of Gaussian ellipsoids.
-
-
-
-
-
-
-
-
-