BOX DETECTION FOR OBJECT ATTACHMENT
    1.
    发明公开

    公开(公告)号:US20230252752A1

    公开(公告)日:2023-08-10

    申请号:US17666045

    申请日:2022-02-07

    Applicant: Lemon Inc.

    Abstract: The present disclosure describes techniques for determining a bounding box. An image may be received. An X-frame, a Y-frame, and a normal frame may be estimated based on the image using a first neural network. At least one planar region may be detected from the image using a second neural network. A vanishing point detection may be performed on each of the at least one planar region. Output of the first neural network may be fused with results of the vanishing point detection. A depth value of each pixel in at least one plane corresponding to the at least one planar region may be determined based at least in part on a result of the fusing. A location of a bounding box may be determined based at least in part on the depth value of each pixel in the at least one plane.

    Dual-level model for segmentation

    公开(公告)号:US12254631B2

    公开(公告)日:2025-03-18

    申请号:US17585301

    申请日:2022-01-26

    Applicant: Lemon Inc.

    Abstract: The techniques for dual-level semantic segmentation are provided. Data may be input to a first segmentation network. The input data comprises an image and label information associated with the image. The image may be captured at nighttime and may comprise a plurality of regions. At least one region among the plurality of regions may be determined based at least in part on output of the first segmentation network. The at least one region of the image may be cropped. The cropped at least one region may be input to a second segmentation network. A final output may be produced based on the output of the first segmentation network and output of the second segmentation network.

    Lightweight transformer for high resolution images

    公开(公告)号:US11983239B2

    公开(公告)日:2024-05-14

    申请号:US17342483

    申请日:2021-06-08

    Applicant: Lemon Inc.

    CPC classification number: G06F18/213 G06F18/24 G06N3/04 G06N3/08 G06V10/82

    Abstract: Systems and methods for obtaining attention features are described. Some examples may include: receiving, at a projector of a transformer, a plurality of tokens associated with image features of a first dimensional space; generating, at the projector of the transformer, projected features by concatenating the plurality of tokens with a positional map, the projected features having a second dimensional space that is less than the first dimensional space; receiving, at an encoder of the transformer, the projected features and generating encoded representations of the projected features using self-attention; decoding, at a decoder of the transformer, the encoded representations and obtaining a decoded output; and projecting the decoded output to the first dimensional space and adding the image features of the first dimensional space to obtain attention features associated with the image features.

    Box detection for object attachment

    公开(公告)号:US12190554B2

    公开(公告)日:2025-01-07

    申请号:US17666045

    申请日:2022-02-07

    Applicant: Lemon Inc.

    Abstract: The present disclosure describes techniques for determining a bounding box. An image may be received. An X-frame, a Y-frame, and a normal frame may be estimated based on the image using a first neural network. At least one planar region may be detected from the image using a second neural network. A vanishing point detection may be performed on each of the at least one planar region. Output of the first neural network may be fused with results of the vanishing point detection. A depth value of each pixel in at least one plane corresponding to the at least one planar region may be determined based at least in part on a result of the fusing. A location of a bounding box may be determined based at least in part on the depth value of each pixel in the at least one plane.

    METHOD AND APPARATUS FOR TRAINING BACKBONE NETWORK, IMAGE PROCESSING METHOD AND APPARATUS, AND DEVICE

    公开(公告)号:US20250139954A1

    公开(公告)日:2025-05-01

    申请号:US18926902

    申请日:2024-10-25

    Applicant: Lemon Inc.

    Abstract: The present application discloses a method and an apparatus for training a backbone network, an image processing method and apparatus, and a device. A weight selection cycle is set, where the weight selection cycle may include at least one backbone network training cycle. The backbone network is trained with sample data in the current weight selection cycle, and a cumulative weight adjustment amount for each weight in the backbone network in the current weight selection cycle is recorded. A target weight for which the cumulative weight adjustment amount meets a preset condition is selected from the backbone network based on the cumulative weight adjustment amount for each weight, and only the target weight in the backbone network is adjusted in a next weight selection cycle, to complete training of the backbone network in the next weight selection cycle based on the adjusted target weight.

    DUAL-LEVEL MODEL FOR SEGMENTATION
    8.
    发明公开

    公开(公告)号:US20230237662A1

    公开(公告)日:2023-07-27

    申请号:US17585301

    申请日:2022-01-26

    Applicant: Lemon Inc.

    CPC classification number: G06T7/11 G06T3/40 G06T2207/20132 G06T2207/20084

    Abstract: The present disclosure describes techniques for dual-level semantic segmentation. Data may be input to a first segmentation network. The input data comprises an image and label information associated with the image. The image may be captured at nighttime and may comprise a plurality of regions. At least one region among the plurality of regions may be determined based at least in part on output of the first segmentation network. The at least one region of the image may be cropped. The cropped at least one region may be input to a second segmentation network. A final output may be produced based on the output of the first segmentation network and output of the second segmentation network.

    EFFICIENT VIDEO PROCESSING VIA TEMPORAL PROGRESSIVE LEARNING

    公开(公告)号:US20230206067A1

    公开(公告)日:2023-06-29

    申请号:US18111756

    申请日:2023-02-20

    Applicant: Lemon Inc.

    Abstract: Systems and methods for performing temporal progressive learning for video processing are provided herein. Some examples include receiving a video that includes a plurality of frames, extracting a first subset of frames from the plurality of frames, and inputting the first subset of frames into a model that includes an encoder and a decoder. The examples further include comparing a first output of the model to the first subset of frames and updating the encoder, thereby training the encoder, and extracting a second subset of frames from the plurality of frames. The second subset of frames includes a number of frames that is larger than a number of frames in the first subset of frames. The examples further include inputting the second subset of frames into the model, comparing a second output of the model to the second subset of frames and updating the encoder, thereby further training the encoder.

Patent Agency Ranking