Cascaded domain bridging for image generation

    公开(公告)号:US12260485B2

    公开(公告)日:2025-03-25

    申请号:US18046077

    申请日:2022-10-12

    Applicant: Lemon Inc.

    Abstract: A method of generating a style image is described. The method includes receiving an input image of a subject. The method further includes encoding the input image using a first encoder of a generative adversarial network (GAN) to obtain a first latent code. The method further includes decoding the first latent code using a first decoder of the GAN to obtain a normalized style image of the subject, wherein the GAN is trained using a loss function according to semantic regions of the input image and the normalized style image.

    VIDEO GENERATION METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20250097545A1

    公开(公告)日:2025-03-20

    申请号:US18711530

    申请日:2022-11-18

    Applicant: Lemon Inc.

    Abstract: The embodiments of the present disclosure provide a video generation method, an apparatus, a device, and a storage medium, the video generation method including: obtaining a plurality of images and music matched to the plurality of images; determining first feature information for the plurality of images and second feature information for the music; according to the first feature information, the second feature information and a plurality of pre-stored rendering effects, determining a target rendering effect combination; the rendering effects being animation, special effects or a transition; and generating a video according to the plurality of images, the music and the target rendering effect combination.

    Pre-training for scene text detection

    公开(公告)号:US12254707B2

    公开(公告)日:2025-03-18

    申请号:US17955285

    申请日:2022-09-28

    Abstract: Embodiments of the present disclosure relate to a method, device and computer readable storage medium of scene text detection. In the method, a first visual representation of a first image is generated with an image encoding process. A first textual representation of a first text unit in the first image is generated with a text encoding process based on a first plurality of symbols obtained by masking a first symbol of a plurality of symbols in the first text unit. A first prediction of the masked first symbol is determined with a decoding process based on the first visual and textual representations. At least the image encoding process is updating according to at least a first training objective to increase at least similarity of the first prediction and the masked first symbol.

    Method and apparatus for knowledge graph construction, storage medium, and electronic device

    公开(公告)号:US12248884B2

    公开(公告)日:2025-03-11

    申请号:US18397227

    申请日:2023-12-27

    Applicant: Lemon Inc.

    Abstract: The present disclosure relates to a method and apparatus for knowledge graph construction, storage medium and electronic device. The method for knowledge graph construction, comprises: identifying an entity concept from a title text of a target web page and at least one entity corresponding to the entity concept from a body text of the target web page; constructing a syntax parse tree of the title text based on syntax parse rules of a language to which the title text belongs, and determining, from the syntax parse tree, a modifier for modifying the entity concept; and generating a knowledge graph based on the entity concept, the modifier, and the at least one entity. Through the solution of the present disclosure, knowledge graphs with high accuracy and high recall rates are constructed without structured processing on target web pages.

    METHOD, APPARATUS, DEVICE, AND STORAGE MEDIUM FOR MODEL TRAINING

    公开(公告)号:US20250077980A1

    公开(公告)日:2025-03-06

    申请号:US18952687

    申请日:2024-11-19

    Abstract: There are provided a method, an apparatus, a device, and a storage medium for model training. In a method, a target model is fine-tuned using a set of training data, each training data including a sample question and corresponding annotation information, the annotation information including policy information for solving the sample question and answer information of the sample question. At least one sample question in the set of training data is provided to the fine-tuned target model to determine a candidate answer to the at least one sample question. The fine-tuned target model is trained based at least on a comparison between the candidate answer and the answer information of the at least one sample question.

    FEATURE SPACE MANAGEMENT
    337.
    发明申请

    公开(公告)号:US20250077587A1

    公开(公告)日:2025-03-06

    申请号:US18458001

    申请日:2023-08-29

    Applicant: Lemon Inc.

    Abstract: There are proposed methods, devices, and computer program products for extending a feature space of a data sample. In the method, a global representation is obtained for a feature in a plurality of features of the data sample. A local representation is obtained for the feature based on a classifying criterion for classifying the data sample into one of a plurality of predefined domains. A representation is generated for the feature of the data sample based on the global representation and the local representation. With these implementations, an exclusive feature space may be created for each domain identified by the classifying criterion, which is dedicated to capturing domain-specific knowledge and characteristics.

    MUSIC GENERATION METHOD, APPARATUS AND SYSTEM, AND STORAGE MEDIUM

    公开(公告)号:US20250069585A1

    公开(公告)日:2025-02-27

    申请号:US18725372

    申请日:2023-04-27

    Applicant: Lemon Inc.

    Abstract: The present disclosure relates to a music generation method, apparatus and system, and storage medium. In an embodiment of the present disclosure: obtaining text information, and converting the text information into a corresponding voice audio; obtaining an initial music audio, wherein the initial music audio comprises a music key point, and music characteristics of the initial music audio have a sudden change at the position of an audio key point; and on the basis of the position of the music key point, synthesizing the voice audio and the initial music audio to obtain a target music audio. In the target music audio, the voice audio appears at the position of the music key point of the initial music audio. Thus, a music audio is generated from text information, and the user can customize the content of the text information and customize the initial music audio.

    IMAGE PROCESSING METHOD AND APPARATUS, AND ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20250069279A1

    公开(公告)日:2025-02-27

    申请号:US18724491

    申请日:2023-01-16

    Applicant: Lemon Inc.

    Abstract: Provided in the present disclosure are an image processing method and apparatus, and an electronic device and a storage medium. The method includes: obtaining configuration information matching effect editing in response to performing the effect editing on an initial image, wherein the configuration information includes a deep learning inference node for performing the effect editing on the initial image, and a pre-processing function node and a post-processing function node; calling processing logic of the pre-processing function node according to the configuration information to obtain input data; obtaining output data by means of an algorithm model corresponding to the deep learning inference node; and calling processing logic of the post-processing function node according to the configuration information to obtain a target image added with an effect.

Patent Agency Ranking