METHOD OF TRAINING INFORMATION GENERATION MODEL, METHOD OF GENERATING INFORMATION, AND DEVICE

    公开(公告)号:US20230075339A1

    公开(公告)日:2023-03-09

    申请号:US18056137

    申请日:2022-11-16

    Abstract: The present disclosure provides a method of training an information generation model, a method of generating an information, an electronic device, and a storage medium. A specific implementation solution of the method of training the information generation model includes: splitting a description information for a target object in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further includes a first recommendation information; inputting the description word sequence into a dialog generation model to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence includes probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, so as to obtain the information generation model.

    METHOD FOR EXTRACTING TEXT INFORMATION, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20230073550A1

    公开(公告)日:2023-03-09

    申请号:US17988065

    申请日:2022-11-16

    Abstract: A method for extracting text information includes: acquiring a text to be extracted and a target field name; extracting candidate text information matching the target field name from the text to be extracted based on the text to be extracted and the target field name; and acquiring target text information matching fusion semantics of the text to be extracted, the target field name and the candidate text information by filtering the candidate text information based on the fusion semantics. Therefore, when the candidate text information matching the target field name is extracted from the text to be extracted, the candidate text information is filtered based on the fusion semantics of the text to be extracted, the target field name and the candidate text information, which improves the accuracy of extracting text information.

    Method for processing multimodal images, apparatus, device and storage medium

    公开(公告)号:US11600057B2

    公开(公告)日:2023-03-07

    申请号:US17355368

    申请日:2021-06-23

    Inventor: Shengzhao Wen

    Abstract: Provided are a method for processing multimodal images, an apparatus, a device and a storage medium. Multiple types of vision sensors are disposed in first preset identity recognition scenario. The method includes: if it is determined that a first vision sensor detects a biometric part of a target object, controlling each vision sensor to separately perform image acquisition for the biometric part in accordance with a preset acquisition strategy to obtain a target visual image of corresponding type and acquisition time information of the target visual image; performing identity recognition for the target object according to first target visual image to determine object identification information corresponding to first target visual image; determining object identification information corresponding to a target visual image of other type other than first target visual image according to acquisition time information of each target visual image and object identification information corresponding to first target visual image.

    METHOD OF PROCESSING IMAGE, METHOD OF TRAINING MODEL, ELECTRONIC DEVICE AND MEDIUM

    公开(公告)号:US20230065675A1

    公开(公告)日:2023-03-02

    申请号:US17982616

    申请日:2022-11-08

    Abstract: A method of processing an image, a method of training a model, an electronic device and a medium, which relate to a field of artificial intelligence technology, in particular to deep learning, computer vision and other technical fields. A solution includes: generating a first face image, wherein a definition difference and an authenticity difference between the first face image and a reference face image are within a set range; adjusting, according to a target voice used to drive the first face image, a facial action information related to pronunciation in the first face image to generate a second face image with a facial tissue position conforming to a pronunciation rule of the target voice; and determining the second face image as a face image driven by the target voice.

    HUMAN-OBJECT INTERACTION DETECTION
    170.
    发明申请

    公开(公告)号:US20230051232A1

    公开(公告)日:2023-02-16

    申请号:US17976673

    申请日:2022-10-28

    Abstract: A human-object interaction detection method, a neural network and a training method therefor is provided. The human-object interaction detection method includes: performing first target feature extraction on an image feature of an image; performing first interaction feature extraction on the image feature; processing a plurality of first target features to obtain target information of a plurality of detected targets; processing one or more first interaction features to obtain motion information of a motion, human information of a human target corresponding to each motion, and object information of an object target corresponding to each motion; matching the plurality of detected targets with one or more motions; and updating human information of a corresponding human target based on target information of a detected target matching the corresponding human target, and updating object information of a corresponding object target based on target information of a detected target matching the corresponding object target.

Patent Agency Ranking