QUESTION ANSWERING METHOD, METHOD OF TRAINING A QUESTION ANSWERING MODEL, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20230153337A1

    公开(公告)日:2023-05-18

    申请号:US18157452

    申请日:2023-01-20

    CPC classification number: G06F16/3329 G06F40/30

    Abstract: A question answering method, a method of training a question answering model, a device, and a medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of natural language processing technology, deep learning technology, and knowledge mapping technology. The question answering method includes: obtaining data to be processed, wherein the data to be processed includes a question and candidate answers; performing general semantic understanding on the data to be processed to obtain a general data feature; selecting a target question answering mode from candidate question answering modes based on the general data feature; and processing the general data feature by using the target question answering mode, to obtain a target answer for the question from the candidate answers.

    METHOD FOR TRAINING IMAGE-TEXT MATCHING MODEL, COMPUTING DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20230005284A1

    公开(公告)日:2023-01-05

    申请号:US17943458

    申请日:2022-09-13

    Abstract: A computer-implemented method is provided. The method includes: obtaining a sample text and a sample image corresponding to the sample text; labeling a true semantic tag for the sample text according to a first preset rule; obtaining a text feature representation of the sample text and a predicted semantic tag output by a text coding sub-model; obtaining an image feature representation of the sample image output by an image coding sub-model; calculating a first loss based on the true semantic tag and the predicted semantic tag; calculating a contrast loss based on the text feature representation of the sample text and the image feature representation of the sample image; adjusting parameters of the text coding sub-model based on the first loss and the contrast loss; and adjusting parameters of the image coding sub-model based on the contrast loss.

    INFORMATION EXTRACTION METHOD AND APPARATUS, ELECTRONIC DEVICE AND READABLE STORAGE MEDIUM

    公开(公告)号:US20230133717A1

    公开(公告)日:2023-05-04

    申请号:US17954900

    申请日:2022-09-28

    Abstract: Disclosed are an information extraction method, an electronic device and a readable storage medium, which relate to the field of artificial intelligence technologies, and particularly to the field of knowledge graph technologies. The information extraction method includes: acquiring to-be-processed text to obtain a semantic vector of each token in the to-be-processed text; generating a relationship prediction matrix, an entity prediction matrix and an alignment matrix according to each token in the to-be-processed text and the semantic vector of each token; and extracting a target triplet in the to-be-processed text using the relationship prediction matrix, the entity prediction matrix and the alignment matrix, and taking the target triplet as an information extraction result of the to-be-processed text.

    MULTIMODAL DATA PROCESSING
    8.
    发明申请

    公开(公告)号:US20230010160A1

    公开(公告)日:2023-01-12

    申请号:US17945415

    申请日:2022-09-15

    Abstract: Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.

    METHOD FOR TRAINING CROSS-MODAL RETRIEVAL MODEL, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20220284246A1

    公开(公告)日:2022-09-08

    申请号:US17502385

    申请日:2021-10-15

    Abstract: The present disclosure discloses a method for training a cross-modal retrieval model, an electronic device and a storage medium, and relates to the field of computer technologies, and particularly to the field of artificial intelligence technologies, such as knowledge graph technologies, computer vision technologies, deep learning technologies, or the like. The method for training a cross-modal retrieval model includes: determining similarity of a cross-modal sample pair according to the cross-modal sample pair, the cross-modal sample pair including a sample of a first modal and a sample of a second modal, and the first modal being different from the second modal; determining a soft margin based on the similarity, and determining a soft margin loss function based on the soft margin; and determining a total loss function based on the soft margin loss function, and training a cross-modal retrieval model according to the total loss function.

Patent Agency Ranking