METHOD OF PROCESSING MULTIMEDIA DATA, DEVICE AND MEDIUM

    公开(公告)号:US20230115737A1

    公开(公告)日:2023-04-13

    申请号:US18080432

    申请日:2022-12-13

    Abstract: A method of processing multimedia data, a device, and a medium, which relates to a field of an artificial intelligence technology, in particular to fields of knowledge graph and deep learning. The method of processing the multimedia data includes: recognizing the multimedia data so as to obtain at least one key information of the multimedia data; querying a predetermined knowledge base according to the at least one key information, so as to determine a multimedia name associated with the at least one key information and an association degree between the multimedia name and the at least one key information; and determining, in the multimedia name, a name of the multimedia data based on a similarity between alternative multimedia data for the multimedia name and the multimedia data, in response to the association degree being less than a first threshold value.

    MULTIMODAL DATA PROCESSING
    3.
    发明申请

    公开(公告)号:US20230010160A1

    公开(公告)日:2023-01-12

    申请号:US17945415

    申请日:2022-09-15

    Abstract: Disclosed are a method for processing multimodal data using a neural network, a device, and a medium, and relates to the field of artificial intelligence and, in particular to multimodal data processing, video classification, and deep learning. The neural network includes: an input subnetwork configured to receive the multimodal data to output respective first features of a plurality of modalities; a plurality of cross-modal feature subnetworks, each of which is configured to receive respective first features of two corresponding modalities to output a cross-modal feature corresponding to the two modalities; a plurality of cross-modal fusion subnetworks, each of which is configured to receive at least one cross-modal feature corresponding to a corresponding target modality and other modalities to output a second feature of the target modality; and an output subnetwork configured to receive respective second features of the plurality of modalities to output a processing result of the multimodal data.

    QUESTION ANSWERING METHOD, METHOD OF TRAINING A QUESTION ANSWERING MODEL, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20230153337A1

    公开(公告)日:2023-05-18

    申请号:US18157452

    申请日:2023-01-20

    CPC classification number: G06F16/3329 G06F40/30

    Abstract: A question answering method, a method of training a question answering model, a device, and a medium are provided, which relate to a field of artificial intelligence technology, in particular to fields of natural language processing technology, deep learning technology, and knowledge mapping technology. The question answering method includes: obtaining data to be processed, wherein the data to be processed includes a question and candidate answers; performing general semantic understanding on the data to be processed to obtain a general data feature; selecting a target question answering mode from candidate question answering modes based on the general data feature; and processing the general data feature by using the target question answering mode, to obtain a target answer for the question from the candidate answers.

    METHOD OF PROCESSING DATA, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20230086145A1

    公开(公告)日:2023-03-23

    申请号:US17936761

    申请日:2022-09-29

    Abstract: A method of processing data, a device, and a medium are provided, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision, natural language technology, speech technology, deep learning and knowledge graph. The method of processing data includes: generating a video feature, a question feature and an answer feature based on acquired video data, acquired question data and acquired candidate answer data; determining a link relationship between the video feature, the question feature and the answer feature; and determining a matching result for the video data, the question data and the candidate answer data based on the link relationship.

    METHOD FOR ACQUIRING STRUCTURED QUESTION-ANSWERING MODEL, QUESTION-ANSWERING METHOD AND CORRESPONDING APPARATUS

    公开(公告)号:US20230018489A1

    公开(公告)日:2023-01-19

    申请号:US17862519

    申请日:2022-07-12

    Abstract: The present disclosure discloses a method for acquiring a structured question-answering (QA) model, a QA method and corresponding apparatuses, and relates to knowledge graph and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring training samples corresponding to N structured QA database types, the training samples including question samples, information of the structured QA database types and query instruction samples used by the question samples to query structured QA databases of the types, N being an integer greater than 1; and training a text generation model by using the training samples to obtain the structured QA model, wherein the question samples and the information of the structured QA database types are taken as input to the text generation model, and the query instruction samples are taken as target output of the text generation model.

Patent Agency Ranking