OPERATOR PROCESSING METHOD OF DEEP LEARNING FRAMEWORK, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20250005446A1

    公开(公告)日:2025-01-02

    申请号:US18547090

    申请日:2022-11-02

    Abstract: An operator processing method of a deep learning framework an electronic device, and a storage medium are provided, which relate to a field of computer technology, especially in a field of artificial intelligence technology such as deep learning. The specific implementation scheme includes: acquiring an operator to be processed, where the operator to be processed includes a template parameter independent of the deep learning framework and an operator kernel function; parsing, in response to receiving an input information for the operator to be processed, the template parameter by using the input information to obtain a plurality of complete template parameters related to the deep learning framework; and processing the operator kernel function according to the plurality of complete template parameters, to obtain an available operator for the deep learning framework.

    DIALOGUE MODEL TRAINING METHOD
    32.
    发明申请

    公开(公告)号:US20240412002A1

    公开(公告)日:2024-12-12

    申请号:US18747641

    申请日:2024-06-19

    Abstract: A method is provided. The method includes: obtaining a first sample dataset; inputting at least one first question text corresponding to at least one piece of first sample data into a dialog model separately to obtain at least one first answer prediction result; inputting each second question text into the dialog model to obtain a second answer prediction result output by the dialog model; inputting the second answer prediction result into a reward model to obtain a score of the second answer prediction result output by the reward model; determining a comprehensive loss based on the at least one first answer prediction result, a first answer text of each of the at least one piece of first sample data, and a score corresponding to each of at least one piece of second sample data; and adjusting at least one parameter of the dialog model based on the comprehensive loss.

    METHOD OF PROCESSING DATA, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20230086145A1

    公开(公告)日:2023-03-23

    申请号:US17936761

    申请日:2022-09-29

    Abstract: A method of processing data, a device, and a medium are provided, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision, natural language technology, speech technology, deep learning and knowledge graph. The method of processing data includes: generating a video feature, a question feature and an answer feature based on acquired video data, acquired question data and acquired candidate answer data; determining a link relationship between the video feature, the question feature and the answer feature; and determining a matching result for the video data, the question data and the candidate answer data based on the link relationship.

    METHOD FOR ACQUIRING STRUCTURED QUESTION-ANSWERING MODEL, QUESTION-ANSWERING METHOD AND CORRESPONDING APPARATUS

    公开(公告)号:US20230018489A1

    公开(公告)日:2023-01-19

    申请号:US17862519

    申请日:2022-07-12

    Abstract: The present disclosure discloses a method for acquiring a structured question-answering (QA) model, a QA method and corresponding apparatuses, and relates to knowledge graph and deep learning technologies in the field of artificial intelligence technologies. A specific implementation solution involves: acquiring training samples corresponding to N structured QA database types, the training samples including question samples, information of the structured QA database types and query instruction samples used by the question samples to query structured QA databases of the types, N being an integer greater than 1; and training a text generation model by using the training samples to obtain the structured QA model, wherein the question samples and the information of the structured QA database types are taken as input to the text generation model, and the query instruction samples are taken as target output of the text generation model.

Patent Agency Ranking