METHOD FOR GENERATING CROSS-LINGUAL TEXTUAL SEMANTIC MODEL, AND ELECTRONIC DEVICE

    公开(公告)号:US20230080904A1

    公开(公告)日:2023-03-16

    申请号:US18054608

    申请日:2022-11-11

    Abstract: A method for generating a cross-lingual textual semantic model includes: acquiring a set of training data that includes pieces of monolingual non-parallel text and pieces of bilingual parallel text; determining a semantic vector of each piece of text in the set of training data by inputting each piece of text into an initial textual semantic model; determining a distance between semantic vectors of each two pieces of text in the set of training data based on the semantic vector of each piece of text in the set of training data; determining a gradient modification based on a parallel relationship between each two pieces of text in the set of training data and the distance between the semantic vectors of each two pieces of text in the set of training data; and acquiring a modified textual semantic model by modifying the initial textual semantic model based on the gradient modification.

    TASK EXECUTION METHOD AND APPARATUS FOR LARGE MODEL, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20250094534A1

    公开(公告)日:2025-03-20

    申请号:US18968798

    申请日:2024-12-04

    Abstract: A task execution method for a large model relates to fields of artificial intelligence, deep learning and large model technologies, and includes executing attention tasks in a task group to be fused using a target computing unit to obtain attention features, where the attention task corresponds to a weighted matrix to be fused, the weighted matrix to be fused is obtained by weighting a matrix to be fused using a weight; obtaining a processing result according to the attention features; determining a loss information according to the processing result; and weighting and fusing matrices to be fused using the target computing unit according to weights for the task group to be fused if the loss information converges, to obtain a fusion matrix for a target task group, where a target task in the target task group is executed by the target computing unit according to the fusion matrix.

    DEEP LEARNING MODEL BASED DATA GENERATION
    15.
    发明公开

    公开(公告)号:US20240028909A1

    公开(公告)日:2024-01-25

    申请号:US18478833

    申请日:2023-09-29

    CPC classification number: G06N3/096

    Abstract: A data generation method based on a deep learning model and a training method is provided. The data generation method includes: determining an initial input of the deep learning model based on input data; obtaining a first output of the model, where in response to the model determining that generating a reply based on the initial input requires calling a first functional component different from the deep learning model, the first output includes a first token for calling the first functional component and a first intermediate inquiry determined based on the initial input and recognizable by the first functional component; obtaining a first intermediate result determined by the first functional component based on the first intermediate inquiry; determining a second input for the model based on the initial input and the first intermediate result; and obtaining a second output of the model for generating a reply to the initial input.

Patent Agency Ranking