METHOD, APPARATUS, DEVICE, AND STORAGE MEDIUM FOR MODEL TRAINING

    公开(公告)号:US20250077980A1

    公开(公告)日:2025-03-06

    申请号:US18952687

    申请日:2024-11-19

    Abstract: There are provided a method, an apparatus, a device, and a storage medium for model training. In a method, a target model is fine-tuned using a set of training data, each training data including a sample question and corresponding annotation information, the annotation information including policy information for solving the sample question and answer information of the sample question. At least one sample question in the set of training data is provided to the fine-tuned target model to determine a candidate answer to the at least one sample question. The fine-tuned target model is trained based at least on a comparison between the candidate answer and the answer information of the at least one sample question.

    INFORMATION PROCESSING METHOD, TASK EXECUTION METHOD, APPARATUS, DEVICE AND MEDIUM

    公开(公告)号:US20250018567A1

    公开(公告)日:2025-01-16

    申请号:US18900502

    申请日:2024-09-27

    Abstract: The present application discloses an information processing method, a task execution method, an apparatus, a device and a medium. The method includes: processing, through a target visual encoding model in a target analysis model, obtained image information to be analyzed, to obtain a corresponding target sequence; fusing, through a target feature fusion model in the target analysis model, the target sequence and obtained text information to be analyzed, to obtain a target fusion result; processing the target fusion result through a target task analysis model in the target analysis model to obtain target task information; and controlling the action execution apparatus to perform an action corresponding to the target task information. The target analysis model is obtained by training an initial analysis model and the initial analysis model comprises an initial visual encoding model and an initial feature fusion model.

    SYSTEM AND METHOD FOR CONCEPT REMOVAL

    公开(公告)号:US20240378453A1

    公开(公告)日:2024-11-14

    申请号:US18658582

    申请日:2024-05-08

    Abstract: A system for removing a concept from a trained neural network for executing a classification task, the system comprising: the trained neural network, wherein the trained neural network comprises a hidden layer; and a classifier applied at a layer of the hidden layer, wherein: the classifier defines a representation vector at the layer of the hidden layer, wherein the representation vector classifies instances of the concept and non-instances of the concept at the layer; the classifier defines a concept activation vector, wherein the concept activation vector is a normal vector to the representation vector and the concept activation vector comprises an adversarial penalty objective to reduce the instances of the concept at the layer; and a loss function of the trained neural network is optimised based on a downstream loss of the classification task and the adversarial penalty objective.

    DIALOGUE UNDERSTANDING METHOD, APPARATUS, READABLE MEDIUM AND ELECTRONIC DEVICE

    公开(公告)号:US20240013007A1

    公开(公告)日:2024-01-11

    申请号:US18251874

    申请日:2021-11-02

    CPC classification number: G06F40/35 G06F40/205

    Abstract: A dialogue understanding method and apparatus, a readable medium, and an electronic device. The method acquires dialogue content and a preset dialogue parsing template, the preset dialogue parsing template comprising preset description and at least one c or at least one slot, where in the description information is used for describing a paraphrase of each candidate intention when the preset dialogue parsing template comprises at least one candidate intention, and describes a paraphrase of each slot when the preset dialogue parsing template at least one comprises slot; and uses the dialogue content and the dialogue parsing template as inputs of a pre-trained target dialogue understanding model to obtain a dialogue state corresponding to the dialogue content.

    RECOMMENDATION MODEL TRAINING METHOD, ARTICLE RECOMMENDATION METHOD AND SYSTEM, AND RELATED DEVICE

    公开(公告)号:US20240346343A1

    公开(公告)日:2024-10-17

    申请号:US18747287

    申请日:2024-06-18

    CPC classification number: G06N5/022 G06N5/04

    Abstract: The present disclosure relates to a recommendation model training method, an article recommendation method and system, and a related device. The recommendation model training method includes: processing data for training by using a recommendation model to obtain a category-independent representation and a category-dependent representation, wherein the data for training comprises a feature of a user and a feature of an article; processing the category-independent representation and the category-dependent representation respectively by using a discriminator to obtain discrimination results; determining a prediction result according to at least one of the category-independent representation or the category-dependent representation; and training the recommendation model and the discriminator according to training targets comprising the category-independent representation not corresponding to any one of the plurality of categories, the category-dependent representation corresponding to the pre-marked category, and the prediction result matching with the pre-marked recommendation information.

    METHOD FOR OBTAINING RECOMMENDED EXPLANATION, DEVICE, AND COMPUTER READABLE MEDIUM

    公开(公告)号:US20250061507A1

    公开(公告)日:2025-02-20

    申请号:US18722879

    申请日:2023-01-04

    Abstract: The present disclosure provides a method for obtaining a recommended explanation, a device, and a computer readable medium. The method includes: generating a recommended item by a recommendation model; calculating a similarity between a plurality of explanatory items and the recommended item; obtaining a predetermined number of explanatory items from the plurality of explanatory items, as a recommended explanation of the recommended item, wherein a similarity between the predetermined number of explanatory items and the recommended item is greater than a similarity between other explanatory items and the recommended item; and outputting identification information of the predetermined number of explanatory items.

Patent Agency Ranking