Model Training Method, Electronic Device, And Storage Medium

    公开(公告)号:US20230142217A1

    公开(公告)日:2023-05-11

    申请号:US17896690

    申请日:2022-08-26

    CPC classification number: G06F40/47 G06F40/166 G06F40/30 G06F40/295 G06F40/151

    Abstract: The present disclosure provides a model training method and apparatus, an electronic device, and a storage medium, and relates to the field of artificial intelligence, in particular, to the field of natural language processing and deep learning. A specific implementation solution includes: constructing initial training corpora; performing data enhancement on the initial training corpora based on an algorithm contained in a target algorithm set to obtain target training corpora, wherein the target algorithm set is determined from multiple algorithm sets, and different algorithm sets are used for performing data enhancement on corpora with different granularity in the initial training corpora; and performing training on a language model based on the target training corpora to obtain a sequence labeling model, herein the language model is pre-trained based on text corpora.

Patent Agency Ranking