-
公开(公告)号:US20230142217A1
公开(公告)日:2023-05-11
申请号:US17896690
申请日:2022-08-26
Inventor: Huihui HE , Leyi WANG , Duohao QIN , Minghao LIU
IPC: G06F40/47 , G06F40/166 , G06F40/30 , G06F40/295 , G06F40/151
CPC classification number: G06F40/47 , G06F40/166 , G06F40/30 , G06F40/295 , G06F40/151
Abstract: The present disclosure provides a model training method and apparatus, an electronic device, and a storage medium, and relates to the field of artificial intelligence, in particular, to the field of natural language processing and deep learning. A specific implementation solution includes: constructing initial training corpora; performing data enhancement on the initial training corpora based on an algorithm contained in a target algorithm set to obtain target training corpora, wherein the target algorithm set is determined from multiple algorithm sets, and different algorithm sets are used for performing data enhancement on corpora with different granularity in the initial training corpora; and performing training on a language model based on the target training corpora to obtain a sequence labeling model, herein the language model is pre-trained based on text corpora.