-
公开(公告)号:US20250013876A1
公开(公告)日:2025-01-09
申请号:US18889928
申请日:2024-09-19
Inventor: Xianwei XUE , Qiutong PAN , Jinchang LUO , Bolei HE , Wei HE
IPC: G06N3/0985 , G06F40/30 , G06F40/40 , G06N3/0475
Abstract: An apparatus for training a large language model includes: at least one sample text instruction is input into a target large language model to obtain at least one standard response text, and the at least one sample text instruction is input into a large language model to be trained to obtain at least one predicted response text. A first sample response text is determined from the at least one standard response text according to the score difference between a first quality score of a standard response text and a second quality score of a predicted response text. A first target training sample is generated according to the first sample response text and a sample text instruction corresponding to the first sample response text, and a training dataset is constructed according to the first target training sample.