-
公开(公告)号:US20250013876A1
公开(公告)日:2025-01-09
申请号:US18889928
申请日:2024-09-19
Inventor: Xianwei XUE , Qiutong PAN , Jinchang LUO , Bolei HE , Wei HE
IPC: G06N3/0985 , G06F40/30 , G06F40/40 , G06N3/0475
Abstract: An apparatus for training a large language model includes: at least one sample text instruction is input into a target large language model to obtain at least one standard response text, and the at least one sample text instruction is input into a large language model to be trained to obtain at least one predicted response text. A first sample response text is determined from the at least one standard response text according to the score difference between a first quality score of a standard response text and a second quality score of a predicted response text. A first target training sample is generated according to the first sample response text and a sample text instruction corresponding to the first sample response text, and a training dataset is constructed according to the first target training sample.
-
公开(公告)号:US20230132618A1
公开(公告)日:2023-05-04
申请号:US18147823
申请日:2022-12-29
Inventor: Wei XU , Xiaoling XIA , Junxiang JIANG , Chengtai CAO , Bolei HE , Kunbin CHEN , Wei HE
IPC: G06F16/23 , G06F18/241
Abstract: A method for denoising click data includes: acquiring a set of click data including pieces of first click data and a real label corresponding to each piece of first click data; extracting feature vectors of each piece of first click data with a graph model; dividing the feature vectors into sets of feature vectors; obtaining trained binary classification models by training binary classification models with the sets of feature vectors; for each of the feature vectors, obtaining prediction values corresponding to the feature vector by predicting the feature vector with the trained binary classification models, and calculating a prediction label of the feature vector based on the prediction values of the feature vector; and removing noise data in the pieces of first click data, based on the pieces of first click data, the real label and the prediction label of each piece of first click data.
-
公开(公告)号:US20250013676A1
公开(公告)日:2025-01-09
申请号:US18889497
申请日:2024-09-19
Inventor: Jinchang LUO , Bolei HE , Kunbin CHEN , Wei HE
IPC: G06F16/332 , G06F16/33
Abstract: A computer-implemented method for information processing based on a large language model is provided. The method includes obtaining query information provided by a user. The method further includes determining memory information related to the query information. The method further includes determining, based on the query information and the memory information, a tool for processing the query information. The method further includes invoking the tool to obtain auxiliary information. The method further includes generating, based on the query information and the auxiliary information, a result of processing the query information.
-
公开(公告)号:US20220121668A1
公开(公告)日:2022-04-21
申请号:US17564374
申请日:2021-12-29
Inventor: Wei XU , Xiaoling XIA , Bolei HE , Kunbin CHEN , Zhun LIU , Wei HE
IPC: G06F16/2455 , G06F16/14 , G06F16/335
Abstract: The present disclosure provides a method of recommending a document, an electronic device, and a storage medium, relating to fields of intelligent recommendation, deep learning etc. The method of recommending a document includes: acquiring a document operated by a user, as a reference document; determining, from a plurality of initial documents, at least one candidate document for the reference document, wherein a document content of each candidate document is associated with a document content of the reference document, based on preset knowledge system data; and recommending a target document in the at least one candidate document to the user, the target document including a document that the user is currently interested in and a document that the user is interested in after a preset time period.
-
-
-