-
公开(公告)号:US20230132618A1
公开(公告)日:2023-05-04
申请号:US18147823
申请日:2022-12-29
Inventor: Wei XU , Xiaoling XIA , Junxiang JIANG , Chengtai CAO , Bolei HE , Kunbin CHEN , Wei HE
IPC: G06F16/23 , G06F18/241
Abstract: A method for denoising click data includes: acquiring a set of click data including pieces of first click data and a real label corresponding to each piece of first click data; extracting feature vectors of each piece of first click data with a graph model; dividing the feature vectors into sets of feature vectors; obtaining trained binary classification models by training binary classification models with the sets of feature vectors; for each of the feature vectors, obtaining prediction values corresponding to the feature vector by predicting the feature vector with the trained binary classification models, and calculating a prediction label of the feature vector based on the prediction values of the feature vector; and removing noise data in the pieces of first click data, based on the pieces of first click data, the real label and the prediction label of each piece of first click data.
-
公开(公告)号:US20220406034A1
公开(公告)日:2022-12-22
申请号:US17822898
申请日:2022-08-29
Inventor: Jingru GAN , Haiwei WANG , Jinchang LUO , Kunbin CHEN , Wei HE , Shuhui WANG
IPC: G06V10/74 , G06F40/295 , G06V10/80
Abstract: A method for extracting information, includes: obtaining an information stream comprising text and an image; generating, according to the text, embedded representations of textual entity mentions and a textual similarity matrix of the textual entity mentions and candidate textual entities; generating, according to the image, embedded representations of image entity mentions and an image similarity matrix of the image entity mentions and candidate image entities; and determining, based on an optimal transport, target textual entities of the textual entity mentions and target image entities of the image entity mentions according to the embedded representations of the textual entity mentions, the embedded representations of the image entity mentions, the textual similarity matrix and the image similarity matrix.
-
公开(公告)号:US20250013876A1
公开(公告)日:2025-01-09
申请号:US18889928
申请日:2024-09-19
Inventor: Xianwei XUE , Qiutong PAN , Jinchang LUO , Bolei HE , Wei HE
IPC: G06N3/0985 , G06F40/30 , G06F40/40 , G06N3/0475
Abstract: An apparatus for training a large language model includes: at least one sample text instruction is input into a target large language model to obtain at least one standard response text, and the at least one sample text instruction is input into a large language model to be trained to obtain at least one predicted response text. A first sample response text is determined from the at least one standard response text according to the score difference between a first quality score of a standard response text and a second quality score of a predicted response text. A first target training sample is generated according to the first sample response text and a sample text instruction corresponding to the first sample response text, and a training dataset is constructed according to the first target training sample.
-
4.
公开(公告)号:US20210397980A1
公开(公告)日:2021-12-23
申请号:US17036160
申请日:2020-09-29
IPC: G06N5/02 , G06N5/04 , G06F40/279 , G06K9/62
Abstract: The present disclosure provides an information recommendation method, which relates to a field of knowledge graph. The method includes: acquiring request information; extracting a request entity word representing an entity from the request information; determining recommendation information based on the request entity word and a pre-constructed knowledge graph; and pushing the recommendation information, wherein the knowledge graph is constructed based on a text, and the knowledge graph indicates a first word representing a source of the text. The present disclosure further provides an information recommendation apparatus, an electronic device and a computer-readable storage medium.
-
公开(公告)号:US20250013676A1
公开(公告)日:2025-01-09
申请号:US18889497
申请日:2024-09-19
Inventor: Jinchang LUO , Bolei HE , Kunbin CHEN , Wei HE
IPC: G06F16/332 , G06F16/33
Abstract: A computer-implemented method for information processing based on a large language model is provided. The method includes obtaining query information provided by a user. The method further includes determining memory information related to the query information. The method further includes determining, based on the query information and the memory information, a tool for processing the query information. The method further includes invoking the tool to obtain auxiliary information. The method further includes generating, based on the query information and the auxiliary information, a result of processing the query information.
-
6.
公开(公告)号:US20230085599A1
公开(公告)日:2023-03-16
申请号:US18057560
申请日:2022-11-21
Inventor: Jinchang LUO , Haiwei WANG , Junzhao BU , Kunbin CHEN , Wei HE
IPC: G06N3/04
Abstract: The disclosure provides a method for training a tag recommendation model. The method includes: collecting training materials that comprise interest tags in response to receiving an instruction for collecting training materials; obtaining training semantic vectors that comprise the interest tags by representing features of the training materials using a semantic enhanced representation frame; obtaining training encoding vectors by aggregating social networks into the training semantic vectors; and obtaining a tag recommendation model by training a double-layer neural network structure using the training encoding vectors as inputs and the interest tags as outputs. Therefore, the interest tags obtained in the disclosure are more accurate.
-
公开(公告)号:US20220121668A1
公开(公告)日:2022-04-21
申请号:US17564374
申请日:2021-12-29
Inventor: Wei XU , Xiaoling XIA , Bolei HE , Kunbin CHEN , Zhun LIU , Wei HE
IPC: G06F16/2455 , G06F16/14 , G06F16/335
Abstract: The present disclosure provides a method of recommending a document, an electronic device, and a storage medium, relating to fields of intelligent recommendation, deep learning etc. The method of recommending a document includes: acquiring a document operated by a user, as a reference document; determining, from a plurality of initial documents, at least one candidate document for the reference document, wherein a document content of each candidate document is associated with a document content of the reference document, based on preset knowledge system data; and recommending a target document in the at least one candidate document to the user, the target document including a document that the user is currently interested in and a document that the user is interested in after a preset time period.
-
-
-
-
-
-