-
公开(公告)号:US12300012B2
公开(公告)日:2025-05-13
申请号:US17984034
申请日:2022-11-09
Inventor: Shangwen Lyu , Hongyu Li , Jing Liu , Hua Wu , Haifeng Wang
IPC: G06V30/19 , G06F40/205 , G06V30/194 , G06V30/412
Abstract: A method for training a document reading comprehension model includes: acquiring a question sample and a rich-text document sample, in which the rich-text document sample includes a real answer of the question sample; acquiring text information and layout information of the rich-text document sample by performing OCR processing on image information of the rich-text document sample; acquiring a predicted answer of the question sample by inputting the text information, the layout information and the image information of the rich-text document sample into a preset reading comprehension model; and training the reading comprehension model based on the real answer and the predicted answer. The method may enhance comprehension ability of the reading comprehension model to the long rich-text document, and save labor cost.
-
公开(公告)号:US12299789B2
公开(公告)日:2025-05-13
申请号:US18087639
申请日:2022-12-22
IPC: G06T11/60
Abstract: A method for fusing map data, includes: obtaining two-dimensional map data and three-dimensional map data to be fused; classifying the three-dimensional map data according to map data types, the map data types including line data, surface data, and traffic body data; performing elevation conversion for each type of three-dimensional map data according to an elevation conversion algorithm preset for each type of three-dimensional map data, to obtain a relative elevation of each type of three-dimensional map data; and fusing the two-dimensional map data and three-dimensional map data to be fused based on the relative elevation of each type of three-dimensional map data, to obtain fused data.
-
公开(公告)号:US12299407B2
公开(公告)日:2025-05-13
申请号:US17896690
申请日:2022-08-26
Inventor: Huihui He , Leyi Wang , Duohao Qin , Minghao Liu
IPC: G06F40/47 , G06F40/151 , G06F40/166 , G06F40/295 , G06F40/30
Abstract: The present disclosure provides a model training method and apparatus, an electronic device, and a storage medium, and relates to the field of artificial intelligence, in particular, to the field of natural language processing and deep learning. A specific implementation solution includes: constructing initial training corpora; performing data enhancement on the initial training corpora based on an algorithm contained in a target algorithm set to obtain target training corpora, wherein the target algorithm set is determined from multiple algorithm sets, and different algorithm sets are used for performing data enhancement on corpora with different granularity in the initial training corpora; and performing training on a language model based on the target training corpora to obtain a sequence labeling model, herein the language model is pre-trained based on text corpora.
-
公开(公告)号:US12298839B2
公开(公告)日:2025-05-13
申请号:US18041035
申请日:2022-06-07
Inventor: Shuaijian Wang , Shiyong Li , Henghua Zhang , Panpan Li , Zaibin Hu , Baotong Luo
Abstract: A method for controlling a distributed operation system includes: for a first container carrying a first process, determining a current fault type of a failure in the first container in response to detecting that the first process is triggered to terminate based on the failure in the first container; and reconstructing the first container and restarting the first process based on the first container reconstructed in response to determining that the current fault type is consistent with a target fault type. The target fault type is a fault type suitable for reconstruction of each container in the distributed operation system to which the first container belongs.
-
公开(公告)号:US20250139327A1
公开(公告)日:2025-05-01
申请号:US18895722
申请日:2024-09-25
Inventor: Liang Shen , Jinle Zeng , Hongxiang Hao , Weibao Gong , Dianhai Yu , Haifeng Wang
IPC: G06F30/20
Abstract: A method for processing a model operator includes: determining an operator set for model networking, wherein the operator set comprises a plurality of operators; determining a storage amount occupied by an output tensor of each operator in the operator set and a computation time period consumed in a forward computation of each operator in the operator set; and determining a first operator participating in recomputation in a model from the operator set, based on the storage amounts and the computation time periods of the plurality of operators.
-
公开(公告)号:US12283124B2
公开(公告)日:2025-04-22
申请号:US17995283
申请日:2022-03-22
Inventor: Heng Wang , Zhenlei Tian , Tianbao Yu
Abstract: A method of training a text quality assessment model, a method of determining text quality, an electronic device, and a storage medium are provided. The method of training the text quality assessment model includes: determining a first text satisfying a condition of being a negative sample and a second text satisfying a condition of being a positive sample from a plurality of texts based on indicators for the texts; for any text of the first text and the second text, adding a label to the text based on the condition satisfied by the text, wherein the label indicates a category of the text, and the category includes a low-quality category for the negative sample and a non-low-quality category for the positive sample; and constituting a training set by the first text having a label and the second text having a label, to train the text quality assessment model.
-
公开(公告)号:US12282746B2
公开(公告)日:2025-04-22
申请号:US17820765
申请日:2022-08-18
Inventor: Haifeng Wang , Zhanyi Liu , Zhongjun He , Hua Wu , Zhi Li , Xing Wan , Jingxuan Zhao , Ruiqing Zhang , Chuanqiang Zhang , Fengtao Huang , Hanbing Song , Wei Di , Shuangshuang Cui , Yongzheng Xin
IPC: G06F40/58
Abstract: A display method, an electronic device, and a storage medium, which relate to a field of natural language processing and a field of display. The display method includes: acquiring a content to be displayed; extracting a target term from the content using a term extraction rule; acquiring an annotation information for at least one target term, responsive to an extraction of the at least one target term; and displaying the annotation information for the at least one target term and the content.
-
公开(公告)号:US20250123812A1
公开(公告)日:2025-04-17
申请号:US18988135
申请日:2024-12-19
Inventor: Yuehao Zhao , Wenjie Li , Chuqing Wang , Yunpeng Peng , Sai Gao , Hui Li , Junwei Xing , Wanpeng Niu , Bingfei Zhang
Abstract: The disclosure provides a code completion method based on a big model. The method includes: determining a first code element where a position to be completed is located in a first code file to be completed; determining a second code file having a dependency relationship with the first code file from a development project to which the first code file belongs; determining, according to the first code element, a second code element whose correlation with the first code element meets a preset condition, in which the second code element belongs to at least one of the first code file or the second code file; and generating a target code corresponding to the position to be completed through a big model based on a signature of the second code element.
-
公开(公告)号:US12277397B2
公开(公告)日:2025-04-15
申请号:US17564369
申请日:2021-12-29
Inventor: Chao Ma , Jingshuai Zhang , Qifan Huang , Kaichun Yao , Peng Wang , Hengshu Zhu
Abstract: A method of training a model, a method of determining a word vector, a device, a medium, and a product are provided, which may be applied to fields of natural language processing, information processing, etc. The method includes: acquiring a first word vector set corresponding to a first word set; and generating a reduced-dimensional word vector for each word vector in the first word vector set based on a word embedding model, generating, for other word vector in the first word vector set, a first probability distribution in the first word vector set based on the reduced-dimensional word vector, and adjusting a parameter of the word embedding model so as to minimize a difference between the first probability distribution and a second probability distribution for the other word vector determined by a number of word vector in the first word vector set.
-
公开(公告)号:US20250117601A1
公开(公告)日:2025-04-10
申请号:US18987693
申请日:2024-12-19
Inventor: Huilin Li , Yi Li , Lunan Zhao , Jie Liu , Li Ma , Tao Li , Tianzhong Hu , Yu Chen , Yongjuan Che , Aowei Li , Hanmeng Liu , Shouke Qin
IPC: G06F40/40 , G06F40/284 , G06F40/30
Abstract: A computer-implemented method for information processing includes: obtaining text information, in which the text information includes first text information of a resource to be commented on and second text information of a candidate prompt; selecting a target prompt from the candidate prompts based on the text information; and generating comment information of the resource to be commented on, based on the resource to be commented on and the target prompt.
-
-
-
-
-
-
-
-
-