METHOD FOR INDUSTRY TEXT INCREMENT AND ELECTRONIC DEVICE

    公开(公告)号:US20220027766A1

    公开(公告)日:2022-01-27

    申请号:US17493365

    申请日:2021-10-04

    Abstract: A method for an industry text increment, as well as an electronic device and a computer readable storage medium for the same are provided. The method may include: acquiring an original industry text in a target industry field, an order of magnitude of a number of the original industry text being smaller than a preset first order of magnitude; and performing a sample incremental processing on the original industry text by using a distant supervision method, to obtain increased industry texts, an order of magnitude of a number of the increased industry texts is greater than a preset second order of magnitude, wherein the preset second order of magnitude is not smaller than the preset first order of magnitude.

    METHOD AND APPARATUS FOR VERIFYING MEDICAL FACT

    公开(公告)号:US20210217504A1

    公开(公告)日:2021-07-15

    申请号:US17023998

    申请日:2020-09-17

    Abstract: The present disclosure relates to the field of medical data processing based on natural language processing. Embodiments of the present disclosure disclose a method and apparatus for verifying a medical fact. The method may include: acquiring a description text of the medical fact; selecting a relevant paragraph related to the description text of the medical fact from a medical document; and inputting the description text of the medical fact and the corresponding relevant paragraph into a trained discrimination model for authenticity judgment, to obtain a verification result of the medical fact, the discrimination model being pre-trained based on a medical text paragraph pair extracted from the medical document, and being iteratively adjusted using a medical fact sample set including authenticity labeling information after the pre-training.

    METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR EXTRACTING SPO TRIPLES

    公开(公告)号:US20210216819A1

    公开(公告)日:2021-07-15

    申请号:US17149267

    申请日:2021-01-14

    Abstract: A method and an apparatus for extracting SPO triples, an electronic device, and a storage medium are related to the field of artificial intelligence technologies. The solution may include: inputting annotated training data into each of multiple extraction models; predicting SPO triples satisfying defined relations in the annotated training data through each of multiple extraction models; combining the predicted SPO triples corresponding to each of multiple extraction models; extracting SPO triples satisfying screening conditions from the combined SPO triples; mining SPO triples with missing annotations from the annotated training data based on the SPO triples satisfying screening conditions, in response to that the SPO triples satisfying screening conditions do not satisfy output conditions; supplementing the SPO triples with missing annotations into the annotated training data; repeating the inputting, predicting, combining, extracting, mining and supplementing until the SPO triples satisfying screening conditions satisfy the output conditions.

    METHOD AND APPARATUS FOR PROCESSING INFORMATION

    公开(公告)号:US20210216725A1

    公开(公告)日:2021-07-15

    申请号:US17147881

    申请日:2021-01-13

    Abstract: A method and an apparatus for processing information are provided. The method can include: acquiring a word sequence obtained by performing word segmentation on two paragraphs in a text; inputting the word sequence into a to-be-trained natural language processing model to generate a word vector corresponding to a word in the word sequence; inputting the word vector into a preset processing layer of the to-be-trained natural language processing model; predicting whether the two paragraphs are adjacent, and a replaced word in the two paragraphs; and acquiring reference information of the two paragraphs, and training the to-be-trained natural language processing model to obtain a trained natural language processing model, based on the prediction result and the reference information.

Patent Agency Ranking