AUTOMATED INFORMATION EXTRACTION AND ENRICHMENT IN PATHOLOGY REPORT USING NATURAL LANGUAGE PROCESSING

    公开(公告)号:US20220301670A1

    公开(公告)日:2022-09-22

    申请号:US17639441

    申请日:2020-09-08

    摘要: In one example, a method being performed by a computer system comprises: receiving an image file containing a pathology report; performing an image recognition operation on the image file to extract input text strings; detecting, using a natural language processing (NLP) model, entities from the input text strings, each entity including a label and a value; extracting, using the NLP model, the values of the entities from the input text strings; converting, based on a mapping table that maps entities and values to pre-determined terminologies, the values of at least some of the entities to the corresponding pre-determined terminologies; and generating a post-processed pathology report including the entities detected from the input text strings and the corresponding pre-determined terminologies.