- 专利标题: AUTOMATED INFORMATION EXTRACTION AND ENRICHMENT IN PATHOLOGY REPORT USING NATURAL LANGUAGE PROCESSING
-
申请号: US17639441申请日: 2020-09-08
-
公开(公告)号: US20220301670A1公开(公告)日: 2022-09-22
- 发明人: Vishakha SHARMA , Yogesh PANDIT , Ram BALASUBRAMANIAN
- 申请人: Roche Molecular Systems, Inc.
- 申请人地址: US CA Pleasanton
- 专利权人: Roche Molecular Systems, Inc.
- 当前专利权人: Roche Molecular Systems, Inc.
- 当前专利权人地址: US CA Pleasanton
- 国际申请: PCT/US2020/049738 WO 20200908
- 主分类号: G16H15/00
- IPC分类号: G16H15/00 ; G16H30/20 ; G06F40/20 ; G16H10/60 ; G06V30/30
摘要:
In one example, a method being performed by a computer system comprises: receiving an image file containing a pathology report; performing an image recognition operation on the image file to extract input text strings; detecting, using a natural language processing (NLP) model, entities from the input text strings, each entity including a label and a value; extracting, using the NLP model, the values of the entities from the input text strings; converting, based on a mapping table that maps entities and values to pre-determined terminologies, the values of at least some of the entities to the corresponding pre-determined terminologies; and generating a post-processed pathology report including the entities detected from the input text strings and the corresponding pre-determined terminologies.
信息查询