- 专利标题: Systems and methods for structure and header extraction
-
申请号: US17156546申请日: 2021-01-23
-
公开(公告)号: US11763079B2公开(公告)日: 2023-09-19
- 发明人: Richard Anthony Pito
- 申请人: Thomson Reuters Enterprise Centre GmbH
- 申请人地址: CH Zug
- 专利权人: THOMSON REUTERS ENTERPRISE CENTRE GMBH
- 当前专利权人: THOMSON REUTERS ENTERPRISE CENTRE GMBH
- 当前专利权人地址: CH Zug
- 代理机构: Alston & Bird LLP
- 主分类号: G06F40/279
- IPC分类号: G06F40/279 ; G06F40/242 ; G06F3/0481 ; G06F40/166 ; G06F40/289 ; G06F40/258 ; G06F40/284 ; G06F40/109 ; G06F40/137 ; G06F40/232 ; G06V30/416
摘要:
The present disclosure is directed towards systems and methods for extracting structure and headers from a body of text. This computational extraction is based on the visual and logical similarities between portions of text. Structure is derived from a programmatic and methodic computation of similarities between header pairs.
公开/授权文献
- US20210319177A1 SYSTEMS AND METHODS FOR STRUCTURE AND HEADER EXTRACTION 公开/授权日:2021-10-14
信息查询