-
公开(公告)号:US12072861B2
公开(公告)日:2024-08-27
申请号:US17722077
申请日:2022-04-15
发明人: Todd Morrill , Eric Roma , Nicolas Kuzak , Neelam Sharma , Andrew Runge , Jayvardhan Rathi , Waqar Sarguroh , Wenting Zhao
IPC分类号: G06F16/22 , G06F16/242 , G06F40/205
CPC分类号: G06F16/2246 , G06F16/221 , G06F16/243 , G06F40/205
摘要: Described herein is a regulatory parser that downloads and efficiently processes regulatory documents. The regulatory documents may be from different sources and may have different formats. The regulatory parser parses all of the text in the regulatory documents and converts into a predetermined, single format for downstream applications. The text is organized and stored in a structured tree, organized into one or more hierarchies with nodes storing segments of text from a regulatory document. In some embodiments, each node in the regulatory tree may represent a segment of text. Partitioning the text of a regulatory document into segments of text may make the storage and querying of the regulatory documents more manageable. The organization and structure of the structured tree may reduce the times and resources needed for accessing and searching for a regulatory citation. The structured tree may allow a user to manipulate a regulatory document or text.