- 专利标题: Phrase based unstructured content parsing
-
申请号: US17179614申请日: 2021-02-19
-
公开(公告)号: US12061627B2公开(公告)日: 2024-08-13
- 发明人: Craig M. Trim , Mary Rudden , Anthony Stevens , Martin G. Keen
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 代理机构: Garg Law Firm, PLLC
- 代理商 Rakesh Garg; Michael O'Keefe
- 主分类号: G06F16/28
- IPC分类号: G06F16/28 ; G06F16/901 ; G06F40/205 ; G06F40/44 ; G06N5/04
摘要:
From an unstructured content using an ontology, a forward materialization graph is generated. The forward materialization graph is converted to a set of vector representations comprising multidimensional numbers representing elements of the forward materialization graph. A set of inference paths is computed for the set of vector representations. An inference path in the set of inference paths connecting a first vector representation with a second vector representation. Based on a set of features, the set of vector representations is formed into clusters, a feature in the set of features comprising a relevance probability, the relevance probability corresponding to a relevance of a portion of the unstructured content according to a relevance metric. A structured representation of the unstructured content is placed at an edge location of a content delivery network determined using the set of clusters.
公开/授权文献
- US20220269698A1 PHRASE BASED UNSTRUCTURED CONTENT PARSING 公开/授权日:2022-08-25
信息查询