- 专利标题: Hierarchical data classification using frequency analysis
-
申请号: US14716554申请日: 2015-05-19
-
公开(公告)号: US10262061B2公开(公告)日: 2019-04-16
- 发明人: Gerhard Brugger , John Eric Baum , Filippo Ferdinando Paolo Beghelli , Charles Wilson
- 申请人: Oracle International Corporation
- 申请人地址: US CA Redwood Shores
- 专利权人: ORACLE INTERNATIONAL CORPORATION
- 当前专利权人: ORACLE INTERNATIONAL CORPORATION
- 当前专利权人地址: US CA Redwood Shores
- 代理机构: Kraguljac Law Group, LLC
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A method of classifying individual documents in a document collection according to a hierarchy may include selecting an object from the hierarchy, generating one or more variants for the object, and for each of the one or more variants, determining a frequency threshold based at least in part on how frequently the one or more variants occurs in the document collection. The method may also include selecting a first document in the document collection, where the first document includes one or more objects that match at least one of the one or more variants. The method may additionally include determining that the number of the one or more objects exceeds the frequency threshold and classifying the first document with the object in the hierarchy.
公开/授权文献
信息查询