发明申请
- 专利标题: Method and system for determining the focus of a document
- 专利标题(中): 确定文档焦点的方法和系统
-
申请号: US11165527申请日: 2005-06-23
-
公开(公告)号: US20060004752A1公开(公告)日: 2006-01-05
- 发明人: Nadav Harel , Einat Amitay , Ron Sivan
- 申请人: Nadav Harel , Einat Amitay , Ron Sivan
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 优先权: GB0414623.9 20040630
- 主分类号: G06F7/00
- IPC分类号: G06F7/00
摘要:
A method and system for determining the focus of a document are provided. Candidate topics in the form of topic nodes in a hierarchy of topics are input into a focus determining algorithm. For each candidate topic node, a score is allocated to the topic of each level of the hierarchy of the topic node , the scores for each topic are summed and one or more topics are determined to be the focus of the document based on the scores. The scores allocated to the topic of each parent level of the hierarchy of the topic node are progressively lower for the topic of each parent level of the hierarchy. The candidate topics may be provided by identifying occurrences of references to a topic in a document, providing a plurality of possible topics in the form of topic nodes in a hierarchy of topics, and, for each identified occurrence of a reference to a topic, determining the appropriate topic node and adding the topic node to the candidate topics.
信息查询