- 专利标题: Summarizing collections of documents
-
申请号: US15206703申请日: 2016-07-11
-
公开(公告)号: US09767165B1公开(公告)日: 2017-09-19
- 发明人: Ruggero Altair Tacchi , Wojciech Musial
- 申请人: Quid, Inc.
- 申请人地址: US CA San Francisco
- 专利权人: Quid, Inc.
- 当前专利权人: Quid, Inc.
- 当前专利权人地址: US CA San Francisco
- 代理机构: Pillsbury Winthrop Shaw Pittman LLP
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Provided is a process including: obtaining a corpus having a plurality of document collections, each of which is associated with features; for a given document collection, computing a pertinence score for each feature; ranking the features based on the features' pertinence scores; selecting a first set of features based on a first coverage score thereof and a threshold; re-ranking the first set of features based on the features' relevance to the document collection; and selecting a second set of features from the first set of features based on a second coverage score thereof and the threshold, the second set of features being used for summarizing the document collection.
信息查询