-
公开(公告)号:US08612447B2
公开(公告)日:2013-12-17
申请号:US13293190
申请日:2011-11-10
IPC分类号: G06F17/30
CPC分类号: G06F17/3053 , G06F17/30864
摘要: Document cluster ranking systems and methods of ranking document clusters are described. In some example embodiments, the method comprises: obtaining, at a document cluster ranking system, a value associated with a first feature for each of a plurality of document clusters; based on the values associated with the first feature, automatically generating, at the document cluster ranking system, a plurality of first feature bins, each first feature bin defining a range of values and a bin identifier; and obtaining a score for one of the document clusters, by: i) identifying the first feature bin having a range of values which includes the obtained value associated with the first feature for that one of the document clusters; and ii) determining a score for that document cluster based on the first feature bin identifier for the identified first feature bin.
摘要翻译: 描述文档集群排名系统和排序文档集群的方法。 在一些示例性实施例中,该方法包括:在文档集群排名系统处获取与多个文档簇中的每一个的第一特征相关联的值; 基于与所述第一特征相关联的值,在所述文档簇排序系统处自动生成多个第一特征区块,每个第一特征区段定义值范围和区块标识符; 以及通过以下方式获得所述文档簇中的一个的分数:i)识别具有包括与所述文档簇中的所述一个的所述第一特征相关联的所获得的值的值范围的所述第一特征块; 以及ii)基于所识别的第一特征仓的第一特征箱标识符来确定该文档簇的得分。