发明申请
- 专利标题: ASSIGNING INTO ONE SET OF CATEGORIES INFORMATION THAT HAS BEEN ASSIGNED TO OTHER SETS OF CATEGORIES
- 专利标题(中): 将其分配给一组已被分配给其他类别的信息
-
申请号: US12980162申请日: 2010-12-28
-
公开(公告)号: US20110137908A1公开(公告)日: 2011-06-09
- 发明人: Byron Edward Dom , Hui Han , Ramnath Balasubramanyan , Dmitry Yurievich Pavlov
- 申请人: Byron Edward Dom , Hui Han , Ramnath Balasubramanyan , Dmitry Yurievich Pavlov
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Techniques are described for assigning, to target categories of a target scheme, items that have been obtained from a plurality of sources. In situations in which one or more of the sources has organized its information according to a source scheme that differs from the target scheme, the assignment may be based, in part, on an estimate of the probability that items from a particular source category should be assigned to a particular target category. Such probability estimates may be based on how many training set items associated with the particular source category have been assigned to the particular target category. Source categories may be grouped into clusters. The probability estimates may also be based on how many training set items within the cluster to which the particular source category has been mapped, have been assigned the particular target category.