发明授权
- 专利标题: Classification rule generation device, classification rule generation method, classification rule generation program, and recording medium
- 专利标题(中): 分类规则生成装置,分类规则生成方法,分类规则生成程序和记录介质
-
申请号: US13996040申请日: 2011-01-13
-
公开(公告)号: US09323839B2公开(公告)日: 2016-04-26
- 发明人: Hideya Shibata , Mamoru Kato , Mitsunori Kori
- 申请人: Hideya Shibata , Mamoru Kato , Mitsunori Kori
- 申请人地址: JP Tokyo
- 专利权人: Mitsubishi Electric Corporation
- 当前专利权人: Mitsubishi Electric Corporation
- 当前专利权人地址: JP Tokyo
- 代理机构: Oblon, McClelland, Maier & Neustadt, L.L.P.
- 国际申请: PCT/JP2011/050384 WO 20110113
- 国际公布: WO2012/095971 WO 20120719
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
In a document classification device 100, a sample document extraction condition storage unit 160 stores sample document extraction conditions 160-1 set for each of classification categories for extracting partial text according to the classification categories from an input document 301 input by a document input unit 110. A document matching unit 120 matches the input document 301 against the sample document extraction conditions 160-1. Based on a result of matching by the document matching unit 120, a document extraction unit 130 extracts the partial text from the input document 301 according to the classification categories. A learning unit 140 performs predetermined machine learning using as a sample document the partial text extracted by the document extraction unit 120, and thereby generates classification rules 150-1.
公开/授权文献
信息查询