发明授权
- 专利标题: Detection of attributes in unstructured data
- 专利标题(中): 检测非结构化数据中的属性
-
申请号: US11809167申请日: 2007-05-31
-
公开(公告)号: US07711736B2公开(公告)日: 2010-05-04
- 发明人: Boris I. Levin
- 申请人: Boris I. Levin
- 申请人地址: NL Amsterdam
- 专利权人: Microsoft International Holdings B.V.
- 当前专利权人: Microsoft International Holdings B.V.
- 当前专利权人地址: NL Amsterdam
- 代理机构: Wolf, Greenfield & Sacks, P.C.
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/30
摘要:
A method for processing information includes receiving a set of records, which include a plurality of fields containing data regarding respective items, and selecting a field that occurs in all of the records and contains multiple terms in each of the records. At least first and second terms that occur among the terms in the selected field in the records are identified, such that the records are partitioned into at least first and second respective subsets by occurrences of the at least first and second terms in the selected field. Responsively to partitioning of the records by the occurrences, it is determined that the at least first and second terms correspond to at least first and second different values of an attribute of the items. The data are classified according to the values of the attribute.
公开/授权文献
- US20070299855A1 Detection of attributes in unstructured data 公开/授权日:2007-12-27