- 专利标题: Method and apparatus for identifying semantically related records
-
申请号: US14954664申请日: 2015-11-30
-
公开(公告)号: US11227002B2公开(公告)日: 2022-01-18
- 发明人: Oktie Hassanzadeh , Anastasios Kementsietsidis
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: McGinn I.P. Law Group, PLLC
- 代理商 Peter Edwards, Esq.
- 主分类号: G06F16/35
- IPC分类号: G06F16/35 ; G06F16/215
摘要:
An apparatus and method of identifying semantically related records, including receiving input data from an input device, splitting the input data into a plurality of clusters according to semantic relationship, each of the clusters including a plurality of source terms and a plurality of target terms, transforming each of the plurality of clusters based on the transformation which includes tokenization of the plurality of clusters, for each of the plurality of clusters that are transformed, finding relatedness scores of a plurality of semantic relatedness measures with the plurality of target terms, building a vector of similarity scores for each of the plurality of target terms, and for each of the plurality of source terms, selecting a predetermined number of the plurality of target terms according to the similarity scores.
公开/授权文献
信息查询