-
公开(公告)号:US10078667B2
公开(公告)日:2018-09-18
申请号:US14967314
申请日:2015-12-13
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Chang Sheng Li , Fan Jing Meng , Edith Helen Stern , Han Wang , Jing Min Xu , Lin Yang , Xuejun Zhuo
CPC classification number: G06F16/2455 , G06F17/3053 , G06F17/30554 , H04L41/0806
Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and -comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.
-
公开(公告)号:US10078666B2
公开(公告)日:2018-09-18
申请号:US14933382
申请日:2015-11-05
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Chang Sheng Li , Fan Jing Meng , Edith Helen Stern , Han Wang , Jing Min Xu , Lin Yang , Xuejun Zhuo
CPC classification number: G06F16/2455 , G06F17/3053 , G06F17/30554 , H04L41/0806
Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.
-