System and method for performing a similarity measure of anonymized data
    1.
    发明申请
    System and method for performing a similarity measure of anonymized data 有权
    用于执行匿名数据的相似性度量的系统和方法

    公开(公告)号:US20070239705A1

    公开(公告)日:2007-10-11

    申请号:US11394271

    申请日:2006-03-29

    IPC分类号: G06F17/30

    摘要: A similarity measure system selects a first value and a first context related to the first value, divides the first value into a first set of substrings in an order preserving way, and processes each of these substrings through an obfuscation function to produce a first set of obfuscated substrings. The system selects a second value and a second context related to the second value, and processes the second value to produce a second set of obfuscated substrings. The system calculates a context similarity measure for the first context and the second context. The system determines a value similarity measure from the first and second set of order preserved obfuscated substrings. The system determines a closeness degree between the first value and the second value and a closeness degree based on the context similarity measure.

    摘要翻译: 相似性度量系统选择与第一值相关的第一值和第一上下文,以顺序保持方式将第一值分割为第一组子串,并通过混淆函数处理这些子串中的每一个,以产生第一组 混淆子串。 系统选择与第二值相关的第二值和第二上下文,并处理第二值以产生第二组混淆子串。 系统计算第一上下文和第二上下文的上下文相似性度量。 系统从第一组和第二组订单保存的模糊子串确定值相似性度量。 系统确定第一值和第二值之间的接近程度,以及基于上下文相似性度量的接近程度。