CUSTODIAN DISAMBIGUATION AND DATA MATCHING
    1.
    发明申请
    CUSTODIAN DISAMBIGUATION AND DATA MATCHING 审中-公开
    CUSTODIAN DISAMBIGUATION和DATA MATCHING

    公开(公告)号:US20160314186A1

    公开(公告)日:2016-10-27

    申请号:US15068129

    申请日:2016-03-11

    IPC分类号: G06F17/30

    摘要: Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format; grouping the information sets in the unified format into indexing buckets based on a user name using a non-phonetic algorithm; determining a similarity score for each pair of information sets in each of the indexing buckets; classifying each information set pair into a set of classes based on the similarity scores, wherein the set of classes comprise at least matches and non-matches; and using a data structure for merging information of information set pairs classified as matches.

    摘要翻译: 提供了一种用于匹配多个计算机系统中的人的不同用户表示的技术。 该技术包括从多个计算机系统收集关于用户表示的信息集; 将信息集归一化为统一格式; 基于使用非语音算法的用户名将统一格式的信息集分组为索引桶; 确定每个索引桶中的每对信息集合的相似性得分; 基于所述相似度得分将每个信息集对分类成一组类,其中所述类的集合至少包括匹配和非匹配; 并且使用用于合并分类为匹配的信息集对的信息的数据结构。

    MAINTAINING A CUSTODIAN DIRECTORY BY ANALYZING DOCUMENTS
    2.
    发明申请
    MAINTAINING A CUSTODIAN DIRECTORY BY ANALYZING DOCUMENTS 有权
    通过分析文件维护一个CUSTODIAN DIRECTORY

    公开(公告)号:US20170024696A1

    公开(公告)日:2017-01-26

    申请号:US14805631

    申请日:2015-07-22

    IPC分类号: G06Q10/10

    摘要: A computer processor may extract identity information from a document. The identity information may include at least one custodian identity attribute. After extracting the identity information, the computer processor may determine that the identity information is associated with a specific custodian. The computer processor may then search for the custodian identity attribute in a custodian directory to determine whether the custodian directory contains an entry for the custodian. If the custodian is not in the custodian directory, the computer processor may create a new entry in the custodian directory for the custodian and store the extracted identity information in the new entry.

    摘要翻译: 计算机处理器可以从文档中提取身份信息。 身份信息可以包括至少一个保管人身份属性。 在提取身份信息之后,计算机处理器可以确定身份信息与特定保管人相关联。 然后,计算机处理器可以在保管人目录中搜索保管人身份属性,以确定保管人目录是否包含保管人的条目。 如果保管人不在保管人目录中,计算机处理器可以在保管人的保管人目录中创建一个新条目,并将提取的身份信息存储在新条目中。