-
公开(公告)号:US20100023511A1
公开(公告)日:2010-01-28
申请号:US12572757
申请日:2009-10-02
IPC分类号: G06F17/30
CPC分类号: G06F16/334
摘要: A method for correlating data from a data source representing a single data file to a data target containing a plurality of data files is provided. The method includes normalizing the data from the data source, such as by removing white space and replacing data strings. One or more data strings are selected for use as preliminary selection criteria. The preliminary selection criteria are then used to search for one or more matches in the normalized data from the data source. If no match is found, one or more data strings are selected for use as secondary selection criteria. A correlation score is calculated if at least one match is found using the preliminary selection criteria.
摘要翻译: 提供了一种用于将表示单个数据文件的数据源的数据与包含多个数据文件的数据目标相关联的方法。 该方法包括从数据源标准化数据,例如通过删除空格和替换数据字符串。 选择一个或多个数据串用作初步选择标准。 然后使用初步选择标准来搜索来自数据源的归一化数据中的一个或多个匹配。 如果没有找到匹配,则选择一个或多个数据字符串作为二级选择标准。 如果使用初步选择标准找到至少一个匹配,则计算相关得分。