摘要:
A method and system are disclosed for formatting address strings into a recognizable sequence of token types for database processing. The system includes a token rule processor and a token sequence processor. The method begins with the step of assigning token types to components of an address string to form a sequence of token types. The method next includes the step of determining whether or not the sequence of token types is contained in an adjustable predetermined rule table. The method further includes the step of processing the sequence of token types into a recognizable sequence format if the sequence of token types is contained in the rule table. Finally, the method concludes with the step of processing the sequence of token types into a recognizable sequence format in accordance with a predetermined interpretation procedure if the sequence of token types is not contained in the rule table.
摘要:
A method and system for matching textual strings representing customer names/addresses is disclosed. The textual strings are first transformed by a plurality of predefined filters. The transformed textual strings are then compared utilizing a plurality of predefined comparators to determine if the two transformed textual strings match. A score is determined based on the comparison of the two transformed textual strings utilizing a scoring procedure. Based on the score and a matching procedure, it is determined whether or not the textual strings match.