PROBLEM QUESTION: what is the probability that two different people - each from a different source system (database) will be matched in my target system (database)?
NOTE: the goal is to match the same person existing in different source systems (databases) in the target system (database)
MATCH CRITERIA:
> Match on Social Security Number, First Name, Last Name
+ SSN is not null
+ SSN is valid (using SSA rules)
+ First Name and Last Name are not case sensitive
+ First Name and Last Name have all spaces, dashes, etc. removed to create a string - which is used to match against.
SOURCE SYSTEMS:
> System A: 3 million customers.
+ Duplicate SSN's allowed. Current query reveals 4 duplicates
> System B: 1.5 million customers
+ No duplicate SSN's allowed.
> System C: 180,000 customers
+ Duplicate SSN's allowed. Current query reveals no duplicates
TARGET SYSTEM (database):
> Stores all matched records from the source systems the client is in.
2007-01-03
09:11:00
·
1 answers
·
asked by
jabar
1
in
Mathematics