Quantcast
Channel: Questions in topic: "ssis"
Viewing all articles
Browse latest Browse all 813

Fuzzy Grouping SSIS - How to Run Either/Or Logic Against Same Data Set,

$
0
0
I have a data set (approximately 4 million records) with the following fields: -Id -First Name -Last Name -Street -City -State -Zip -Email The total confidence score needs to be greater than or equal to 75%, and the Last Name field must have a minimum confidence of 80%. The following rules also need to be applied to the match process: -If First Name has a length less than or equal to 7 characters, then the minimum confidence must be 90%; if First Name has a length greater than 7 characters, then the minimum confidence can be 80%. -Street and City must have a minimum confidence of 90% OR Email must be an exact match (100%). I have seen numerous SSIS examples where a Conditional Split was used to break up a data set by State, for example, but then the separate Fuzzy Grouping tasks used the same overall match logic. Breaking up by State is straightforward, but the two rules above amount to a more complicated either/or logic (Street and City greater than or equal to 90% OR Email equals 100%). What would be the best way in Fuzzy Grouping to process this kind of either/or logic against the same data set? Any advice would be greatly appreciated!

Viewing all articles
Browse latest Browse all 813

Trending Articles