Diagnosis Precedes Cure

Blog Administrator | Analyzing Data Quality | , , , , , ,

By Elliot King

Face it. Almost all data has problems. Most organizations have multiple data sources and too often depend on potentially unreliable data flows from customers, data entry clerks, third-party providers and different processing systems for it to be otherwise.

So what can you do to identify data quality problems before bad data interferes with critical business processes? Actually, that is something of a trick question.… Read More

Record Linkage and Fuzzy Matching Part 2

Blog Administrator | Uncategorized | , , , ,

 

This blog series will address overall the steps necessary for efficient data/record processing that include a record linkage or fuzzy matching step.  In part 1, we covered the overall approach.

 

Today, we will cover the following steps:

 

1. Categorize

2. Split records

 

They are defined in academia as creating a “Blocking Index.” (We will cover cleansing next; I am jumping ahead, because I like to start with the end in mind, and the end in this case is the fastest possible matching process.)Read More