Communication is a Key to Data Quality

By Elliot King Too often, data quality is seen as a strictly technical issue. Data quality problems must be identified, assessed and then rectified, and that process is best managed by experts using the right tools. But communication may be the most important element in a data quality program. A data quality program can only succeed if all the stakeholders…

Continue Reading

Address Quality – Take 2

By David Loshin We have dealt with some of our core address quality concepts, but not this one: The intended recipient must be associated with the deliverable address. The problem here is no longer address quality but rather address correctness. The address may be complete, all the elements may be valid, the ZIP+4 is the right one, and all values…

Continue Reading

Characterizing the Quality of Address Data

By David Loshin My company is currently working on a couple of projects associated with address quality and location master data. We are reviewing a lot of the existing documentation that has been collected from a number of different operational systems, as well as reviewing the business processes to see where location data is either created, modified, or read. And…

Continue Reading

Standardizing Your Approach to Monitoring the Quality of Data

By David Loshin In my last post, I suggested three techniques for maturing your organizational approach to data quality management. The first recommendation was defining processes for evaluating errors when they are identified. These types of processes actually involve a few key techniques: 1) An approach to specifying data validity rules that can be used to determine whether a data…

Continue Reading

Approximate Matching

By David Loshin Actually, my first name is not David - that is really my middle name, but it is the given name my parents used when talking to me. This has actually led to a lot of confusion over the years, especially when confronted with a form asking for me "first name" and my "last name." For official forms…

Continue Reading

The Challenge of Identifying Information

By David Loshin In my last post, I introduced the question of determining which characteristics are used to uniquely differentiate between any pair of records within a data set. The same question is relevant when attempting to match a pair of records as well, once they are determined to represent the same entity. I like to call these "identifying attributes,"…

Continue Reading

Distributed Data and Distributed Information

By David Loshin You might not realize how broad your electronic footprint really is. Do you have any idea how many data sets contain information about and specific individual? These days, any interaction you have with any organization is likely to be documented electronically. And, for those curious enough to read the fine print of the "privacy" policies, you might…

Continue Reading