By David Loshin
Hierarchical classification schemes are great for scanning through unstructured text for identifying critical pieces of information that can be mapped to an organized analytical profile. To enable this scanning capability, you will need two pieces of technology.
The first involves a text analysis methodology for scanning text and determining which character strings and phrases are meaningful and which ones are largely noise.… Read More
In this case study, the Northern Ontario School of Medicine (NOSM) was able to cleanse and incorporate their data from about 30 distinct source systems, by integrating Melissa Data’s Total Data Quality Integration Toolkit (TDQ-IT). TDQ-IT works within the SSIS data flow to deliver a wide range of data integration, transformation, and cleansing functionality including: profiling, parsing, cleansing, matching, and monitoring.… Read More