Archive for April, 2009

Meet Zealand-New

Friday, April 17th, 2009

babies

In a recent post in data quality PRO the validation of foreign given names was discussed. After reading the excellent comment of my colleague Michael Grigat, I stumbled across an article in the “Bay of Plenty Times”, a local New Zealand newspaper.

The article described the rather patriotic name choice of a couple with regard to the newest addition to their family. They named their daughter “Zealand-New”.

Zealand-New’s mother Tomicina Davoren, and father Mana Te Moni, said they preferred a name that was different to any that featured in the country’s top baby names for 2008.  “I’ve never heard anybody being called that name. You hear New Zealand every day but not as someone’s name.” She said the top names for 2008 may be popular but were “not the greatest”.  “They are too plain,” she said.
Zealand-New is Ms Davoren’s and Mr Te Moni’s fourth child. Their other children are named Rlexuz Toara Chantz Te Moni, Mikaere Morgan Te Moni, and Korizma-Lake Vonnita Manaaki Te Moni.

This is what you might call a very substantial illustration of what Michael Grigat said in his post: “We have more than 20 years of experience with international given names and we know that validation of given names is quite a tricky thing.”  Indeed we do ……

Data Cleansing with intelligent identification

Thursday, April 16th, 2009

clean-data

In many cases an inductive method of data cleansing is the way to go. With the right tools and expertise you can inspect, transform  and cleanse entities in a database and reach high levels of data quality without the need to use external reference data. In some cases, however, only working with the internal data and inductively identifying and fixing data patterns is not sufficient. Let’s take a practical example: a bank needs to report on a particular segment of its clients to German bank supervisor BaFin – the Federal Financial Supervisory Authority aka Bundesanstalt für Finanzdienstleistungsaufsicht. The bank apparently has done its homework and has created a central database containing all entities needed for the compliance check. Moreover, the bank has worked out a rather complex set of rules how data must be processed and corrected. One of the most important anchor points in this specific framework is the separation between B2C and B2B entities and for the latter the exact identification of the correct legal form. But what if you cannot trust this identification? (more…)