How many people do you know that enjoy cleaning dishes after a party or family get together? Not many. To me, cleaning a customer database is very similar. It needs to be done, but not many people like doing it. At one company where I worked, the executive management team didn’t believe in cleaning customer data. They would just continue to buy lists without de-duping. They would throw tens of thousands of dollars away every time they mailed a flier, brochure or catalog. Keeping a database clean isn’t cheap, but well worth the investment. As I suggested in my article “Clean Customer and Product Data – Your Pot of Gold”, your customer data is one of the most valuable assets that you have and you must protect it.
Here’s a process that has worked well for me that helped keep our customer database clean. This is by no means the only database cleansing method, but it works well.
- Create a “Key” using zip code (3 or 5 digits) and primary address (sometimes all, sometimes the first 14 characters or so).
- The Key record will look something like “505011234MAINSTSW” while the zipcode field still reads “50501-1578” and the primary address record still reads “1234 Main Street S.W.”. (We could still use the address information when we generated the address labels, since we know that the list will get CASS and NCOA (National Change of Address) processing by the mailer if we wanted to get any sort of postage discount.)
- If the list hasn’t been CASS certified (Coding Accuracy Support System) yet, we would standardize the Key by doing a global search and replace on things like “Road”, “Street”, and “North”, and change them to standard postal abbreviations like “Rd”, “St”, and “N”. Then we would strip special characters from the Key: dashes, periods, commas, pound signs, and spaces.
- When you sort by the Key, then by contact name, a formula can be written in Excel to compare addresses, and use segments of the contact name and/or company name to identify duplicates.
- By doing visual checks of a few hundred records, you can usually tell if the formulas need to be tweaked and if additional processing of the records needs to be done.
- Concerning demographics and customer value, we use SIC (Standard Industrial Classification) and NAICS (North American Industry Classification System) to help target customers, and tallying sales activity of defined time periods help classify the accounts. Because we had the luxury of a SQL database, that information was stored within the database and updated as needed. We could identify the SIC or NAICS hot spots in the customer database for target marketing.
- Phone or e-mail contact with the customer helped keep the contact list up-to-date. Because the customer service reps would associate an order with a caller by leveraging a SQL database, we could use the data to help identify active contacts, (who placed orders, how often were orders placed, and the date of their last activity). This activity helped pinpoint when a contact went cold, and helped us identify who to ask for when calling to clean up the list.
After cleaning up the database and before we mailed an expensive piece like a 1,000 page catalog, we would do a smaller mailing to that same list to see what got returned. We would clean up the list using that information and then we would be ready for our mailing of a more expensive piece.
If you want to learn more about me, please visit my LinkedIn profile, my website and my blog.