Data Quality checks on the user data

Hi - I have a requirement to do a data quality check on the data entered by the Users, the data quality check should make sure the data does not contains spaces (leading/trailing),special characters and only cotains numbers and alphabets....the Volume of data ranges from Minimum few thousands to 100 millions ...so the data quality check should runs with a good perfomance ..any Ideas please much appreciated

I had a few idea on the data quality check on the Spaces alone (Trailing/leading) and also it did a good perfomance

select sum(char_length(name)) - sum(char_length(trim both from name ) from X

Say if above the query retruns more than 0, then i consider the data has some apces and fail the process...

Simililary does anyone has any data quality ideas for Special characters,spaces etc ?

Re: Data Quality checks on the user data

If is is for a small and one-off situation , you can develop scripts. Teradata has Miner's Profiler, DQRM, if you have them. You can check. But many advanced features are also available in Ataccama's solutions for DQ, including MDM....