New Data Masking filters to help you identify personally identifiable information.

 

 Linkedin, 13-12-2011, Huw Price (Grid-tools)

New In Datamaker Release 2.7.6, there is a new set of sampling filters to help you identify PII data. Within Datamaker there is a data sampling facility that scan your existing data. A set of specific data analysis routines that parse the columns and look for different types of common PII data.

One of the problems we have found is that there are so many columns it is impossible to carefully check them all. Using these filters we are looking for patterns in the data that could indicate that it is data that should be private.

The list of pattern matching algorithms is increasing all the time and of course you can build your own as well.
Unique columns – picks up columns that are found to be unique according to the data sample
GUIDcolumns – columns that have GUID values, any combination of braces/no braces; dashes/no dashes.
Empty columns – columns that have no non-null values but whose tables do have rows in them
Numeric character columns – columns of a char data type which have only numeric values in them.
Formatted numeric character columns – columns of a char data type which have formatted numeric data, for example 11-22-33, useful for picking up phone numbers