[email protected] | +45 91 76 72 94
GDPR compliance for unstructured data2018-06-20T13:40:29+00:00

Find sensitive information in files

GDPR Compliance tool for unstructured data

Here you can find sets of auto-tagging rules and tags, that you can use to data-mine information by scraping your drives and matching the content of your files to patterns (based on regular expressions). Use this as a GDPR compliance tool for your unstructured data (files and documents). The auto-tagging rules can be used to tag files on local drives, shared-drives or Cloud folders (Dropbox, OneDrive or similar). Read more about this feature on our blog post.


The files listed here can be imported in your Confidential database from the menu File > Confidential databases’ > Import data from XML zipped (there is no undo, so make sure you know what you are importing! 🙂 )

You can of course create your own rules and edit or delete the existing ones. If you import the file twice, the rules and tags will not be duplicated (unless you have renamed them).

Import auto-tagging rules for GDPR compliance


To start scraping a drive, you have to right click on a drive in Confidential, then click on  Folder menu and then on Apply auto-tagging rules.

Scrape a drive for GDPR Compliance

Feedback/Requests/Suggestions: in the forum topic.


Sensitive Info #1

(Updated 16.05.18) This is a first attempt to put together a list of auto-tagging rules to scrape and auto-tag. Most of the regex where found on  RegExLib. This list contains rules to match:

  • Name of all the danish cities with more than 2500 inhabitants (around 1000 cities) [FIXED, this caused to trigger any character]
  • Credit card numbers from the major credit card companies (Visa, Mastercard, Amex…)
  • Danish CPR number (check on wikipedia)
  • Email addresses (RFC 5322 Official Standard)
  •  Italian Codice Fiscale (check on wikipedia)
  • USA Social security codes (this regex gives a lot of false positives, but since the pattern is so simple, it’s to be expected)
GDPR search unstructured data

UK Passport, National Insurance Number and Postcodes

(Updated 20.06.18) Upon a user request we put together a database of UK Documents :

  • UK Passport based on this regex
  • UK National Insurance number based on this regex [Updated: less permissive regex, less false positives]
  • UK Postcodes
Data mine sensitive data UK GDPR

German Zip Code, tax code and more

Various German codes, including:

  • German  Zip Code (Postleitzahl – PLZ), matches the numeric part
  • German personal tax number (Steuernummer)
  • German health insurance number (Versicherungsnummer)
  • German army ID number (Personenkennziffer)
  •  German VAT Number
Data mine sensitive information German

JMBG (former Jugoslavia) and OIB (Croatian)

Various personal identification codes, including:

Jmbg abd OIB regex

We use technology (including cookies) to collect, use and transfer information to support this Site, including for data analytics purposes, as described in this Site's updated Privacy Policy. Your continued use of the Site signifies your acceptance of such cookies. To learn more about how to manage your cookie settings and how to exercise your rights under GDPR please see our Privacy Policy

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.