Data Sets

This page houses links to various data sets in a convenient tabular format. Click the link in the Download column to access the respective data set.

NameDescriptionFile FormatDownload Link
US Names and AddressesThis data set includes both name and address fields. It's suitable to run a simple geocode using the Universal Addressing and Enterprise Geocoding Modules.TXT (Tab Delimited)Download
Bank Account InformationThis data set features variable fields such as Name, Account Number, Account Type, Party ID, Branch ID, Risk Description, Risk Code, and more. This data set could be used to assess account activities and for Anti Money Laundering use cases.TXT (Pipe Delimited)Download
Sanction Screening Party InformationThis data set features variable fields such as Name, Address, Annual Income, Passport Numbers, Pension Numbers, Risk Status, etc, which can be utilized to form a 360 degree customer view, and/or to perform sanction screening and risk assessment. TXT (Pipe Delimited)Download
Poor Data Quality SampleThis is a mocked up intentionally poor quality data set, built for the purpose of learning how how to standardize names and phone numbers, handling mis-fielded data, validating addresses, and performing matches.TXT (Tab Delimited)Download
Canada AddressesThis data set contains complete formatted addresses within Canada. Variable fields include street address, city, province, and postal code.TXT (Comma Delimited)Download
UK AddressesThis data set contains complete formatted addresses within the UK. Variable fields include thoroughfare (street address), post town, and post code. TXT (Comma Delimited)Download
Spain AddressesThis data set contains complete formatted addresses within Spain. Variable fields include street name, building number, post code, town name, and province.TXT (Comma Delimited)Download
Mexico AddressesThis data set contains complete formatted addresses within Mexico. Variable fields include street address, colonia (neighborhood), city, and codigo postal (postal code).TXT (Comma Delimited)Download
FCC Data SetThis is a mocked data set, built for the purpose of demonstrating how to standardize data and perform analysis for Financial Crimes and Compliance.XLSXDownload
Data Quality AssessmentThis is a mocked data set, built for the purpose of demonstrating the capabilities of Metadata Insights, specifically Data Discovery and ProfilingcsvDownload