Data profiling is the statistical analysis of a data set and is the first step in any successful data initiative. Frequently an organisation will not know the contents of data tables and columns, data profiling allows detailed information to be produced to inform the organisation of the data and its quality; whether it is fir for purpose.
Data profiling can also identify statistical outliers, nulls and defaulting of values, these values may not be accurate or the most up to data in the organisation and thus will identify areas that need more analysis, and possible remediation.
- Format Distributions
- Format Analysis
- Length Analysis
- Data Type Analysis
- Uniqueness Analysis
Compiled by experienced data quality analysts, the bespoke report will be tailored toward your requirements but will typically comprise, a table report, column report and an exceptions report.
Approximately 4 pages long, this will give you an overview of all the data in the table, including:
Total number of rows
How many of the rows have a value in them
Minimum and Maximum values etc.
The majority of the data in this overall report is included in the Column Reports (described below) but having it all in one large table has the advantage that you can scan through everything as soon as you receive your report and get a quick overview. It also enables you to easily compare columns with one another.
This will provide you with a detailed profile of your data, column by column. Data found within the Column report include;
- Data Type
- Frequency Distribution of Values
- Null Counts
- Unique Count
- Minimum & Maximum Values for the Column
- Lengths of Values
- Speace (separated from Null Counts)
Our trained data quality analyst’s will also drilldown on any unusual data, and will highlight any concerns for you in red in the ‘Exceptions Report’ .