Skip to main content
Skip table of contents

Analyze

Once all the connection data has been configured and profiles have been defined, we can move on to the main functionality of Analyze; the Analyze tab. In this part of the application the user can specify analyze functions on the columns of the imported metadata and Analyze the data in the database.

At the left side of the window you see the imported tables or views. The right part shows the details and statistics of this table or view.

DATPROF Analyze 1.PNG

To analyze one or more tables or views, click the Run Analyze Statistics button located at the bottom left of the window.

DATPROF Analyze 2.png

There are three icons giving you 3 options.

From left to right:

  • Run Statistics: The Analyze run will start.

  • Pause Statistics: The run will pause, a pop-up will appear. To continue press "OK".

  • Abort Statistics: The analyse run will abort.

To run statistics a connection to the database is required.

Saving your project

After collecting statistics for your tables, it is recommended to save your project. Since gathering statistics can take some time, saving the project allows you to skip this step if you close and reopen the project—unless you wish to gather statistics again.


Tables and views at the left side

On the left side of the screen the imported tables and views are shown. 

DATPROF Analyze 3.png

From top to bottom:

  • A search box. This enables you to filter the tables or views in this screen. You can use % as a wildcard.

  • A drop-down menu containing all available schemas. This only applies if more then one schema is imported.

  • The Tables and views.  These can be identified by the following icons: 

    Analyze Tables icon.png

    for tables, 

    Analyze Views icon.png

    for views.

You can select one or more tables simultaneously to perform tasks on them at the same time. 

Right-click on a table or view to open a menu with the following options:

  • Count rows: The amount will appear right from the table or view name.

  • Clear statistics: Clear statistics , including count, from the selected tables or views.

  • Clear profiles: Clear the matching profiles for the selected tables or views.

  • Sort by: Sort the tables by Object name, Object type and Row count.


Statistics at the right side

The right part of the screen contains 4 Tabs. 

DATPROF Analyze 4.png

Tab Statistics

The Statistics tab provides statistical data for selected tables or views. Only the tables or views you select will be analyzed.

Certain column characteristics are displayed without requiring a full analysis. These are collected from the metadata imported during the Import meta data process and reflect the available data at that stage.

DATPROF Analyze 5.PNG

After an analyze run the other columns are filled.

DATPROF Analyze 6.PNG

Statistics are gathered on the data in the columns.

  • # Nulls: The amount of NULL values in this column.

  • # Distinct: The number of unique values in this column

  • Min: Show the lowest value in this column ("A" is before "a", "-1" is before "1")

  • Max: Show the highest value in this column.

  • # Shortest: Show the shortest value in this column. For numeric also the shortest length is given where the the value 11 has the length 2 and the value -2 also has the length 2.

  • # Longest: Show the longest value in this column. For numeric also the shortest length is given where the the value 11 has the length 2 and the value -2 also has the length 2.

  • Least frequent: Show the least often occurring value in this column.

  • Most frequent: Show the most frequently occurring value in this column.

Filtering

To analyze the entire content of a table, simply select the table and click Run Statistics.

If you want to generate statistics for a specific subset of data, you can apply a filter. In the central part of the window, navigate to the Filter tab, where a text box will appear. Here, you can enter an SQL WHERE clause to define the desired data subset for analysis.

  • When running statistics with a filter applied, only the filtered data will be analyzed.

  • The resulting count is displayed in the Summary section under Data Filing.

DATPROF Analyze Data Filling.png
  • A filter can be given for every table or view.

  • Filters are saved in the project.

  • Filters are NOT removed when clearing Statistic data.

  • The Number of records in the left panel still shows the total number of unfiltered records in this table.

Tab Summary

The summary tab shows either one or two diagrams depending on whether the table was filtered. The data filling diagram shows the filling of this column. The other diagram is only visible for Numeric columns. It shows the number of positive, negative and "0" records in this column.

Below the diagram part you find three blocks of information: Ordering, Length, Frequency

For all three blocks you can see the most extreme results. The other tabs elaborate upon these results.

Below is an example of an alphanumeric and a numeric column, both filtered with the condition WHERE "LAST_NAME" LIKE 'A%' (meaning the last name value must start with the letter A):

DATPROF Analyze Data Filling where filter salary year.png

DATPROF Analyze Data Filling where filter last name.png

Tab Ordening

DATPROF Analyze Tab Ordening.png

This tab displays two panels: one for the minimum values and one for the maximum values. It shows the values (sorted from low to high and vice versa), along with their length and the number of occurrences.

Double-click on a record to view the corresponding result(s). You can modify the filter and re-run the query if you'd like to further refine the data you've found.

DATPROF Analyze Tab Ordening Filter.png

Tab Length

DATPROF Analyze Tab Length2.png

This shows two panels. One for the shortest and one for the longest values. It shows also the number of occurrences.

Double-click on the record to show the found record(s). You can adjust this filter and run the query if you like to filter the data you've found further. 

DATPROF Analyze Tab Length Filter.png

Tab Frequency

In the frequency you can see which values occur most often and least often within a given table. In the Value column the actual value is shown, and in the Occurrences column the amount of times that value was encountered is shown.

DATPROF Analyze Tab Frequency.png

Double-click on the record to show the found record(s). You can adjust this filter and run the query if you like to filter the data you've found further. 

DATPROF Analyze Tab Frequency Filter.png

Tab Profiling

This screen shows an oversight of which tables conform to the profiles we've configured earlier. 

In the example below, the columns EMAIL and GENDER have matching profiles. This is entirely expected, as we can reasonably assume that a column named "EMAIL" will contain email addresses.

However, when dealing with abstract column names like ZGKLZ125 or ZGKLA124, the use of profiles becomes particularly valuable. Profiles help remove the abstraction, offering a clear understanding of what data is actually contained within these columns.

DATPROF Analyze 7.png

Tab Data

This screen features a SQL window where you can enter a WHERE clause to filter the data in the selected table. The results are displayed in a grid, and you can sort the grid by clicking on the column headers.

By default, only 100 records are retrieved. Since all data is transferred to the client, increasing this value may lead to performance issues.

DATPROF Analyze 8.PNG

Tab Foreign Keys

The Foreign Keys tab displays the existing foreign keys retrieved through the Import and Sync Metadata options.

DATPROF Analyze Tab Foreign Keys.png

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.