Checking column quality, distribution, and profiles

As a final check before loading our data, the Power Query Editor includes powerful tools that allow us to understand the quality of our data. These tools can be found on the View tab of the ribbon in the Data Preview section.

To use these features, we will perform the following steps:

  1. Click on the Hours query in the Active group in the Queries pane.
  2. Click the View tab of the ribbon and check the box next to Column quality in the Data Preview section:

Figure 23: Column quality
Note that, under the column headers, information is displayed regarding the quality of data in each column. This information includes what percentages of row values are Valid, Error, or Empty.
  1. Click the checkbox next to Column distribution in the ribbon. This same area now displays information regarding the distinct and unique values that were found in the rows of each column. This information is based on the first 1,000 rows returned by the query:

Figure 24: Column distribution

Finally, we can also view additional statistical information about the values in each column.

  1. Click the checkbox next to Column profile in the ribbon. Note that two additional areas are displayed, that is, Column statistics and Column distribution:

Figure 25: Column profiling

This completes the transformation process for our data. Our final step is to finally load the data into the data model. 

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.188.146.77