Tableau Prep Builder is an exciting new platform to develop data pipelines to transform your data for reporting and analytics purposes.
In this chapter, you will come to understand how we think about data preparation from the perspective of Tableau Prep Builder. You will learn about the different use cases you may employ Tableau Prep for, be it ad hoc data analysis, creating a dataset for a BI tool, or specifically for Tableau Desktop.
In this chapter, we will cover the following recipes:
To follow along with the recipes in this chapter, you need to have Tableau Prep Builder installed, and Tableau Desktop. In the first recipe, we'll walk through the details of installing Tableau Prep Builder.
In this recipe, you'll install Tableau Prep Builder. We'll download the software, perform the installation, and open Tableau Prep Builder for the first time.
To enjoy the many benefits of Tableau Prep Builder, you need a license key. Typically, this would be issued by your administrator. Alternatively, you may have purchased a license yourself at https://buy.tableau.com/.
If you do not have a license key, Tableau offers a free trial so you can start right away.
As with all recipes in this book, the installation is performed on an Apple MacBook running macOS Big Sur. The steps are nearly identical on Windows machines and you can follow along on either operating system.
Ensure you're connected to the internet and have your favorite browser open:
You're now ready to start using Tableau Prep Builder with the recipes in this book.
Tableau Prep Builder is updated frequently, and you may expect new features, enhancements, and bug fixes at a regular cadence. Once you have installed a version, as in this recipe, Tableau Prep Builder will always notify you upon startup if a more recent version is available, along with a link to the download page.
Tableau has taken great care in creating an interface that is intuitive and easy to understand. Perhaps best of all, it has quite a few similarities to the manner in which things are laid out in Tableau Desktop. So, if you are familiar with Tableau Desktop, you should feel right at home.
In this recipe, we will take a brief tour of the Tableau Prep user interface.
Tableau Prep provides what we need right out of the box. That includes data connectors, sample flows, training resources, and community updates. We'll walk through these step by step. This knowledge is foundational to all recipes.
Open Tableau Prep:
In Tableau Prep, a flow is what we call a data pipeline. If you've used other software in the past, you may have referred to a pipeline as an Extract, Transform, and Load (ETL) process, workflow, or data pipeline.
It's easy to start a new flow, simply by creating a data connection. To get started, click the blue Connect to Data button to expand the data connection options:
From here, select your connection type, and that will complete the creation of a new flow:
In Chapter 2, Extract and Load Processes, we'll cover the configuration of various data connections in detail.
Both these flows use the sample Superstore and WorldIndicators data that is delivered with the Tableau Desktop application as well, so you may be familiar with this data already.
These example flows can be opened with one click and run locally. They're excellent for testing out quick actions and recipes learned in this book, without the need for you to create a new flow from scratch. Personally, I've become so accustomed to this, I typically try something out in an example workflow quickly, and then move on to my own flow and implement the action there when I'm confident it'll work.
The Discover pane has two sections that are always visible, Training and Resources. Training includes links to training materials authored by the Tableau team, while Resources includes links to Tableau blogs and user community forums.
These links update over time, so it's great to glance at this pane every time you open Tableau Prep, to make sure you're up to date on the latest developments.
The second, one-click approach is to select a flow from the Recent Flows section. This section will automatically update based on your activity, with the latest flow accessed being the first one listed:
You're now familiar with the foundational elements that make up the Tableau Prep Builder user interface and can start building flows using your data.
Simply put, Tableau Prep Builder works by ingesting data from a source to your local machine and processes it there whenever you make updates to a flow, in real time. To stay performant, Tableau Prep Builder automatically takes a sample of your data inputs only during this process. When you execute an entire flow, only then will the full data input be processed, and so this may take longer than previewing data in real time. In Chapter 2, Extract and Load Processes, there is a recipe to manage the sampling size and method used by Tableau Prep.
In this recipe, you'll learn how to leverage Tableau Prep Builder to perform ad hoc data analysis. In most scenarios, getting insights from your data would involve the creation of a data pipeline and then connecting a data visualization tool to the output of that pipeline to perform your analysis. However, with Tableau Prep Builder, you can perform basic ad hoc analysis on your data from within the tool itself.
Open the Tableau Prep Superstore flow to follow along with the steps outlined.
Ad hoc analysis typically starts with a business question to be answered with the use of your data. Let's assume the question posed for the Superstore data is: Which is the top category of products that consumers order with same-day delivery?
Following these steps, you'll be able to use Tableau Prep to answer this question without the need for additional reporting tools:
Tableau Prep will instantly apply the filter and show you the data preview excluding any segments that are not Consumer.
With these steps, you've quickly performed ad hoc analysis on the Superstore data and identified the top product category for consumers who placed orders with the same-day shipping delivery mode.
Using Tableau Prep Builder, we've quickly performed exploratory data analysis without the need to run our flow or create new outputs. Doing so provides great value not only in terms of a fast turnaround but also in keeping your data landscape clean by avoiding the creation of new data sources (outputs) for simple analysis.
When you perform analysis in this fashion, Tableau Prep instantly runs the required actions in the background to give you the results. In the Superstore example flow, this is fairly quick. However, on large datasets, this may take more time. Tableau Prep will show a progress indicator in the top-right corner when performing such background actions:
In this recipe, you've learned how to quickly perform ad hoc data analysis in Tableau Prep without the need to export your data for analysis in a downstream application.
In this recipe, you'll learn how to use Tableau Prep to generate outputs for consumption by a variety of Business Intelligence (BI) tools. Specifically, we'll write a single output, from a flow with multiple outputs, to a CSV file. At the time of writing, output to CSV is the only non-Tableau proprietary format supported by Tableau Prep. Future releases of Tableau Prep will see the introduction of output to databases.
Open the Tableau Prep Superstore flow to follow along with the steps outlined.
In the steps that follow, we'll create an output that is suitable for consumption by data visualization and BI tools other than Tableau Desktop:
You've now successfully completed this recipe and are able to export data to CSV so that you may utilize the transformed dataset in a BI application.
Using the preceding steps, Tableau Prep Builder generates a generic CSV file that can be opened by most popular BI and spreadsheet applications. The benefit of this is that you are free to leverage any tool of your choice and are not locked into any particular ecosystem.
Using Tableau Prep to run distinct outputs separately is a great feature for ad hoc analysis, as you do not need to refresh your entire output every time.
In this recipe, you'll learn how to use Tableau Prep to generate a Tableau Desktop workbook at any point in your flow, to perform further analysis on that data.
Similar to the third recipe, Using Tableau Prep for ad hoc data analysis, we'll find the answer to the question posed for the Superstore data: Which is the top category of products that consumers order with same-day delivery? However, this time, we'll get the answer in Tableau Desktop instead.
Open the Tableau Prep Superstore flow to follow along with the steps outlined. Ensure that Tableau Desktop is installed.
In the steps that follow, we'll produce a temporary hyper extract that will allow us to perform quick ad hoc analysis in Tableau Desktop:
And with this, we've found the answer to our question. Office Supplies is the product category for which consumers most often leverage same-day shipping.
When leveraging the Preview in Tableau Desktop functionality, Tableau Prep automatically creates a temporary .hyper extract and connects that to a new Tableau workbook. This is ideal for quickly visualizing your data at any point in the data flow, whether that is for exploratory analysis or simply verifying that your flow works as expected up to the selected step.
3.22.51.241