Technical requirements

The code for this chapter can found inside the GitHub repository shared with this book inside the Chapter 3 folder. This dataset consists of email data taken from my personal Gmail account. Due to privacy issues, the dataset cannot be shared with you. However, in this chapter, we will guide you on how you can download your own emails from Gmail to perform initial data analysis.

Here are the steps to follow:

  1. Log in to your personal Gmail account. 
  2. Go to the following link: https://takeout.google.com/settings/takeout.
  3. Deselect all the items but Gmail, as shown in the following screenshot:

  1. Select the archive format, as shown in the following screenshot:

Note that I selected Send download link by email, One-time archive, .zip, and the maximum allowed size. You can customize the format. Once done, hit Create archive

You will get an email archive that is ready for download. You can use the path to the mbox file for further analysis, which will be discussed in this chapter. 

Now let's load the dataset.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.116.36.194