10 Data Frames

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

10
Data Frames

This chapter introduces data frame values, which are the primary two-dimensional data storage type used in R. In many ways, data frames are similar to the row-and-column table layout that you may be familiar with from spreadsheet programs like Microsoft Excel. Rather than interact with this data structure through a user interface (UI), you will learn how to programmatically and reproducibly perform operations on this data type. This chapter covers ways of creating, describing, and accessing data from data frames in R.

10.1 What Is a Data Frame?

At a practical level, data frames act like tables, where data is organized into rows and columns. For example, reconsider the table of names, weights, and heights from Chapter 9, shown in Figure 10.1. In R, you can use data frames to represent these kinds of tables.

An example of a data frame in R language. — Figure 10.1 A table of data (of people’s weights and heights) when viewed as a data frame in RStudio.

The table has three columns with headers: name, height, and weight, and it has five rows. It infers the following data (row-wise): Ada, 64, and 135; Bob, 74, and 156; Chris, 69, and 139; Diya, 69, and 144; Emma, 71, and 152.

Data frames are really just lists (see Chapter 8) in which each element is a vector of the same length. Each vector represents a column, not a row. The elements at corresponding indices in the vectors are considered part of the same row (record). This structure makes sense because each row may have different types of data—such as a person’s name (string) and height (number)—and vector elements must all be of the same type.

For example, you can think of the data shown in Figure 10.1 as a list of three vectors: name, height, and weight. The name, height, and weight of the first person measured are represented by the first elements of the name, height, and weight vectors, respectively.

You can work with data frames as if they were lists, but data frames have additional properties that make them particularly well suited for handling tables of data.

10.2 Working with Data Frames

Many data science questions can be answered by honing in on the desired subset of your data. In this section, you will learn how to create, describe, and access data from data frames.

10.2.1 Creating Data Frames

Typically you will load data sets from some external source (see Section 10.3), rather than writing out the data by hand. However, it is also possible to construct a data frame by combining multiple vectors. To accomplish this, you can use the data.frame() function, which accepts vectors as arguments, and creates a table with a column for each vector. For example:

Syntax	Description	Example
`my_df[row_name, col_name]`	Element(s) by row and column names	`people["Ada", "height"]` (element in row named `Ada` and column named `height`)
`my_df[row_num, col_num]`	Element(s) by row and column indices	`people[2, 3]` (element in the second row, third column)
`my_df[row, col]`	Element(s) by row and column; can mix names and indices	`people[2, "height"]` (second element in the `height` column)
`my_df[row, ]`	All elements (columns) in row name or index	`people[2, ]` (all columns in the second row)
`my_df[, col]`	All elements (rows) in a column name or index	`people[, "height"]` (all rows in the `height` column; equivalent to list notations)

Table of Contents for 10 Data Frames

Create new playlist

Sign In

Sign Up

10Data Frames

10.1 What Is a Data Frame?

10.2 Working with Data Frames

10.2.1 Creating Data Frames

10.2.2 Describing the Structure of Data Frames

10.2.3 Accessing Data Frames

10.3 Working with CSV Data

10.3.1 Working Directory

10.3.2 Factor Variables

Table of Contents for
10 Data Frames

10
Data Frames