2 Using the Command Line

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

2
Using the Command Line

The command line is an interface to a computer—a way for you (the human) to communicate with the machine. Unlike common graphical interfaces that use “windows, icons, menus, and pointers” (i.e., WIMP), the command line is text-based, meaning you type commands instead of clicking on icons. The command line lets you do everything you would normally do by clicking with a mouse, but by typing in a manner similar to programming! As a data scientist, you will mostly use the command line to manage your files and keep track of your code using a version control system (see Chapter 3).

While the command line is not as friendly or intuitive as a graphical interface, it has the advantage of being both more powerful and more efficient (it’s faster to type than to move a mouse, and you can do lots of “clicks” with a single command). The command line is also necessary when working on remote servers (other computers that often do not have graphical interfaces enabled). Thus, the command line is an essential tool for data scientists, particularly when working with large amounts of data or files.

This chapter provides a brief introduction to basic tasks using the command line—enough to get you comfortable navigating the interface and to enable you to interpret commands.

2.1 Accessing the Command Line

To use the command line, you will need to open a command shell (also known as a command prompt or terminal). This program provides the interface you type commands into. You should have installed a command shell, here also referred to as “the terminal” or the “command line,” as detailed in Chapter 1.

Once you open up the command shell (the Terminal program on Mac, or Git Bash on Windows), you should see something like the screen shown in Figure 2.1.

A screenshot shows a newly opened terminal on a Mac machine. The terminal screen reads, work-laptop1:~/Documents mikefree$ where work-laptop1 labeled "Machine", Documents labeled "Directory", mikefree labeled "User", and the command area labeled "Prompt". — Figure 2.1 Newly opened command shells: Terminal on a Mac (top) and Git Bash on Windows (bottom). Red notes are added.

A screenshot shows a newly opened Git Bash on a Windows machine. The screen reads, joelross@is -joelrossm13 MINGW64 ~/Desktop where joelross@is means "User", joelrossm13 means "Machine", MINGW64 means "Environment", Desktop means "Directory", and the command area means "Prompt".

A command shell is the textual equivalent of having opened up Finder or File Explorer and having it display the user’s “Home” folder. While every command shell program has a slightly different interface, most will display at least the following information:

The machine you are currently interfacing with (you can use the command line to control different computers across a network or the internet). In Figure 2.1 the Mac machine (top) is work-laptop1, and the Windows machine (bottom) is is-joelrossm13.
The directory (folder) you are currently looking at. In Figure 2.1 the Mac directory is ~/Documents, while the Windows directory is ~/Desktop. The ~ is a shorthand for the “home directory”: /Users/CURRENT_USER/ on a Mac, or C:/Users/CURRENT_USER/ on Windows.
The user you are logged in as. In Figure 2.1 the users are mikefree (Mac) and joelross (Windows).
The command prompt (typically denoted as the $ symbol), which is where you will type in your commands.

Remember

Lines of code that begin with a pound symbol (#) are comments: They are included to explain the code to human readers (they will be ignored by your computer!).

2.2 Navigating the File System

Although the command prompt gives you the name of the folder you are in, you might like more detail about where that folder is. Time to send your first command! At the prompt, type the pwd command:

Command	Behavior
`mkdir`	make a directory
`rm`	remove a file or folder
`cp`	copy a file from one location to another
`open`	open a file or folder (Mac only)
`start`	open a file or folder (Windows only)
`cat`	concatenate (combine) file contents and display the results
`history`	show previous commands executed
`!!`	repeat the previous command

Command	Behavior
`head`	Output first n lines of an input (specified as an argument)
`grep`	Search the list of inputs for a pattern and output the matches (globally search regular expression and print)
`cut`	Select portions from input and write them as output
`uniq`	Copy unique input lines to the output (and use the `-c` argument to count the lines!)
`sed`	“Find and replace” content in input (stream editor)
`sort`	Sort input lines (ascending or descending)
`wc`	Output word count information
`curl`	Download content/webpage at a URL (“see URL”—get it?)
`say`	Have the computer speak the argument (Mac only)

Table of Contents for 2 Using the Command Line

Create new playlist

Sign In

Sign Up

2Using the Command Line

2.1 Accessing the Command Line

2.2 Navigating the File System

2.2.1 Changing Directories

2.2.2 Listing Files

2.2.3 Paths

2.3 Managing Files

2.3.1 Learning New Commands

2.3.2 Wildcards

2.4 Dealing with Errors

2.5 Directing Output

2.6 Networking Commands

Table of Contents for
2 Using the Command Line

2
Using the Command Line