Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 8. Creating Command Pipelines

The designers of Unix created an operating system with a philosophy that remains valid to this day. The Unix designers established the following:

Everything is a file. Devices are represented as special files, as are networking connections and plain old normal files.
Each process runs in an environment. This environment includes standard files for input, output, and errors.
Unix has many small commands, each of which was designed to perform one task and to do that task well. This saves on memory and processor usage. It also leads to a more elegant system.
These small commands were designed to accept input from the standard input file and send output to the standard output files.
You can combine these small commands into more complex commands by creating command pipelines.

This chapter delves into these concepts from the perspective of shell scripts. Because shell scripts were designed to call commands, the ability to create command pipelines, thereby making new, complex commands from the simple primitive commands, provides you with extraordinary power. (Be sure to laugh like a mad scientist here.)

This chapter covers how you can combine commands and redirect the standard input, output, and errors, as well as pipe commands together.

Working with Standard Input and Output

Every process on Unix or a Unix-like system is provided with three open files (usually called file descriptors). These files are the standard input, output, and error files. By default:

Standard input is the keyboard, abstracted as a file to make it easier to write scripts and programs.
Standard output is the shell window or terminal from which the script runs, abstracted as a file to again make writing scripts and programs easier.
Standard error is the same as standard output: the shell window or terminal from which the script runs.

When your script calls the read command, for example, it reads data from the standard input file. When your script calls the echo command, it sends data to the standard output file.

A file descriptor is simply a number that refers to an open file. By default, file descriptor 0 (zero) refers to standard input and is often abbreviated as stdin. File descriptor 1 refers to stdout, and file descriptor 2 refers to stderr. These numbers are important when you need to access a particular file, especially when you want to redirect these files to other locations. File descriptor numbers go up from zero.

Redirecting Standard Input and Output

Because the keyboard and shell window are treated as files, it's easier to redirect a script's output or input. That is, you can send the output of a script or a command to a file instead of to the shell window. Similarly, you can change the input of a script or command to come from a file instead of the keyboard. To do this, you create commands with a special > or < syntax.

To review, the basic syntax for a command is:

command options_and_arguments

The options are items such as -l for a long file listing (for the ls command). Arguments are items such as file names.

To redirect the output of a command to a file, use the following syntax:

command options_and_arguments > output_file

To redirect the input of a command to come from a file, use the following syntax:

command options_and_arguments < input_file

You can combine both redirections with the following syntax:

command options_and_arguments < input_file > output_file

You can use this syntax within your scripts or at the command line.

Try It Out: Redirecting Command Output

To try this, type in the following command at the prompt:

$ ls /userinput.txt

You can then see the data that would have gone to the screen with the more command:

$ more commands.txt
[
411toppm
a2p
a2ps
ab
abiword
AbiWord-2.0
ac
access

The output will continue for quite a while. The commands you see will differ based on your system, but you should see commands such as [, covered in Chapter 3.

How It Works

The > operator tells the shell to redirect the output of the command to the given file. If the file exists, the shell deletes the old contents of the file and replaces it with the output of the command, ls in this case. Each line in the file commands.txt will contain the name of a file from /usr/bin, where many system commands reside.

The more command sends the contents of the file to the shell window, one window at a time.

Note that if your system does not have the more command, try the less command. Cygwin on Windows, for example, does not include the more command by default.

Try It Out: Redirecting a Command's Input

Use the < operator to redirect the input for a command. For example:

$ wc -l < commands.txt
2291

How It Works

The wc command, short for word count, counts the number of bytes, words, and lines in a file. The -l (ell) option tells the wc command to output only the number of lines. This gives you a rough estimate as to the number of commands in /usr/bin.

In this example, the input to the wc command comes from the file named commands.txt. The shell sends the contents of the file commands.txt as the standard input for the wc command. You'll find input redirection very useful for programs that cannot open files on their own, such as the mail command.

Redirecting Standard Error

In addition to redirecting the standard input and output for a script or command, you can redirect standard error. Even though standard error by default goes to the same place as standard output—the shell window or terminal—there are good reasons why stdout and stderr are treated separately. The main reason is that you can redirect the output of a command or commands to a file, but you have no way of knowing whether an error occurred. Separating stderr from stdout allows the error messages to appear on your screen while the output still goes to the file.

To redirect stderr from a command to a file, use the following syntax:

command options_and_arguments 2> output_file

The 2 in 2> refers to file descriptor 2, the descriptor number for stderr.

The C shell uses a different syntax for redirecting standard error. See the next section for more on this.

Redirecting Both Standard Output and Standard Error

In the Bourne shell (as well as Bourne-shell derivatives such as bash and ksh), you can redirect stderr to the same location as stdout in a number of ways. You can also redirect standard error to a separate file. As part of this, you need to remember that the file descriptors for the standard files are 0 for stdin, 1 for stdout, and 2 for stderr.

Try It Out: Sending stderr to the Same Place as stdout

If you redirect stdout to a file, you can use the 2>&1 syntax to redirect stderr to the same location as stdout:

$ ls /usr/bin > commands.txt 2>&1

How It Works

The example command has three parts.

ls /usr/bin is the command run—that is, ls with its argument, /usr/bin.
> commands.txt redirects the output of the ls command—that is, stdout—to the file named commands.txt.
2>&1 sends the output of file descriptor 2, stderr, to the same location as file descriptor 1, stdout. Because you already redirected stdout, any errors will also go into the file commands.txt.

You can see this if you try the ls command with a directory that does not exist. For example:

$ ls /usr2222/bin > commands.txt 2>&1
$ more commands.txt
ls: /usr2222/bin: No such file or directory

Note that this example assumes that your system has no directory named /usr2222/bin.

Note

You have to be very careful entering in 2>&1 because an ampersand (&) alone means to run a command in the background. Do not place any spaces around 2>&1.

Try It Out: Redirecting Both stderr and stdout at Once

You can redirect both stdout and stderr to the same location with the &> syntax. For example:

$ ls /usr2222/bin &> commands.txt
$ more commands.txt
ls: /usr2222/bin: No such file or directory

How It Works

In this example, ls is the command, /usr2222/bin is the argument to the ls command, and &> commands.txt redirects both stdout and stderr to the file named commands.txt.

If you do not redirect both file descriptors, then errors will be sent to the shell window or terminal. For example:

$ ls /usr2222/bin > commands.txt
ls: /usr2222/bin: No such file or directory

In this case, errors go to the screen, and any output would go to the file named commands.txt. (There will be no output in the case of an error like this one.)

Note

The C shell equivalent to redirect both stdout and stderr to the same place is >&. For example:

ls >& output.txt

There is no easy way to redirect stderr to one place and then stdout to another. See the note on Csh Programming Considered Harmful at www.faqs.org/faqs/unix-faq/shell/csh-whynot/ for more on why you may not want to use the C shell for scripting.

Appending to Files

The > operator can be quite destructive. Each time you run a command redirecting stdout to a file with >, the file will be truncated and replaced by any new output. In many cases, you'll want this behavior because the file will contain just the output of the command. But if you write a script that outputs to a log file, you typically don't want to destroy the log each time. This defeats the whole purpose of creating a log.

To get around this problem, you can use the >> operator to redirect the output of a command, but append to the file, if it exists. The syntax follows:

command >> file_to_append

The shell will create the file to append if the file does not exist.

Try It Out: Appending to a File

Enter the following commands:

$ uptime >> sysload.txt
$ uptime >> sysload.txt
$ uptime >> sysload.txt
$ more sysload.txt
 20:45:09 up 23 days,  1:54, 78 users,  load average: 0.23, 0.13, 0.05
 20:45:21 up 23 days,  1:54, 78 users,  load average: 0.20, 0.13, 0.05
 20:45:24 up 23 days,  1:54, 78 users,  load average: 0.18, 0.12, 0.05

How It Works

The uptime command lists how long your system has been up—that is, the time since the last reboot. It also lists a system load average. By using the >> append operator, you can view the output of the uptime command over time.

Use the >> operator any time you want to preserve the original contents of a file but still want to write additional data to the file.

Use the > operator when there is no need to preserve the contents of a file or where you explicitly want to overwrite the file.

Truncating Files

You can use a shorthand syntax for truncating files by omitting the command before the > operator. The syntax follows:

> filename

You can also use an alternate format with a colon:

: > filename

Note that : > predates the use of smiley faces in email messages.

Both of these command-less commands will create the file if it does not exist and truncate the file to zero bytes if the file does exist.

Try It Out: Truncating Files

Try the following commands to see file truncating in operation:

$ ls /usr/bin > commands.txt
$ ls -l
total 5
-rw-r--r--    1 ericfj   None         3370 Nov  1 07:25 commands.txt
drwxr-xr-x+   2 ericfj   None            0 Oct 13 12:30 scripts
-rw-r--r--    1 ericfj   None          232 Sep 27 10:09 var

$ : > commands.txt
$ ls -l
total 1
-rw-r--r--    1 ericfj   None            0 Nov  1 07:25 commands.txt
drwxr-xr-x+   2 ericfj   None            0 Oct 13 12:30 scripts
-rw-r--r--    1 ericfj   None          232 Sep 27 10:09 var

How It Works

The original command redirects the output of ls to a file named commands.txt. You can then perform a long listing on the file to see how many bytes are in the file, 3370 in this example (your results should differ).

Next, the : > operator truncates the file to a length of zero bytes. Again, use a long listing to verify the size.

Sending Output to Nowhere Fast

On occasion, you not only want to redirect the output of a command, you want to throw the output away. This is most useful if:

A command creates a lot of unnecessary output.
You want to see error messages only, if there are any.
You are interested only in whether the command succeeded or failed. You do not need to see the command's output. This is most useful if you are using the command as a condition in an if or while statement.

Continuing in the Unix tradition of treating everything as a file, you can redirect a command's output to the null file, /dev/null. The null file consumes all output sent to it, as if /dev/null is a black hole star.

The file /dev/null is often called a bit bucket.

To use this handy file, simply redirect the output of a command to the file. For example:

$ ls /usr/bin > /dev/null

The Cygwin environment for Windows includes a /dev/null to better support Unix shell scripts.

Redirecting input and output is merely the first step. The next step is to combine commands into command pipelines.

Piping Commands

Command pipelines extend the idea of redirecting the input and output for a program. If you can redirect the output of one command and also redirect the input of another, why not connect the output of one command as the input of another? That's exactly what command pipelines do.

The basic syntax is:

command options_and_arguments | command2 options_and_arguments

The pipe character, |, acts to connect the two commands. The shell redirects the output of the first command to the input of the second command.

Note that command pipelines are often redundant to the normal redirection. For example, you can pass a file as input to the wc command, and the wc command will count the characters in the file:

$ wc < filename

You can also pass the name of the file as a command-line argument to the wc command:

$ wc filename

Or you can pipe the output of the cat command to the wc command:

$ cat filename | wc

Not all commands accept file names as arguments, so you still need pipes or input redirection. In addition, you can place as many commands as needed on the pipeline. For example:

command1 options_and_arguments | command2 | command3 | command4 > output.txt

Each of the commands in the pipeline can have as many arguments and options as needed. Because of this, you will often need to use the shell line-continuation marker, , at the end of a line. For example:

command1 options_and_arguments | 
    command2 | 
    command3 | 
    command4 > output.txt

You can use the line-continuation marker, , with any long command, but it is especially useful when you pipe together a number of commands.

Note that in your scripts, you don't need to use the line-continuation marker.

Piping with Unix Commands

Unix commands were designed with pipes in mind, as each command performs one task. The designers of Unix expected you to pipe commands together to get any useful work done.

For example, the spell command outputs all the words it does not recognize from a given file. (This is sort of a backward way to check the spelling of words in a file.) The sort command sorts text files, line by line. The uniq command removes duplicate lines. You can combine these commands into a primitive spell-checking command.

Try It Out: Checking Spelling the Primitive Way

Imagine you are living in a cave. Saber-toothed tigers roam outside. Mammoths taste bad. Try the following command line:

$ spell filename.txt | sort | uniq  > suspect_words.txt

Choose a text file and pass it as the file name to the spell command. This is the file that will be checked. Any file with a lot of words will do. Running this command on an outline for this book generates a number of suspect words:

$ more suspect_words.txt
AppleScript
arg
Awk
backticks
basename
bashrc
BBEdit
bc
builtin
CDE
commnads
csh
Csh
CSH
CVS
drive's
dtedit
elif
eq
expr
fc
--More--(28%)

At least one of these words, commnads, is misspelled.

How It Works

The spell command goes through the file named by the command-line argument and outputs every word that is not in its internal word list. The assumption is that these words must be misspelled. As you can see from the example, virtually all computer and shell scripting terms are considered errors.

Note that modern programs such as the OpenOffice.org office suite contain much better spell-checking packages. The spell command is a really old Unix command but very useful for testing pipelines.

The spell command outputs these words to stdout, one word per line. This one-per-line style is common among many Unix commands because this style makes it so easy to process the data. The command pipeline then pipes the output of the spell command to the input of the sort command. The sort command sorts the lines. (Modern versions of the spell command may sort as well, making this step unnecessary.)

The output of the sort command is a list of words, one per line, in sorted order. The command line pipes this output to the uniq command (another command always used in examples like this). The uniq command, short for unique, removes duplicate adjacent lines. Thus, the input must be sorted before calling uniq.

Finally, the command pipeline sends the data to the file named suspect_words.txt. You can then check this file to see a list of all the words that spell flagged as errors.

As you can see, the invention of word processing software really made life easier. The buildup of print-out paper forced people out of caves and into suburbia.

The concepts here work the same for any pipelines you need to create.

Creating Pipelines

Creating command pipelines can be difficult. It's best to approach this step by step, making sure each part of the pipeline works before going on to the next part.

For example, you can create a series of commands to determine which of many user accounts on a Unix or Linux system are for real users. Many background services, such as database servers, are given user accounts. This is mostly for the sake of file permissions. The postgres user can then own the files associated with the Postgres database service, for example. So the task is to separate these pseudo user accounts from real live people who have accounts on a system.

On Unix and Linux, user accounts are traditionally stored in /etc/passwd, a specially formatted text file with one line per user account.

Mac OS X supports a /etc/passwd file, but in most cases, user accounts are accessed from DirectoryServices or lookup. You can still experiment with the following commands to process formatted text in the /etc/passwd file, however. In addition, many systems do not use /etc/passwd to store all user accounts. Again, you can run the examples to see how to process formatted text.

An /etc/passwd file from a Linux system follows:

$ more /etc/passwd
root:x:0:0:root:/root:/bin/bash
bin:x:1:1:bin:/bin:/sbin/nologin
daemon:x:2:2:daemon:/sbin:/sbin/nologin
adm:x:3:4:adm:/var/adm:/sbin/nologin
lp:x:4:7:lp:/var/spool/lpd:/sbin/nologin
sync:x:5:0:sync:/sbin:/bin/sync
shutdown:x:6:0:shutdown:/sbin:/sbin/shutdown
halt:x:7:0:halt:/sbin:/sbin/halt
mail:x:8:12:mail:/var/spool/mail:/sbin/nologin
news:x:9:13:news:/etc/news:
uucp:x:10:14:uucp:/var/spool/uucp:/sbin/nologin
operator:x:11:0:operator:/root:/sbin/nologin
games:x:12:100:games:/usr/games:/sbin/nologin
gopher:x:13:30:gopher:/var/gopher:/sbin/nologin
ftp:x:14:50:FTP User:/var/ftp:/sbin/nologin
nobody:x:99:99:Nobody:/:/sbin/nologin

rpm:x:37:37::/var/lib/rpm:/sbin/nologin
vcsa:x:69:69:virtual console memory owner:/dev:/sbin/nologin
nscd:x:28:28:NSCD Daemon:/:/sbin/nologin
sshd:x:74:74:Privilege-separated SSH:/var/empty/sshd:/sbin/nologin
rpc:x:32:32:Portmapper RPC user:/:/sbin/nologin
rpcuser:x:29:29:RPC Service User:/var/lib/nfs:/sbin/nologin
nfsnobody:x:65534:65534:Anonymous NFS User:/var/lib/nfs:/sbin/nologin
pcap:x:77:77::/var/arpwatch:/sbin/nologin
mailnull:x:47:47::/var/spool/mqueue:/sbin/nologin
smmsp:x:51:51::/var/spool/mqueue:/sbin/nologin
apache:x:48:48:Apache:/var/www:/sbin/nologin
squid:x:23:23::/var/spool/squid:/sbin/nologin
webalizer:x:67:67:Webalizer:/var/www/usage:/sbin/nologin
dbus:x:81:81:System message bus:/:/sbin/nologin
xfs:x:43:43:X Font Server:/etc/X11/fs:/sbin/nologin
named:x:25:25:Named:/var/named:/sbin/nologin
ntp:x:38:38::/etc/ntp:/sbin/nologin
gdm:x:42:42::/var/gdm:/sbin/nologin
postgres:x:26:26:PostgreSQL Server:/var/lib/pgsql:/bin/bash
ericfj:x:500:500:Eric Foster-Johnson:/home2/ericfj:/bin/bash
bobmarley:x:501:501:Bob Marley:/home/bobmarley:/bin/bash

The /etc/passwd file uses the following format for each user account:

username:password:userID:groupID:Real Name:home_directory:starting_shell

Each field is separated by a colon. So you can parse the information for an individual user:

bobmarley:x:501:501:Bob Marley:/home/bobmarley:/bin/bash

In this case, the user name is bobmarley. The password, x, is a placeholder. This commonly means that another system handles login authentication. The user ID is 501. So is the user's default group ID. (Linux systems often create a group for each user, a group of one, for security reasons.) The user's real name is Bob Marley. His home directory is /home/bobmarley. His starting shell is bash. (Good choice.)

Like the ancient spell command used previously, making broad assumptions is fun, although not always accurate. For this example, a real user account is a user account that runs a shell (or what the script thinks is a shell) on login and does not run a program in /sbin or /usr/sbin, locations for system administration commands. As with the spell command, this is not fully accurate but good enough to start processing the /etc/passwd file.

You can combine all this information and start extracting data from the /etc/passwd file one step at a time.

Try It Out: Processing User Names

The cut command extracts, or cuts, pieces of text from formatted text files. The following command tells cut to extract the username, real name, and starting shell fields from the /etc/passwd file:

$ cut -d: -f1,5,7 /etc/passwd
root:root:/bin/bash
bin:bin:/sbin/nologin

daemon:daemon:/sbin/nologin
adm:adm:/sbin/nologin
lp:lp:/sbin/nologin
sync:sync:/bin/sync
shutdown:shutdown:/sbin/shutdown
halt:halt:/sbin/halt
mail:mail:/sbin/nologin
news:news:
uucp:uucp:/sbin/nologin
operator:operator:/sbin/nologin
games:games:/sbin/nologin
gopher:gopher:/sbin/nologin
ftp:FTP User:/sbin/nologin
nobody:Nobody:/sbin/nologin
rpm::/sbin/nologin
vcsa:virtual console memory owner:/sbin/nologin
nscd:NSCD Daemon:/sbin/nologin
sshd:Privilege-separated SSH:/sbin/nologin
rpc:Portmapper RPC user:/sbin/nologin
rpcuser:RPC Service User:/sbin/nologin
nfsnobody:Anonymous NFS User:/sbin/nologin
pcap::/sbin/nologin
mailnull::/sbin/nologin
smmsp::/sbin/nologin
apache:Apache:/sbin/nologin
squid::/sbin/nologin
webalizer:Webalizer:/sbin/nologin
dbus:System message bus:/sbin/nologin
xfs:X Font Server:/sbin/nologin
named:Named:/sbin/nologin
ntp::/sbin/nologin
gdm::/sbin/nologin
postgres:PostgreSQL Server:/bin/bash
ericfj:Eric Foster-Johnson:/bin/bash
bobmarley:Bob Marley:/bin/bash

With the cut command, you have narrowed the data, removing extraneous fields, which makes it easier to filter the entries.

Note that cut starts counting with 1. Many Unix-related commands start at 0.

The next step is to filter out all the items with starting programs in the /sbin directory, especially the aptly named /sbin/nologin, which implies an account where the user is not allowed to log in. To do this, you can pipe the results to the grep command:

$ cut -d: -f1,5,7 /etc/passwd | grep -v sbin
root:root:/bin/bash
sync:sync:/bin/sync
news:news:
postgres:PostgreSQL Server:/bin/bash
ericfj:Eric Foster-Johnson:/bin/bash
bobmarley:Bob Marley:/bin/bash

The -v option tells grep to output all lines that do not match the expression. This is very useful for shell scripts.

You now have a lot less data. The next filter should focus on keeping only those user accounts that run a shell. Because all shells have sh in their names, you can use grep again:

$ cut -d: -f1,5,7 /etc/passwd | grep -v sbin | grep sh
root:root:/bin/bash
postgres:PostgreSQL Server:/bin/bash
ericfj:Eric Foster-Johnson:/bin/bash
bobmarley:Bob Marley:/bin/bash

Note that not all shells are required to have sh in their names. This is an assumption used for simplicity.

The data looks good—well, mostly good. You still have a false positive with the postgres account because it is listed as having bash for its shell. (Exercise 3 in the exercises at the end of the chapter aims to get you to solve this issue.)

The next step is to display the data in a way that looks better than the previous output. To display the data, you can go back to the awk command. The following awk program will format the data better:

awk -F':' ' { printf( "%-12s %-40s
", $1, $2 )   } ' users.txt

See Chapter 7 for more on the awk command.

This command tells awk to process the data in the file named users.txt. To create this file, you can redirect the output of the previous command:

cut -d: -f1,5,7 /etc/passwd | grep -v sbin | grep sh | sort > users.txt

For example:

$ cut -d: -f1,5,7 /etc/passwd | grep -v sbin | grep sh > users.txt
$ more users.txt
root:root:/bin/bash
postgres:PostgreSQL Server:/bin/bash
ericfj:Eric Foster-Johnson:/bin/bash
bobmarley:Bob Marley:/bin/bash

The data now appears in the file named users.txt, ready for the awk command. To make for a better display, the sort command rearranges the output in alphabetical order:

$ cut -d: -f1,5,7 /etc/passwd | grep -v sbin | grep sh | sort > users.txt
$ more users.txt
bobmarley:Bob Marley:/bin/bash
ericfj:Eric Foster-Johnson:/bin/bash
postgres:PostgreSQL Server:/bin/bash
root:root:/bin/bash

Putting this all together, enter the following script and save it under the name listusers:

cut -d: -f1,5,7 /etc/passwd | grep -v sbin | grep sh | sort > users.txt

awk -F':' ' { printf( "%-12s %-40s
", $1, $2 )   } ' users.txt

# Clean up the temporary file.
/bin/rm -rf users.txt

When you run this script, you should see output like the following:

$ sh listusers
bobmarley    Bob Marley
ericfj       Eric Foster-Johnson
postgres     PostgreSQL Server
root         root

Your output will differ based on the user accounts listed in /etc/passwd on your system.

How It Works

Yow. That is a lot of work for a very short script. Note how you build up the piped command line slowly, one step at a time. You'll often need to follow this approach when creating a complicated command line.

Also note that there are a lot of commands available on your system, just waiting for you to use them in shell scripts.

The listusers script makes a lot of assumptions. Each of these assumptions can break down and cause the script to miss a user or, more likely, include false positives in the list of user accounts. The postgres account in the example output shows a false positive. Furthermore, someone with a real name of Linusbinsky would fail the sbin grep test. You probably don't have a user with this name, but it shows that people's names often make filtering rules very hard to create.

In addition to piping between commands, you can pipe data to and from your shell scripts, as in the following Try It Out.

Try It Out: Piping with Scripts

Enter the following script and save it under the name echodata:

#!/bin/sh
echo -n "When is the project due? "
read DUE
echo
echo "Due $DUE."

Mark the script file as executable:

$ chmod a+x echodata

This is a very simple script that prompts the user to enter data and a due date, and then repeats the data. The purpose of the script is just to experiment with pipes.

Enter the following piped command:

$ echo today | ./echodata
When is the project due?
Due today.

How It Works

The echodata script prompts the user with the echo command and then uses the read command to read input from stdin, normally the keyboard. You can pipe the output of the echo command to the echodata script. With that, the script has the data it needs from stdin, so the script will complete right away. It will not wait for you to type in any data.

Using tee to Send the Output to More Than One Process

The tee command sends output to two locations: a file as well as stdout. The tee command copies all input to both locations. This proves useful, for example, if you need to redirect the output of a command to a file and yet still want to see it on the screen. The basic syntax is:

original_command | tee filename.txt | next_command

In this example, the tee command sends all the output of the original_command to both the next_command and to the file filename.txt. This allows you to extract data from the command without modifying the result. You get a copy of the data, written to a file, as well as the normal command pipeline.

Try It Out: Tee Time

To see the tee command in action, try the following commands:

$ ls −1 /usr/bin | tee usr_bin.txt | wc -l
2291
$ more usr_bin.txt
[
411toppm
a2p
a2ps
ab
abiword
AbiWord-2.0
ac
access
aclocal
aclocal-1.4
aclocal-1.5
aclocal-1.6

aclocal-1.7
aclocal-1.8
aconnect
activation-client
addftinfo
addr2line
addr2name.awk
addresses
allcm
--More--(0%)

How It Works

This command counts the number of files in the directory /usr/bin. The −1 (one) option tells the ls command to output each file name one to a line. The −l (ell) option tells the wc command to report just the number of lines in the data.

Note how the wc command consumes the data. With the wc command, you have a count, but the data itself is gone. That's where the tee command comes into play. The tee command feeds all the data to the wc command, but it also makes a copy to the file usr_bin.txt.

Summary

You can get a lot of work done by combining simple commands. Unix systems (and Unix-like systems) are packed full of these types of commands. Many in the programming community liken scripting to the glue that ties commands together. You can think of the operating system as a toolbox and the shell as a way to access these tools. This philosophy will make it a lot easier to write shell scripts that you can use again and again.

This chapter covers redirecting input, output, and errors, as well as creating command pipelines.

You can redirect the output of commands to files using the > operator. The > operator will truncate a file if it already exists. Use >> in place of > if you want to append to the file.
You can redirect the error output of commands using &>, or 2>&1 to send the error output to the same location as the normal output.
You can redirect the input of commands to come from files using the < operator.
Redirect the output of one command to the input of another using the pipe character, |. You can pipe together as many commands as you need.
The tee command will copy its input to both stdout and to any files listed on the command line.

The next chapter shows how to control processes, capture the output of commands into variables, and mercilessly kill processes.

Exercises

Discuss the ways commands can generate output. Focus particularly on commands called from shell scripts.
Use pipes or redirection to create an infinite feedback loop, where the final output becomes the input again to the command line. Be sure to stop this command before it fills your hard disk. (If you are having trouble, look at the documentation for the tail command.)
Modify the listusers script so that it does not generate a false positive for the postgres user and other, similar accounts that are for background processes, not users. You may want to go back to the original data, /etc/passwd, to come up with a way to filter out the postgres account.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 8. Creating Command Pipelines

Create new playlist

Sign In

Sign Up

Chapter 8. Creating Command Pipelines

Working with Standard Input and Output

Redirecting Standard Input and Output

Redirecting Standard Error

Redirecting Both Standard Output and Standard Error

Note

Note

Appending to Files

Truncating Files

Sending Output to Nowhere Fast

Piping Commands

Piping with Unix Commands

Creating Pipelines

Using tee to Send the Output to More Than One Process

Summary

Exercises

Table of Contents for
8. Creating Command Pipelines