Chapter 13. perf

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 13

perf

perf(1) is the official Linux profiler and is in the Linux kernel source under tools/perf.¹ It is a multi-tool that has profiling, tracing, and scripting capabilities, and is the front-end to the kernel perf_events observability subsystem. perf_events is also known as Performance Counters for Linux (PCL) or Linux Performance Events (LPE). perf_events and the perf(1) front-end began with performance monitoring counter (PMC) capabilities, but have since grown to support event-based tracing sources as well: tracepoints, kprobes, uprobes, and USDT.

¹perf(1) is unusual in that it is a large, complex user-level program that is in the Linux kernel source tree. Maintainer Arnaldo Carvalho de Melo described this situation to me as an “experiment.” While this has been beneficial to perf(1) and Linux as they have been developed in lockstep, some are uncomfortable with its inclusion, and it may remain the only complex user software ever to be included in the Linux source.

This chapter, along with Chapter 14, Ftrace, and Chapter 15, BPF, are optional reading for those who wish to learn one or more system tracers in more detail.

Compared with other tracers, perf(1) is especially suited for CPU analysis: profiling (sampling) CPU stack traces, tracing CPU scheduler behavior, and examining PMCs to understand micro-architectural level CPU performance including cycle behavior. Its tracing capabilities allow it to analyze other targets as well, including disk I/O and software functions.

perf(1) can be used to answer questions such as:

Which code paths are consuming CPU resources?
Are the CPUs stalled on memory loads/stores?
For what reasons are threads leaving the CPU?
What is the pattern of disk I/O?

The following sections are structured to introduce perf(1), show event sources, and then show the subcommands that use them. The sections are:

Prior chapters show how to use perf(1) for the analysis of specific targets. This chapter focuses on perf(1) itself.

13.1 Subcommands Overview

perf(1)’s capabilities are invoked via subcommands. As a common usage example, the following uses two subcommands: record to instrument events and save them to a file, and then report to summarize the contents of the file. These subcommands are explained in Section 13.9, perf record, and Section 13.10, perf report.

Section	Command	Description
-	`annotate`	Read perf.data (created by perf record) and display annotated code.
-	`archive`	Create a portable perf.data file containing debug and symbol info.
-	`bench`	System microbenchmarks.
-	`buildid-cache`	Manage build-id cache (used by USDT probes).
-	`c2c`	Cache line analysis tools.
-	`diff`	Read two perf.data files and display the differential profile.
-	`evlist`	List the event names in a perf.data file.
14.12	`ftrace`	A perf(1) interface to the Ftrace tracer.
-	`inject`	Filter to augment the events stream with additional information.
-	`kmem`	Trace/measure kernel memory (slab) properties.
11.3.3	`kvm`	Trace/measure kvm guest instances.
13.3	`list`	List event types.
-	`lock`	Analyze lock events.
-	`mem`	Profile memory access.
13.7	`probe`	Define new dynamic tracepoints.
13.9	`record`	Run a command and record its profile into perf.data.
13.10	`report`	Read perf.data (created by `perf record`) and display the profile.
6.6.13	`sched`	Trace/measure scheduler properties (latencies).
5.5.1	`script`	Read perf.data (created by `perf record`) and display trace output.
13.8	`stat`	Run a command and gather performance counter statistics.
-	`timechart`	Visualize total system behavior during a workload.
-	`top`	System profiling tool with real-time screen updates.
13.12	`trace`	A live tracer (system calls by default).

Table of Contents for Chapter 13. perf

Create new playlist

Sign In

Sign Up

Chapter 13

13.1 Subcommands Overview

13.2 One-Liners

Listing Events

Counting Events

Profiling

Static Tracing

Dynamic Tracing

Reporting

13.3 perf Events

13.4 Hardware Events

13.4.1 Frequency Sampling

13.5 Software Events

13.6 Tracepoint Events

13.7 Probe Events

13.7.1 kprobes

kprobe Arguments

13.7.2 uprobes

uprobe Arguments

13.7.3 USDT

13.8 perf stat

13.8.1 Options

13.8.2 Interval Statistics

13.8.3 Per-CPU Balance

13.8.4 Event Filters

13.8.5 Shadow Statistics

13.9 perf record

13.9.1 Options

13.9.2 CPU Profiling

13.9.3 Stack Walking

13.10 perf report

13.10.1 TUI

13.10.2 STDIO

13.11 perf script

13.11.1 Flame Graphs

13.11.2 Trace Scripts

13.12 perf trace

13.12.1 Kernel Versions

13.13 Other Commands

13.14 perf Documentation

13.15 References

Table of Contents for
Chapter 13. perf