Chapter 5. Integer Security

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 5. Integer Security

with Douglas A. Gwyn, David Keaton, and David Svoboda¹

1. Douglas A. Gwyn is retired from the U.S. Army and is an Emeritus Member of INCITS PL22.11. David Keaton is a senior member of the technical staff in the CERT Program of Carnegie Mellon’s Software Engineering Institute (SEI) and chair of INCITS PL22.11. David Svoboda is a member of the technical staff for the SEI’s CERT.

Everything good is the transmutation of something evil: every god has a devil for a father.

—Friedrich Nietzsche, Sämtliche Werke: Kritische Studienausgabe

5.1. Introduction to Integer Security

The integers are formed by the natural numbers including 0 (0, 1, 2, 3, . . .) together with the negatives of the nonzero natural numbers (–1, –2, –3, . . .). Viewed as a subset of the real numbers, they are numbers that can be written without a fractional or decimal component and fall within the set {. . . –2, –1, 0, 1, 2, . . .}. For example, 65, 7, and –756 are integers; 1.6 and 1½ are not integers.

Integers represent a growing and underestimated source of vulnerabilities in C programs, primarily because boundary conditions for integers, unlike other boundary conditions in software engineering, have been intentionally ignored. Most programmers emerging from colleges and universities understand that integers have fixed limits. However, because these limits were deemed sufficient or because testing the results of each arithmetic operation was considered prohibitively expensive, violations of integer boundary conditions have gone unchecked for the most part in commercial software.

When developing secure systems, we cannot assume that a program will operate normally, given a range of expected inputs, because attackers are looking for input values that produce an abnormal effect. Digital integer representations are, of course, imperfect. A software vulnerability may result when a program evaluates an integer to an unexpected value (that is, a value other than the one obtained with pencil and paper) and then uses the value as an array index, size, or loop counter.

Because integer range checking has not been systematically applied in the development of most C software systems, security flaws involving integers will definitely exist, and some of them will likely cause vulnerabilities.

5.2. Integer Data Types

An integer type provides a model of a finite subset of the mathematical set of integers. The value of an object having integer type is the mathematical value attached to the object. The representation of a value for an object having integer type is the particular encoding of the value in the bit pattern contained in the storage allocated for the object.

C provides a variety of standard integer types (with keyword-specified names) and allows implementations to define other extended integer types (with non-keyword reserved identifier names); either can be included in type definitions in standard headers.

The standard integer types include all the well-known integer types that have existed from the early days of Kernighan and Ritchie C (K&R C). These integer types allow a close correspondence with the underlying machine architecture. Extended integer types are defined in the C Standard to specify integer types with fixed constraints.

Each integer-type object in C requires a fixed number of bytes of storage. The constant expression CHAR_BIT from the <limits.h> header gives the number of bits in a byte, which must be at least 8 but might be greater depending on the specific implementation. With the exception of the unsigned char type, not all of the bits are necessarily available to represent the value; unused bits are called padding. Padding is allowed so that implementations can accommodate hardware quirks, such as skipping over a sign bit in the middle of a multiple-word representation.

The number of nonpadding bits used to represent a value of a given type is called the width of that type, which we denote by w(type) or sometimes just N. The precision of an integer type is the number of bits it uses to represent values, excluding any sign and padding bits.

For example, on architectures such as x86-32 where no padding bits are used, the precision of signed types is w(type) – 1, while, for unsigned types, the precision equals w(type).

There are other ways to represent integers, such as arbitrary-precision or bignum arithmetic. Those methods dynamically allocate storage as required to accommodate the widths necessary to correctly represent the values. However, the C Standard does not specify any such scheme, and, unlike C++, built-in operators such as + and / cannot be overloaded and used in expressions containing such abstract data types. Applications such as public-key encryption generally use such a scheme to get around the limitations of C’s fixed sizes.

The standard integer types consist of a set of signed integer types and corresponding unsigned integer types.

Unsigned Integer Types

C requires that unsigned integer types represent values using a pure binary system with no offset. This means that the value of the binary number is . The rightmost bit has the weight 2⁰, the next bit to the left has the weight 2¹, and so forth. The value of the binary number is the sum of all the set bits. This means that all-zero value bits always represent the value 0, and the value 1 is represented by all zeros except for a single 1 bit, which is the least significant bit. Unsigned integer types represent values from 0 through an upper limit of 2^w(type) − 1.

All bitwise operators (|, &, ^, ~) treat the bits as pure binary, as shown in Example 5.1.

Example 5.1. Bitwise Operators: 13 ^ 6 = 11

  1 1 0 1 = 13
^ 0 1 1 0 =  6
--------------
  1 0 1 1 = 11

Unsigned integers are the natural choice for counting things. The standard unsigned integer types (in nondecreasing length order) are

1. unsigned char

2. unsigned short int

3. unsigned int

4. unsigned long int

5. unsigned long long int

The keyword int can be omitted unless it is the only integer-type keyword present.

Nondecreasing length order means that, for example, unsigned char cannot be longer than unsigned long long int (but can be the same size). The many different widths reflect existing hardware; as time progressed, registers also became larger, so longer and longer types were introduced as needed.

Compiler- and platform-specific integral limits are documented in the <limits.h> header file. Familiarize yourself with these limits, but remember that these values are platform specific. For portability, use the named constants and not the actual values in your code. The “Minimum Magnitudes” column in Table 5.1 identifies the guaranteed portable range for each unsigned integer type, that is, the smallest maximum value allowed by an implementation. These magnitudes are replaced by implementation-defined magnitudes with the same sign, such as those shown for the x86-32 architecture.

Table 5.1. Compiler- and Platform-Specific Integral Limits

Because these are unsigned values, the minimum magnitude is always 0, and no constants are defined for it.

Minimum widths for the standard unsigned types are unsigned char (8), unsigned short (16), unsigned int (16), unsigned long (32), and unsigned long long (64).

C added a first-class Boolean type. An object declared type _Bool is large enough to store the values 0 and 1 and acts as unsigned. When any scalar value is converted to _Bool, the result is 0 if the value compares equal to 0; otherwise, the result is 1.

Wraparound

A computation involving unsigned operands can never overflow, because a result that cannot be represented by the resulting unsigned integer type is reduced modulo the number that is 1 greater than the largest value that can be represented by the resulting type. For addition and multiplication, this is the same as pretending that there are additional high-order (most significant) bits appended to make sufficient room for the representation and then discarding these bits.

You can visualize wraparound using the 4-bit unsigned integers wheel shown in Figure 5.1.

Figure 5.1. Four-bit unsigned integer representation

Incrementing a value on the wheel produces the value immediately clockwise from it. Note that incrementing an unsigned integer at its maximum value (15) results in the minimum value for that type (0). This is an example of wraparound, shown in Example 5.2.

Example 5.2. Wraparound

Table of Contents for Chapter 5. Integer Security

Create new playlist

Sign In

Sign Up

Chapter 5. Integer Security

5.1. Introduction to Integer Security

5.2. Integer Data Types

Unsigned Integer Types

Wraparound

Signed Integer Types

Sign and Magnitude

One’s Complement

Two’s Complement

Integer Representations Compared

Signed Integer Ranges

Integer Overflow

Character Types

Data Models

Other Integer Types

size_t

ptrdiff_t

intmax_t and uintmax_t

intptr_t and uintptr_t

Platform-Independent Integer Types for Controlling Width

Platform-Specific Integer Types

5.3. Integer Conversions

Converting Integers

Integer Conversion Rank

Integer Promotions

Usual Arithmetic Conversions

Conversions from Unsigned Integer Types

Unsigned, Loss of Precision

Unsigned to Signed

Conversions from Signed Integer Types

Signed, Loss of Precision

Signed to Unsigned

Conversion Implications

5.4. Integer Operations

Assignment

Addition

Avoiding or Detecting Signed Overflow Resulting from Addition

Postcondition Test Using Status Flags

Precondition Test, Two’s Complement

Precondition Test, General

Downcast from a Larger Type

Avoiding or Detecting Wraparound Resulting from Addition

Postcondition Test Using Status Flags

Precondition Test

Postcondition Test

Subtraction

Postcondition Test Using Status Flags

Avoiding or Detecting Signed Overflow Resulting from Subtraction

Precondition Test

Avoiding or Detecting Wraparound Resulting from Subtraction

Postcondition Test Using Status Flags

Precondition Test

Postcondition Test

Multiplication

Postcondition Test Using Status Flags

Downcast from a Larger Type

Precondition Test, General

Division and Remainder

Error Detection

Precondition

Postcondition

Unary Negation (–)

Shifts

Left Shift

Right Shift

5.5. Integer Vulnerabilities

Vulnerabilities

Wraparound

Conversion and Truncation Errors

Conversion Errors

Truncation Errors

Nonexceptional Integer Logic Errors

5.6. Mitigation Strategies

Integer Type Selection

Abstract Data Types

Arbitrary-Precision Arithmetic

Table of Contents for
Chapter 5. Integer Security