Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 11. Rijndael: A Successor to the Data Encryption Standard

I don't know if we have any real chance. He can multiply and all we can do is add. He represents progress and I just drag my feet.

——Sten Nadolny (translated by Breon Mitchell), God of Impertinence

The american national institute of Standards and Technology (NIST) launched a competition in 1997 under the aegis of an Advanced Encryption Standard (AES) with the goal of creating a new national standard (federal information processing standard, or FIPS) for encryption with a symmetric algorithm. Although we have concentrated our attention in this book on asymmetric cryptography, this development is important enough that we should give it some attention, if only cursorily. Through the new standard FIPS 197 [F197], an encryption algorithm will be established that satisfies all of today's security requirements and that in all of its design and implementation aspects will be freely available without cost throughout the world. Finally, it replaces the dated data encryption standard (DES), which, however, as triple DES remains available for use in government agencies. However, the AES represents the cryptographic basis of the American administration for the protection of sensitive data.

The AES competition received a great deal of attention abroad as well as in the USA, not only because whatever happens in the United States in the area of cryptography produces great effects worldwide, but because international participation was specifically encouraged in the development of the new block encryption procedure.

From an original field of fifteen candidates who entered the contest in 1998, by 1999 ten had been eliminated, a process with involvement of an international group of experts. There then remained in competition the algorithms MARS, of IBM; RC6, of RSA Laboratories; Rijndael, of Joan Daemen and Vincent Rijmen; Serpent, of Ross Anderson, Eli Biham, and Lars Knudson; and Twofish, of Bruce Schneier et al. Finally, in October 2000 the winner of the selection process was announced. The algorithm with the name "Rijndael," by Joan Daemen and Vincent Rijmen, of Belgium, was named as the future advanced encryption standard (cf. [NIST]).^[32] Rijndael is a successor of the block cipher "Square," published earlier by the same authors (cf. [Squa]), which, however, had proved to be not as powerful. Rijndael was especially strengthened to attack the weaknesses of Square. The AES report of NIST gives the following basis for its decision.

Security
All candidates fulfill the requirements of the AES with respect to security against all known attacks. In comparison to the other candidates, the implementations of Serpent and Rijndael can at the least cost be protected against attacks that are based on measurements of the time behavior of the hardware (so-called timing attacks) or changes in electrical current use (so-called power or differential power analysis attacks).^[33] The degradation in performance associated with such protective measures is least for Rijndael, Serpent, and Twofish, with a greater advantage to Rijndael.
Speed
Rijndael is among the candidates that permit the most rapid implementation, and it is distinguished by equally good performance across all platforms considered, such as 32-bit processors, 8-bit microcontrollers, smart cards, and implementations in hardware (see below). Of all the candidates Rijndael allows the most rapid calculation of round keys.
Memory requirement
Rijndael makes use of very limited resources of RAM and ROM memory and is thus an excellent candidate for use in restricted-resource environments. In particular, the algorithm offers the possibility to calculate round keys separately "on the fly" for each round. These properties have great significance for applications on microcontrollers such as used in smart cards. Due to the structure of the algorithm, the requirements on ROM storage are least when only one direction, that is, either encryption or decryption, is realized, and they increase when both functions are needed. Nonetheless, with respect to resource requirements Rijndael is not beaten by any of the other four contestants.
Implementation in hardware
Rijndael and Serpent are the candidates with the best performance in hardware implementations, with a slight advantage going to Rijndael due to its better performance in output and cipher feedback modes.

The report offers further criteria that contributed to the decision in favor of Rijndael, which are collected into a closing summary (see [NIST], Section 7):

There are many unknowns regarding future computing platforms and the wide range of environments in which the AES will be implemented. However, when considered together, Rijndael's combination of security, performance, efficiency, implementability, and flexibility make it an appropriate selection for the AES for use in the technology of today and in the future.

Given the openness of the selection process and the politically interesting fact that with Rijndael an algorithm of European vintage was selected, one might expect future speculation about secret properties, hidden trap doors, and deliberately built-in weaknesses to be silenced, which never quite succeeded with DES.

Before we get involved with the functionality of Rijndael, we would like as preparation to go on a brief excursion into the arithmetic of polynomials over finite fields, which leans heavily on the presentation in [DaRi], Section 2.

Arithmetic with Polynomials

We start by looking at arithmetic in the field • _2ⁿ, the finite field with 2ⁿ elements, where an element of • _2ⁿ is represented as a polynomial f(x) = a_{n− 1}xⁿ⁻¹ + a_n−2xⁿ⁻² + ... + a₁x+ a₀ with coefficients a_i in • ₂ (which is isomorphic to •₂). Equivalently, an element of •_2ⁿ can be represented simply as an n-tuple of polynomial coefficients, each representation offering its own advantages. The polynomial representation is well suited for manual calculation, while the representation as a tuple of coefficients corresponds well to a computer's binary representation of numbers. To demonstrate this, we notate •_2³ as a sequence of eight polynomials and again as eight 3-tuples with their associated numerical values (see Table 11-1).

Addition of polynomials proceeds by adding the coefficients in •₂: If f(x) := x² + x and g(x) := x² + x + 1, then f(x) + g(x) = 2x² + 2x + 1 = 1, since 1+1 = 0 in •₂. We can carry out addition of 3-tuples in •_2³ column by column. We see, then, for example, that the sum of (1 1 0) and (1 1 1) is (0 0 1):

Equation 11.1.

Table 11-1. Elements of •₂₃

`Polynomials in` •_2³	`3-Tuples in` •_2³	`Numerical Value`
0	0	0	0	'00'
1	0	0	1	'01'
x	0	1	0	'02'
x + 1	0	1	1	'03'
x²	1	0	0	'04'
x² + 1	1	0	1	'05'
x² + x	1	1	0	'06'
x² + x + 1	1	1	1	'07'

The addition of digits takes place in •₂ and is not to be confused with binary addition, which can involve a carry. This process is reminiscent of our XOR function in Section 7.2, which executes the same operation in •_n for large n.

Multiplication in •_2³ is accomplished by multiplying each term of the first polynomial by each term of the second and then summing the partial products. The sum is then reduced by an irreducible polynomial of degree 3 (in our example modulo m(x) := x³ + x + 1):^[34]

Equation 11.2.

This corresponds to the product of 3-tuples (1 1 0) • (1 1 1) = (1 0 0), or, expressed numerically, '06' • '07' = '04'.

The abelian group laws hold in •_2³ with respect to addition and in •_2³ {0} with respect to multiplication (cf. Chapter 5). The distributive law holds as well.

The structure and arithmetic of •_2³ can be carried over directly to the field •_2⁸, which is the field that is actually of interest in studying Rijndael. Addition and multiplication are carried out as in our above example, the only differences being that •_2⁸ has 256 elements and that an irreducible polynomial of degree 8 will be used for reduction. For Rijndael this polynomial is m(x) := x⁸ + x⁴ + x³ + x + 1, which in tuple representation is (1 0 0 0 1 1 0 1 1), corresponding to the hexadecimal number '011B'.

Multiplication of a polynomial

Equation 11.3.

by x (corresponding to a multiplication • '02') is particularly simple:

Equation 11.4.

where the reduction modulo m(x) is required only in the case a₇ ≠ 0, and then it can be carried out by subtracting m(x), that is, by a simple XOR of the coefficients.

For programming one therefore regards the coefficients of a polynomial as binary digits of integers and executes a multiplication by x by a left shift of one bit, followed by, if a₇ = 1, a reduction by an XOR operation with the eight least-significant digits '1B' of the number '011B' corresponding to m(x) (whereby a₇ is simply "forgotten"). The operation a • '02' for a polynomial f, or its numerical value a, is denoted by Daemen and Rijmen by b = xtime(a). Multiplication by powers of x can be executed by successive applications of xtime().

For example, multiplication of f(x) by x + 1 (or '03') is carried out by shifting the binary digits of the numerical value a of f one place to the left and XOR-ing the result with a. Reduction modulo m(x) proceeds exactly as with xtime. Two lines of C code demonstrate the procedure:

f ^= f << 1;    /* multiplication of f by (x + 1) */
if (f & 0x100) f ^= 0x11B;    /* reduction modulo m(x) */

Multiplication of two polynomials f and h in •_2⁸ {0} can be speeded up by using logarithms: Let g(x) be a generating polynomial^[35] of •_2⁸ {0}. Then there exist m and n such that f ≡ g^m and h ≡ gⁿ. Thus f • h ≡ g^m+n mod m(x).

From a programming point of view this can be transposed with the help of two tables, into one of which we place the 255 powers of the generator polynomial g(x) := x + 1 and into the other the logarithms to the base g(x) (see Tables 11-2 and 11-3). The product f • h is now determined by three accesses to these tables: From the logarithm table are taken values m and n for which g^m = f and gⁿ = h. From the table of powers the value g^{((n+m)mod255)} is taken (note that g^ord(g) = 1) Table 11-2 contains the powers of g twice in succession, and so one can avoid having to reduce the exponent of g in f • h = g^n+m.

With the help of this mechanism we can also carry out polynomial division in •_2⁸. Thus for f, g € •_2⁸ {0},

Equation 11.5.

This procedure for polynomial multiplication in •_2⁸ is illustrated in the function polymul():

Table 11-2. Powers of g(x) = x + 1, ascending left to right

01	03	05	0F	11	33	55	FF	1A	2E	72	96	A1	F8	13	35
5F	E1	38	48	D8	73	95	A4	F7	02	06	0A	1E	22	66	AA
E5	34	5C	E4	37	59	EB	26	6A	BE	D9	70	90	AB	E6	31
53	F5	04	0C	14	3C	44	CC	4F	D1	68	B8	D3	6E	B2	CD
4C	D4	67	A9	E0	3B	4D	D7	62	A6	F1	08	18	28	78	88
83	9E	B9	D0	6B	BD	DC	7F	81	98	B3	CE	49	DB	76	9A
B5	C4	57	F9	10	30	50	F0	0B	1D	27	69	BB	D6	61	A3
FE	19	2B	7D	87	92	AD	EC	2F	71	93	AE	E9	20	60	A0
FB	16	3A	4E	D2	6D	B7	C2	5D	E7	32	56	FA	15	3F	41
C3	5E	E2	3D	47	C9	40	C0	5B	ED	2C	74	9C	BF	DA	75
9F	BA	D5	64	AC	EF	2A	7E	82	9D	BC	DF	7A	8E	89	80
9B	B6	C1	58	E8	23	65	AF	EA	25	6F	B1	C8	43	C5	54
FC	1F	21	63	A5	F4	07	09	1B	2D	77	99	B0	CB	46	CA
45	CF	4A	DE	79	8B	86	91	A8	E3	3E	42	C6	51	F3	0E
12	36	5A	EE	29	7B	8D	8C	8F	8A	85	94	A7	F2	0D	17
39	4B	DD	7C	84	97	A2	FD	1C	24	6C	B4	C7	52	F6	01
03	05	0F	11	33	55	FF	1A	2E	72	...	...	...	F6

Function:	multiplication of polynomials in •_2⁸
Syntax:	`UCHAR polymul (unsigned int f, unsigned int h);`
Input:	`unsigned int f` (summand), `unsigned int h` (summand)
Return:	the product `f` • `h`

UCHAR
polymul (unsigned int f, unsigned int h)
{
  if ((f != 0) && (h != 0))
    {

return (AntiLogTable[LogTable[f] + LogTable[h]]);
  }
else
  {
    return 0;
  }
}

Table 11-3. Logarithms to base g(x) = x + 1 (e.g., log_g(x) 2 = 25 = 19 in hexadecimal, log_g(x) 255 = 7).

We now ratchet the complexity level up one notch and consider arithmetic with polynomials of the form f(x) = f₃x³ + f₂x² + f₁x + f₀ with coefficients f_i in •_2⁸, that is, coefficients that are themselves polynomials. The coefficients of such polynomials can be represented as fields of four bytes each. Now things begin to get interesting: While addition of such polynomials f(x) and g(x) again takes place by means of a bitwise XOR of the coefficients, the product h(x) = f(x)g(x) is calculated to be

Equation 11.6.

with coefficients

After reduction of h(x) by a polynomial of degree 4, one again obtains a polynomial of degree 3 over •_2⁸.

For this Rijndael uses the polynomial m(x) := x⁴ + 1. Usefully, x^j mod M(x) = x^jmod4, so that h(x) mod m(x) can be easily computed as

Equation 11.7.

with

Equation 11.8.

Logarithms to base g(x) = x + 1 (e.g., logg(x) 2 = 25 = 19 in hexadecimal, logg(x) 255 = 7).

From this one concludes that the coefficients d_i can be computed by matrix multiplication over •_2⁸:

Equation 11.9.

It is precisely this operation with the constant, invertible modulo M(x), polynomial a(x) := a₃x³ + a₂x² + a₁x + a₀ over •_2⁸, with coefficients a₀(x) = x, a₁(x) = 1, a₂(x) = 1, and a₃(x) = x + 1, that is executed in the so-called MixColumns transformation, which constitutes a principal component of the round transformations of Rijndael.

The Rijndael Algorithm

Rijndael is a symmetric block encryption algorithm with variable block and key lengths. It can process blocks of 128, 192, and 256 bits and keys of the same lengths, where all combinations of block and key lengths are possible. The accepted key lengths correspond to the guidelines for AES, though the "official" block length is only 128 bits. Each block of plain text is encrypted several times with a repeating sequence of various functions, in so-called rounds. The number of rounds is dependent on the block and key lengths (see Table 11-4).

Rijndael is not a Feistel algorithm, whose essential characteristic is that blocks are divided into left and right halves, the round transformations applied to one half, and the result XOR-ed with the other half, after which the two halves are exchanged. DES is the best-known block algorithm built along these lines. Rijndael, on the other hand, is built up of separate layers, which successively apply various effects to an entire block. For the encryption of a block the following transformations are sequentially applied:

The first round key is XOR-ed with the block.
L_r - 1 regular rounds are executed.
A terminal round is executed, in which the MixColumns transformation of the regular rounds is omitted.

Table 11-4. Number of Rijndael rounds as a function of block and key length

Each regular round of step 2 consists of four individual steps, which we shall now examine:

Substitution:Each byte of a block is replaced by application of an S-box.
Permutation:The bytes of the block are permuted in a ShiftRows transformation.
Diffusion:The MixColumns transformation is executed.
Round key addition:The current round key is XOR-ed with the block.

The layering of transformations within a round is shown schematically in Figure 11-1.

Each layer exercises a particular effect within a round and thus on each block of plain text:

Influence of the key
XOR-ing with the round key before the first round and as the last step within each round has an effect on every bit of the round result. In the course of encryption of a block there is no step whose result is not dependent in every bit on the key.
Nonlinear layer
The substitution effected via the S-box is a nonlinear operation. The construction of the S-box provides almost ideal protection against differential and linear cryptanalysis (see [BiSh] and [NIST]).
Linear layer
The ShiftRows and MixColumns transformations ensure an optimal mixing up of the bits of a block.

In the following description of the internal Rijndael functions L_b will denote the block length in 4-byte words, L_k the length of the user key in 4-byte words (that is, L_b, L_k € {4, 6, 8})), and L_r the number of rounds as indicated in Table 11-4.

Plain text and encrypted text are input, respectively output, as fields of bytes. A block of plain text, passed as a field m₀,...,m₄L_b − 1, will be regarded in the following as a two-dimensional structure

Table 11-5. Representation of message blocks

b_0,0	b_0,1	b_0,2	b_0,3	b_0,4	...	b_{0,L_b−1}
b_1,0	b_1,1	b_1,2	b_1,3	b_1,4	...	b_{1,L_b−1}
b_2,0	b_2,1	b_2,2	b_2,3	b_2,4	...	b_{2,L_b−1}
b_3,0	b_3,1	b_3,2	b_3,3	b_3,4	...	b_{3,L_b−1}

Figure 11-1. Layering of transformations in the Rijndael rounds

where the bytes of plain text are sorted according to the following ordering:

Equation 11.10.

with i = n mod 4 and j =

Access to

Calculating the Round Key

Encryption and decryption each require the generation of L_r round keys, called collectively the key schedule. This occurs through expansion of the secret user key by attaching recursively derived 4-byte words k_i = (k_0,i, k_1,i, k_2,i, k_3,i)) to the user key.

The first L_k words k₀, ..., k_{L_k−1} of the key schedule are formed from the secret user key itself. For L_k € {4,6} the next 4-byte word k_i is determined by XOR-ing the preceding word k_i−1 with k_{i-L_k}. If i ≡ 0 mod L_k, then a function F_{L_k} (k,i) is applied before the XOR operation, which is composed of a cyclic left shift (left rotation) r(k) of k bytes, a substitution S(r(k)) from the Rijndael S-box (we shall return to this later), and an XOR with a constant c (

The constants c(j) are defined by c(j) := (rc(j)0, 0, 0), where rc(j) are recursively determined elements from •_2⁸: rc(1) := 1, rc(j) := rc(j − 1) • x = x^j−1 . Expressed in numerical values, this is equivalent to rc(1) := '01', rc(j) := rc(j − 1) • '02'. From the standpoint of programming, rc(j) is computed by a (j − 1)-fold execution of the function xtime described above, beginning with the argument 1, or more rapidly by access to a table (Tables 11-6 and 11-7).

Table 11-6. rc(j) constants (hexadecimal)

'01'	'02'	'04'	'08'	'10'	'20'	'40'	'80'	'1B'	'36'
'6C	'D8'	'AB'	'4D'	'9A	'2F'	'5E'	'BC	'63'	'C6'
'97'	'35'	'6A'	'D4'	'B3'	'7D'	'FA	'EF'	'C5'	'91'

For keys of length 256 bits (that is, L_k = 8) an additional S-box operation is inserted: If i ≡ 4 mod L_k, then before the XOR operation k_i−1 is replaced by S (k_i−1).

Table 11-7. rc(j) constants (binary)

00000001	00000010	00000100	00001000	00010000
00100000	01000000	10000000	00011011	00110110
01101100	11011000	10101011	01001101	10011010
00101111	01011110	10111100	01100011	11000110
10010111	00110101	01101010	11010100	10110011
01111101	11111010	11101111	11000101	10010001

Thus a key schedule is built up of L_b • (L_r + 1) 4-byte words, including the secret user key. At each round i = 0, ..., L_r - 1 the next L_b 4-byte words k_{L_b•i} through kL_b•(i+1) are taken as round keys from the key schedule. The round keys are conceptualized, in analogy to the structuring of the message blocks, as a two-dimensional structure of the form depicted in Table 11-8.

Table 11-8. Representation of the round keys

k_0,0	k_0,1	k_0,2	k_0,3	k_0,4	...	k_{0,L_b−1}
k_1,0	k_1,1	k_1,2	k_1,3	k_1,4	...	k_{1,L_b−1}
k_2,0	k_2,1	k_2,2	k_2,3	k_2,4	...	k_{2,L_b−1}
k_3,0	k_3,1	k_3,2	k_3,3	k_3,4	...	k_{3,L_b−1}

For key lengths of 128 bits key generation can be understood from an examination of Figure 11-2.

Figure 11-2. Diagram for round keys for L_k = 4

There are no weak keys known, those whose use would weaken the procedure.

The S-Box

The substitution box, or S-box, of the Rijndael algorithm specifies how in each round each byte of a block is to be replaced by another value.

The S-box has the task of minimizing the susceptibility of the algorithm to methods of linear and differential cryptanalysis and to algebraic attacks. To accomplish this, the S-box operation should possess a high algebraic complexity in •_2⁸ and thus create a good extension to the ShiftRows and MixColumns operations. Not having such a function would support attacks within •_2⁸ and thereby decisively weaken the procedure.

In addition to the requirement of complexity the S-box function must of course be invertible; it must have no fixed points S(a) = a or complementary fixed points S(a) = ā; and it must also execute rapidly and be easy to implement.

All these desiderata were achieved through a combination of multiplicative inversion in •_2⁸ and the previously mentioned affine mapping from •_2⁸ to itself. The S-box consists of a list of 256 bytes, which are constructed by first thinking of each nonzero byte as a representative of •_2⁸ and replacing it with its multiplicative inverse (zero remains unchanged). Then an affine transformation over •₂ is calculated as a matrix multiplication and addition of (1 1 0 0 0 1 1 0):

Equation 11.11.

In this representation x₀ and y₀ denote the least-significant, and x₇ and y₇ the most-significant, bits of a byte, where the 8-tuple (1 1 0 0 0 1 1 0) corresponds to the hexadecimal value '63'.

Through this construction, all of the requisite design criteria were satisfied. The substitution is thereby an ideal strengthening of the algorithm. Successive application of the construction plan to the values 0 to 255 leads to Table 11-9 (in hexadecimal form; read horizontally from left to right).

For decryption the S-box must be used backwards: The affine inverse transformation is used, followed by multiplicative inversion in •_2⁸. The inverted S-box appears in Table 11-10.

The ShiftRows Transformation

The next step in the cycle of a round consists in the permutation of a block at the byte level. To this end the bytes are exchanged within the individual lines (b_i,0, b_i,1, b_i,2, . . ., b_{i, L_b − 1}) of a block according to the schemata depicted in Tables 11-11 through 11-13.

Table 11-9. The values of the S-box

In each first row (row index i = 0) no exchange takes place. In lines i = 1, 2, 3 the bytes are rotated left by c_{L_b,i} positions, from position j to position j - c_{L_b,i} mod L_b, where c_{L_b,i} is taken from Table 11-14.

For inverting this step, positions j in rows i = 1, 2, 3 are shifted to positions j + c_{L_b,i} mod L_b.

The MixColumns Transformation

After the rowwise permutation in the last step, in this step each column (b_i,j), i = 0 ,..., 3, j = 0 ,..., L_b of a block is taken to be a polynomial over •_2⁸ and multiplied by the constant polynomial a(x) := a₃x³ + a₂x² + a₁x + a₀, with coefficients a₀(x) = x, a₁(x) = 1, a₂(x) = 1, a₃(x) = x + 1, and reduced modulo M(x) := x⁴ + 1. Each byte of a column thus interacts with every other byte of the column. The rowwise operating ShiftRows transformation has the effect that in each round, other bytes are mixed with one another, resulting in strong diffusion.

Table 11-10. The values of the inverted S-box

Table 11-11. ShiftRows for blocks oflength 128 bits (L_b = 4)

`Before ShiftRows`	`After ShiftRows`
0	4	8	12	0	4	8	12
1	5	9	13	5	9	13	1
2	6	10	14	10	14	2	6
3	7	11	15	15	3	7	11

We have already seen (see page 244) how this step can be reduced to a matrix multiplication

Equation 11.12.

ShiftRows for blocks oflength 128 bits (Lb = 4)

with multiplication and addition carried out over •_2⁸. For multiplication by '02' (respectively x) the function xtime() has already been defined; multiplication by '03' (respectively x +1) has already been handled similarly (cf. page 247).

For inverting the MixColumns transformation every column (b_i,j) of a block is multiplied by the polynomial r (x) := r₃x³ + r₂x² + r₁x + r₀ with coefficients

Table 11-12. ShiftRows for blocks oflength 192 bits (L_b = 6)

Table 11-13. ShiftRows for blocks of length 256 bits (L_b =8)

Table 11-14. Distances of line rotations in ShiftRows

L_b	c_{L_b, 1}	c_{L_b, 2}	c_{L_b, 3}
4	1	2	3
6	1	2	3
8	1	3	4

r₀(x) = x³+x²+x, r₁(x) = x³+1, r₂(x) = x³+x²+1, and r₃(x) = x³+x+1 and reduced modulo M(x) := x⁴+1. The corresponding matrix is

Equation 11.13.

Distances of line rotations in ShiftRows

The AddRoundKey Step

The last step of a round carries out an XOR of the round key with the block:

Equation 11.14.

for j = 0,..., L_b − 1. In this way, every bit of the result of a round is made dependent on every key bit.

Encryption as a Complete Process

Encryption with Rijndael is encapsulated in the following pseudocode according to [DaRi], Section 4.2–4.4. The arguments are passed as pointers to fields of bytes or 4-byte words. The interpretation of the fields, variables, and functions employed is provided in Tables 11-15 through 11-17.

Table 11-15. Interpretation of variables

`Variable`	`Interpretation`
`Nk`	length L_k of the secret user key in 4-byte words
`Nb`	block length L_b in 4-byte words
`Nr`	round number L_r according to the table above

Table 11-16. Interpretation of fields

`Variables`	`Size in bytes`	`Interpretation`
`CipherKey`	*`4Nk`**	secret user key
`ExpandedKey`	*`4Nb * (Nr+1)`**	field of 4-byte words to hold the round key
`Rcon`	`⌈4Nb (Nr+1)/Nk⌉`	field of 4-byte words as constant c(j) := (rc(j),0,0,0)^[a]
`State`	*`4Nb`**	field for input and output of plain text and encrypted blocks
`RoundKey`	*`4Nb`**	round key, segment of `ExpandedKey`
^[a]It suffices to store the constants rc(j) in a field of size `⌈Nb * (Nr+1)/Nk⌉ ⩽` 30 bytes. If the field begins with 0, this byte is unoccupied, since the index j begins with 1. It then is then 31 bytes long.

Table 11-17. Interpretation of functions

`Function`	`Interpretation`
`KeyExpansion`	generation of round key
`RotBytes`	left rotation of a 4-byte word by 1 byte: `(abcd) → (bcda)`
`SubBytes`	S-box substitution `S` of all bytes of the passed field
`Round`	regular round
`FinalRound`	last round without `MixColumns`
`ShiftRows`	`ShiftRows` transformation
`MixColumns`	`MixColumns` transformation
`AddRoundKey`	addition of a round key

Key generation:

KeyExpansion (byte CipherKey, word ExpandedKey)
{
  for (i = 0; i < Nk; i++)
    ExpandedKey[i] = (CipherKey[4*i], CipherKey[4*i + 1],
      CipherKey[4*i + 2], CipherKey[4*i + 3]);
  for (i = Nk; i < Nb * (Nr + 1); i++)
  {
    temp = ExpandedKey[i - 1];
    if (i % Nk == 0)
      temp = SubBytes (RotBytes (temp)) ^ Rcon[i/Nk];
    else if ((Nk == 8) && (i % Nk == 4))
       temp = SubBytes (temp);
    ExpandedKey[i] = ExpandedKey[i - Nk] ^ temp;
  }
}

Round functions:

Round (word State, word RoundKey)
{
  SubBytes (State);
  ShiftRows (State);
  MixColumns (State);
  AddRoundKey (State, RoundKey)
}
FinalRound (word State, word RoundKey)
{
  SubBytes (State);
  ShiftRows (State);
  AddRoundKey (State, RoundKey)
}

Entire operation for encrypting a block:

Rijndael (byte State, byte CipherKey)
{
  KeyExpansion (CipherKey, ExpandedKey);
  AddRoundKey (State, ExpandedKey);
  for (i = 1; i < Nr; i++)
    Round (State, ExpandedKey + Nb*i);
  FinalRound (State, ExpandedKey + Nb*Nr);
}

There exists the possibility of preparing the round key outside of the function Rijndael and to pass the key schedule ExpandedKey instead of the user key CipherKey. This is advantageous when it is necessary in the encryption of texts that are longer than a block to make several calls to Rijndael with the same user key.

Rijndael (byte State, byte ExpandedKey)
{
  AddRoundKey (State, ExpandedKey);
  for (i = 1; i < Nr; i++)
    Round (State, ExpandedKey + Nb*i);
  FinalRound (State, ExpandedKey + Nb*Nr);
}

Especially for 32-bit processors it is advantageous to precompute the round transformation and to store the results in tables. By replacing the permutation and matrix operations by accesses to tables, a great deal of CPU time is saved, yielding improved results for encryption, and, as we shall see, for decryption as well. With the help of four tables each of 256 4-byte words of the form

Equation 11.15.

(for w = 0, ..., 255, S(w) denotes, as above, the S-box replacement), the transformation of a block b = (b_0,j,b_1,j,b_2,j,b_3,j), j = 0,...,L_b − 1, can be determined quickly for each round by the substitution

Equation 11.16.

with d(i,j) := j + c_{L_b, i} mod L_b (cf. ShiftRows, Table 11-14) and k_j = (k_0,j, k_1,j, k_2,j, k_3,j) as the jth column of the round key.

For the derivation of this result, see [DaRi], Section 5.2.1. In the last round the MixColumns transformation is omitted, and thus the result is determined by

Equation 11.17.

Clearly, it is also possible to use a table of 256 4-byte words, in which

Equation 11.18.

with a right rotation r(a, b, c, d) = (d, a, b, c) by one byte. For environments with limited memory this can be a useful compromise, the price being only a slightly increased calculation time for the three rotations.

Decryption

For Rijndael decryption one runs the encryption process in reverse order with the inverse transformations. We have already considered the inverses of the transformations SubBytes, ShiftRows, and MixColumns, which in the following are represented in pseudocode by the functions InvSubBytes, InvShiftRows, and InvMixColumns. The inverted S-box, the distances for inversion, the ShiftRows transformation, and the inverted matrix for the inversion of the MixColumns transformation are given on pages 251-252. The inverse round functions are the following:

InvFinalRound (word State, word RoundKey)
{
  AddRoundKey (State, RoundKey);
  InvShiftRows (State);
  InvSubBytes (State);
}

InvRound (word State, word RoundKey)
{
  AddRoundKey (State, RoundKey);
  InvMixColumns (State);
  InvShiftRows (State);
  InvSubBytes (State);
}

The entire operation for decryption of a block is as follows:

InvRijndael (byte State, byte CipherKey)
{
  KeyExpansion (CipherKey, ExpandedKey);
  InvFinalRound (State, ExpandedKey + Nb*Nr);
  for (i = Nr - 1; i > 0; i--)
    InvRound (State, ExpandedKey + Nb*i);
  AddRoundKey (State, ExpandedKey);
}

The algebraic structure of Rijndael makes it possible to arrange the transformations for encryption in such a way that here, too, tables can be employed. Here one must note that the substitution S and the InvShiftRows transformation commute, so that within a round their order can be switched. Because of the homomorphism property f(x + y) = f(x) + f(y) of linear transformations the InvMixColumns transformation and addition of the round key can be exchanged when InvMixColumns was used previously on the round key. Within a round the following course is taken:

InvFinalRound (word State, word RoundKey)
{
  AddRoundKey (State, RoundKey);
  InvSubBytes (State);
  InvShiftRows (State);
}

InvRound (word State, word RoundKey)
{
  InvMixColumns (State);
  AddRoundKey (State, InvMixColumns (RoundKey));
  InvSubBytes (State);
  InvShiftRows (State);
}

Without changing the sequence of transformations over both functions ordered one after the other, they can be redefined as follows:

AddRoundKey (State, RoundKey);

InvRound (word State, word RoundKey)
{
  InvSubBytes (State);
  InvShiftRows (State);
  InvMixColumns (State);
  AddRoundKey (State, InvMixColumns (RoundKey));
}

InvFinalRound (word State, word RoundKey)
{
  InvSubBytes (State);
  InvShiftRows (State);
  AddRoundKey (State, RoundKey);
}

With this is created the analogous structure to that for encryption. For reasons of efficiency the application of InvMixColumns to the round key in InvRound() is postponed until the key expansion, where the first and last round keys of InvMixColumns are left untouched. The "inverse" round keys are generated with

InvKeyExpansion (byte CipherKey, word InvEpandedKey)
{
  KeyExpansion (CipherKey, InvExpandedKey);
  for (i = 1; i < Nr; i++)
    InvMixColumns (InvExpandedKey + Nb*i);
}

The entire decryption operation of a block is now as follows:

InvRijndael (byte State, byte CipherKey)
{
  InvKeyExpansion (CipherKey, InvExpandedKey);
  AddRoundKey (State, InvExpandedKey + Nb*Nr);
  for (i = Nr - 1; i > 0; i--)
    InvRound (State, InvExpandedKey + Nb*i);
  InvFinalRound (State, InvExpandedKey);
}

In analogy to encryption, tables can be precomputed for this form of decryption. With

Equation 11.19.

(for w = 0, ...,255, S⁻¹(w) denotes the inverse S-box replacement) the result of an inverse round operation on a block b = (b_0,j,b_1,j,b_2,j,b_3,j), j = 0, ..., L_b − 1, can be determined by

Equation 11.20.

for j = 0, ..., L_b − 1 with d⁻¹ (i, j) := j − ^cL_b,i mod L_b (cf. page 250) and the jth column

Again in the last round the MixColumns transformation is omitted, and thus the result of the last round is given by

Equation 11.21.

To save memory one can also make do in decryption with a table of only 256 4-byte words, in which

Equation 11.22.

with a right rotation r(a, b, c, d) = (d, a, b, c) of one byte.

Performance

Implementations for various platforms have verified the superior performance of Rijndael. The bandwidth suffices for realizations for small 8-bit controllers with small amounts of memory and key generation on the fly up through current 32-bit processors. For purposes of comparison, Table 11-18 provides encryption rates for the candidates RC6, Rijndael, and Twofish, as well as for the older 8051 controller and the Advanced Risc Machine (ARM) as a modern 32-bit chip card controller.

Table 11-18. Comparative Rijndael performance in bytes per second, after [Koeu]

	`8051 (3.57 MHz)`	`ARM (28.56 MHz)`
RC6	165	151260
Rijndael	3005	311492
Twofish	525	56 289

Because of the more complex InvMixColumns operation, the times for decryption and encryption can diverge, depending on the implementation, though this effect can be completely compensated by using the tables described previously. Of course, the times depend on, in addition to the key length, the block length and the number of rounds (see Table 11-4). For comparison, on a Pentium III/200 MHz, throughput of about 8 MByte per second for a key of length 128 bits, about 7 Mbyte per second for 192-bit keys, and about 6 MByte per second for 256-bit keys for blocks of length 128 bits is achievable in both directions. On the same platform, the DES in C can encrypt and decrypt about 3.8 MByte per second (see [Gldm], http://fp.gladman.plus.com).

Modes of Operation

The classical operating modes Electronic Code Book (ECB), Cipher Block Chaining (CBC), Cipher Feedback (CFB), and Output Feedback (OFB) for block ciphers were updated by NIST for use with AES and provided with appropriate test vectors (see [FI81, N38A]). Consideration of additional operating modes, which had begun already in the framework of standardization of AES and which relates to the use of modes of operation in Internet communication, has resulted in the following operating modes:

Counter Mode (CTR): A block keystream is generated and joined to the plain text blocks using XOR.
CCM Mode: To ensure the reliability and integrity of a message, the counter mode is combined with a message authentication code (MAC) based on cipher block chaining (see [N38C]).
RMAC: Using a randomized message authentication code, which is still in development, the validity of a message can be checked with respect to both its content and its source (see [N38B]).

For further details, investigations into security and cryptanalysis, computational times, and current information on AES and Rijndael the reader is referred to the literature cited above as well as the Internet sites of NIST and Vincent Rijmen, which in turn contain many links to further sources of information:

http://csrc.nist.gov/CryptoToolkit/tkencryption.html

http://csrc.nist.gov/CryptoToolkit/modes

http://www.esat.kuleuven.ac.be/~rijmen/rijndael

In the downloadable source code to this book there is an implementation of AES in the file aes.c, which can be used to deepen an understanding of the procedure and to do some experimentation.

^[32]The name "Rijndael" is a portmanteau word derived from the names of the authors. Sources tell me that the correct pronunciation is somewhere between "rain doll" and "Rhine dahl." Per haps NIST should include in the standard a pronunciation key in the international phonetic alphabet.

^[33]Power analysis attacks (simple PA/differential PA) are based on correlations between individual bits or groups of bits of a secret cryptographic key and the average consumption of electricity for the execution of individual instructions or code sequences depending on the key (see, for example, [KoJJ], [CJRR], [GoPa]).

^[34]A polynomial is said to be irreducible if it divisible (without remainder) only by itself and 1.

^[35]g generates •_2⁸ {0} if g has order 255. That is, the powers of g run through all the elements of •_2⁸ {0}.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 11. Rijndael: A Successor to the Data Encryption Standard

Create new playlist

Sign In

Sign Up