Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

CHAPTER 2 GENERATING FUNCTIONS

Generating functions are algebraic objects that provide a powerful tool for analyzing recurrence relations. In this chapter, we will cover the basic theory of generating functions and examine many specific examples.

2.1 Rational generating functions

Given any sequence a₀, a₁, a₂,…,the ordinary generating function is

(2.1) equation

The generating function f(x) contains all of the information about the sequence {a_n}, and, being an algebraic entity, it is often easier to manipulate than the sequence itself. The term a_n is recovered by finding the coefficient of xⁿ in f(x).

EXAMPLE 2.1 Generating function for the Fibonacci sequence

Find the ordinary generating function for the Fibonacci sequence {F₀, F₁, F₂,…}.

Solution: Let . We obtain

equation

Through mass cancellation, the recurrence relation for the Fibonacci numbers yields

and

(2.2)

The function f(x) contains complete information about the Fibonacci numbers and can be used to evaluate related infinite sums such as . The computation of this sum is called for in the exercises.

For what values of x is the generating function valid?

Similarly, you can show that the generating function for the sequence of Lucas numbers {L_n} is

Notice that the generating function for the Fibonacci sequence is a rational function. Recall that a sequence {a_n} satisfies a linear recurrence relation with constant coefficients c₁,…,c_k if

equation

for all n ≥ k. A sequence satisfies a linear recurrence relation with constant coefficients if and only if it has a rational ordinary generating function.

Also, notice that the denominator of the generating function for the Fibonacci numbers, 1 – x – x², takes its form from the Fibonacci recurrence, while the numerator comes from multiplying the generating function by the denominator and keeping only those terms of degree less than 2.

Linear recurrence relation theorem. Let {a_n} be a sequence and c₁,…,c_k be arbitrary numbers. Then the following three assertions are equivalent.

(1) The sequence {a_n} satisfies a linear recurrence relation with constant coefficients c₁,…,c_k, i.e.,

equation

(2) The sequence {a_n} has a rational ordinary generating function of the form

equation

where g is a polynomial of degree at most k – 1.

(3) If

equation

with then r_i distinct, then the terms of {a_n} are given by the formula

where α₁,…,α_k are constants.

More generally, if

equation

where the roots r₁,…,r₁ occur with multiplicities m₁,…,m₁ then

where the p_i are polynomials with deg p_i < m_i for 1 ≤ i ≤ l.

Proof. (1) ⇒ (2) Assume that (a_n) satisfies a recurrence relation of the type specified in (1). Let f(x) be the ordinary generating function for (a_n). Then

equation

Hence

equation

so that

equation

where deg g ≤ k – 1.

(2) ⇒ (3) First consider the case where

equation

and then r_i are distinct. Expanding by partial fractions, we obtain

equation

where α₁,…,α_k are constants. Since this is just another formula for the ordinary generating function for {a_n}, we have

More generally, assume that 1 – has repeated roots. Suppose that r is a root with multiplicity m. Then, in the partial fraction decomposition of

equation

we have terms

where β₁, β₂,…,β_m are constants. By the formula

equation

the contribution to xⁱ in the power series for these fractions is p(n)xⁿ, where p is a polynomial of degree less than m. The rest of the proof that (2) ⇒ (3) follows as in the special case.

Each step in the proof is reversible.

The factorization of 1 – called for in the proof (and in practice) can be accomplished using the change of variables y = 1/x. Then

equation

The problem is reduced to factoring the polynomial

equation

This polynomial is the characteristic polynomial of the recurrence relation.

EXAMPLE 2.2

Find the generating function for the sequence defined by the recurrence relation a_n = 6a_n–1 – 9a_n–2 for n ≥ 2, and a₀ = 1, a₁ = 1. (This comes from Example 1.31.) Use the generating function to find a direct formula for a_n.

Solution: The form of the recurrence relation tells us that the denominator of the generating function is 1 – 6x + 9x². To get the numerator, we calculate

The only terms of degree less than 2 are 1 – 5x, so the numerator is 1 – 5x. Hence, the generating function is

To find a direct formula for a_n, we write the generating function as

Thus, we have a binomial series with a negative exponent. Its expansion is

equation

Therefore

equation

This is the same solution we saw before.

EXAMPLE 2.3 Change for a dollar

How many ways can you make change for $1.00 using units of 0.01, 0.05, 0.10, 0.25, 0.50, and 1.00? Here are some examples:

equation

Solution: For n ≥ 0, let a_n be the number of ways to make change for an amount n. We set a₀ = 1. The generating function for {a_n} is

This generating function has the rational form

(2.3)

Using a computer algebra system, we find that the coefficient of x¹⁰⁰ of the generating function is 293, i.e., there are 293 ways to make change for a dollar.

The factors in the denominator of the generating function give rise to geometric series. For example, the second factor gives

equation

Each term in the product corresponds to a way to make change for a dollar. For example, the term corresponding to the sum 5 + 10 + 10 + 25 + 25 + 25 is shown in boldface:

equation

Since the denominator of the generating function is a polynomial of degree 191, the sequence {a_n} satisfies a linear recurrence relation of order 191. The explicit formula for a_n involves a sum of exponential functions which are powers of 100th roots of unity.

EXAMPLE 2.4 Alcuin’s sequence

How many incongruent triangles have integer side lengths and perimeter 10¹⁰⁰?

Solution: Let t(n) be the number of triangles with integer side lengths and perimeter n.

We set t(0) = 0. It is easy to find the values in the following table:

The sequence {t(n)} is known as Alcuin’s sequence, named after Alcuin of York (735–804), who wrote a problem solving book containing some allocation problems equivalent to finding integer triangles.

The generating function for {t(n)} is rational:

(2.4) equation

We prove this by showing that we can obtain any integer triangle from (1,1,1) by adding nonnegative integer multiples of (0,1,1), (1,1,1), and (1,1,2). These triples satisfy the weak triangle inequality, which is sufficient since we start with a genuine triangle. The unique solution to the equation

in nonnegative integers α, β, and γ is α = b – a, β = a + b – c – 1, γ = c – b. Since 2α + 3β + 4γ = a + b + c – 3 = n – 3, we see that t(n) is equal to the number of ways of writing n – 3 as a sum of 2′s, 3′s, and 4′s (order of terms is unimportant). This is equivalent to the number of ways to make change for n – 3 using any combination of 2′s, 3′s, and 4′s.

Since the denominator of the (rational) generating function is

the sequence {t(n)} satisfies the order nine linear recurrence relation

The initial values, for 0 ≤ n ≤ 8, are given in our table. The recurrence relation allows us to compute moderately large values of t(n) easily by computer.

The generating function has the partial fraction decomposition

From this expansion we will find a simple formula for t(n).

By the binomial series theorem, the first four terms can be written as

equation

From the identity (_n^−k) = (–1)ⁿ (^n+k−1_n), the coefficient of xⁿ is

equation

The final two terms yield coefficients of xⁿ that follow a pattern modulo 12, i.e., c/72 where c is given in the table:

Thus, we obtain the formula t(n) = n²/48 + (c − 7)/72 for n even, and t(n) = (n + 3)²/48 + (c – 7)/72 for n odd. This can be represented as

(2.5) equation

where {x} is the nearest integer to x. For example, to find the number of integer triangles with perimeter 10¹⁰⁰, we compute {10²⁰⁰/48} by long division, and obtain the answer 2083 … 3, where there are one hundred ninety-six 3′s.

EXAMPLE 2.5

We can prove Lucas’ theorem (see Section 1.10) using generating functions. Let’s examine the case k = 2.

From the binomial theorem modulo p, we have

equation

From the uniqueness of base b expansion, we conclude that

EXERCISES

2.1 Evaluate the infinite series Σ^∞_n=1 nF_n/3ⁿ.

2.2 Find a rational generating function for the sequence {a_n} given by the recurrence formula

2.3 Evaluate the infinite series

equation

where {a_n} is the sequence of the previous exercise.

2.4 Find a rational generating function for the sequence of perfect squares, {n²}, for n ≥ 0.

2.5 Find a rational generating function for the sequence {a_n} defined by a₀ = 1, a₁ = 3, a_n = a_n–1 + a_n–2 + 2^n–2, for n ≥ 2. See Exercise 1.75.

2.6 (a) Suppose that f(x) is the generating function for a sequence {a_n). Show that

(b) Suppose that f(x, y) is the generating function (in two variables) for a sequence {a_m,n}. Show that

2.7 Use a computer and an appropriate generating function to determine the number of ways of making change for $1 using an even number of coins.

2.8 Use a generating function to determine the number of solutions in nonnegative integers to the equation

This problem duplicates Exercise 1.53.

2.9 Determine the number of solutions in nonnegative integers to the equation

2.10 Determine the number of solutions in nonnegative integers to the equation

2.11 (a) Show that the generating function (in two variables) for binomial coefficients is

(b) Show that the generating function (in three variables) for multinomial coefficients of the form (k₁,k₂,k₃ⁿ) is

2.12 Find a recurrence formula for the coefficient of xⁿ in the series expansion of (1 + 3x + x²)⁻¹.

2.13 Find a recurrence formula for the coefficient of xⁿ in the series expansion of (1 + x + 2x²)⁻¹.

2.14 (Series multisection) Let f(x) = Σ^∞_n=0 a_nxⁿ. Show that, for any positive integers p and q with 0 ≤ p < q, we have

equation

where ω is a primitive qth root of unity.

2.15 Prove that, for each positive integer k, there exists a monk polynomial p(n) of degree k + 1 with integer coefficients such that

equation

2.16 Prove that Alcuin’s sequence is a zigzag sequence (its values alternately rise and fall) for n ≥ 6.

2.17 Prove that Alcuin’s sequence {t(n)} satisfies the recurrence relation

2.18 Let t_n be the number of triples (a, b, c), where a, b, c are nonnegative integers satisfying a ≤ b ≤ c, a + b ≥ c, and a + b + c = n. Find the generating function for {t_n} and use the generating function to find t_n for 0 ≤ n ≤ 6.

2.19 Given any integer m > 1, prove that Alcuin’s sequence {t(n)} is periodic modulo m with period 12m.

2.2 Special generating functions

In this section we investigate certain interesting sequences and their generating functions.

Given any sequence a₀, a₁, a₂,…,we define the exponential generating function

(2.6) equation

Let d(x) be the exponential generating function for the sequence {d_n}, where d_n is the number of derangements of n elements:

equation

We set d₀ = 1. From the recurrence relation (1.31), it follows that

for

equation

Separating variables, we obtain

and hence

The value d₀ = 1 implies that C = 1. Therefore

(2.7)

As we mentioned earlier, the coefficients of the generating function give us a formula for the general term of the underlying sequence. In this case, we obtain the explicit formula

(2.8) equation

We now consider a famous sequence of numbers called Catalan numbers, named after the mathematician Eugène Charles Catalan (1814–1894).

The Catalan number C_n is the number of sequences of n A’s and n B’s which have the property that, reading the sequence from left to right, at each symbol the number of As seen thus far is always greater than or equal to the number of B’s seen thus far.

The five valid sequences for n = 3 are

equation

Hence C₃ = 5.

With a little work, we find the following values of C_n:

We set C₀ = 1.

Let f(x) = be the ordinary generating function for the Catalan numbers. The strategy is to find an equation satisfied by f(x), solve the equation, and read off the coefficients. The Catalan numbers satisfy the recurrence relation

(2.9) equation

(Every valid sequence can be written uniquely in the form As₁Bs₂, where s₁ is a valid sequence with k A’s and B’s, and s₂ is a valid sequence with n – 1 – k A’s and B’s.) This recurrence relation implies

From the quadratic formula and the fact that f(0) = 1, it follows that

(2.10)

From the binomial series for , we obtain

(2.11)

It is easy to show from (2.11) that

(2.12)

The Catalan numbers occur in many settings, such as the following:

C_n is the number of lattice paths in the first quadrant of the plane which start at (0, 0), end at (2n, 0), and proceed at each step by (Δx, Δy) (+1, +1) or (+1, −1).

C_n−1 is the number of binary search trees on n vertices.

C_n–2 is the number of ways to parenthesize a product of n terms.

C_n–2 is the number of triangulations of a convex n-gon with n – 3 nonintersecting diagonals.

Next, we determine and work with generating functions for the Stirling numbers of the first and second kinds and for the Bell numbers.

We observe a pattern in the generating function for the Stirling numbers of the first kind. For instance.

equation

Define the failing factorial function x_(n) as

(2.13)

and the rising factorial function x⁽ⁿ⁾ as

(2.14)

We set x⁽⁰⁾ = x₍₀₎ = 1.

The polynomials x_(n) and x⁽ⁿ⁾ are related by

(2.15) equation

The following theorem states that x⁽ⁿ⁾ is the ordinary generating function for the Stirling numbers of the first kind.

Generating function for the Stirling numbers of the first kind.

(2.16) equation

Proof. The proof is by induction on n. The case n = 1 is trivial: .

Assuming the result for n, it follows that

equation

which is the correct formula for the n + 1 case.

EXAMPLE 2.6 Expected number of cycles in a permutation

What is the expected number of cycles in a randomly chosen permutation of {1,2,3,…,n}?

Solution: Differentiating the generating function for [ⁿ_k] with respect to x, we obtain

equation

Evaluating the second identity at x = 1, we obtain

equation

Dividing by n!, we find that the expected number of cycles is

an expression asymptotic to ln n. For instance, in a permutation on 1000 elements, we expect to find about seven cycles.

The polynomial x_(n) is the generating function for the signed Stirling numbers of the first kind, (−1)^n+k [ⁿ_k].

Generating function for the signed Stirling numbers of the first kind.

(2.17) equation

Proof. We have

equation

To test this generating function, we compute x₍₃₎ = x(x – 1)(x – 2) = x³ – 3x² + 2x and note that the coefficients 1, –3, 2 are the signed Stirling numbers of the first kind with n = 3.

Generating function for the Stirling numbers of the second kind.

(2.18) equation

Proof. We have

equation

The vector space of polynomials with real coefficients has as bases the two sets ₁ = {xⁿ : n ≥ 0} and ₂ = {x_(n) : n ≥ 0}. Recall that we have set and for n ≥ 1. Then is the change of basis matrix from ₂ to ₁, while S₂ = [] is the change of basis matrix from ₁ to ₂. Therefore, the two matrices S₁ and S₂ are inverses, so that S₁S₂ = S₂S₁ = I, where I is the infinite-dimensional identity matrix. In summation form, this assertion is written as

(2.19) equation

Recall that δ(n, j) = 1 if n = j and 0 if n ≠ j. This identity leads (as per Exercise 1.59) to the following wonderful inversion formula.

Inversion formula for summations with Stirling numbers as weights. For any two real-valued functions f and g, we have

equation

if and only if

equation

Proof Assume that . Then

equation

The reverse implication is proved similarly.

Let t_n be the number of transitive and reflexive relations on an arbitrary n-set X. It can be shown that t_n is the number of topologies on an n-set. Let p_n be the number of partial orders on X. The reader may wish to verify that p₁ = 1, p₂ = 3, p₃ = 19 and t₁ = 1, t₂ = 4, t₃ = 29. Although there are no known formulas for t_n and p_n, the two functions are related by our inversion formula.

Suppose X = {1,…,n} and R is a transitive and reflexive relation on X. We define a new relation R’ on X as follows: (a, b) R’ if and only if (a, b) R and (b, a) R. We claim that R’ is an equivalence relation on X. Certainly R’ is reflexive, as (a, a) R implies (a, a) R’. If (a, b) R’, then (b, a) R’ (by definition), so R’ is symmetric. If (a, b) R’ and (b, c) R’, then (a, b), (b, c), (b, a), (c, b) R, which implies that (a, c), (c, a) R (because R is transitive); hence, (a, c) R’, so R’ is transitive.

Therefore, R’ is an equivalence relation with equivalence classes [a] = {b X : (a, b) R and (b, a) R}. This means that in order to construct a transitive, reflexive relation on X, we must first partition X into equivalence classes. As X may be partitioned into k equivalence classes in ways, the question is, how are the equivalence classes pieced together? Suppose [a] and [b] are two different equivalence classes under R’ and (a, b) R. By transitivity, (a, c) R for all c [b]. Again, by transitivity, (d, c) R for all d [a]. To paraphrase: “Everything in [a] is arrowed to everything in [b].” Therefore we can think of the equivalence classes as points that are either joined or not joined by an arrow. This defines a partial order on the set of equivalence classes. In other words, the k equivalence classes are partially ordered. Because this may be done in p_k ways, summing over all possible values of k, we obtain

(2.20) equation

We test this equation by putting in the values , and calculating t₃ = 29.

Our equation would provide a formula for t_n if only a formula for p_n were known (as we already have a formula for ). However, using the inversion formula for Stirling numbers we can write p_n in terms of t_n:

(2.21) equation

For example, putting in the values , we obtain p₃ = 19.

Although no explicit formulas are known for the general terms of the sequences {p_n} and {t_n}, it is known that p_n ~ t_n and log₂ p_n = n²/4 + o(n²). We let p_n^* and t_n^* be the unlabeled set versions of p_n and t_n. There are no known formulas for these sequences, although it is known that p_n^* ~ p_n/n!. One might think that t_n^* = Σⁿ_k=1 p(n, k)p_k^*, but this is false. Why?

Open problem. Find explicit formulas for p_n, p_n^*, t_n, and t_n^*.

Table 2.1 presents the first few terms of the four sequences. See [24] or the Online Encyclopedia of Integer Sequences.

Table 2.1 Some terms of four important sequences.

Now we give the exponential generating function for the Bell numbers B(n).

Generating function for the Bell numbers. Σ^∞_n=0 B(n)xⁿ/n! = e^{e^x–1}.

Proof. We have

equation

What an interesting-looking generating function! The reader may wish to compute the first four terms of the generating function and compare them to the known values B(0) = 1, B(1) = 1, B(2) = 2, B(3) = 5.

EXAMPLE 2.7 Rook walks

A chess Rook can move any number of squares horizontally or vertically on a chess board. How many different walks can a Rook travel in moving from the lower-left corner (al) to the upper-right corner (h8) on the board? Assume that the Rook moves right or up at every step. For example, Figure 2.1 shows the Rook walk al-cl-dl-d3-d5-f5-f7-h7-h8.

Figure 2.1 A Rook walk from al to h8.

Note that if the Rook could only move one square at a time, this problem would be equivalent to the problem of counting paths along city streets (see Example 1.14). Here the number of such paths is simply = 3432. We expect that the number of Rook walks will be much larger.

Solution: We generalize the problem by considering Rook walks to any square on the board (with the Rook starting on al and moving toward the goal square at every step). We make a table displaying the number of walks. The bottom-left entry is the number of Rook walks from al to al, which is 1. Each other entry is the sum of all the entries below or to the left of the given entry. The reason is that the Rook’s last move must come from one of the squares represented by these entries. For example, the entry corresponding to the c4 square is 4 + 12 + 2 + 5 + 14 = 37. It’s possible to complete the table by hand in a few minutes. The number of Rook walks from a1 to h8 is 470,010. (The dots in the table indicate that we can generalize the problem to arbitrarily large chess boards.)

equation

Let a(m, n) be the number of Rook walks from (0, 0) to (m, n), and set a(m, n) = 0 if m < 0 or n < 0. The generating function for the doubly infinite sequence {a(m, n)} is the rational function

(2.22) equation

Can you explain why?

It follows that the sequence a(m, n) satisfies a recurrence relation (with initial values):

equation

Can you explain why this recurrence relation holds by counting Rook walks?

Let a_n = a(n, n), for n ≥ 0. The sequence {a_n}, called the diagonal sequence, is

The eighth term, 470,010, is the number of Rook walks in the original problem.

Let f(x) be the generating function for the diagonal sequence, i.e.,

We will show that

(2.23)

In order to get the generating function for the diagonal sequence, we make the change of variables t = x/s. To include terms such as s³t⁵, we allow arbitrary integer exponents for s. For instance, we represent s³t⁵ as s⁻²x⁵. The diagonal generating function is the coefficient of s⁰. For more about this method, see [25].

We obtain the generating function

We now consider the function

Using the quadratic formula, we write this as

where

We put our function into partial fraction form,

For –1/9 < x < 1/9, we expand the function as a Laurent series in the annulus |α| < |s| < |β| in powers of α/s and s/β, obtaining

equation

The coefficient of s⁰ in this series is

This establishes the generating function for the diagonal sequence.

We can use the generating function for the diagonal sequence to obtain a recurrence relation for it. By inspection,

We can read off a recurrence relation for {a_n} by looking at the coefficient of xⁿ in the above relation:

(2.24) equation

A counting argument for this relation is provided by E. Y. Jin and M. E. Nebel in “A combinatorial proof of the recurrence for rook paths,” in Electronic Journal of Combinatorics, 19, no. 1 (2012).

We can also use the generating function to determine the asymptotic growth rate of a_n. We have

(2.25)

See Exercises.

Can you find the generating function for the number of King walks from the lower-left corner to the upper-right corner of an arbitrary-size chess board? At each step, the King moves one square to the right, one square up, or both. Such walks are called Delannoy walks.

EXERCISES

2.20 Prove that

equation

2.21 Prove that the Catalan number C_n is odd if and only if n = 2^k – 1 for some positive integer k.

2.22 Prove that for any positive integer n we have

2.23 Prove that if n > 3, then the Catalan number C_n is not a prime.

2.24 Define the n × n matrix A = [a_ij] by the rule a_ij = C_i+j–2. Prove that det A = 1.

2.25 (Ising problem, 2 × n case) Let O(a, b) be a box consisting of a + b cells, each 1 × 1, arranged in a row of length a sitting on top of a row of length b (the leftmost cells of the two rows line up). Let f(a, b) be the number of ways of covering O(a, b) with 1 × 1 and 1 × 2 tiles. Set up a recurrence relation for f (a, b). Generate some data using a computer and make a conjecture about a formula for f(a, b).

2.26 Prove that

2.27 Prove that

equation

and

equation

2.28 Let X be the number of cycles of a randomly chosen permutation of {1,…,n}. We have shown that E(X) ~ In n. Show that Var(X) ~ In n.

Hint: From the relation

obtain

Use logarithmic differentiation.

2.29 Let a_n be the number of permutations in S_n which alternately rise, fall, rise, fall, etc. For example, 142635 is such a permutation. Find

and use this information to find a₆.

2.30 Let {a_n} be a sequence with the exponential generating function

equation

Evaluate the sum

equation

Interpret the sum combinatorially. Using Exercise 1.58, find a combinatorial interpretation of a_n. See Exercises 1.37, 2.45, and 2.69.

Hint: Multiply both sides by e^−x.

2.31 A lone King is on a chessboard. How many paths may the King take from the lower-left corner (al) of the board to the lower-right corner (h8), moving one square right, up, or up-right at every step?

2.32 Find a recurrence relation for the number of Queen walks from the lower-left corner of an arbitrary-size chess board to the square (n, n). (A Queen can move any number of squares horizontally, vertically, or diagonally. Assume that the Queen always moves to the right, up, or up-right.) Find the corresponding generating function.

2.33 Suppose that a ChildRook can move like a chess Rook but only at most two squares horizontally or vertically. Let a(m, n) be the number of walks a ChildRook can take from square (1, 1) to square (m, n) on an arbitrary-size chess board. (Assume that the RookPlus always moves toward the goal square.) Find a finite-order recurrence relation for a(m, n).

2.34 Suppose that a RookPlus can move like a chess Rook with the additional possibility of moving one square diagonally. Let b(m, n) be the number of walks a RookPlus can take from square (1, 1) to square (m, n) on an arbitrary-size chess board. (Assume that the RookPlus always moves toward the goal square.) Find a finite-order recurrence relation for b(m, n).

2.35 Show that

where a_n is the number of Rook walks from (0, 0) to (n, n).

Hint: We know that a_n, for n ≥ 1, is the coefficient of xⁿ in the generating function

. We examine this generating function, which has a singularity at x = 1/9. Let y = 9x and

. Then

. Suppose that (1 – y)^−1/2 = s₀ + s₁x + s₂x² + · · ·. Then

and, using Stirling’s approximation

, we have

2.3 Partition numbers

In Chapter 1, we defined the partition number p(n, k) to be the number of ways to write n as a sum of k positive integers. We can also think of p(n, k) as the number of onto functions f : X → Y, |X| = n, |Y| = k, where X and Y are unlabeled sets. Also, the partition number p(n) has been defined as p(n) = Σⁿ_k=1 p(n, k). In this section we develop generating functions for partition numbers.

EXAMPLE 2.8

Determine p(4, k), for k = 1, 2, 3, 4, and p(4).

Solution: We have

equation

and

equation

Suppose that X and Y are unlabeled and f : X → Y is an onto function (|X| = n, |Y| = k). Suppose that the inverse images of the elements of Y have cardinalities λ₁, …, λ_k. Because the inverse images account for all the elements of X, it follows that

Furthermore, we may assume that the λ_i are ordered from largest to smallest:

A partition λ₁ + · · · + λ_k = n may be pictured with a Ferrers diagram consisting of k rows of dots with λ_i dots in row i, where 1 ≤ i ≤ k. The Ferrers diagram for the partition 7 + 3 + 1 + 1 = 12 is shown in Figure 2.2.

Figure 2.2 The Ferrers diagram of a partition of 12.

The transpose of a Ferrers diagram is created by writing each row of dots as a column. The resulting partition is called the conjugate of the original partition. For example, the partition 12 = 7 + 3 + 1 + 1 of Figure 2.2 is transposed to create the conjugate partition 12 = 4 + 2 + 2 + 1 + 1 + 1 + 1 of Figure 2.3.

Figure 2.3 A transpose Ferrers diagram.

The reader may enjoy matching each partition of 4 on the previous page with its conjugate. (One partition is self-conjugate.)

We now give the ordinary generating function for the partition numbers p(n). For convenience, we set p(0) = 1.

Generating function for integer partitions.

equation

Proof. We need to show that the coefficients of xⁿ on the two sides of the equation are equal. The coefficient of xⁿ on the left side is patently p(n). On the right side, the product may be written as

equation

To find the contribution to xⁿ from this product, suppose the term x^m(k)k is selected from the kth factor and these terms are multiplied to yield x^{m(1)+m(2)2+···}. If this expression equals xⁿ, then

(2.26)

Contributions to xⁿ correspond to solutions of (2.26). These solutions may be envisioned as Ferrers diagrams for partitions of n. With t as the greatest integer for which m(t) is nonzero, we create the Ferrers diagram with m(t) rows of t dots, followed by m(t – 1) rows of t – 1 dots, etc. This correspondence between solutions of (2.26) and partitions of n completes the proof.

Determining the ordinary generating function for p(n, k) with k fixed is not difficult. We start by making an elementary observation from the transposes of Ferrers diagrams.

Theorem. The number of partitions of n into exactly k parts is equal to the number of partitions of n where the size of the greatest summand is k.

Now we can establish the generating function for p(n, k).

Theorem.

equation

Proof We define p(n, ≤ k) to be the number of partitions of n into at most k parts. From the previous theorem, p(n, ≤ k) is also the number of partitions of n into parts of size at most k. Clearly, p(n, k) = p(n, ≤ k) – p(n, ≤ k – 1). We obtain

equation

It follows that

equation

Let p(n, O) be the number of partitions of n into summands each of which is an odd number. Let p(n, D) be the number of partitions of n into distinct summands. As a further illustration of the use of generating functions, we prove that p(n, O) = p(n, D).

Euler’s theorem (1748).

Proof. We have

equation

The desired equality follows by comparing coefficients of the two generating functions.

EXERCISES

2.36 Show that the number of partitions of n into summands none of which occurs exactly once is the same as the number of partitions of n into summands none of which is congruent to 1 or 5 modulo 6.

2.37 Find formulas for p(n, 1), p(n, 2), and p(n, 3). Find an asymptotic estimate for p(n, k) (with k fixed).

2.38 Prove Jacobi’s identity:

equation

2.39 We say that two permutations σ and τ of S_n are in the same conjugacy class if there exists a permutation ρ

S_n such that τ = ρσρ¹. Prove that two permutations are in the same conjugacy class if and only if they have the same cycle structure. How many conjugacy classes of S_n are there?

2.40

(a) How many nonisomorphic abelian groups of order 2700 are there?

(b) How many ways may one make $2.23 postage using 1 cent, 2 cent, 3 cent, 10 cent, 20 cent, $1, and $2 stamps, and not more than three of any one denomination?

Notice that the answers to parts (a) and (b) are the same. Why is this?

2.4 Labeled and unlabeled sets

Many enumeration problems can be solved by representing the objects to be counted as functions f : X → Y, where X and Y are chosen appropriately. The conditions imposed in the enumeration problem usually amount to putting restrictions on the functions (e.g., requiring them to be one-to-one) and/or making rules as to when two functions are considered equivalent (e.g., when they are equivalent up to a permutation of the elements of X). For example, the binomial coefficient is the number of 1–1 functions from an m-set X into an n-set Y, where the elements of X are considered to be indistinct.

Suppose that X and Y are two finite nonempty sets and Y^x is the collection of functions f : X → Y. We wish to define some equivalence relations on Y^x and, in each case, count the number of equivalence classes. When we say that two functions f and g(f, g Y^X) are equivalent, we will mean one of four things:

(1) f = g.

(2) f = gh for some bijection h : X → X.

(3) f = ig for some bijection i : Y → Y.

(4) f = igh for some bijections h : X → X and i : Y → Y.

(Note that the functions here are applied from right to left.)

In definitions (2) and (4) we say that h delabels X, and we speak of X as an unlabeled (or delabeled) set. Likewise, in definitions (3) and (4) we say that i delabels Y, and we speak of Y as an unlabeled or delabeled set.

EXAMPLE 2.9

Consider the functions f and g of Figure 2.4. Are they equivalent?

Figure 2.4 Two functions. Are they equivalent?

Solution: The functions fail to be equivalent under definition (1) because, after all, they are different functions. However, f = gh if h : X → X is the bijection given by h(a) = a, h(b) = c, and h(c) = b; therefore f and g satisfy definition (2). The bijection h rearranges the set {a, b, c}, eliminating the discrepancy between the two functions due to the labeling of the elements of the domain. If i is the bijection given by i(x) = x, i(y) = z, and i(z) = y, then f = ig; therefore f and g satisfy definition (3). It is now clear that the functions f and g of Figure 2.4 are equivalent according to definitions (2), (3), and (4).

EXAMPLE 2.10

Are the functions f and g of Figure 2.5 equivalent?

Figure 2.5 Equivalent functions when domain and codomain are unlabeled.

Solution: The functions f and g are equivalent only when both X and Y are unlabeled, i.e., according to definition (4). Define h: X → X by h(a) = c, h(b) = b, and h(c) = a, and define i: Y → Y by i(x) = z, i(y) = x, and i(z) = y. Then f = igh.

The domain and codomain of a function may be labeled or unlabeled sets, leading to four types of functions to be counted. Furthermore, functions may be classified according to whether they are one-to-one, onto, both, or not necessarily either. Altogether there are 16 cases, as shown in Table 2.2.

Table 2.2 The number of functions from X to Y, where |X| = m and |Y| = n.

We assume that f : X → Y is a function from a set X with m elements to a set Y with n elements. Let us consider the entries in the table, beginning with the X labeled, Y labeled box.

X labeled, Y labeled

We have already noted that the total number of such functions is n^m.

As for the second entry, every one-to-one (1–1) function corresponds to an ordered selection of m objects from the set Y. There are n choices for the first object selected, n – 1 choices for the second, and so on, leading to the formula

If m > n, there are no one-to-one functions so we set P (n, m) = 0.

In the case of bijections, we either have two sets of the same cardinality or we don’t. If m ≠ n, then no bijection is possible. However, if m = n, then there are m! ways to match up the two sets. We define δ(m, n) to be 0 if m ≠ n and 1 if m = n. Thus, the number of bijections is m!δ(m, n). If either X or Y (or both) is unlabeled, then all bijections look the same, so the total number is δ(m, n).

X unlabeled, Y labeled

A one-to-one function f : X → Y is equivalent to a selection of m elements of Y without regard to order. The number of such selections is the value of the binomial coefficient

A function f : X → Y is equivalent to a distribution of m identical, elements (the elements of X) into n different boxes (the elements of Y). A box may receive any number of objects, including zero. Suppose we represent the m identical objects with m copies of the symbol *. The n boxes are represented by a set of n – 1 vertical lines|. The placement of the objects in the boxes is indicated by a linear ordering of the *’s and |’s. For example, * * * | * || * means that the first box (to the left of the first |) contains three objects, the second box contains one object, the third contains no objects, and the fourth contains one object. The total number of items in the linear ordering is m + n – 1, and n of these are |’s. Therefore, the total number of functions is

If f is onto, then each box must contain at least one object, and there are m – n objects to distribute freely. Hence, the number of onto functions is

Can you furnish a more direct combinatorial proof of this formula?

X labeled, Y unlabeled

If X is labeled and Y unlabeled, then the total number of functions from X to Y is the number of ways X may be partitioned into unlabeled parts (corresponding to the images under f). We prefer to define a formula only when m ≤ n, in which case the magnitude of n becomes unimportant (X can’t be divided into more than m parts). The Bell number B(m) is the number of such functions.

If m > n, then one-to-one functions do not exist. However, if m ≤ n, then all one-to-one functions look alike when X is unlabeled. This observation accounts for the 1–1 entries of the X unlabeled, Y labeled and X unlabeled, Y unlabeled boxes.

The number of onto functions when X is labeled and Y is unlabeled is , a Stirling number of the second kind. Clearly,

as dividing by n! permutes the labels of the n sets into which X has been divided.

X unlabeled, Y unlabeled

The total number of functions when X and Y are unlabeled and m ≤ n is denoted p(m), and the values of p(m) are the partition numbers. The number of onto functions is the partition number p(m, n).

Four relations are obtained by comparing the total number of functions in each box of Table 2.2 to the number of onto functions. Thus

(2.27) equation

(2.28) equation

(2.29) equation

(2.30) equation

The relation (2.27) follows from the fact that every function f : X → Y is onto some subset Y′ Y of cardinality j, where 1 < j ≤ m. The binomial coefficient “chooses” Y′ and T(m, j) counts the number of functions from X onto Y′. Relations (2.28) through (2.30) are proved similarly.

EXERCISE

2.41 Verify the formulas for the X unlabeled, Y labeled case in Table 2.2.

2.42 Let |X| = m, |Y| = n.

(a) How many possible relations are there from X to Y?

(b) How many relations are there if X is unlabeled and Y is labeled?

(c) How many relations are there if X is labeled and Y is unlabeled?

(d) How many relations are there if both X and Y are unlabeled?

Hint: In each case, the relation R: X → Y may be viewed as a function f : X → P(Y), defined by f(x) = R(x). Now use the techniques associated with the fundamental counting problem for functions. For example, the answer for (b) is

2.43 Let |X| = m. An algebra on X is a subset S of P(X) with the following properties:

1. X

2. A

S implies X – A

3. A

S and B

S imply A ∪ B

(a) How many algebras on X are possible if X is labeled?

(b) How many if X is unlabeled?

2.44 Let f be a random function from {1,…,n} to {1,…,n} and let r(n) be the expected number of elements in the range of f. Find lim_n→∞ r(n)/n.

2.45 How many onto functions f : X → X, |X| = m (X labeled), have the property that f(f(x)) = x for all x

X? Find a recurrence relation or an explicit formula. See Exercises 2.30 and 2.69.

2.46 Give a combinatorial proof of the formula for the number of onto functions from an unlabeled set to a labeled set.

2.5 Counting with symmetry

We have classified functions f : X → Y (|X| = m, |Y| = n), where X and Y are labeled or unlabeled sets, and we enumerated these functions. Now we generalize the notion of labeled and unlabeled sets. For instance, recall that two functions f: X → Y and g : X → Y are equivalent in the X unlabeled, Y labeled sense if there exists a bijection h: X → X such that f = gh. The bijection h can be viewed as a permutation of X (in fact, any permutation in the symmetric group S_m.). What happens if we restrict the permutations to a specified subgroup G of S_m? If G is the identity group (e), for example, then we obtain the X labeled case; while if G = S_m, we obtain the X unlabeled case. Nontrivial subgroups G give rise to interesting intermediate cases. In these cases, Pólya’s theorem for the number of inequivalent functions allows us to count quite complicated configurations, including nonisomorphic graphs. (See Section 3.3 for definitions about graphs.) More generally, if one group G acts on X and another group H acts on Y, then the number of inequivalent functions is given by de Bruijn’s formula, which enumerates more intricate structures such as self-complementary graphs.

A group G is a nonempty set on which is defined a binary operation * satisfying the following three laws:

1. For all x, y, z

G, x * (y * z) = (x * y) * z (associativity).

2. There exists an element e

G with the property that, for all x

G, x * e = e * x = x.

3. For every x

G, there exists an element x⁻¹

G with the property that x * x⁻¹ = x⁻¹ * x = e.

The element e is called the identity of G. The element x⁻¹ is called the inverse of x. One can easily prove that the identity element of a group is unique and that the inverse x⁻¹ of each element x is unique.

We sometimes indicate a group by an ordered pair, e.g., (G, *), to emphasize both the set and the operation. We usually suppress the group operation sign and write xy for x * y. We abbreviate xx by x², x⁻¹x⁻¹ by x⁻², etc. For any x G, we set x⁰ = e.

EXAMPLE 2.11

The set Z is a group with respect to addition.

A finite group is a group with a finite number of elements, and the order of a finite group is the number of elements in it.

The cyclic group Z_n, of order n, is the set {0,…,n – 1} under the clock addition operation * defined by x * y = x + y if x + y < n and x*y = x + y – n if x + y ≥ n.

A group G is abelian if xy = yx for all x, y G. Otherwise, G is nonabelian.

EXAMPLE 2.12

The group Z_n, is abelian.

The order of an element x G is the least positive integer n for which xⁿ = e. If there is no such integer, then we say that x has infinite order.

EXAMPLE 2.13

In Z₄, the elements 0, 1, 2, 3 have orders 1, 4, 2, 4, respectively.

The symmetric group S_n consists of the n! permutations of an n-set (e.g., N_n). The group operation * is composition of permutations. The elements of S_n are conveniently written in cycle notation. Thus,

is the element of S₉ which maps 1 to 2 to 3 to 1, transposes 4 and 8, maps 3 to 6 to 7 to 3, and fixes 5 and 9. To multiply two permutations together, we just compute the result of the composition of the two bijections (reading left to right). For example,

Since (1 2)(1 3) ≠ (1 3)(1 2), the symmetric group S_n, is nonabelian for all n ≥ 3.

A homomorphism from one group (G₁, *₁) to another group (G₂, *₂) is a map f : G₁ G₂ which preserves multiplicative structure: f(x *₁ y) = f(x) *₂ f(y) for all x, y G₁. If the homomorphism is a bijection we call it an isomorphism and say that the two groups G₁ and G₂ are isomorphic; we write G₁ G₂. A one-to-one homomorphism is called a monomorphism and an onto homomorphism is called an epimorphism. An isomorphism from a group G to itself is called an automorphism of G.

Suppose that G₁ and G₂ are two groups. The product of G₁ and G₂, denoted G₁ × G₂, is the set of ordered pairs {(g₁, g₂) : g₁ G₁, g₂ G₂} subject to the multiplication rule (g₁, g₂) * (g′₁, g′₂) = (g₁g′₁, g₂g′₂).

EXAMPLE 2.14

The product Z₂ × Z₂ is a four-element group. It is not isomorphic to Z₄, for Z₂ × Z₂ has three elements of order 2 and Z₄ has only one.

We say that H is a subgroup of G if H is a subset of G and H is a group with respect to the group operation of G. We say that H is a normal subgroup of G if it is invariant under conjugation by elements of G, that is, xHx⁻¹ = H for all x G.

EXAMPLE 2.15

The two-element group {(1 2)(3), (1)(2)(3)} is a subgroup of the six-element group S₃. It is not a normal subgroup. Why not?

The symmetric group S_n is especially important because every finite group is isomorphic to a subgroup of some S_n. This is Cayley’s theorem.

Cayley’s theorem (1854). If G is a finite group of order n, then G is isomorphic to a subgroup of S_n.

Proof. For each element g G, we define a function f_g: G → G by the rule f_g(a) = ag (right multiplication by g). Note that f_g is a bijection because it has an inverse, namely, f_g⁻¹. We check: (f_g f_g⁻¹)(a) = ag⁻¹g = a and (f_g⁻¹ f_g)(a) = agg⁻¹ = a. Since each f_g is a permutation of the n-set G, we can define a map ϕ: G → S_n by ϕ(g) = f_g. We claim that ϕ is a monomorphism. First we check that ϕ is an isomorphism: ϕ(gh)(a) = f_gh(a) = a(gh) = (ag)h = f_h(f_g(a)) = (f_gf_h)(a) = (ϕ(g)ϕ(h))(a). Now we check that ϕ is one-to-one: If ϕ(g)(a) = ϕ(h)(a), then f_g(a) = f_h(a), which implies that ag = ah, and g = h.

The dihedral group D_n, of order 2n, consists of the set of symmetries of a regular n-gon. If we number the vertices of the n-gon 1,…,n, then we see that D_n is a subgroup of S_n. Specifically, the subgroup is generated by two permutations: the rotation r = (1 2 3 4 … n) and a flip f along an axis of symmetry of the n-gon. If n is odd we take the flip to be f = (n)(1 n – 1)(2 n – 2)(3 n – 3) … . If n is even we choose . Each element of D_n can be written in the form r^a f^b, where a {1, n – 1} and b {0, 1}. Two such elements are multiplied using the basic rules rⁿ = e, f² = e, and rf = fr⁻¹. From these basic rules it is possible to compute all other products. We say that D_n has the presentation r, f : rⁿ = 1, f² = 1, rf = fr⁻¹). For information about group presentations the reader should consult [17].

We have noted that every element of S_n can be expressed as a product of cycles. A cycle of length 2 is called a transposition and a cycle of length 1 is called a fixed point. Cycles of length greater than 2 can be written as products of transpositions. For example, (1 2 3) = (1 2)(1 3). A permutation may be written as a product of transpositions and fixed points in more than one way. However, the number of transpositions is always even or always odd. A permutation is accordingly called an even permutation or an odd permutation. It follows from the first homomorphism theorem for groups, to be discussed shortly, that S_n contains n!/2 even permutations and n!/2 odd permutations. (This fact follows more simply from the observation that f(σ) = (12)σ is a bijection between the set of even permutations of S_n and the set of odd permutations of S_n.) As the identity permutation is even and the even permutations are closed under multiplication and taking inverses, the even permutations are a group.

The alternating group A_n is the group of even permutations of the set { 1,…,n}. The group has order .

Let G be a group (with identity element e) and X a nonempty set. An action of G on X is a function θ: G × X → X which satisfies the following two conditions:

1. For every x

X, we have θ(e, x) = x.

2. For every g, h

G and x

X, we have θ(g, θ(h, x)) = θ(gh, x).

For convenience, we write θ(g, x) as gx, so that the two conditions become:

1. ex = x.

2. g(hx) = (gh)x.

Remember, however, that g and h are group elements and x is a set element.

We note that each element g G yields a permutation of the set X (defined by sending x to gx). For if gx = gy, then x = y (hence the map is one-to-one) and gg⁻¹ x = x (hence the map is onto).

EXAMPLE 2.16

The symmetric group S_n, acts on N_n, by the natural action gx = g(x), where g(x) is the image of x under the function g: N_n → N_n.

EXAMPLE 2.17

The cyclic group Z_n, acts on N_n by

equation

Here g denotes the equivalence class representative of [g] that lies between 1 and n.

If x X, then the orbit of x (under the action θ) is orb(x) = {gx: g G}. Orbits constitute equivalence classes of X; that is, X is partitioned into orbits. If there is only one orbit, then the action θ is transitive. Both examples above are transitive actions.

To find the size of orb(x), note that gx = hx if and only if h⁻¹ gx = x. Let G_x = {g G : gx = x}. We call G_x the stabilizer of x. We leave it to the reader to check that G_x is a normal subgroup G. Now gx = gy if and only if g hG_x, that is, if g and h are elements of the same coset of G_x. Therefore, the number of distinct values of gx is the number of cosets of Gx:

(2.31)

Let G be a finite group. For each g G, define f_g: G → G by f_g(x) = gxg⁻¹. We say that G acts on itself by conjugation. The stabilizer of an element x G is called the centralizer of x and is denoted C(x). We call orb(x) the conjugacy class of x, and denote it by ccl(x). For the conjugacy action, we have

(2.32)

Two permutations are in the same conjugacy class if and only if they have the same cycle structure. Therefore, the number of conjugacy classes equals the partition number p(n), where n = |G|.

Burnside’s lemma. Let G be a finite group acting on a finite nonempty set X. For each g G, let f_g be the number of elements of X fixed by g. Then number of orbits n is given by

equation

Proof. The proof is a nice example of the technique of enumerating a set two different ways and comparing the results. The set is

On the one hand, by definition of f_g, we have |S| = Σ_gG f_g. On the other hand, counting from the perspective of the elements of X and letting x′ denote orbit representatives, we have

equation

Equating the two expressions for |S|, we obtain , from which the desired relation follows instantly.

EXAMPLE 2.18 Average number of fixed points of a permutation

How many fixed points has the average permutation of {1, 2, 3,…,n}? Recall that we solved this problem using probability in Example 1.27.

Solution: Applying Burnside’s lemma to the natural action of S_n, on N_n (which is transitive), we obtain

equation

This is the average number of fixed points.

We now assume that f : X → Y is a function from a set X of m elements to a set Y of n elements. In the terminology of Pólya’s theory of counting, the elements of Y are labels, and f is a labeling of X. Suppose that a finite group G acts on X. We picture this situation with the following diagram:

equation

This action of G on X induces an action of G on the set of labelings Y^X as follows: (gf)(x) = f(g⁻¹ x) for all x X. We check the two axioms for an action:

1. (ef)(x) = f(e⁻¹ x) = f(x) for all x

2. (g(hf)(x) = (hf)(g⁻¹ x) = f(h⁻¹ x) = f((gh)⁻¹ x) = ((gh) f)(x) for x

The set Y^X of functions is partitioned into equivalence classes by this action. Functions in different equivalence classes are called G-inequivalent functions. By definition, the number of G-inequivalent functions is the number of orbits of the action of G on Y^X.

Number of orbits theorem. When G (finite) acts on the set of functions Y^X (|X| = m, |Y| = n), the number of orbits is

(2.33) equation

where c(i) is the number of cycles of length i of g (regarded as a permutation of X).

Proof. Suppose that g, when regarded as a permutation of X, has cycle structure c = (c(1), c(2),…,c(m)). The functions fixed by g are precisely those which are constant on each cycle. As there are c(1) + · · · + c(m) cycles, each of which may be assigned one of n images, the number of functions fixed by g is f_g = n^{c(1) + · · · + c(m)}. Our conclusion now follows directly from Burnside’s lemma.

EXAMPLE 2.19 Stacks of chips

How many stacks of 11 poker chips are possible with two colors of poker chips (red and white)?

Solution: Let X be the set of 11 positions in the stack and Y = {red, white}. The group S₂ = {e, τ} acts on X, the identity e leaving the stack alone and τ turning the stack upside down. We need to know the cycle structures of e and τ. Certainly, e consists of 11 fixed points, so c(1) = 11 and c(i) = 0 for 2 ≤ i ≤ 11. And τ consists of one fixed point (the middle poker chip) and five transpositions, so c(1) = 1, c(2) = 5, and c(i) = 0 for 3 ≤ i ≤ 11. Therefore, the number of S₂-inequivalent stacks is

EXAMPLE 2.20 Circular necklaces

How many 10-bead circular necklaces may be made using black or white beads?

Solution: Symmetries of the circular arrangement of beads are of six types: identity, rotations with one cycle, rotations with two cycles, rotations with five cycles, reflections through two opposite beads, and reflections through two opposite “sides?’ The number of necklaces is

While it is possible at this point to enumerate some rather complicated structures (nonisomorphic graphs, for instance), we prefer to do so only after developing some additional machinery—cycle indexes—in the next section.

EXERCISE

2.47 (a) How many conjugacy classes has S₅?

(b) Let x = (1 2 3)(4)(5). Find |ccl|(x)| and C(x).

2.48 Let Z₄ rotate a cube around an axis passing through the centers of opposite faces. Verify Burnside’s lemma for

X₁ = the set of vertices of the cube.

X₂ = the set of faces of the cube, and

X₃ = the set of edges of the cube.

2.49 (a) What is the average number of 1-cycles in the group A_n?

(b) What about in D_n?

2.6 Cycle indexes

As in the previous section, we let c = (c(1),…,c(m)) be the cycle structure of g G when G acts on X (G finite, |X| = m). We assign to g the monomial

where the x_i are variables in a commutative ring containing the rational numbers. The cycle index Z(G) is the average of these monomials:

(2.34) equation

The cycle index Z(G) contains complete information about the cycle lengths of the various permutations g of the group action. George Pólya (1887–1985) chose the letter Z to stand for the German word Zycklenzeiger (“cycle indicator”). As Pólya said, “The cycle index knows many things.” For instance, letting each x_i = n (a substitution denoted Z(G)[x_i ← n]), we obtain the formula for the number of G-inequivalent functions in Y^x |X| = m, |Y| = n):

(2.35) equation

We next determine the cycle indexes of the most important group actions: E_n, Z_n, D_n, S_n, A_n. The set acted upon is always X = {1,…n}.

The identity group E_n. The identity group consists of only the identity element e, which fixes every element of X. There are n 1-cycles, and the cycle index is

(2.36)

The cyclic group Z_n. Given g G, x X, the length of the cycle containing x is the minimum positive k for which gk + x x (mod n), or gk 0 (mod n). Because k is independent of x, all cycles of g have length k. If g contains j cycles, then jk = n, from which it follows that k is a divisor of n and j = n/k. To determine the number of elements g corresponding to each value of k, observe that k′n/k has order k whenever gcd(k′, k) = 1. There are ϕ(k) such values of k′, by definition of Euler’s ϕ-function. As Σ_k|n ϕ(k) = n, there are exactly ϕ(k) values of g for each k/n. Therefore

(2.37) equation

The dihedral group D_n. As D_n contains Z_n as a subgroup, the cycle index of D_n will contain all the terms in the formula (2.37). The other elements of D_n are “flips” (reflections). If n is odd, each flip fixes one element of X and contains (n – 1)/2 transpositions. If n is even, half of the flips contain n/2 transpositions and half contain two fixed points and (n – 2)/2 transpositions. Putting these facts together, we obtain the formulas

(2.38) equation

The symmetric group S_n. A permutation g S_n can have any cycle structure c = (c(1),…,c(n)), where

(2.39)

The number of solutions to this equation is a good counting puzzle in its own right. Solutions may be generated by ordering the elements of X and partitioning the elements from left to right into cycles of the appropriate lengths. Each of the n! orderings of X gives rise to repeated solutions due to interchanging cycles and writing cycles down in more than one way. Because there are c(k)! ways to list cycles of length k and each cycle may be written k ways, the number of solutions is

(2.40)

The cycle index of the symmetric group is

(2.41)

The alternating group A_n. The alternating group consists of the n! even permutations of S_n. When a permutation is written as a disjoint product of cycles, it is easy to tell whether it is even or odd. Because each odd cycle is equal to the product of an even number of transpositions, the number of odd cycles has no bearing on whether a permutation is even. However, each even cycle is equal to the product of an odd number of cycles. Hence, for a permutation to be even, it must be composed of an even number of disjoint cycles of even length. Therefore,

counts 1 for every even permutation and 0 for every odd permutation in S_n. This establishes the cycle index for the alternating group:

(2.42)

EXAMPLE 2.21 Circular necklaces (again)

How many 11-bead circular necklaces can be made with two types of beads?

Solution: The appropriate group is D₁₁ acting on the set X = {1,…,11}. Hence

equation

The only divisors of 11 are 1 and 11, for which ϕ(1) = 1 and ϕ(11) = 10. Thus

The number of necklaces that can be made with two types of beads is obtained by making the substitution

equation

Recall that in Example 2.19 we determined that there are 1056 different stacks of 11 poker chips using two types of chips. The number of necklaces with 11 beads of two types is smaller because a necklace has more symmetries than a stack of chips.

EXERCISE

2.50 Calculate Z(Z₃), Z(D₃), Z(A₃), and Z(S₃).

2.51 How many 10-bead circular necklaces may be made using three types of beads?

2.52 Calculate the number of 100-bead circular necklaces that may be made using two types of beads.

2.53 How many 100-bead circular necklaces can be made with 50 white beads and 50 black beads?

Hint: If you don’t see how to solve this now, consult the next section.

2.54 How many ways may the eight vertices of a cube be colored with three colors (up to symmetry of the cube)? How many ways may the six faces be colored? How many ways may the 12 edges be colored?

2.7 Pólya’s theorem

Pólya’s theorem. Let G act on X and therefore on the set of functions f : X → Y (G finite, |X| = m, |Y| = n). Suppose that F(y(1),…,y(n)) is the set of G-inequivalent functions for which |f⁻¹(y_i)| = y(i), where 1 ≤ i ≤ n. Then

(2.43) equation

where the right-hand sum is taken over all n-tuples (y(1),…,y(n)) with

equation

Proof. We need to show that the coefficient of y^y(1)₁ · · · y^y(n)_n on the left side of the relation (2.43) is |F(y(1),…,y(n))|. By Burnside’s lemma,

(2.44) equation

where f_g is the number of functions in F(y(1),…,y(n)) fixed by g.

Suppose that g has cycle structure c = (c(1),…,c(n)). If the function

is fixed by g, then f is constant on each cycle of g; that is, each cycle of g lies entirely within one of the inverse images f⁻¹(y_i). Picture an m × 1 box (the elements of X) partitioned into n sections of sizes y(1),…,y(n) (the inverse images of f). Then f_g is the number of possible packings of this box with c(i) blocks of sizes i, where 1 ≤ i ≤ m (the cycles of g). The polynomial on the left side of the relation (2.43) is

equation

Let us consider how the terms are formed in the relation above. Suppose that each term is expanded as a product of c(i) factors. Then each term in the product of these multiplicands is obtained by choosing a yⁱ_i from each factor. The contribution to is the number of ways the exponents (c(i) units of size i) may be arranged to equal y(i) for each 1 ≤ i ≤ n. These arrangements are clearly equivalent to the box packings described above. Therefore, the coefficient of is

equation

as we needed to show.

EXAMPLE 2.22 Circular necklaces (yet again)

Recall Example 2.20, which counted the number of circular necklaces with 10 beads, using black or white beads. How many such necklaces are made of seven white beads and three black beads?

Solution: The cycle index is

equation

Making the substitution x_i ← + yⁱ₁ + yⁱ₂ yields

We see that there are eight necklaces with seven white beads and three black beads. Can you sketch these eight necklaces?

EXAMPLE 2.23 Circular necklaces (one more time)

Making the substitution x_i ← aⁱ + 1 into Z(D₁₁), we obtain the polynomial

(We use the term aⁱ + 1 instead of yⁱ₁ + yⁱ₂ because it is simpler.) This polynomial tells us the number of 11-bead circular necklaces made of two types of beads, according to the number of beads of each type. For instance, there are 20 necklaces with seven white beads and four black beads.

EXERCISE

2.55 How many stacks of 10 poker chips contain four black chips and six white chips (up to inversion of the stack)?

2.56 How many circular necklaces of 10 beads may be made from two types of beads with five of each type?

2.57 How many 11-bead circular necklaces are there with three red beads, three white beads, and five blue beads?

2.58 How many ways may eight identical markers be placed on an 8 × 8 square grid (up to a rotation of the grid)?

2.59 How many ways may the vertices of an octagon be colored with three colors (up to symmetry of the octagon)?

2.60 How many ways may the faces of a cube be colored using three colors (up to symmetry of the cube)?

2.61 (a) How many ways may the 12 faces of a regular dodecahedron be colored with two colors?

(b) How many ways with six faces of each color?

2.62 How many ways can the faces of a regular icosahedron be colored with three colors (up to symmetry of the icosahedron)?

2.8 The number of graphs

In order to use Pólya’s theorem to count nonisomorphic graphs of order n, we need to determine the cycle index of the appropriate group action. A graph G = (V, E) may be identified with a function f : [V]² → {0, 1}, where f({x, s}) = 1 or 0 according to whether or not {x, y} is a member of E. We say that two graphs G₁ and G₂ are isomorphic if there is a bijection between their vertex sets V₁ to V₂ that preserves adjacency. We normally regard two isomorphic graphs as the same graph. Two graphs are isomorphic if the corresponding functions are equal up to a permutation of V. As any permutation of V is allowed, the group acting on V is S_n, where n = |V|. This group induces an action on [V]² by the rule {x, y} {gx, gy}. We call this action [S_n]². Our goal is to calculate the cycle index Z([S_n]²).

Assume that g S_n has cycle structure c = (c(1),…,c(n)). We determine the cycle structure of g as a permutation of [V]² by assuming that {x, y} [V]² and examining four cases. We call the cycles in [V]² pair-cycles.

Case 1. Suppose that x and y are in cycles of different lengths a and b. There are c(a) choices for which cycle contains x and c(b) choices for which cycle contains y. Given these choices, the ab ordered pairs in the Cartesian product of the two cycles are partitioned into pair-cycles of length lcm(a, b). The number of such cycles is ab/lcm(a, b) = gcd(a, b). The contribution to Z([S_n]²) is

(2.45)

Case 2. Suppose that x and y are in different cycles of the same length a. There are choices for the cycles. The pair-cycle has length a, and the number of pair-cycles is a. The contribution to Z([S_n]²) is

(2.46) equation

Case 3. Suppose that x and y are in the same cycle of odd length a. There are c(a) choices for the cycle containing x and y and (a – 1)/2 choices for the gap between x and y. The pair-cycles all have length a. The contribution to Z([S_n]²) is

(2.47)

Case 4. Suppose that x and y are in the same cycle of even length a. This case is the same as Case 3 except for one important difference. There are still c(a) choices for the cycle containing x and y. But now, although the typical pair-cycle has length a, and there are (a – 2)/2 choices for the gap between x and y, there is also the possibility that x and y are a/2 units apart in the cycle, so that the pair-cycle has length a/2. The contribution to Z([S_n]²) is

(2.48)

Combining the formulas (2.45), (2.45), (2.45), and (2.45), we obtain a formula for the cycle index of [S_n]²:

(2.49) equation

The cycle indexes for 2 ≤ n ≤ 4 are easily calculated by hand:

(2.50)

(2.51)

(2.52)

Let’s examine the polynomial Z([S₄]²). The 24 permutations of the set {1, 2, 3, 4} induce permutations of the six unordered pairs of elements. The identity permutation results in six fixed pairs. The six permutations of type 2 + 1 + 1 result in two fixed points and two 2-cycles in the set unordered pairs. The three permutations of type 2 + 2 result in two fixed points and two 2-cycles. The eight permutations of type 3 + 1 result in two 3-cycles. The six 4-cycles result in one 2-cycle and one 4-cycle.

We apply Pólya’s theorem to the cycle index Z([S₄]²) to enumerate nonisomorphic graphs of order 4 by number of edges:

(2.53)

The coefficient of y^k₁y^6–k₂ in this enumerating polynomial is the number of nonisomorphic graphs of order 4 with k edges and 6 – k non-edges. The symmetry in the polynomial comes from the fact that G and H are isomorphic if and only if their complements and are isomorphic.

We verify formula (2.53) by presenting in Figure 2.6 all nonisomorphic graphs of order 4 arranged by number of edges.

Figure 2.6 Graphs of order 4 and their enumerating monomials.

The total number of nonisomorphic graphs of order 4 is

equation

The number of nonisomorphic graphs. The number of nonisomorphic graphs of order n is

Let g(n) be the number of nonisomorphic graphs on n vertices. Table 2.3 gives the value of g(n) for small n. We will show that

Table 2.3 The number of nonisomorphic graphs.

order	number of graphs
1	1
2	2
3	4
4	11
5	34
6	156
7	1044
8	12346
9	274668
10	12005168

equation

Because is the number of labeled graphs, this means that almost all graphs have no nontrivial automorphisms. In fact, g(n) ~ c(n), where c(n) is the number of nonisomorphic connected graphs of order n. See [14].

Open problem. Find a formula for the number of nonisomorphic graphs of order n that contain a triangle.

Open problem. Find a formula for the number of nonisomorphic planar graphs of order n.

EXERCISE

2.63 Use formula (2.49) to find the number of graphs of order 5 (up to isomorphism).

2.64 How many ways are there to color the edges of K₆ with three colors (up to isomorphism)?

2.65 How many ways can the edges of the graph K_{3, 3} be colored with nine colors (up to isomorphism of the graph)?

2.66 Write the generating function for nonisomorphic multigraphs in terms of Z(S_n). (A multigraph is a graph in which pairs of vertices can be joined by more than one edge.)

2.67 How many nonisomorphic multigraphs of order 4 have at most five edges?

2.68 Prove the identity

equation

2.69 (a) Let Z(S₀) = 1. Prove the identity

(b) Let a_n be the number of onto functions f : X → X, where |X| = n, with the property that f(f(x)) = x for all x

X. Show that the exponential generating function for the sequence

Note that a_n is the number of involutions of an n-set and also the number of n × n symmetric permutation matrices. See Exercise 2.45.

2.9 Symmetries in domain and range

Suppose that X and Y are finite nonempty sets (|X| = m, |Y| = n) acted upon by finite groups G and H, respectively. We picture this situation with the following diagram:

We want to define what it means for two functions to be the same under these group actions. To this end, we define a new action H^G on the set of functions Y^X = {f: X → Y} as follows: if g G, h H, and f Y^X, then (h^g f)(x) = hf (g⁻¹ x). (The reader should check that the two axioms for a group action are satisfied.) We say that two functions are equivalent if they are in the same orbit of the action and inequivalent otherwise. This definition is a generalization of the labeled/unlabeled set paradigm. If G is the symmetric group S_m, then X is unlabeled, and if G is the identity group E_m, then X is labeled. Likewise, if H is S_n, then Y is unlabeled, and if H is E_n, then Y is labeled. For any groups G and H, Burnside’s lemma gives the number of inequivalent functions:

(2.54) equation

where Ω(g, h) is the number of functions fixed by h^g.

If f is fixed by h^g, then f(g⁻¹ x) = hf(x) for all x X. Thus f(x) = y implies that f(g⁻¹ x) = hf(x) = hy, which in turn implies that f(g⁻² x) = h²y. In general, f(g⁻ⁱ x) = hⁱ y. It follows that if x is in a cycle of length i in g and y is in a cycle of length j in h, then j divides i. Furthermore, we have shown that the correspondence between the two cycles is completely determined by the relation f(x) = y. There are j choices for the image of x. Suppose that g has cycle structure (c(1),…,c(m)) and h has cycle structure (d(1),…,d(n)). Then, given i and a particular cycle of length i in g, the number of fixed functions is

(2.55) equation

and the total number of fixed functions is

(2.56)

Formulas (2.54) through (2.56) combine to yield the following formula of N. G. de Bruijn (1918–2012).

De Bruijn’s formula. If finite groups G and H act on finite nonempty sets X and Y, respectively, then the number of inequivalent functions is given by

(2.57) equation

We apply de Bruijn’s formula to the problem of counting self-symmetric graphs of order n, that is, graphs G for which G is isomorphic to its complement . In the previous section we determined the cycle index Z([S_n]²) and the number of nonisomorphic graphs of order n, i.e., g(n) = Z([S_n])[x_i ← 2]. If [S_n]² acts on the set X = {1,…,n} and S₂ acts on Y = 11, then functions correspond to nonisomorphic graphs where G and its complement are regarded as the same. Hence, the number of such functions is

(2.58) equation

Let g^* (n) be the number of self-complementary graphs of order n. As 2N(n) counts nonisomorphic graphs with each self-complementary graph counted twice, we obtain the formula

(2.59)

For example, g^*(4) = 2N(4) – g(4) = 2 · 6 – 11 = 1, and the unique self-complementary graph of order 4 is the path P₄.

EXERCISE

2.70 How many self-complementary graphs are there of orders 5 and 6?

2.71 Use a computer to calculate the number of self-complementary graphs of orders 7 through 12.

2.10 Asymmetric graphs

Our formula for the number of nonisomorphic graphs of order n is

(2.60) equation

where c = (c₁,…,c_n) is the cycle type of a permutation of {1,…,n}, the number of permutations of such a cycle type is

(2.61)

and

(2.62) equation

We will prove that

(2.63)

This result means that the typical unlabeled graph has no nontrivial symmetries and hence is an asymmetric graph.

The lower bound is trivial:

Now we prove the upper bound. Assume that the permutation has n – j fixed points, where 0 ≤ j ≤ n. The case j = 0 gives the term !. We will show that the other terms are bounded above by expressions which, upon division by !, tend to 0 as n → ∞.

We have

equation

The number of permutations with n – j fixed points is bounded above by

equation

The case j = 2 yields the exponent

equation

Upon dividing by , the contribution is bounded by 2^{−n+2+2 log₂ n}, which tends to 0 as n → ∞.

Now let’s look at the case j ≥ 3, so that 1 – j/2 < O. We obtain

equation

and

Upon dividing by !, the contribution is bounded by 2^{n(1–j/2)+j log₂ n}, which tends to 0 as n → ∞.

Erds and Rényi theorem (1963). Almost all unlabeled graphs are asymmetric:

equation

EXERCISE

2.72 Find an asymmetric graph with six vertices. Show that no graph with five vertices is asymmetric.

2.73 Use a computer to compare g(n) and

! for 2 ≤ n ≤ 20.

2.74 Estimate g(100).

Notes

The notation for falling factorial and rising factorial varies considerably. The falling factorial function is often written (x)_n.

James Sylvester (1814–1897) introduced the notion of Ferrers diagrams in 1853. Apparently they were discovered by Norman Ferrers (1829–1903).

G. H. Hardy and Srinivasa Ramanujan (1887–1920) were the first mathematicians to find an explicit formula for p(n). The most elementary asymptotic estimate is log_e p(n) ~ π(2n/3)^1/2. See [12] for details.

In 1937 George Pólya (1887–1985) published the formula for enumerating graphs in connection with a problem concerning the number of chemical isomers. The language of his counting theory was quite descriptive. A function from X to Y is called a configuration. The elements of X are places and the elements of Y are figures. The store inventory is and the pattern inventory is . Thus the basic problem is to find the number of G-inequivalent patterns. An alternative approach to the Pólya theory of counting was undertaken independently by John H. Redfield (1879–1944) in 1927.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 2: Generating Functions

Create new playlist

Sign In

Sign Up

CHAPTER 2

GENERATING FUNCTIONS

2.1 Rational generating functions

EXAMPLE 2.1 Generating function for the Fibonacci sequence

EXAMPLE 2.2

EXAMPLE 2.3 Change for a dollar

EXAMPLE 2.4 Alcuin’s sequence

EXAMPLE 2.5

EXERCISES

2.2 Special generating functions

EXAMPLE 2.6 Expected number of cycles in a permutation

EXAMPLE 2.7 Rook walks

EXERCISES

2.3 Partition numbers

EXAMPLE 2.8

EXERCISES

2.4 Labeled and unlabeled sets

EXAMPLE 2.9

EXAMPLE 2.10

X labeled, Y labeled

X unlabeled, Y labeled

X labeled, Y unlabeled

X unlabeled, Y unlabeled

EXERCISE

2.5 Counting with symmetry

EXAMPLE 2.11

EXAMPLE 2.12

EXAMPLE 2.13

EXAMPLE 2.14

EXAMPLE 2.15

EXAMPLE 2.16

EXAMPLE 2.17

EXAMPLE 2.18 Average number of fixed points of a permutation

EXAMPLE 2.19 Stacks of chips

EXAMPLE 2.20 Circular necklaces

EXERCISE

2.6 Cycle indexes

EXAMPLE 2.21 Circular necklaces (again)

EXERCISE

2.7 Pólya’s theorem

EXAMPLE 2.22 Circular necklaces (yet again)

EXAMPLE 2.23 Circular necklaces (one more time)

EXERCISE

2.8 The number of graphs

EXERCISE

2.9 Symmetries in domain and range

EXERCISE

2.10 Asymmetric graphs

EXERCISE

Notes

Table of Contents for
Chapter 2: Generating Functions