Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 8

Probabilistic Machines and Complexity Classes

Once you have mathematical certainty, there is nothing left to do or to understand.

—Fyodor Dostoevsky

We develop, in this chapter, the theory of randomized algorithms. A randomized algorithm uses a random number generator to determine some parameters of the algorithm. It is considered as an efficient algorithm if it computes the correct solutions in polynomial time with high probability, with respect to the probability distribution of the random number generator. We introduce the probabilistic Turing machine (PTM) as a model for randomized algorithms and study the relation between the class of languages solvable by polynomial-time randomized algorithms and the complexity classes P and NP.

8.1 Randomized Algorithms

A randomized algorithm is a deterministic algorithm with the extra ability of making random choices during the computation that are independent of the input values. Using randomization, some worst-case scenarios may be hidden so that it only occurs with a small probability, and so the expected runtime is better than the worst-case runtime. We illustrate this idea by examples.

Example 8.1

(Quicksort) We consider the following two versions of the well-known Quicksort algorithm.

Deterministic QuicksortInput: a list $c08-math-0001$ of integers;if n ≤ 1 then return L else begin1:

let $c08-math-0004$ ;

let L₁ be the sublist of L whose elements are $c08-math-0007$ ;

let L₂ be the sublist of L whose elements are $c08-math-0010$ ;

let L₃ be the sublist of L whose elements are $c08-math-0013$ ;

recursively Quicksort L₁ and L₃;

return L as the concatenation of the lists L₁, L₂, and L₃

end.Randomized Quicksort{same as deterministic Quicksort, except line 1:} 1: choose a random integer i, $c08-math-0023$ ;

It is easy to see that both algorithms sort list L correctly. We consider the runtime of the algorithms. Let $c08-math-0025$ denote the maximum number of comparisons (of elements in L) made by the deterministic Quicksort algorithm on an input list L of n integers. It is clear that the worst case occurs when L is reversely sorted and, hence, a₁ is always the maximum element in L. Thus, we get the following recurrence inequality:

for some constant c > 0. This observation yields $c08-math-0034$ .

Let $c08-math-0035$ be the expected number of comparisons made by the randomized Quicksort algorithm on an input list L of n integers. Let s be the size of L₁. As the integer i is a random number in $c08-math-0041$ , the value s is actually a random variable with the property $c08-math-0043$ for all $c08-math-0044$ (assuming that all elements in L are distinct). Thus, we obtain the following recurrence inequality:

for some constant c > 0. Solving this recurrence inequality, we get $c08-math-0048$ .

Using essentially the same proof, we can show that the deterministic Quicksort algorithm also has an expected runtime $c08-math-0049$ , under the uniform distribution over all lists L of n integers. This result, however, is of quite a different nature from the result that $c08-math-0052$ . The expected runtime $c08-math-0053$ of the randomized Quicksort algorithm is the same for any input list L, because the probability distribution of the random number i is independent of the input distribution. On the other hand, the expected runtime $c08-math-0056$ of the deterministic Quicksort algorithm is only correct for the uniform distribution on the input domain. In general, it varies when the input distributions change.

In the above, we have seen an example in which the randomized algorithm does not make mistakes and it has an expected runtime lower than the worst-case runtime. We achieved this by changing a deterministic parameter i into a randomized parameter that transforms a worst-case input into an average-case input. (Indeed, all inputs are average-case inputs after the transformation.) In the following, we show another example of the randomized algorithms that runs faster than the brute-force deterministic algorithm but might make mistakes with a small probability. In this example, we avoid the brute-force exhaustive computation by a random search for witnesses (for the answer NO). A technical lemma shows that such witnesses are abundant and so the random search algorithm works efficiently.

Example 8.2

(DETERMINANT OF A POLYNOMIAL MATRIX, or DPM). In this example, we consider matrices of multivariable integer polynomials. An n-variable integer polynomial $c08-math-0059$ is said to be of the standard form if it is written as the sum of terms of the form $c08-math-0060$ . For instance, the following polynomial is in the standard form:

A polynomial Q in the standard form has degree d if the maximum of the total degree of each term is d. For instance, the above polynomial Q has degree 7.

The problem DPM is the following: Given an $c08-math-0066$ matrix $c08-math-0067$ of n-variable integer polynomials and an n-variable integer polynomial Q₀, with Q₀ and all Q_i,j in Qwritten in the standard form, determine whether $c08-math-0073$ , where $c08-math-0074$ denotes the determinant of Q. Even for small sizes of m and n, the brute-force evaluation of the determinant of Qappears formidable. On the other hand, the determinant of an $c08-math-0077$ matrix over integers in $c08-math-0078$ can be computed in time polynomial in $c08-math-0079$ . (In fact, as discussed in Chapter 6, this problem is solvable in NC².) In the following, we present a randomized algorithm for DPM, based on the algorithm for evaluating determinants of integer matrices.¹

Randomized Algorithm for DPMInput: $c08-math-0084$ , Q₀, ε;let $c08-math-0087$ degree of $c08-math-0088$ ;let $c08-math-0089$ degree of Q₀;let $c08-math-0091$ ;let $c08-math-0092$ ;for $c08-math-0093$ to k dobegin

choose n random integers $c08-math-0096$ in the range $c08-math-0097$ ;

if $c08-math-0098$ then output NO and halt

end;output YES and halt.

Suppose the absolute values of the coefficients of the polynomials Q_i,j and Q₀ are bounded by r. Then, the absolute value of each $c08-math-0102$ is bounded by $c08-math-0103$ , and so $c08-math-0104$ can be computed in time polynomial in $c08-math-0105$ . Thus this algorithm always halts in polynomial time. It, however, might make mistakes. We claim nevertheless that the error probability is smaller than ε:

If $c08-math-0107$ , then the above algorithm always outputs YES.
If $c08-math-0108$ , then the above algorithm outputs NO with probability $c08-math-0109$ .

The claim (1) is obvious because the algorithm outputs NO only when a witness $c08-math-0110$ is found such that $c08-math-0111$ . The claim (2) follows from the following lemma. Let $c08-math-0112$ .

Lemma 8.3

Proof

The case n = 1 is trivial because a 1-variable nonzero polynomial Q of degree d has at most d zeros. We show the general case n > 1 by induction. Write Q as a polynomial of a single variable x₁, and let d₁ be the degree of Q in x₁. Let Q₁ be the coefficient of $c08-math-0132$ in Q. Then Q₁ is an $c08-math-0135$ -variable polynomial of degree $c08-math-0136$ .

By induction, because Q₁ is not identical to zero, it has at most (d−d₁)(2m + 1)ⁿ⁻² zeros in $c08-math-0138$ . For any $c08-math-0139$ in $c08-math-0140$ , if $c08-math-0141$ is not a zero of Q₁, then $c08-math-0143$ is a 1-variable polynomial not identical to zero and, hence, has at most d₁ zeros u₁, with $c08-math-0146$ . If $c08-math-0147$ is a zero of Q₁, then $c08-math-0149$ might be identical to zero and might have $c08-math-0150$ zeros u₁, with $c08-math-0152$ . Together, the polynomial Q has at most

zeros in A_n,m.

The above lemma shows that if $c08-math-0156$ , then the probability that a random $c08-math-0157$ in A_n,d satisfies $c08-math-0159$ is less than 1/2. Thus, the probability that the algorithm fails to find a witness in A_n,d for k times is $c08-math-0162$ .

Our third example is the problem of testing whether a given integer is a prime number.

PRIMALITY TESTING (PRIME): Given a positive integer N in the binary form, determine whether N is a prime number.

PRIME is known to be in P, but all known deterministic algorithms are very slow. The following randomized algorithm gives a simple faster solution, with an arbitrarily small error probability ε. The algorithm searches for a witness that asserts that the given input number is a composite number. A technical lemma shows that if the given number is a composite number, then there are abundant witnesses and, hence, the error probability is small.The technical lemmas are purely number-theoretic results, and we omit the proofs.

Example 8.4

Lemma 8.5

Lemma 8.5 above shows that the Legendre–Jacobi symbols can be computed by a polynomial-time algorithm similar to the Euclidean algorithm for the greatest common divisors.

Now we are ready for the randomized algorithm for primality testing.

Randomized Algorithm for PRIMEInput: n, ε;let $c08-math-0203$ ;for $c08-math-0204$ to k dobegin

choose a random x in the range $c08-math-0207$ ;

if $c08-math-0208$ then output NO and halt;

if $c08-math-0209$ then output NO and halt;

end;Output YES and halt.

From the polynomial-time computability of the Legendre–Jacobi symbols, the above algorithm always halts in polynomial time. In addition, by Euler's criterion, if n is a prime, then the above algorithm always outputs YES. Finally, the following lemma establishes the property that if n is a composite number, then the above algorithm outputs NO with probability $c08-math-0212$ .

Lemma 8.6

The above examples demonstrate the power of randomization in the design and analysis of algorithms. In the following sections, we present a formal computational model for randomized computation and discuss the related complexity problems.

8.2 Probabilistic Turing Machines

Our formal model for randomized computation is the PTM. A multi-tape probabilistic Turing machine (PTM) M consists of two components $c08-math-0218$ , where $c08-math-0219$ is a regular multi-tape DTM and ϕ is a random-bit generator. In additionto the input, output, and work tapes, the machine M has a special random-bit tape. The computation of the machine M on an input x can be informally described as follows: At each move, the generator ϕ first writes a random bit b = 0 or 1 on the square currently scanned by the tape head of the random-bit tape. Then the DTM $c08-math-0226$ makes the next move according to the current state and the current symbols scanned by the tape heads of all (but output) tapes, including the random-bit tape. The tape head of the random-bit tape always moves to the right after each move.

For the sake of simplicity, we only define formally the computation of a 2-tape PTM, where the first tape is the input/output/work tape and the second tape is the random-bit tape. Also, we will use a fixed random-bit generator ϕ that, when asked, outputs a bit 0 or 1 with equal probability. This generator, thus, defines a uniform probability distribution U over infinite sequences in $c08-math-0229$ .

With this fixed uniform random-bit generator ϕ, a 2-tape PTM M is formally defined by five components $c08-math-0232$ , which defines, as in Section 1.2, a 2-tape DTM $c08-math-0233$ , with the following extra conditions:

$c08-math-0234$ .
The function δ is defined from $c08-math-0236$ to $c08-math-0237$ . (That is, the random-bit tape is a read-only tape holding symbols in {0,1} and always moves to the right.)

From the above conditions, we define a configuration of M to be an element in $c08-math-0240$ . Informally, a configuration $c08-math-0241$ indicates that the machine is currently in the state q, the current tape symbols in the first tape are $c08-math-0243$ , its tape head is currently scanning the first symbol of t, the random-bit tape contains symbols α that were generated by ϕ so far, and its tape head is scanning the cell to the right of the last bit of α. (Note that the random-bit tape symbols α do not affect the future configurations. We include them in the configuration only for the convenience of the analysis of the machine M.) Recall that $c08-math-0250$ is the next-configuration function on the configurations of the machine $c08-math-0251$ . Let $c08-math-0252$ and $c08-math-0253$ be two configurations of M. We write $c08-math-0255$ and say that γ₂ is a successor of γ₁, if there exists a bit $c08-math-0258$ such that

$c08-math-0259$ , and
$c08-math-0260$ .

(Informally, b is the random bit generated by ϕ at this move. Thus the configuration $c08-math-0263$ of the DTM $c08-math-0264$ shows, in addition to the configuration of the PTM M, the new random bit b to be read by $c08-math-0267$ .) We let $c08-math-0268$ denote the reflexive transitive closure of $c08-math-0269$ .

For each input x, let the initial configuration be γ_x, that is, $c08-math-0272$ . We say that a configuration γ₁ of M is a halting configuration if there does not exist a configuration γ₂ such that $c08-math-0276$ . In the following, we write $c08-math-0277$ to mean that α is an initial segment of β.

Definition 8.7

The above definition involves the probability on the space of infinite sequences $c08-math-0292$ and is not convenient in the analysis of practical randomized algorithms. Note, however, that in each halting computation, the machine M only uses finitely many random bits. Therefore, there is an equivalent definition involved only with the probability on finite strings in $c08-math-0294$ . To be more precise, we treat $c08-math-0295$ as a 2-input DTM that satisfies the conditions (1) and (2) above. We assume that initially the first tape holds the first input $c08-math-0296$ , with all other squares having blank symbols B, and its tape head scans the first symbol of x. Also assume that the second tape initially contains the second input $c08-math-0299$ , with all other squares having a symbol 0,² and the tape head scans the first symbol of α.

Proposition 8.8

Proof

For notational purposes, we write $c08-math-0305$ to denote that M accepts x and $c08-math-0308$ to denote that M rejects x. That is, M(x) is a random variable with the probability $c08-math-0312$ and $c08-math-0313$ .

Definition 8.9

Our definition above appears arbitrary when, for instance, $c08-math-0326$ . An alternative stronger definition for the set L(M) requires that if $c08-math-0328$ , then the rejecting probability of M(x) is greater than 1/2 and if both the accepting probability and the rejecting probability of M(x) are $c08-math-0331$ , then let M(x) be undefined. For time-bounded probabilistic computation, however, to be shown in Proposition 8.15, these two definitions are essentially equivalent.

Finally, we point out some discrepancies between our model of PTMs and practical randomized algorithms. The first difference is that we have simplified the model so that the random-bit generator ϕ can only generate a bit 0 or 1. In practice, one might wish to get a random number i in any given range. For instance, our model is too restrictive to generate a random number $c08-math-0335$ with $c08-math-0336$ . We might solve this problem by extending the model of PTMs to allow more general random number generators. Alternatively, we could just approximate the desired random number by our fixed random-bit generator ϕ (cf. Exercise 8.5).

The second difference is that in our formal model of PTMs, we defined the random-bit generator ϕ to generate a random bit at every step, no matter whether the deterministic machine $c08-math-0339$ needs it or not. In practice, we only randomize a few parameters when it is necessary. We could modify our definition of PTMs to reflect this property. Exercise 8.4 shows that these two models are equivalent. In many applications, we note, however, the model of Exercise 8.4 is more practical because the generation of a random bit could be much costlier than the runtime of the deterministic machine $c08-math-0340$ .

8.3 Time Complexity of Probabilistic Turing Machines

The time complexity of a PTM can be defined in two ways: the worst-case time complexity and the expected time complexity. The worst-case time complexity of a PTM M on an input x measures the time used by the longest computation paths of M(x). The expected time complexity of M on x is the average number of moves for M(x) to halt with respect to the uniform distribution of the random bits used. In both cases, if $c08-math-0347$ 1, then the time complexity of M on x is ∞.

Definition 8.10

Definition 8.11

The expected time complexity is a natural complexity measure for randomized algorithms. People often use this complexity measure in the analysis of randomized algorithms. In the theory of polynomial-time computability, however, we will mostly work with the worst-case time complexity of PTMs. The main reason is that the worst-case time complexity is consistent with the time complexity measures of deterministic and nondeterministic machines, allowing us to study them in a unified model. In addition, for time- and error-bounded probabilistic computations, the worst-case time complexity is very close to the expected time complexity, as shown in the following proposition. We say two PTMs, M₁ and M₂, are equivalent if $c08-math-0378$ , and they are strongly equivalent if $c08-math-0379$ for all inputs x.

Proposition 8.12

Proof

As we are going to work mostly with the worst-case time complexity, we restrict ourselves to PTMs with a uniform time complexity. A PTM M has a uniform time complexity t(n) if for all x, for all α, $c08-math-0407$ , $c08-math-0408$ halts in exactly $c08-math-0409$ moves. The following proposition shows that we do not lose much by considering only PTMs with uniform time complexity.

Proposition 8.13

Proof

We are ready to define the complexity classes of probabilistic computation.

Definition 8.14

In Definition 8.9, the set L(M) was defined to be the set of all strings x with the accepting probability greater than $c08-math-0426$ and all other strings are defined to be in $c08-math-0427$ . A slightly stronger definition requires that for all $c08-math-0428$ , its rejecting probability is greater than 1/2. The following proposition shows that the class PP is robust with respect to these two different definitions.

Proposition 8.15

Proof

The idea of the proof is to increase the rejecting probability by $c08-math-0436$ for all strings of length n. As all strings x of length n that are in L(M) have the accepting probability $c08-math-0441$ , this smaller amount of increase in the rejecting probability will not affect the language L(M). Formally, we define M₁ by the following deterministic machine $c08-math-0444$ . Recall that $c08-math-0445$ means that α begins with the symbol 0.

Deterministic Machine $c08-math-0447$ Input: $c08-math-0448$ , $c08-math-0449$ and $c08-math-0450$ ;If y ≠ 00 then accept $c08-math-0452$ if and only if $c08-math-0453$ accepts $c08-math-0454$ ;If y = 00 and $c08-math-0456$ and $c08-math-0457$ then accept $c08-math-0458$

else reject $c08-math-0459$ .

As machine M has a uniform time complexity t(n), we know that on an input x of length n, it has an accepting probability $c08-math-0464$ for some integer r, $c08-math-0466$ . Then the accepting probability of x by the new machine M₁ is equal to $c08-math-0469$ . This accepting probability is greater than 1/2 if $c08-math-0470$ and is less than 1/2 if $c08-math-0471$ .

We defined the class PP as the class of sets accepted by some polynomial-time PTMs. Our goal is to define the class of sets feasibly computable by randomized algorithms. Does the class PP really consist of probabilistically feasible sets? The answer is NO; the class PP appears too general, as demonstrated by the following relations.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter

Create new playlist

Sign In

Sign Up

Probabilistic Machines and Complexity Classes

8.1 Randomized Algorithms

Example 8.1

Example 8.2

Lemma 8.3

Proof

Example 8.4

Lemma 8.5

Lemma 8.6

8.2 Probabilistic Turing Machines

Definition 8.7

Proposition 8.8

Proof

Definition 8.9

8.3 Time Complexity of Probabilistic Turing Machines

Definition 8.10

Definition 8.11

Proposition 8.12

Proof

Proposition 8.13

Proof

Definition 8.14

Proposition 8.15

Proof

Table of Contents for
Chapter