Complexity Issues

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3.2. Complexity Issues

Given an algorithm (or an implementation of the same), the time and space required for the execution of the algorithm on a machine depend very much on the machine’s architecture and on the compiler. But this does not mean that we cannot make some general theoretical estimates. The so-called asymptotic estimates that we are going to introduce now tend to approach the real situation as the input size tends to infinity. For finite input sizes (which is always the case in practice), these theoretical predictions turn out to provide valuable guidelines.

3.2.1. Order Notations

We start with the following important definitions.

Definition 3.1.

Let f and g be positive real-valued functions of natural numbers.

f is said to be bounded above by g or of the order of g, denoted f = O(g), if there exists an and a positive real constant c such that f(n) ≤ cg(n) for all n ≥ n₀. In this case, we also say that g is bounded below by f and denote this by g = Ω(f).
If f = O(g) and g = O(f), we say that f and g are of the same order and denote this by f = Θ(g) (or by g = Θ(f)). Equivalently, f = Θ(g) if and only if f = O(g) and f = Ω(g); that is, if and only if there exist an integer and real positive constants c₁, c₂ such that c₁g(n) ≤ f(n) ≤ c₂g(n) for all n ≥ n₀.
f is said to be of strictly lower order than g, denoted f = o(g), if f(n)/g(n) tends to 0 as n tends to infinity. In other words, f = o(g) if and only if for every real positive constant c (however small it may be) there exists an integer such that f(n) < cg(n) for all n ≥ n_c. If f = o(g), we also say that g is of strictly higher order than f and denote this by g = ω(f). Thus g = ω(f) if and only if for every real positive constant c (however large it may be) there exists an integer such that g(n) > cf(n) for all n ≥ n_c.

Example 3.1.

Let f(n) := a_dn^d + · · · + a₁n + a₀ with d ≥ 0, , a_d > 0. Then f = Θ(n^d). This heuristically means that as n becomes sufficiently large, the leading term a_dn^d dominates over the other terms, and apart from the constant of proportionality a_d the function f(n) grows with n as n^d does. If f = Θ(n^d) for some integer d > 0, we say that f is of polynomial order in n.^[1] A Θ(1) function is often called a constant function.
^[1] This is not the complete truth. Functions like , n^2.3 or n³(log n)² would be better included in the polynomial family. Thus, we may define f to be of polynomial order (in n), if f = O(n^d) and f = Ω(n^d′) for some positive real constants d, d′. Similar comments hold for poly-logarithmic and exponential orders.
If f = Θ((log n)^a) for some real a > 0, we say that f is of poly-logarithmic order in n. By Exercise 3.2(b), any function of poly-logarithmic order grows asymptotically slower than any function of polynomial order.
If f = Θ(aⁿ) for some real a > 1, f said to be of exponential order in n. Again by Exercise 3.2(b) any function of exponential order grows asymptotically faster than any function of polynomial order.
Now, consider a function of the form

Equation 3.1

for real c > 0 and for 0 ≤ α ≤ 1. For α = 0, we have f = Θ(n^c); that is, f is of polynomial order. On the other extreme, if α = 1, f = Θ(aⁿ), where a := exp(c), that is, f is of exponential order. If 0 < α < 1, we say that f is of subexponential order in n, since the order of f is somewhere in between polynomial and exponential. We will come across functions of subexponential orders quite frequently in the rest of the book. Note that as α increases from 0 to 1, the order of f also increases monotonically from polynomial to exponential.
A function f = O(n^a(log n)^b) with a > 0 and b ≥ 0 is often denoted by the soft O-notation: f = O^~(n^a). This implies that up to multiplication by a polynomial in log n the function f is of the order of n^a. Similarly, if f = O(aⁿg(n)) for a > 1 and for some g(n) of polynomial order, we say that f = O^~(aⁿ). Intuitively spoken, the O-notation hides constant multipliers, whereas the soft O-notation suppresses exponentially small multipliers.
The notion of order can be readily extended to functions with two or more input variables. For example, for positive real-valued functions f, g of two positive integer variables m, n one says f = O(g), if for some m₀, and for some positive real constant c one has f(m, n) ≤ cg(m, n) for all m ≥ m₀ and n ≥ n₀. The function f(m, n) = m³2ⁿ is of polynomial order in m, but of exponential order in n.

The order notation is used to analyse algorithms in the following way. For an algorithm, the input size is defined as the total number of bits needed to represent the input of the algorithm. We find asymptotic estimates of the running time and the memory requirement of the algorithm in terms of its input size. Let f(n) denote the running time^[2] of an algorithm A for an input of size . If f(n) = Θ(n^a) (or, more generally, if f = O(n^a)) for some a > 0, A is called a polynomial-time algorithm. If a = 1 (resp. 2, 3, . . .), then A is specifically called a linear-time (resp. quadratic-time, cubic-time, . . .) algorithm. A Θ(1) algorithm is often called a constant-time algorithm. If f = Θ(bⁿ) for some b > 1, A is called an exponential-time algorithm. Similarly, if f satisfies Equation (3.1) with 0 < α < 1, A is called a subexponential-time algorithm.

^[2] The practical running time of an algorithm may vary widely depending on its implementation and also on the processor, the compiler and even on run-time conditions. Since we are talking about the order of growth of running times in relation to the input size, we neglect the constants of proportionality and so these variations are usually not a problem. If one plans to be more concrete, one may measure the running time by the number of bit operations needed by the algorithm.

One has similar classifications of an algorithm in terms of its space requirements, namely, polynomial-space, linear-space, exponential-space, and so on. We can afford to be lazy and drop -time from the adjectives introduced in the previous paragraph. Thus, an exponential algorithm is an exponential-time algorithm, not an exponential-space algorithm.

It is expedient to note here that the running time of an algorithm may depend on the particular instance of the input, even when the input size is kept fixed. For an example, see Exercise 3.3. We should, therefore, be prepared to distinguish, for a given algorithm and for a given input size n, between the best (that is, shortest) running time f_b(n), the worst (that is, longest) running time f_w(n), the average running time f_a(n) on all possible inputs (of size n) and the expected running time f_e(n) for a randomly chosen input (of size n). In typical situations, f_w(n), f_a(n) and f_e(n) are of the same order, in which case we simply denote, by running time, one of these functions. If this is not the case, an unqualified use of the phrase running time would denote the worst running time f_w(n).

The order notation, though apparently attractive and useful, has certain drawbacks. First it depicts the behaviour of functions (like running times) as the input size tends to infinity. In practice, one always has finite input sizes. One can check that if f(n) = n¹⁰⁰ and g(n) = (1.01)ⁿ are the running times of two algorithms A and B respectively (for solving the same problem), then f(n) ≤ g(n) if and only if n = 1 or n ≥ 117,309. But then if the input size is only 1,000, one would prefer the exponential-time algorithm B over the polynomial-time algorithm A. Thus asymptotic estimates need not guarantee correct suggestions at practical ranges of interest. On the other hand, an algorithm which is a product of human intellect does not tend to have such extreme values for the parameters; that is, in a polynomial-time algorithm, the degree is usually ≤ 10 and the base for an exponential-time algorithm is usually not as close to 1 as 1.01 is. If we have f(n) = n⁵ and g(n) = 2ⁿ as the respective running times of the algorithms A and B, then A outperforms B (in terms of speed) for all n ≥ 23.

The second drawback of the order notation is that it suppresses the constant of proportionality; that is, an algorithm whose running time is 100n² has the same order as one whose running time is n². This is, however, a situation that we cannot neglect in practice. In particular, when we compare two different implementations of the same algorithm, the one with a smaller constant of proportionality is more desirable than the one with a larger constant. This is where implementation tricks prove to be important and even indispensable for large-scale applications.

3.2.2. Randomized Algorithms

A deterministic algorithm is one that always follows the same sequence of computations (and thereby produces the same output) for a given input. The deterministic running time of a computational problem P is the fastest of the running times (in order notation) of the known algorithms to solve P.

If an algorithm makes some random choices during execution, we call the algorithm randomized or probabilistic. The exact sequence of computations followed by the algorithm depends on these random choices and as a result different executions of the same algorithm may produce different outputs for a given input. At first glance, randomized algorithms look useless, because getting different outputs for a given input is apparently not what one would really want. But there are situations where this is desirable. For example, in an implementation of the RSA protocol, one generates random primes p and q of given bit lengths. Here we require our prime generation procedure to produce different primes during different executions (that is, for different entities on the net).

More importantly, randomized algorithms often provide practical computational solutions for many problems for which no practical deterministic algorithms are known. We will shortly encounter many such situations where randomized algorithms are simplest and/or fastest known algorithms. However, this sudden enhancement in performance by random choices does not come for free. To explain the so-called darker sides of randomization, we explain two different types of randomized algorithms.

A Monte Carlo algorithm is a randomized algorithm that may produce incorrect outputs. However, for such an algorithm to be useful, we require that the running time be always small and the probability of an error sufficiently low. A good example of a Monte Carlo algorithm is Miller–Rabin’s algorithm (Algorithm 3.13) for testing the primality of an integer. For an integer of bit size n, the Miller–Rabin test with t iterations runs in time O(tn³). Whenever the algorithm outputs false, it is always correct. But an answer of true is incorrect with an error probability ≤ 2^–2t, that is, it certifies a composite integer as a prime with probability ≤ 2^–2t. For t = 20, an error is expected to occur less than once in every 10¹² executions. With this little sacrifice we achieve a running time of O(n³) (for a fixed t), whereas the best deterministic primality testing algorithm (known to the authors at the time of writing this book) takes time O(n^7.5) and hence is not practical.

A Las Vegas algorithm is a randomized algorithm which always produces the correct output. However the running time of such an algorithm depends on the random choices made. For such an algorithm to be useful, we expect that for most random choices the running time is small. As an example, consider the problem of finding a random (monic) irreducible polynomial of degree n over . Algorithm 3.22 tests the irreducibility of a polynomial in in deterministic polynomial time. We generate random polynomials of degree n and check the irreducibility of these polynomials by Algorithm 3.22. From Section 2.9.2, we know that a randomly chosen monic polynomial of degree n over a finite field is irreducible with an approximate probability of 1/n. This implies that after O(n) random polynomials are tried, one expects to find an irreducible polynomial. The resulting Las Vegas algorithm (Algorithm 3.23) runs in expected polynomial time. It may, however, happen that for certain random choices we keep on generating reducible polynomials for an exponential number of times, but the likelihood of such an accident is very, very low (Exercise 3.5).

An algorithm is said to be a probabilistic or randomized polynomial-time algorithm, if it is either a Monte Carlo algorithm with polynomial worst running time or a Las Vegas algorithm with polynomial expected running time. Both the above examples of randomized algorithms are probabilistic polynomial-time algorithms. A combination of these two types of algorithms can also be conceived; namely, algorithms that produce correct outputs with high probability and have polynomial expected running time. Some computational problems are so challenging that even such probably correct and probably fast algorithms are quite welcome.

We finally note that there are certain computational problems for which the deterministic running time is exponential and for which randomization also does not help much. In some cases, we have subexponential randomized algorithms which are still too slow to be of reasonable practical use. Some of these so-called intractable problems are at the heart of the security of many public-key cryptographic protocols.

3.2.3. Reduction Between Computational Problems

In the last two sections, we have introduced theoretical measures (the order notations) for estimating the (known) difficulty of solving computational problems. In this section, we introduce another concept by which we can compare the relative difficulty of two computational problems.

Let P₁ and P₂ be two computational problems. We say that P₁ is polynomial-time reducible to P₂ and denote this as , if there is a polynomial-time algorithm which, given a solution of P₂, provides a solution for P₁. This means that if , then the problem P₁ is no more difficult than P₂ apart from the extra polynomial-time reduction effort. In that case, if we know an algorithm to solve P₂ in polynomial time, then we have a polynomial-time algorithm for P₁ too. If and , we say that the problems P₁ and P₂ are polynomial-time equivalent and write P₁ ≅ P₂.

In order to give an example of these concepts, we let G be a finite cyclic multiplicative group of order n and g a generator of G. The discrete logarithm problem (DLP) is the problem of computing for a given an integer x such that a = g^x. The Diffie–Hellman problem (DHP), on the other hand, is the problem of computing g^xy from the given values of g^x and g^y. If one can compute y from g^y, one can also compute g^xy = (g^x)^y by performing an exponentiation in the group G. Therefore, , if exponentiations in G can be computed in polynomial time. In other words, if a solution for DLP is known, a solution for DHP is also available: that is, DHP is no more difficult than DLP except for the additional exponentiation effort. However, the reverse implication (that is, whether ) is not known for many groups.

So far we have assumed that our reduction algorithms are deterministic. If we allow randomized (that is, probabilistic) polynomial-time reduction algorithms, we can similarly introduce the concepts of randomized polynomial-time reducibility and of randomized polynomial-time equivalence. We urge the reader to formulate the formal definitions for these concepts.

Exercise Set 3.2

3.1

Sort the following functions in the increasing sequence of order. (Don’t mind if some of these functions are not defined for a few values of n.)
10¹², 2ⁿ, 2^2ⁿ, 2^n², 100n², 10^–3n³, 1/n, , n!, nⁿ,
log n, (log n)/n, n/log n, n² log n, n(log n)², (0.1)^{log n}, (log n)ⁿ,
1/log n, , 10⁶(log n)¹⁰⁰, log log n, 2^{log log n}, n^{log log n},
, , ,
exp(n^1/3(ln n)^2/3), exp((ln n)^1/3(ln ln n)^2/3).
Evaluate the functions of Part (a) at n = 10ⁱ for i = 1, 2, . . . , 10 and conclude that as n gets larger, the asymptotic ordering tallies with the actual ordering more correctly.

3.2

Show that for any real a > 1 and b > 0 one has n^b = o(aⁿ).
For any positive real c, d, show that (log n)^c = o(n^d).
Show that if f = O(g) and g = O(h), then f = O(h).
Give an example to show that f = O(g) does not necessarily imply f = Θ(g).
Give an example of a function f with f = O(n^1+ε) for every ε > 0, but f is not O(n).

3.3

Suppose that an algorithm A takes as input a bit string and runs in time g(t), where t is the number of one-bits in the input string. Let f_b(n), f_w(n), f_a(n) and f_e(n) respectively denote the best, worst, average and expected running times of A for inputs of size n. Derive the following table under the assumption that each of the 2ⁿ bit strings of length n is equally likely.

		Running times
g(t)	f_b(n)	f_w(n)	f_a(n)	f_e(n)
t	0	n	n/2	n/2
t²	0	n²	n(n + 1)/4	n²/4
2^t	1	2ⁿ	(3/2)ⁿ

3.4

Show that an exponential-space (resp. subexponential-space) algorithm must be (at least) exponential-time (resp. subexponential-time) too. You may assume that at a time a computing device can access (read/write) at most a finite number of memory locations.
Give an example of an algorithm that is exponential-time but polynomial-space.

3.5

Consider the Las Vegas algorithm discussed in Section 3.2.2 for generating a random irreducible polynomial of degree n over

. Assume that a randomly chosen polynomial in

of degree n has (an exact) probability of 1/n for being irreducible. Find out the probability p_r that r polynomials chosen randomly (with repetition) from

are all reducible. For n = 1000, calculate the numerical values of p_r for r = 10ⁱ, i = 1, . . . , 6, and find the smallest integers r for which p_r ≤ 1/2 and p_r ≤ 10^–12. Find the expected number of polynomials tested for irreducibility, before the algorithm terminates.

3.6

Let n = pq be the product of two distinct primes p and q. Show that factoring n is polynomial-time equivalent to computing φ(n) = (p–1)(q–1), where φ is Euler’s totient function. (Assume that an arithmetic operation (including computation of integer square roots) on integers of bit size t can be performed in polynomial time (in t).)

3.7

Let G be a finite cyclic multiplicative group and let H be the subgroup of G generated by

whose order is known. The generalized discrete logarithm problem (GDLP) is the following: Given

, find out if

and, if so, find an integer x for which a = h^x. Show that GDLP ≅ DLP, if exponentiations in G can be carried out in polynomial time and if DLP in H is polynomial-time equivalent to DLP in G. [H]

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Complexity Issues

Create new playlist

Sign In

Sign Up