Appendix B: The Essentials of Probability

In this section, we give a brief review of the basic probability concepts and definitions used or alluded to in this book. For further information, there are many excellent books on probability (e.g., see the book by Ross, 2002).

The space of all possible outcomes of an experiment is labeled Ω. Events are subsets of the space of all possible outcomes, A ⊂ Ω. Given two events A, B, the event

1. AB is the event either A occurs or B occurs, or they both occur;
2. AB is the event both A and B occur;
3. inline is called the impossible event, while the event Ω itself is called the sure event;
4. Ac = Ω A is called the complement of A, so either A occurs or, if A does not occur, then Ac occurs;

The probability of an event A is written as P(A) or Prob(A). It must be true that for any event A, 0 ≤ P(A) ≤ 1. In addition,

1. P(inline) = 0, P(Ω) = 1;
2. P(Ac) = 1 − P(A);
3. P(AB) = P(A) + P(B)− P(AB);
4. P(AB) = P(A)P(B) if A and B are independent;

Given two events A, B with P(B) > 0, the conditional probability of A given the event B has occurred, is defined by

Unnumbered Display Equation

Two events are therefore independent if and only if P(A | B) = P(A) so that the knowledge that B has occurred does not affect the probability of A. The very important laws of total probability are frequently used:

1. P(A) = P(AB) + P(ABc).
2. P(A) = P(A | B)P(B) + P(A | Bc)P(Bc).

These give us a way to calculate the probability of A by breaking down cases. Using the formulas, if we want to calculate P(A| B) but know P(B|A) and P(B|Ac), we can use Bayes’ rule:

Unnumbered Display Equation

A random variable is a function X: inline that is, a real-valued function of outcomes of an experiment. As such, this random variable takes on its values by chance. We consider events of the form

{ω inline Ω: X(ω) ≤ x} for values of inline For simplicity, this event is written as {Xx} and the function

Unnumbered Display Equation

is called the cumulative distribution function of X. We have

1. P(X > a) = 1 − FX(a).
2. P(a < Xb) = FX(b) − FX(a).

A random variable X is said to be discrete if there is a finite or countable set of numbers x1, x2, …, so that X takes on only these values with P(X = xi) = pi, where 0 ≤ pi ≤ 1, and ∑i pi = 1. The cumulative distribution function is a step function with a jump of size pi at each xi, inline

A random variable is said to be continuous if P(X = x) = 0 for every inline and the cumulative distribution function FX(x) = P(Xx) is a continuous function. The probability that a continuous random variable is any particular value is always zero. A probability density function for X is a function

Unnumbered Display Equation

In addition, we have

Unnumbered Display Equation

The cumulative distribution up to x is the area under the density from −∞ to x. With an abuse of notation, we often see P(X = x) = fX(x), which is clearly nonsense, but it gets the idea across that the density at x is roughly the probability that X = x.

Two random variables X and Y are independent if P(Xx and Yy) = P(Xx)P(Xy) for all inline If the densities exist, this is equivalent to f(x, y) = fX(x) fY(y), where f(x, y) is the joint density of (X, Y).

The mean or expected value of a random variable X is

Unnumbered Display Equation

and

Unnumbered Display Equation

In general, a much more useful measure of X is the median of X, which is any number satisfying

Unnumbered Display Equation

Half the area under the density is to the left of m and half is to the right.

The mean of a function of X, say g(X), is given by

Unnumbered Display Equation

and

Unnumbered Display Equation

With the special case g(x) = x2, we may get the variance of X defined by

Unnumbered Display Equation

This gives a measure of the spread of the values of X around the mean defined by the standard deviation of X

Unnumbered Display Equation

We end this appendix with a list of properties of the main discrete and continuous random variables.

B.1 Discrete Random Variables

1. Bernoulli. Consider an experiment in which the outcome is either success, with probability p > 0 or failure, with probability 1 − p. The random variable,

Unnumbered Display Equation

is Bernoulli with parameter p. Then E[X] = p, Var(X) = p(1 − p).

2. Suppose that we perform an independent sequence of Bernoulli trials, each of which has probability p of success. Let X be a count of the number of successes in n trials. X is said to be a binomial random variable with distribution

Unnumbered Display Equation

Then E[X] = np, Var(X) = np(1 − p).

3. In a sequence of independent Bernoulli trials, the number of trials until the first success is called a geometric random variable. If X is geometric, it has distribution

Unnumbered Display Equation

with mean inline and variance inline

4. A random variable has a Poisson distribution with parameter λ if X takes on the values k = 0, 1, 2, …, with probability

Unnumbered Display Equation

It has E[X] = λ and Var(X) = λ. It arises in many situations and is a limit of binomial random variables. In other words, if we take a large number n of Bernoulli trials with p as the probability of success on any trial, then as n → ∞ and p → 0 but np remaining the constant λ, the total number of successes will follow a Poisson distribution with parameter λ.

B.2 Continuous Distributions

1. X is uniformly distributed on the interval [a, b] if it has the density

Unnumbered Display Equation

It is a model of picking a number at random from the interval [a, b] in which every number has equal likelihood. The cdf is FX(x) = (xa)/(ba), axb. The mean is E[X] = {(a + b)/2, the midpoint, and variance is Var(X) = (ba)2/12.

2. X has a normal distribution with mean μ and standard deviation σ if it has the density

Unnumbered Display Equation

Then the mean is E[X] = μ, and the variance is Var(X) = σ2. The graph of fX is the classic bell-shaped curve centered at μ. The central limit theorem makes this the most important distribution because it roughly says that sums of independent random variables normalized by inline converge to a normal distribution, no matter what the distribution of the random variables in the sum. More precisely

Unnumbered Display Equation

Here, E[Xi] = μ and Var(Xi) = σ2 are the mean and variance of the arbitrary members of the sum.

3. The random variable X ≥ 0 is said to have an exponential distribution if

Unnumbered Display Equation

The density is inline Then E[X] = 1/λ and Var(X) = 1/λ2.

B.3 Order Statistics

In the theory of auctions, one needs to put in order the valuations of each bidder and then calculate things such as the mean of the highest valuation, the second highest valuation, and so on. This is an example of the use of order statistics, which is an important topic in probability theory. We review a basic situation.

Let X1, X2, …, Xn be a collection of independent and identically distributed random variables with common density function f(x) and common cumulative distribution function F(x). The order statistics are the random variables

Unnumbered Display Equation

and automatically X(1)X(2) ≤ ··· ≤ X(n). This simply reorders the random variables from smallest to largest.

To get the density function of X(k), we may argue formally that X(k)x if and only if k − 1 of the random variables are ≤ x and nk of the random variables are > x. In symbols,

Unnumbered Display Equation

This leads to the cumulative distribution function of X(k). Consequently, it is not too difficult to show that the density of X(k) is

Unnumbered Display Equation

where

Unnumbered Display Equation

because there are

Unnumbered Display Equation

ways of splitting up n things into three groups of size k − 1, nk, and 1.

If we start with a uniform distribution f(x) = 1, 0 < x < 1, we have

Unnumbered Display Equation

If k = n − 1, then

Unnumbered Display Equation

Consequently, for X1, …, Xn independent uniform random variables, we have E(X(n−1)) = (n − 1)/(n + 1). The means and variances of all the order statistics can be found by integration.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.116.60.158