Appendix B: The Essentials of Probability

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

In this section, we give a brief review of the basic probability concepts and definitions used or alluded to in this book. For further information, there are many excellent books on probability (e.g., see the book by Ross, 2002).

The space of all possible outcomes of an experiment is labeled Ω. Events are subsets of the space of all possible outcomes, A ⊂ Ω. Given two events A, B, the event

1. A ∪ B is the event either A occurs or B occurs, or they both occur;

2. A ∩ B is the event both A and B occur;

is called the impossible event, while the event Ω itself is called the sure event;

4. Ac = Ω A is called the complement of A, so either A occurs or, if A does not occur, then Ac occurs;

The probability of an event A is written as P(A) or Prob(A). It must be true that for any event A, 0 ≤ P(A) ≤ 1. In addition,

1. P(

) = 0, P(Ω) = 1;

2. P(Ac) = 1 − P(A);

3. P(A ∪ B) = P(A) + P(B)− P(A ∩ B);

4. P(A ∩ B) = P(A)P(B) if A and B are independent;

Given two events A, B with P(B) > 0, the conditional probability of A given the event B has occurred, is defined by

Two events are therefore independent if and only if P(A | B) = P(A) so that the knowledge that B has occurred does not affect the probability of A. The very important laws of total probability are frequently used:

1. P(A) = P(A ∩ B) + P(A ∩ Bc).

2. P(A) = P(A | B)P(B) + P(A | Bc)P(Bc).

These give us a way to calculate the probability of A by breaking down cases. Using the formulas, if we want to calculate P(A| B) but know P(B|A) and P(B|Ac), we can use Bayes’ rule:

A random variable is a function X: that is, a real-valued function of outcomes of an experiment. As such, this random variable takes on its values by chance. We consider events of the form

{ω Ω: X(ω) ≤ x} for values of For simplicity, this event is written as {X ≤ x} and the function

is called the cumulative distribution function of X. We have

1. P(X > a) = 1 − FX(a).

2. P(a < X ≤ b) = FX(b) − FX(a).

A random variable X is said to be discrete if there is a finite or countable set of numbers x1, x2, …, so that X takes on only these values with P(X = xi) = pi, where 0 ≤ pi ≤ 1, and ∑i pi = 1. The cumulative distribution function is a step function with a jump of size pi at each xi,

A random variable is said to be continuous if P(X = x) = 0 for every and the cumulative distribution function FX(x) = P(X ≤ x) is a continuous function. The probability that a continuous random variable is any particular value is always zero. A probability density function for X is a function

In addition, we have

The cumulative distribution up to x is the area under the density from −∞ to x. With an abuse of notation, we often see P(X = x) = fX(x), which is clearly nonsense, but it gets the idea across that the density at x is roughly the probability that X = x.

Two random variables X and Y are independent if P(X ≤ x and Y ≤ y) = P(X ≤ x)P(X ≤ y) for all If the densities exist, this is equivalent to f(x, y) = fX(x) fY(y), where f(x, y) is the joint density of (X, Y).

The mean or expected value of a random variable X is

and

In general, a much more useful measure of X is the median of X, which is any number satisfying

Half the area under the density is to the left of m and half is to the right.

The mean of a function of X, say g(X), is given by

and

With the special case g(x) = x2, we may get the variance of X defined by

This gives a measure of the spread of the values of X around the mean defined by the standard deviation of X

We end this appendix with a list of properties of the main discrete and continuous random variables.

B.1 Discrete Random Variables

1. Bernoulli. Consider an experiment in which the outcome is either success, with probability p > 0 or failure, with probability 1 − p. The random variable,

is Bernoulli with parameter p. Then E[X] = p, Var(X) = p(1 − p).

2. Suppose that we perform an independent sequence of Bernoulli trials, each of which has probability p of success. Let X be a count of the number of successes in n trials. X is said to be a binomial random variable with distribution

Then E[X] = np, Var(X) = np(1 − p).

3. In a sequence of independent Bernoulli trials, the number of trials until the first success is called a geometric random variable. If X is geometric, it has distribution

with mean and variance

4. A random variable has a Poisson distribution with parameter λ if X takes on the values k = 0, 1, 2, …, with probability

It has E[X] = λ and Var(X) = λ. It arises in many situations and is a limit of binomial random variables. In other words, if we take a large number n of Bernoulli trials with p as the probability of success on any trial, then as n → ∞ and p → 0 but np remaining the constant λ, the total number of successes will follow a Poisson distribution with parameter λ.

B.2 Continuous Distributions

1. X is uniformly distributed on the interval [a, b] if it has the density

Unnumbered Display Equation

It is a model of picking a number at random from the interval [a, b] in which every number has equal likelihood. The cdf is FX(x) = (x − a)/(b − a), a ≤ x ≤ b. The mean is E[X] = {(a + b)/2, the midpoint, and variance is Var(X) = (b − a)2/12.

2. X has a normal distribution with mean μ and standard deviation σ if it has the density

Then the mean is E[X] = μ, and the variance is Var(X) = σ2. The graph of fX is the classic bell-shaped curve centered at μ. The central limit theorem makes this the most important distribution because it roughly says that sums of independent random variables normalized by converge to a normal distribution, no matter what the distribution of the random variables in the sum. More precisely

Here, E[Xi] = μ and Var(Xi) = σ2 are the mean and variance of the arbitrary members of the sum.

3. The random variable X ≥ 0 is said to have an exponential distribution if

The density is Then E[X] = 1/λ and Var(X) = 1/λ2.

B.3 Order Statistics

In the theory of auctions, one needs to put in order the valuations of each bidder and then calculate things such as the mean of the highest valuation, the second highest valuation, and so on. This is an example of the use of order statistics, which is an important topic in probability theory. We review a basic situation.

Let X1, X2, …, Xn be a collection of independent and identically distributed random variables with common density function f(x) and common cumulative distribution function F(x). The order statistics are the random variables

Unnumbered Display Equation

and automatically X(1) ≤ X(2) ≤ ··· ≤ X(n). This simply reorders the random variables from smallest to largest.

To get the density function of X(k), we may argue formally that X(k) ≤ x if and only if k − 1 of the random variables are ≤ x and n − k of the random variables are > x. In symbols,

This leads to the cumulative distribution function of X(k). Consequently, it is not too difficult to show that the density of X(k) is

where

because there are

ways of splitting up n things into three groups of size k − 1, n − k, and 1.

If we start with a uniform distribution f(x) = 1, 0 < x < 1, we have

If k = n − 1, then

Consequently, for X1, …, Xn independent uniform random variables, we have E(X(n−1)) = (n − 1)/(n + 1). The means and variances of all the order statistics can be found by integration.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Appendix B: The Essentials of Probability

Create new playlist

Sign In

Sign Up

Table of Contents for
Appendix B: The Essentials of Probability