251distr 11/18/04 (Open this document in 'Outline' view!)

L. Discrete Distributions.

1. Binomial Distribution.

a. Formula: .

Gives probability of successes in tries. is probability of success on 1 try.

b. Mean: .

c. Variance: .

d. Replacement of by observed proportion in formulas.

,

e. Cumulative Distribution - Use of Tables. Always use tables if possible!

Note that for any discrete distribution (with integer values for ) like the ones in this section. (But for a continuous distribution like the normal or continuous uniform distribution .)

Also note that for The problem must be totally recast in terms of failures to use the table.

2. Geometric Distribution.

a. Formula: .

Gives probability that the first success occurs on try .

b. Mean and Variance: , .

c. Cumulative Distribution: . This is the formula you really use!

3. Poisson Distribution.

a. Formula: .

Gives probability of successes in an interval in which the average number of successes is . Recursive version

b. Mean and Variance: , .

c. Cumulative Distribution - Use of Tables.

d. Use Poisson as an approximation for the Binomial Distribution.

4. Hypergeometric Distribution.

a. Formula: . ( Note that and are integers!)

Gives the probability of successes in a sample of taken from a population of in which there are successes. There is a recursive formula for the Hypergeometric distribution that can save you some time in repeated calculations with the same distribution - see appendix.

b. Mean and Variance: If .

c. What if is infinite? Use the Binomial Distribution! This works if .

5. Summary (See part L5 of 251distrl and 251greatD)

M. Continuous Distributions.

1. Introduction. (For more detail see 251greatD)

a. Normal Distribution

b. Exponential Distribution: , when the mean time to a success is .

c. Chi-squared Distribution.

d. t Distribution.

e. F Distribution.

2. Properties of the Normal Distribution.

a. Use of Standard Normal Tables.

For examples see 251distrex2. For more examples see 251distrex1.

b. Probabilities for Normal Distributions that are not Standardized.

For examples see 251distrex3

3. Percentiles and Intervals about the Mean.

Read the table backwards to find z.

For examples see 251distrex4

4. Normal Approximation to the Binomial Distribution.

a. Without Continuity Correction.

, if and . For examples see 251distrex5.

b. With Continuity Correction.

Expand interval by 0.5 in both directions. Use especially if .

5. Normal Approximation to the Poisson Distribution.

, if .

6. Review of Conditions for Approximation of One Distribution by Another. End of 251greatD.

N. Statistical Sampling.

1. Definitions.

Random Samples, Sampling and Nonsampling Errors. The Law of Large Numbers. Convenience, Judgment and Probability Samples. Simple, Stratified and Cluster Random Samples.

2. Distribution of and

a. , This is the standard deviation of the sample mean, and is often called the standard error.

b. Finite Population Correction Factor. Use this if sample is more than 5% of population!

3. The Central Limit Theorem

a. as becomes large.

b. Problems involving Probabilities for and .

O. Estimation of Parameters. (Go to 251param for expanded version)

1. Point and Interval Estimation. Properties of Estimators.

a. Unbiassedness.

b. Consistency

c. Efficiency.

d. Maximum likelihood

2. A Confidence Interval for When is Known.

You can only use this when you know the population variance. Don’t forget that there are two formulas for the sample variance depending on sample size!

3. A Confidence Interval for When is not known.

This is what you actually use most of the time! All that " unknown" means is that we do not have a value of the population variance. If you only have the sample variance, use the t table.

Appendix to L - a Recursive Formula for the Hypergeometric Distribution.

The formula is

Example: . Assume that , and .

(Particularly easy to calculate because both the numerator and denominator are divided by (7!). , , etc. This amounts to saying that

Notice how each part of the fractions changes by 1 each time until the first denominator rises to , the second denominator rises to and the second numerator falls to 1. At that point we have all values of that can occur.

4