Skip to main content

Section 9.2 The Normal Distribution

Definition 9.2.1. The Normal Distribution.

Given two parameters μ and σ, a random variable X over R=(,) has a normal distribution provided it has a probability function given by
f(x)=1σ2πe(xμσ)2/2
The normal distribution is also sometimes referred to as the Gaussian Distribution (often by Physicists) or the Bell Curve (often by social scientists).
Notice that the work needed to complete the integrals over the entire domain above was pretty serious. To determine probabilities for a given interval is however not possible in general and therefore approximations are needed. When using TI graphing calculators, you can use
P(a<x<b)=normalcdf(a,b,μ,σ).
Or you can use the calculator below.
Note that when no mean or standard deviation for normalcdf is provided, the calculator presumes standard normal.
Note that you can use a graphing calculator’s normalcdf(a,b) = Φ(b)Φ(a) to compute probabilities in the standard normal distrubution. If you have a normal distribution other than the standard and don’t want to convert to standard, then the graphing calculator usage is normalcdf(a,b,μ,σ).
Find the following probabilities for the standard normal random variable z:
(a) P(1.83z2.06)=
(b) P(0.91z1.49)=
(c) P(z0.93)=
(d) P(z>0.51)=
Answer 1.
0.946676
Answer 2.
0.750477
Answer 3.
0.823814
Answer 4.
0.694974
Suppose the scores of students on a Statistics course are Normally distributed with a mean of 293 and a standard deviation of 69.
What percentage of the students scored between 155 and 293 on the exam? (Give your answer to 3 significant figures.)
percent.
Answer.
47.5
Note that you can convert the integral above to standard units 5.6.1 so that it is sufficient to show
I=12πez22dz=1
Toward this end, consider I2 and change the variables to get
I2=12πeu22du12πev22dv=12πeu2+v22dudv
Converting to polar coordinates using
dudv=rdrdθ
and
u2+v2=r2
gives
I2=12π02π0er22rdrdθ=12π02πer22|0dθ=12π02π1dθ=12πθ|02π=1
as desired.
The definition of expected value for a continuous variable in this case gives an integral to evaluate since X is continuous. In that integral, it is useful to use a standard change of variables as in basic integral calculus to convert the integral to something easier to evaluate. In this case, you will want to convert the X variable to the standard units variable Z so that
z=xμσ
or by solving for x
x=μ+zσ.
So, it follows that
E[X]=x1σ2πe(xμσ)2/2dx=12π(μ+zσ)ez2/2dz=μ12πez2/2dz+σ12πzez2/2dz=μ1+σ0=μ
and therefore the use of μ as the parameter in the normal distribution probability function is warranted.
E[(Xμ)2]=(xμ)21σ2πe(xμσ)2/2dx=12πσ2z2ez2/2dz=σ22πzzez2/2dz=σ22π[zez2/2|+ez2/2dz]=σ22π[0+2π]=σ2
using integration by parts. So, the use of σ as the other parameter in the normal probability function is also warranted.
Using simple calculus on the normal probability function provides a few tips regarding how to best sketch a nice graph.
Take the derivative of the probability function to get
f(x)=2(μx)e((μx)22σ2)2πσ3
which is zero only when x=μ.
It is easy to see by evaluating to the left and right of this value that this critical value yields a maximum.
Take the second derivative of the probability function to get
f(x)=2(μ+σx)(μσx)e(μ22σ2+μxσ2x22σ2)2πσ5
which is zero only when x=μ±σ.
It is easy to see by evaluating to the left and right of this value that these critical values yield points of inflection.
When using a graphing calculator’s normalcdf(a,b,μ,σ), pay attention to the the order of terms. For normal distributions, the calculator function always requires an interval. If you are looking for a one-sided probability, such as P(X>4) for a problem with (say) mean μ=2 and σ=3, you can replace the infinite upper limit with "large" finite endpoint. Providing something that is more than 10 standard deviations above the mean is for all practical purposes infinity with respect to calculations. So, in this case P(X>4) can be approximated by normalcdf(4,32,2,3). If you are brave, you can go even higher and use normalcdf(4,100000,2,3) if desired.
Length of snowboards in a boardshop are normally distributed with a mean of 151.1 cm and a standard deviation of 0.4 cm. The figure below shows the distribution of the length of snowboards in a boardshop. Calculate the shaded area under the curve. Express your answer in decimal form with at least two decimal place accuracy.
Answer:
Answer.
0.8185948
In the standard normal distribution, we have considered the case where you get the probability when given an interval. However, what about the reverse problem of finding an interval that would result in a given probability? That is, to solve for example the problem
Φ(b)Φ(a)=P(a<z<b)=0.6217.
To deal with this we need an "inverse function" Φ1. Toward that end, consider the simpler problem of solving
Φ(z0)=P(z<z0)=some given probability value=α.
Then, technically the answer would be
z0=Φ1(α).
Since integrating the normal probability function is impossible you can expect that finding a nice formula for the inverse of that integration might also be challenging and that is certainly the case. However, you have two options:
  • Guessing z0 until you get a probabilty that is close enough to α.
  • Use a calculator that has the inverse built in!
For most graphing calculators, there is a function called "invNorm" and that is a way to compute values involving Φ1. Indeed, for example if you wanted to solve
Φ(z0)=P(z<z0)=0.813
then just use invNorm(0.813) to get z00.2198.
Find the value of the standard normal random variable z, called z0 such that:
(a) P(zz0)=0.7699
z0=
(b) P(z0zz0)=0.8568
z0=
(c) P(z0zz0)=0.856
z0=
(d) P(zz0)=0.0905
z0=
(e) P(z0z0)=0.1842
z0=
(f) P(2.26zz0)=0.6250
z0=
Answer 1.
0.738518
Answer 2.
1.46398
Answer 3.
1.46106
Answer 4.
1.33768
Answer 5.
0.479476
Answer 6.
0.350185
While we are at it, can we "go backwards" and figure out the mean and variance if given some probabilities. This requires some problem solving skills and enough provided information to figure things out. For the exercise below, what does it mean to talk about the "middle" percentage of the area? That gives one the mean and also describes a probability of the sort
P(μz0σ<X<μ+z0σ),
where z0 is a z-value obtained by using the inverse normal distribution function. It might help to draw a picture first of the area under the normal curve described in any such exercise.
Consider a normal distribution curve where the middle 65 % of the area under the curve lies above the interval ( 6 , 13 ). Use this information to find the mean, μ , and the standard deviation, σ , of the distribution.
a) μ=
b) σ=
Answer 1.
9.5
Answer 2.
3.74496
The physical fitness of an athlete is often measured by how much oxygen the athlete takes in (which is recorded in milliliters per kilogram, ml/kg). The mean maximum oxygen uptake for elite athletes has been found to be 75 with a standard deviation of 5.8. Assume that the distribution is approximately normal.
(a) What is the probability that an elite athlete has a maximum oxygen uptake of at least 70 ml/kg?
answer:
(b) What is the probability that an elite athlete has a maximum oxygen uptake of 55 ml/kg or lower?
answer:
(c) Consider someone with a maximum oxygen uptake of 27 ml/kg. Is it likely that this person is an elite athlete? Write "YES" or "NO."
answer:
Answer 1.
0.805675
Answer 2.
0.000282000000000004
Answer 3.
NO