Section 9.4 Other "Bell Shaped" distributions
The Normal distribution discussed above is very important when doing statistical analysis. It however is not the only distribution that is symmetrical about the mean and looks like a bell. In this section, we consider two other options--one which is virtually useless and another which is very useful.
Theorem 9.4.2. The Cauchy Distribution.
Proof.
Now that we have a probability function, it is important to determine its mean and variance. It should be obvious that when doing so using the Cauchy probability function, problems quickly arise. Indeed,
which is problematic. Further, even assuming that the distribution is symmetrical and therefore has a mean of 0, for the variance
and note that the integrand does not converge to 0 at the endpoints and therefore the integral is automatically considered divergent. Thus it is reasonable to note that the Cauchy distribution has no variance.
The formula for this curve is so much easier to deal with versus the normal distribution. Perhaps it should be used more. You can see above that it is pretty much inadequate since its theoretical statistics are not well-defined. In the interactive cell below, you might notice some issues right away by comparing the Cauchy probabilty function against a normal probability function (when but with varied standard deviations. Notice especially that as you change the normal distributionβs the the area you see under the normal curve totally overwhelms the area in the stationary Cauchy distribution. That means that the two tails of the Cauchy distribution have a lot more area far away from zero than the nomal distribution. This is one of the issues why the Cauchy doesnβt give good results.
xxxxxxxxxx
# A nice picture of Cauchy compared to normal
f = 1/pi*(1/(1+x^2))
def _(sigma=slider(2/10,4,1/10,1,label="$$ \\sigma_{\\text{normal}}$$")):
g = 1/(sqrt(2*pi*sigma))*e^(-x^2/(2*sigma^2))
G = (plot(f,(x,-6,6),color='blue')
+plot(g,(x,-6,6),color='red'))
T = "Cauchy (blue) vs Normal (red)"
G.show(title=T,figsize=[5,3])
On the other hand, there is another "bell-shaped" distribution that is useful and its random variable can be created by using a mixture of a normal variable and a variable.
Definition 9.4.3. Student-t Distribution.
Theorem 9.4.4. Student t-distribution properties.
Example 9.4.5. Similarity between Normal and t-distributions for larger n.
Consider the probabilities vs for a t-distribution with r=30 degrees of freedom.
For normal,
while for t,
Here is a calculator for obtaining probabilities for the t-distribution over an interval.
xxxxxxxxxx
print("Calculator for t-Distribution")
var("x")
layout=dict(top=[['a', 'b']])) (
def _(a=input_box(-1,width=10,label='$$ a =$$'),
b=input_box(1,width=10,label='$$ b =$$'),
r0=input_box(5,width=8,label='$$ df = $$')):
if (b < a):
t=a
a=b
b=t
T = RealDistribution('t', r0) # use built-in
G = plot(T,x,-5,5)+plot(T,x,a,b,fill=True,fillcolor='green')
show(G,figsize=(3,2))
P = T.cum_distribution_function(b)-T.cum_distribution_function(a)
pretty_print(html("$$ P("+str(a)+" < X < "
+str(b)+") \\approx "+str(P)+"$$"))
As has been our pattern for some time, it is of interest to see what happens to the t-distributionβs graph when (in this case) the number of degrees of freedom increase. The interactive cell below illustrates what happens up till 30 degrees of freedom. Notice at that point, the t-distribution and the normal distribution have almost the same probability function.
xxxxxxxxxx
# Display the Student's t distributions with various
# degrees of freedom and compare to the normal distribution
# Copied from www.statmethods.net
β
x <- seq(-4, 4, length=100)
hx <- dnorm(x)
β
degf <- c(1, 3, 8, 30)
colors <- c("red", "blue", "darkgreen", "gold", "black")
labels <- c("df=1", "df=3", "df=8", "df=30", "normal")
β
plot(x, hx, type="l", lty=2, xlab="x value",
ylab="Density", main="Comparison of t Distributions")
β
for (i in 1:4){
lines(x, dt(x,degf[i]), lwd=2, col=colors[i])
}
β
legend("topright", inset=.05, title="Distributions",
labels, lwd=2, lty=c(1, 1, 1, 1, 2), col=colors)