Section 7.2 Binomial Distribution
Consider a sequence of n independent Bernoulli trials with the likelihood of a success p on each individual trial stays constant from trial to trial with \(0 \lt p \lt 1 \text{.}\) If we let the variable \(X\) measure the number of successes obtained when doing a fixed number of trials n with \(R = \{ 0, 1, ..., n \}\text{,}\) then the resulting distribution of probabilities is called a Binomial Distribution.
Now, let's determine the actual probability function for this distribution.
For R = {0, 1, ..., n},
Theorem 7.2.1. Derivation of Binomial Probability Function.
Proof.
Since successive trials are independent, then the probability of X successes occurring within n trials is given by
Theorem 7.2.2. Verification of Binomial Distribution Formula.
Proof.
Using the Binomial Theorem with a = p and b = 1-p yields
The following interactive cell illustrates the range of choices for a Binomial setup with fixed N and fixed p:
Example 7.2.3. Flipping a coin a fixed number of times.
Let's consider a simple example for flipping coins. Indeed, suppose you flip a coin exactly 20 times and need to determine the probability of getting exactly 10 heads.
This is binomial with n = 20, p = 1/2 and you are looking for f(10). With these values
Note that the mean for this distribution is also 10 so one might expect 10 heads in general
If you rather would prefer to determine the probability of getting 10 or fewer heads requires F(10) = f(0) + f(1) + ... + f(10). There is no "nice" formula for F but this calculation can be performed using a graphing calculator, such as the TI-84 with F(x) = binomcdf(n,p,x). In this case, F(10) = binomcdf(20,1/2,10) = 0.588.
Checkpoint 7.2.4. WebWork - Binomial.
Example 7.2.5. Testing critical components.
Often one will test a critical system components for failure and toward that end collect a sample of 100 of these components from the manufacturer. Suppose the component is listed as having a p = 0.01 probability of breaking and you want to know the likelihood that at most 1 of the tested components actually fails when tested. You find it reasonable to presume that different components succeed or fail independently of each other. So, you can model this situation with a binomial distribution.
If \(X\) measures the number of components that fail when tested, the specific probability function is given by
The probability that at most one component fails is then given by
Utilize the interactive cell below to compute \(f(x)\) and \(F(x)\) for the Binomial distribution
Theorem 7.2.6. Binomial Distribution Statistics.
Proof.
For the mean 1,
Using the change of variables \(k=x-1\) and \(m = n-1\) yields a binomial series
For the variance 2,
Using the change of variables \(k=x-2\) and \(m = n-2\) yields a binomial series
The skewness 3 and kurtosis 4 can be found similarly using formulas involving E[X(X-1)(X-2)] and E[X(X-1)(X-2)(X-3)]. The complete determination is performed using Sage below.
The following uses Sage to symbolically confirm the general formulas for the Binomial distribution.
Once again, you can see that as one of the parameters is allowed to increase without bound this distribution must become symmetrical and bell-shaped.Theorem 7.2.7. Binomial Limiting Distribution.
For the binomial distribution 7.2.1, as \(m \rightarrow \infty\)
and
Proof.
For skewness, take the limit of the skewness result above
Similarly for kurtosis
Checkpoint 7.2.8. WebWork - Binomial.
You can of course get specific values and graph the Binomial Distribution using R as well...