Section 5.3 Probability Functions
In the formulas below, we will presume that we have a random variable 5.2.1 \(X\) which maps the sample space S onto some range of real numbers \(R\text{.}\) From this set, we then can define a probability function \(f(x)\) which acts on the numerical values in \(R\) and returns another real number. We attempt to do so to obtain (for discrete values) P(sample space value s)\(= f(X(s))\text{.}\) That is, the probability of a given outcome s is equal to the composition which takes s to a numerical value x which is then plugged into f to get the same final values.
For example, consider a random variable which assigns a 1 when you roll a 1 on a six-sided die and 0 otherwise. Presuming each side is equally likely, \(f(1) = \frac{1}{6}\) and \(f(0) = \frac{5}{6}\text{.}\)
Given a discrete random variable 5.2.1 \(X\) on a space \(R\text{,}\) a probability mass function on \(X\) is given by a function \(f:R \rightarrow \mathbb{R}\) such that: For \(x \not\in R\text{,}\) you can use the convention \(f(x)=0\text{.}\)
Definition 5.3.1. Probability "Mass" Function.
Given a continuous random variable \(X\) on a space \(R\text{,}\) a probability density function on \(X\) is given by a function \(f:R \rightarrow \mathbb{R}\) such that: For \(x \not\in R\text{,}\) you can use the convention \(f(x)=0\text{.}\)
Definition 5.3.2. Probability "Density" Function.
For the purposes of this book, we will use the term "Probability Function" to refer to either of these options.
Consider \(f(x) = x/10\) over R = {1,2,3,4}. Then, f(x) is obviously positive for each of the values in R and certainly Therefore, f(x) is a probability mass function over the space \(R\text{.}\)
Example 5.3.3. Discrete Probability Function.
Consider \(f(x) = x^2/c\) for some positive real number c and presume \(R\) = [-1,2]. Then f(x) is nonnegative (and only equals zero at one point). To make \(f(x)\) a probability density function 5.3.2, we must have In this instance you get Therefore, \(f(x)\) is a probability density function over \(R\) provided \(c = 3\text{.}\)
Example 5.3.4. Continuous Probability Function.
Given a random variable \(X\) on a space \(R\text{,}\) a probability distribution function on \(X\) is given by a function
Definition 5.3.5. Distribution Function.
Using \(f(x) = x/10\) over \(R\) = {1,2,3,4} again, note that \(F(x)\) will only change at these four domain values. We get
Example 5.3.6. Discrete Distribution Function.
X
F(x)
\(x \lt 1\)
0
\(1 \le x \lt 2\)
1/10
\(2 \le x \lt 3\)
3/10
\(3 \le x \lt 4\)
6/10
\(4 \le x \)
1
Consider \(f(x) = x^2/3\) over \(R\) = [-1,2]. Then, for \(-1 \le x \le 2\text{,}\) Notice, \(F(-1) = 0\) since nothing has yet been accumulated over values smaller than -1 and \(F(2) = 1\) since by that time everything has been accumulated. In summary:
Example 5.3.8. Continuous Distribution Function.
X
F(x)
\(x \lt -1\)
0
\(-1 \le x \lt 2\)
\(x^3/9 + 1/9\)
\(2 \le x\)
1
\(F(x)=0, \forall x \lt \inf(R)\) where inf is the infimum...the "minimum" but in a limit sense.
Theorem 5.3.10.
Proof.
Let a = inf(\(R\)). Then, for \(x \lt a,\)
since none of the x-values in this range are in \(R\text{.}\)
\(F(x)=1, \forall x \ge \sup(R)\) where sup is the supremum...the "maximum" but in a limit sense.
Theorem 5.3.11.
Proof.
Let b = sup(\(R\)). Then, for \(x \ge b,\)
since all of the x-values in this range are in R and therefore will either sum over or integrate over all of \(R\text{.}\)
\(F\) is non-decreasing
Theorem 5.3.12.
Proof.
Case 1: \(R\) discrete
Case 2: \(R\) continuous
For \(x \in R, f(x) = F(x) - F(x-1)\)
Theorem 5.3.13. Using Discrete Distribution Function to compute probabilities.
Proof.
Assume \(x \in R\) for some discrete \(R\text{.}\) Then,
For \(a \lt b, (a,b) \in R, P(a \lt X \le b) = F(b) - F(a)\)
Theorem 5.3.14. Using Continuous Distribution function to compute probabilities.
Proof.
For a and b as noted, consider
For continuous distributions, \(P(X = a) = 0\text{.}\)
Corollary 5.3.15.
Proof.
We will assume that \(F(x)\) is a continuous function. With that assumption, note
Take the limit as \(\epsilon \rightarrow 0^+\) to get the result noting that
If \(X\) is a continuous random variable, \(f\) the corresponding probability function, and \(F\) the associated distribution function, then
Theorem 5.3.16. \(F(x)\) vs \(f(x)\text{,}\) for continuous distributions.
Proof.
Assume \(X\) is continuous and \(f\) and \(F\) as above. Notice, by the definition of \(f\text{,}\) \(\lim_{x \rightarrow \pm \infty} f(x) = 0\) since otherwise the integral over the entire space could not be finite.
Now, let \(A(x)\) be any antiderivative of \(f(x)\text{.}\) Then, by the Fundamental Theorem of Calculus,
Hence, \(F'(x) = A'(x) - \lim_{u \rightarrow -\infty} A'(u) = f(x)\) as desired.
For \(0 \lt p \lt 1\text{,}\) the \(100p^{th}\) percentile is the largest random variable value c that satisfies For continuous random variables over an interval \(R = [a,b]\text{,}\) you will solve for c in the equation For discrete random variables, it is unlikely that a particular percentile will land exactly on one of the elements of \(R\) but you will want to take the smallest value in \(R\) so that \(F(c) \ge p\text{.}\) The 50th percentile (as before) is also known as the median.
Definition 5.3.17. Percentiles for Random Variables.
For our earlier example with \(f(x) = x^2/3\) on R = [-1,2], the 50th percentile (i.e. the median) is found by starting with p = 0.5 and then solving or or After solving for c, you find
Example 5.3.18. Continuous Percentile.
TBA, using one of the table examples from above.
Example 5.3.19. Discrete Percentile.