Section 2.1 Creating Models
From each person who would like to participate, collect their height (in inches) and shoe size. Note that in the USA it is common for "women’s" shoe sizes to be 2 more than the similarly sized "men’s" show size so determine whether one group should add 2 or the other should subtract 2 to make comparisons similar. If anyone is sensitive about either of these, feel free to adjust these numbers as needed. Consider:
- Plot this data as ordered pairs on a graph.
- Comment on whether it might be reasonable to adjust the height data to be the number of inches from (say) 6 feet rather than just total inches tall. Replot.
- Notice any trends that the data points might illustrate.
- Suggest ways that trends might be expressed mathematically.
- How might a mathematical formula that describes the trend be utilized in practice?
When analyzing an issue mathematically, you might find it useful to have a formula of some sort that allows you to plug in some (independent) value and get out some (dependent) value. Functions in your algebra and calculus classes served that purpose quite well. Indeed, the formula for converting from temperature in degrees Fahrenheit (F) to degrees Celcius (C) is given by
\begin{equation*}
C = \frac{5}{9}(F-32).
\end{equation*}
This way, the boiling point of water at \(212^o\)F maps to \(100^o\)C and the freezing point of water at \(32^o\)F maps to \(0^o\)C as everyone should know. Since only two points are prescribed in this mapping then a line appears to be a good way to correlate values from Fahrenheit and Celcius at all points in between (interpolation) and beyond (extrapolation).
In general, given two distinct points there is one line which passes exactly through both. If the points are \((x_1,y_1), (x_2,y_2)\) then presuming the x-values are different gives the equation
\begin{equation*}
y = \frac{y_2 - y_1}{x_2 - x_1}(x - x_1) + y_1
\end{equation*}
is the linear function which passes through both points. If the x’s are equal then
\begin{equation*}
x = x_1
\end{equation*}
is your linear equation. However, once you collect three or more points it is likely that there is no line which exactly "interpolates" all of the points. So, to accommodate this increased complexity, the complexity of the model also must increase. For example, a quadratic might be a good model to fit three points or a 6th degree polynomial might be a good model to fit seven points. However as the number of points increases significantly, the use of a model that exactly interpolates at each of the data points becomes problematic due to its complexity and the trouble expended in order to obtain the model.
Since lower complexity models are easier to implement, it becomes necessary to remove the restriction that data points are met exactly by only requiring that the model be "close" to the data points using some measure of closeness. This chapter provides a scheme using calculus extrema for creating an approximate model that "best" approximates any number of data points but with a relatively low complexity model. You will use the resulting model in this chapter as a descriptive measure for the data. Other courses may use similar techniquest to make inferences but that is for one of those courses.