Histogram

The histogram is a representation similar to the diagram of bars. It is usually used when the volume of data is larger, or when the variables are continuous.

The heights, in meters, of a basketball team are the following: $$$1.74, 1.84, 1.87, 1.92, 1.92, 1.97, 2.02, 2.03, 2.07, 2.11, 2.22$$$ We can do several different histograms. If we define equal height intervals or classes (for example $$10$$ cm).

The middle points of each interval are the classes marks.

The diagram will be equivalent to a diagram of bars with the following data:

Heights interval (or classes) Mark of classes Absolute frequency
$$[1.70, 1.80]$$ $$1.75$$ $$1$$
$$( 1.80,1.90]$$ $$1.85$$ $$2$$
$$( 1.90, 2.00]$$ $$1.95$$ $$3$$
$$( 2.00, 2.10]$$ $$2.05$$ $$3$$
$$( 2.10, 2.20]$$ $$2.15$$ $$2$$
$$( 2.20, 2.30]$$ $$2.25$$ $$1$$

And the histogram is like a bars diagram, since the intervals have a constant size:

imagen

However, we can do histograms with different intervals. Let'suppose we want to see bars that indicate the quantity of players in each position.

If the lowest two players are bases, the following three guards, the following four forwards, and the three highest are centers. Which intervals can be defined?

Position Height interval
base $$[1.70, 1.85]$$
guard $$( 1.85, 1.95]$$
orward $$( 1.95, 2.09]$$
center $$( 2.09, 2.25]$$

Notice that there are different intervals that can be defined.

The representation of this new histogram is:

imagen

The heights of the rectangles are defined so that the area of each one has the value of the absolute frequency of the interval: $$h_i=\displaystyle \frac{f_i}{a_i}$$ where $$h$$ is height of the rectangle corresponding to the interval, $$a_i$$ to its largeness, and $$f$$ to its frequency.

Note: All the representations explained in this level can be done with frequencies other than the absolute. For example, they can be done using relative frequencies (the histograms would have the same form but normalized heights), or using cumulative frequencies, so that the histogram would have an increasing form from $$0$$ to $$1$$ similar to a probability distribution function.