4 Common Graphs (A to N)

This section covers covers common graphs and visualization techniques for examining your sample data.

Visualization Techniques:

  • Bar Charts:
  • Histograms:

4.1 Area Chart

4.1.1 Overview and Summary

4.1.2 Common Applications

4.1.3 Interpretation

4.1.4 Doing It In R

Coming soon

4.1.5 Sources and Useful Links

Sources:

Useful Links:

4.2 Bar Chart

An example of a bar chart is shown below:

4.2.1 Overview and Summary

Bar charts are used to compare categorical variables. Usually on one axis is the categorical variable and on the other axis is a quantitative measure of that variable such as a count or the mean. Bar charts with the categorical variable on the x-axis are sometimes called column charts while bar charts with categorical variables on the y-axis are sometimes called row charts. In the above, we see a column chart with the categorical variable on the x-axis.

Bar charts are very similar to histograms except that typically histograms are used to depict quantitative variables as opposed to categorical variables.

4.2.2 Common Applications

Bar charts are useful to summarize and compare groups (or levels) of a categorical variable.

4.2.3 Interpretation

Generally, the height of a bar corresponds to the value of the quantitative measure. Do be careful during interpretation to examine the range on the axis for the quantitative measure. It is possible that the axis does not start at zero, thus leading to deceiving bar heights. This is seen below.

In the below, we see two charts that depict the exact same data and information. On the left, the bar chart has a y-axis ranging from 0 to 30. Note that the values are actually pretty similar to each other. On the right, we changed the range of the y-axis to be between 10 and 30. Note how visually, now it seems that the values are now much further from each other. The middle bar looks like it is less than 4 times the height of other bars.

Typically, practitioners do not adjust the y-axis to distort bar heights.

4.2.4 Doing It In R

In the below, we create bar charts where the categorical variable is planets using R. The below depict the number of hours in a day for different planets. Data is pulled on 9/27/2020 from NASA.

4.3 Bubble Chart

An example of a bubble chart is shown below:

4.3.1 Overview and Summary

Bubble charts are useful to depict three dimensional data. That is, when y

4.3.2 Common Applications

Bar charts are useful to summarize and compare groups (or levels) of a categorical variable.

4.3.3 Interpretation

Generally, the height of a bar corresponds to the value of the quantitative measure. Do be careful during interpretation to examine the range on the axis for the quantitative measure. It is possible that the axis does not start at zero, thus leading to deceiving bar heights. This is seen below.

In the below, we see two charts that depict the exact same data and information. On the left, the bar chart has a y-axis ranging from 0 to 30. Note that the values are actually pretty similar to each other. On the right, we changed the range of the y-axis to be between 10 and 30. Note how visually, now it seems that the values are now much further from each other. The middle bar looks like it is less than 4 times the height of other bars.

Typically, practitioners do not adjust the y-axis to distort bar heights.

4.3.4 Doing It In R

In the below, we create bar charts where the categorical variable is planets using R. The below depict the number of hours in a day for different planets. Data is pulled on 9/27/2020 from NASA.