About 10% of people are left-handed.
Is this descriptive or inferential?
Solution:
This is descriptive. It is a summary.
There are two main approaches in statistics, one focuses on presenting a situation and the other is about predicting. You may notice we like to offer ways to remember concepts so here’s another one: “d” with “d” and “i” with “i”. Descriptive statistics deals with describing data while inferential statistics is used to infer.
Statistics techniques used mainly for summarizing data are called descriptive statistics.
So, in descriptive statistics the goal is to describe a situation. We want accurate information and then let others know about this data. Descriptive statistics involve collecting data, organizing data, summarizing data, and presenting data. There are different ways to present data, including charts and tables. It can also be powerful for a summary to be just a numerical value. For example, April 2020 headlines mention US unemployment claims were about 17 million people in three weeks.
Statistical techniques used primarily to draw conclusions about populations are called inferential statistics.
An inference is a decision based on samples drawn from populations. Inferential statistics uses probability, which is the chance of an event occurring. For example, the Congressional Budget Office expects the unemployment rate will be 9 percent at the end of 20211. Can you tell why this is an inference rather than a description? We are currently in 2020 so referring to a year in the future is a good indicator we have inferential information. The words “will be” instead of “are” can also be a tip off we are dealing with an inferential statement. (Note - we will try to use other examples but with a pandemic going on it is difficult.)
Let’s take a look at some examples and decide if our statements are descriptive or inferential.
Example 3.1 (Left Handed People)
About 10% of people are left-handed.
Is this descriptive or inferential?
Solution:
This is descriptive. It is a summary.
\[ \tag*{$\blacksquare$} \]
Example 3.2 (Instagram)
9 out of 10 students in a certain major use Instagram.
Is this descriptive or inferential?
Solution:
This is descriptive. It is a summary. Which majors do you think are most likely to use Instagram? Please let us know!
\[ \tag*{$\blacksquare$} \]
Example 3.3 (Bubble Tea)
Drinking bubble tea can lower cholesterol levels by 1%.
Is this descriptive or inferential?
Solution:
This is inferential. It is a prediction.
\[ \tag*{$\blacksquare$} \]
One of the authors is obsessed with bubble tea and made this statistic up! That’s a reason why statistics can be powerful and dangerous. Someone may find my false inferential statement and use it as a fact. We will talk about the misuse of statistics soon but please be on the lookout for sources when you are given any data or statements!
Example 3.4 (Median Household Income:)
According to a Census, the US median household income is $63,1792.
Is this descriptive or inferential?
Solution:
This is descriptive. It is a summary.
\[ \tag*{$\blacksquare$} \]
Example 3.5 (Harvard Class of 2024)
40,246 prospective students applied to the Harvard Class of 2024.
Is this descriptive or inferential?
Solution:
This is descriptive. It is a summary.
Did this one trick you because of a future year? It is descriptive because the class of 2024 enters four years prior to this date and applies months earlier than that!
\[ \tag*{$\blacksquare$} \]
Example 3.6 (New Courses at FIT)
In the year 2030, about 45 new courses will be offered at FIT.
Is this descriptive or inferential?
Solution:
This is inferential. It is a prediction; we cannot know for sure how many courses will be approved in the next couple years. We hope to have a few in our department!
\[ \tag*{$\blacksquare$} \]
Note that we should usually use the word "about" because we cannot retrieve information from every single person in the US or even every single person at FIT! We are using samples of our target population to represent the entire group. If you notice there are two cases in the examples above where we do not use “about” and supposedly have the total population. For applications at a university this does seem reasonable to achieve! For the US Census, we truly want the information to represent the entire population.
Let's look more at inferential statistics. We take data values (called the sample) to make inferences (draw conclusions) about an entire group we're interested in (called the population). After we analyze collected sample data, we reach a conclusion.
Here is an example:
Example 3.7 (Average Height of Females)
Based on a random sample of 100 females 18 years or older in New York City, we believe that the average height of all females 18 year or older in New York City is 64 inches with a maximum error of 3 inches. We are 90% confident of this.
Is this descriptive or inferential?
Solution:
This is inferential.
Here we are using the data values from our sample (the 100 New York City females) to estimate the average height of the entire population of females in New York City. We get this estimate by doing a calculation based on the data values from the sample. It should be mentioned that we do not find out the height of every single female 18 or older in NYC! Rather, we can only estimate the average and talk about how certain we are of our estimate. Again, we draw conclusions about a certain population based on our specific sample.
\[ \tag*{$\blacksquare$} \]
Why don't you try to answer the following for extra practice in determining whether descriptive or inferential statistics have been used:
Once you're done scroll down for the answers!
Example 3.8 (Average Life Expectancy)
The average life expectancy in (the) Netherlands is 81.6 years.
Is this descriptive or inferential?
\[ \tag*{$\blacksquare$} \]
Example 3.9 (Potassium Lowers Blood Pressure)
A diet high in potassium will lower blood pressure.
Is this descriptive or inferential?
\[ \tag*{$\blacksquare$} \]
Example 3.10 (Bubble Tea Budget)
The total amount of money a household spent on Bubble Tea in one week was $35.40. One of the author’s statistics and by the entire household, it is just me (my partner and kids do not drink it).
Is this descriptive or inferential?
\[ \tag*{$\blacksquare$} \]
Example 3.11 (Number of High School Students)
In 2033, the number of high school students will be 3.6 million.
Is this descriptive or inferential?
\[ \tag*{$\blacksquare$} \]
Here are the solutions: