16.13 Correlation and Causality

Correlation is a statistical measure describing how two variables move together. In contrast, causality (or causation) goes deeper into the relationship between two variables by looking for cause and effect.

Correlation is a statistical property that summarizes the way in which two variables move either over time or across people (firms, governments, etc.). The concept of correlation is quite natural to us, as we often take note of how two variables interrelate. If you think back to high school, you probably have a sense of how your classmates did in terms of two measures of performance: grade point average (GPA) and the results on a standardized college entrance exam (SAT or ACT). It is likely that classmates with high GPAs also had high scores on the SAT or ACT exam. In this instance, we would say that the GPA and SAT/ACT scores were positively correlated: looking across your classmates, when a person’s GPA is higher than average, that person’s SAT or ACT score is likely to be higher than average as well.

As another example, consider the relationship between a household’s income and its expenditures on housing. If you conducted a survey across households, it is likely that you would find that richer households spend more on most goods and services, including housing. In this case, we would conclude that income and expenditures on housing are positively correlated.

When economists look at data for a whole economy, they often focus on a measure of how much is produced, which we call real gross domestic product (real GDP), and the fraction of workers without jobs, called the unemployment rate. Over long periods of time, when GDP is above average (the economy is doing well), the unemployment rate is below average. In this case, GDP and the unemployment rate are negatively correlated, as they tend to move in opposite directions.

The fact that one variable is correlated with another does not inform us about whether one variable causes the other. Imagine yourself on an airplane in a relaxed mood, reading or listening to music. Suddenly, the pilot comes on the public address system and requests that you buckle your seat belts. Usually, such a request is followed by turbulence. This is a correlation: the announcement by the pilot is positively correlated with air turbulence. The correlation is of course not perfect because sometimes you hit some bumps without warning, and sometimes the pilot’s announcement is not followed by turbulence.

But—obviously—this does not mean that we could solve the turbulence problem by turning off the public address system. The pilot’s announcement does not cause the turbulence. The turbulence is there whether the pilot announces it or not. In fact, the causality runs the other way. The turbulence causes the pilot’s announcement.

We noted earlier that real GDP and unemployment are negatively correlated. When real GDP is below average, as it is during a recession, the unemployment rate is typically above average. But what is the causality here? If unemployment caused recessions, we might be tempted to adopt a policy that makes unemployment illegal. For example, the government could fine firms if they lay off workers. This is not a good policy because we do not think that low unemployment causes high real GDP. Neither do we necessarily think that high real GDP causes low unemployment. Instead, based on economic theory, there are other influences that affect both real GDP and unemployment.

More Formally

Suppose you have N observations of two variables, x and y, where xi and yi are the values of these variables in observation i = 1, 2,…, N. The mean of x, denoted μx, is the sum over the values of x in the sample is divided by N; the scenario applies for y.




We can also calculate the variance and standard deviations of x and y. The calculation for the variance of x, denoted σx2, is as follows:

σx2=(x1μx)2+(x2μx)2+... (xNμx)2N.

The standard deviation of x is the square root of σx2:

σx=(x1μx)2+(x2μx)2+... (xNμx)2N.

With these ingredients, the correlation of (x,y), denoted corr(x,y), is given by

corr(x,y)=(x1μx)(y1μy)+(x2μx)(y2μy)+... (xNμx)(yNμy)Nσxσy.