Money is like muck, not good except it be spread.
In 1974, Richard Easterlin
published an important article. It was called ‘Has economic growth improved the human lot?’.Footnote 1 In it he focused on two alleged facts that were not apparently consistent with each other:
(1) At a point in time, richer people are on average happier than poorer people, (though there is a huge overlap).
(2) Over time, increases in national income per head do not lead to increases in happiness.
This was the so-called Easterlin paradox. If richer people are happier than poorer people, you would think that, when a country becomes richer, it will also become happier. But, Easterlin claimed, this is not what happens. And his explanation was that, when national income increases, everyone increases the norm against which they compare their own income. The most obvious norm for comparison is the income of other people. If it is only their relative income
that matters to people (rather than their absolute income
), then that could explain why economic growth does not increase the nation’s happiness.
In this chapter, we shall explore this hypothesis and much else besides.
First, we shall look at individual happiness and how this is affected by a person’s income. Similarly, we shall look at happiness and income across countries at a moment in time.
Then we shall look at national happiness over time.
Finally, we shall explore the role of relative income compared with absolute income and the policy implications of this difference.
Differences Between Individuals
As between individuals, three central findings of wellbeing research are these:
(1) In every country, richer people are on average happier than poorer people.
(2) This difference is quite small and explains around 2% of the variance of wellbeing within the population.
(3) The effect of additional income gets smaller, the richer you are.
Figure 13.1 shows the position in the UK. Each bar shows the average wellbeing of people at each level of income. As you see, richer people are on average happier than poorer people. But there is also a huge overlap. This is shown by the thin lines around each average figure. The lines span from 1 standard deviation below the average to 1 standard deviation above (i.e., they show the range of wellbeing for the middle two-thirds of people at each level of income). As the figure shows, there are many poor people who are happier than the average rich person.
The diminishing marginal utility of income
The diagram also shows something else. Extra income makes more difference to wellbeing at the bottom (left-hand) end of the scale than it does at the upper end of the scale. An extra $ of income produces a smaller and smaller amount of extra wellbeing the richer the person is. This old idea is now called ‘the diminishing marginal utility of income’; and before the behaviourist revolution (discussed in Chapter 2), it was a central belief of every economist. Indeed, it was why most economists favoured some redistribution of income – because, when $1 was transferred from a rich person to a poor person, the rich person lost less wellbeing than the poor person gained. So overall there was a net gain in wellbeing.
Until the last few decades, this was simply a speculative belief. But the new science now makes it possible to measure the quantitative effect with some accuracy. And, once we know how income affects wellbeing, we can compute by how much income inequality reduces average wellbeing, compared with a situation where everyone received the existing average level of income (see Box 13.1).
We are interested here in the difference between average wellbeing as it now is and average wellbeing as it would be if everyone received the current average level of income. This difference equalsFootnote 15
Using a quadratic Taylor’s series expansion of log Yi this becomes
In a typical advanced country, Var(Y)/Ȳ2 is around 0.8,Footnote 16 so the direct cost of inequality is 0.12 points (out of 10). There are also of course indirect effects through the pattern of human relationships and so on.
In investigating the effect of income on wellbeing, there are two issues:
(1) What is the functional form of the relationship?
(2) What is the actual size of the effect of income on wellbeing?
On the functional form, we can investigate this empirically, and it turns out that the semi-logarithmic linear relationship is a very good fit to the data in a whole range of surveys.Footnote 2
Thus
where Y is household income per equivalent adult. This fits in with the general law, known as the Weber–Fechner effect (discussed in Chapter 1), which says that, whether the experience is of light or sound, the perceived size of a change depends linearly on the proportional change in the thing that is changing. In a similar way, our feelings about a change in income depend linearly on the proportional change in income.
This gives us a direct measure of the marginal utility of income,Footnote 3 since
So, a person’s marginal utility of income is inversely proportional to her income: an extra $1 matters 10 times more to a poor person than to someone who is 10 times richer. If equation (2) is exactly right, the marginal utility of income will always be positive, even though it can be very low when a person’s income is high; and there is some evidence that, in fact, marginal utility becomes zero at some point – the point of ‘satiation’.Footnote 4
The size of the effect of income on wellbeing
However, we still need to know the size of the coefficient To find it, we could estimate a cross-sectional equation of the following form
where the Xjs are the other things that determine wellbeing – acting in this case as control variables. These controls should obviously include age, age squaredFootnote 5 and gender, but what else is not completely clear. For there are at least five problems that arise when estimating equation (3).
Problems
(1) If income is measured inaccurately, the estimate of will be biased downwards.
(2) If we include controls that are themselves affected by income (and thus ‘mediate’ its effect), we shall underestimate the total effect of income upon wellbeing.
(3) If, on the other hand, we omit variables that are positively correlated with both income and wellbeing, we shall overestimate the effects of income, by omitting these ‘confounding’ variables.
(4) The relationship in equation (3) may not be properly identified, because there may also be a relation between income and wellbeing in which wellbeing is causing income. This is the problem of reverse causality.
(5) If people are affected by other people’s incomes, we should include these in the controls.
We shall deal with these problems as follows.
(1) We measure income as income per person in the household (children being converted into ‘equivalent adults’). This is a proxy for living standards. (Though consumption data would be preferable, they are not generally available in surveys where we also have data on wellbeing.) In most surveys, incomes are self-reported, but they are no less accurate than data on, for example, health.
(2) We shall show estimates with and without including mediating variables.
(3) To handle confounding variables that are omitted, we shall exploit the panel nature of the data, by including in each equation a person fixed effect fi . This will remove the effect of any omitted variables that do not vary over time. It gives an equation
(4)
The estimate of is now based on comparisons ‘within-person’ (rather than across persons). As we shall find, such panel equations tend to give very low estimates of ,Footnote 6 but this is partly due to enhanced effects of measurement errorFootnote 7 and partly because the timing of effects may not be properly represented by the form of the equation.
(4) To deal with the problem of reverse causality,Footnote 8 we shall show the results of including an element of income that is completely random and therefore exogeneous. This is people’s winnings in a lottery.
(5) We shall normally include the log income per head of a person’s comparator group (people of similar age, gender and region). This is to estimate the role of relative income as compared with absolute income. However, we leave reporting these comparator effects till later in the chapter.
Results
We can begin with simple cross-section results for Britain, Germany, Australia and the United States. The data for the first three countries are from annual longitudinal household studies (UKHLS, SOEP and HILDA) and for the United States, they are from the annual cross-sectional BRFSS.
As Table 13.1 shows, the effects are not large. To take the US case, one additional point of log income, corresponding to nearly a tripling of income, will produce an extra 0.31 points of wellbeing (out of 10 points maximum).Footnote 9
Britain | Germany | Australia | United States |
---|---|---|---|
0.16 | 0.26 | 0.16 | 0.31 |
(.01) | (.01) | (.01) | (.01) |
Note: Standard errors in brackets. ‘Controls’ include all those in Figure 8.1. Estimates omitting comparator income are very similar.
It is interesting to see how much of the variance of happiness is explained by income inequality. Remember that the partial correlation coefficient is
So the share of the variance of W in the United States explained by income inequality is
This 2.56% compares with an R2 of around 19% for the full set of influences shown in Figure 8.2. Yet, some economists claim that ‘absolute income is the dominant fact determining wellbeing’.Footnote 10 That is clearly not correct. Income is one important influence but one among many.
Moving on to the issue of omitted variables, one possible solution is to use a fixed effects equation like that shown in Chapter 7 and thus exploit the panel nature of the data. As Table 13.2 shows, this produces smaller coefficients.Footnote 11 But we should probably ignore these time-series estimates due to the problems noted earlier.
A final way to handle omitted variables and reverse causation is to use data on lottery winnings. In the majority of studies, these are followed up over short periods and are therefore difficult to interpret. But in a remarkable study, Lindqvist and others followed up people who played the Swedish lottery for a further 22 years after that.Footnote 12 Among the players, the winnings could reasonably be counted as random. And the size of the winnings had a similar effect on wellbeing over all the 22 years. If we convert these one-off winnings into an equivalent income stream, the effect of a unit change in log income is to raise wellbeing (0–10) by 0.38 points.
Britain | Germany | Australia |
---|---|---|
0.04 | 0.08 | 0.06 |
(.01) | (.01) | (.01) |
Note: Standard errors in brackets. ‘Controls’ include all those in Figure 8.1. Estimates omitting comparator income are very similar.
One further issue: are the effects of log income different in poorer countries to the effects in richer ones? We can examine this, using data from the Gallup World Poll. The results are shown in Table 13.3. The coefficients are remarkably similar in countries at all levels of income. This is less surprising than might appear since (due to the diminishing marginal utility of income) an extra dollar in the hands of a poor person is worth 20 times more than in the hands of someone who is 20 times richer. We should also note that for high income countries the coefficient is somewhat higher than those shown in Table 13.1, which makes sense since in Gallup data the Table 13.1 countries do have coefficients somewhat below the high-income country average.
High | Upper middle | Lower middle | Low | All |
---|---|---|---|---|
0.37 | 0.43 | 0.45 | 0.35 | 0.40 |
(.04) | (.04) | (.03) | (.04) | (.02) |
It is helpful to have in mind a benchmark coefficient for the effect of log income on wellbeing. In picking one, we should also bear in mind the danger that the Table 13.3 coefficients are exaggerated through reverse causality. Thus we would suggest that a figure of 0.30 is a useful benchmark.Footnote 13
This gives us straight away two vital pieces of information. The first is the marginal utility of income (meaning the change in WELLBYs for an extra dollar of annual income). As we explained earlier, this is given by . So if annual income per head is $30,000, the marginal utility of income is 1/100,000.Footnote 14 And 1 extra WELLBY is equivalent to some $100,000 (spread across a group of people).
Second, we can measure the direct impact of inequality on average wellbeing. As Box 13.1 explains, in the typical country this is of the order of 0.12 points (out of 10) – that is how much average wellbeing would rise if average income remained the same but income inequality was abolished. It is a surprisingly small figure.
Differences Between Countries
We can turn now to differences in income between countries and how these are reflected in the country’s average level of wellbeing. Figure 13.2 shows the scatter diagram of countries together with the line of best fit.
As with individuals, there is overwhelming evidence of diminishing marginal utility of income, and the line of best fit has been estimated using the logarithmic formulation. If no other variables are included, the effect of log income is very substantial. But, as we argued in Chapter 8, this is partly because high income is correlated with many other cultural variables including trust, social support, freedom and generosity. We do not know how far these other characteristics have been caused by national income. But suppose we are asking How much would it help if households were given more income? Then clearly we should be holding these other things constant. In that case, when estimated across countries, the coefficient of wellbeing on log income is 0.33 (se = 0.07).Footnote 17 This is conveniently similar to the coefficient across individuals.Footnote 18
Time-Series for Countries
We turn now to the second part of the Easterlin paradox. His claim is that, as time passes, higher national income does not produce higher national wellbeing. In the cross-section of individuals we have found that
One would therefore expect that over time at the country level
But Easterlin says it does not.
So what are the facts? The most striking fact supporting Easterlin is the story of the United States. As Figure 13.3 shows, average happiness has not increased there since the 1950s, despite rapid economic growth that was widely shared at least till the 1970s.
However, Figure 13.3 does not prove that in the United States higher income did not improve wellbeing. It might have done so, with other factors offsetting this effect. In any case, the experience of one country can prove little. So the first step is to see whether countries with higher long-term economic growth have had higher growth in wellbeing. It is important to stress the word long-term because there is no doubt that happiness rises in booms and falls in slumps (for all kinds of reasons we shall come to). But do countries with high long-term growth do better in terms of happiness?
Long-term growth and wellbeing
To investigate this, the best available long time-series is for European countries, where the Eurobarometer survey has been conducted regularly in many countries since the early 1970s.Footnote 19 To investigate the effect of long-term growth, it is of course necessary to control for booms and slumps (which drive wellbeing up and then down). This is done in the following equation, by including percentage unemployed (u) and annual percentage inflation (π), as well as GDP per head and country fixed effects.
As the equation shows, income is estimated to have a positive effect but with a huge standard error.Footnote 20 This is typical of multi-country time-series – reflecting in part the very different growth rates of wellbeing in different countries with similar rates of income growth.Footnote 21 Thus, the conclusion on the second part of the Easterlin hypothesis has to be ‘sometimes yes and sometimes no’. But for citizens of the United States there is a particular challenge – of why they are no happier on average than people were in the 1950s.
The Role of Income Comparisons and Adaptation
There is one obvious reason why national increases in income over time might produce lower changes in wellbeing than individual increases in income at one point in time. It is social comparisons.
Suppose that each of us has a comparator group with whom we compare our incomes and that much of our concern about income is focused on our relative income rather than our absolute income. Then a person’s wellbeing depends positively on her wellbeing but negatively on the income of her comparators. The relationship could beFootnote 22
where is the average income in the person’s comparator group. This can also be written as the sum of the effects of absolute income (Yi) and of relative income:
If is substantial, comparators’ income is a substantial force reducing our happiness.
So what is the evidence on the effect of comparator income? In the great majority of studies, it is negative and large.Footnote 23 But the findings sometimes depend on what group people are assumed to compare themselves with – very local (such as neighbours or colleagues) or people from the same region, age group and gender.Footnote 24 In the cross-sectional study reported in Table 13.1, we used the latter definition. The results are shown in Table 13.4.
Britain | Germany | Australia | United States | |
---|---|---|---|---|
Own income | 0.16 (.01) | 0.26 (.01) | 0.16 (.01) | 0.31 (.01) |
Comparator income | −0.23 (.07) | −0.25 (.04) | −0.17 (.06) | −0.19 (.03) |
Note: Standard errors in brackets. ‘controls’ include all those in Figure 8.1.
In these countries, the effects of comparator income are negative and large. Thus, the effects of relative income are positive and large. Meanwhile, the effects of absolute average income, which is the sum of the two rows in Table 13.4, are small. If these numbers are anywhere near right, they provide an obvious explanation of the Easterlin paradox:
When an individual has a higher income, holding constant, she is happier. This is mainly because her relative income is higher.
But when the whole society becomes richer, rises and relative incomes do not change. (Some people may go up in relative terms and others down but the average of relative income remains constant.) So at the level of society the only effect of economic growth is the weaker effect of absolute income.
There is much other evidence that people care about relative income as well as absolute income. Some of it comes from neuroscience, led by Armin Falk of the University of Bonn.Footnote 25 His team organised an experiment where participants had to undertake a task while undergoing a functional MRI measurement of brain activity in the brain’s reward centre, the ventral striatum. Those who successfully completed the task were given a financial reward, which was varied randomly. They were also told of the reward, if any, received by the person with whom they were paired. The findings were remarkable. The measure of activity in the ventral striatum increased by 0.92 units for every €100 they themselves received and fell by 0.67 units for every €100 their pair received. So, relative income had double the effect of absolute income.
In another ingenious experiment, David Card (another winner of the Nobel Prize) and his colleagues examined the effect of knowing the incomes of your colleagues. It happened that the University of California, where he works, had recently put all faculty salaries online. But most people did not know about it. So Card informed a random selection of the faculty members that these data existed. He also measured the wellbeing of the treatment and control group before and after he did this. Those who learned about colleagues’ salaries became on average less satisfied.Footnote 26 So relative income clearly matters.Footnote 27
Adaptation
However, some psychologists favour a different explanation of the Easterlin paradox – adaptation. According to this, people may enjoy an increase in income for a time but then they get used to the higher income and their wellbeing reverts to its former level. Unfortunately, not many studies have tested the effect of social comparisons and adaptation simultaneously. In Table 13.5 we do this, using the same panel data that was used cross-sectionally in Table 13.4. But this time we include a fixed effect for each individual, and we include not only social comparison income but also the lagged values of own income:
When this fixed effects model is run, the effect of one’s own income is lower than in cross-section (as discussed earlier). But the negative effect of comparator income is much more substantial than of lagged income from the previous three years. In any case, adaptation cannot be the main reason for the Easterlin paradox: in any community, most richer people have always been richer and poorer people have been poorer, and yet the richer people are on average happier.Footnote 28
Britain | Germany | Australia | |
---|---|---|---|
Log own income | 0.06 (.01) | 0.19 (.01) | 0.06 (.01) |
Log comparator income | −0.09 (.06) | −0.12 (.04) | 0.01 (.04) |
Log previous 3 yrs’ income | −0.02 (.02) | −0.08 (.01) | −0.01 (0.01) |
Note: Standard errors in brackets. ‘Controls’ include all those in Figure 8.1. Note that comparator income has little movement except for people who move regions.
Policy Implications of Income Comparisons
The analysis in this chapter has major implications. First, as we have said, diminishing marginal utility is a powerful argument for the redistribution of income to the point where it ceases to raise average wellbeing. This point has been understood for many years. But the role of social comparisons introduces a completely new argument in favour of taxation (not based on the diminishing marginal utility of income nor on the need to fund public goods).Footnote 29
To see this, assume a world in which everybody is equal. It will still be the case that when someone works longer and therefore earns more, she increases the income level against which everyone else compares their income. This is a negative externality, and it leads to an inefficient scale of work unless something is done. What level of tax would suffice? Let us suppose that there are (N + 1) identical individuals each paid (for simplicity) a salary of one unit per hour worked – meaning that a person’s earnings equal the hours she works. So suppose that individual wellbeing depends additively on own log income, log comparator income and the cost of work effort. Then, if there are no taxes,
where is the average income of the other N people, and the last term is the psychic cost of working Hi hours.Footnote 30
When anyone works longer to improve their own income, they also raise the average income against which others compare their income. In the absence of a tax, the individual chooses Hi to maximise Wi, ignoring the effect this has on So the individual optimum in the absence of a tax is found by setting dW/dHi equal to zero, which gives
The value of marginal income is equated to the psychic cost of obtaining the marginal income.
But the social optimum would also allow for the negative effect of increased Hi on the wellbeing of the other N people. When Hi goes up by one unit, goes up by 1/N. This in turn reduces the wellbeing of each of the other N people by. But there are N people who are affected in this way, so the total loss of wellbeing is. The social optimum takes this into account and is therefore given byFootnote 31
This implies fewer hours of work.
So what is the optimum corrective tax?Footnote 32 Suppose it is structured as a linear tax with a constant marginal tax rate (t) and the yield is returned to the workers in lump-sum form. Then the individual optimum is now
We now want to find that value of t that ensures that the individual optimum is the same as the social optimum. This requires
So the optimum marginal tax rate is – the ratio of the pollution effect to the own income effect. Given the estimated values of, this could justify quite high marginal tax rates on grounds of efficiency. At the very least, this externality argument should be put against the traditional argument that taxation has an ‘excess burden’ – because it inefficiently discourages work.Footnote 33 If when people earn more they are imposing a cost on others, this should be taken into account in any argument about what is efficient.
Similarly, with cost-benefit analysis. If a project is paid for out of higher taxes paid by everybody, any loss of wellbeing from the tax will be partially softened by the fact that everybody else is losing as well.
It is sometimes said that this argument is less forceful if people only compare themselves with small numbers of other people. But, as our reasoning earlier about N showed, the argument applies whatever the size of the group.Footnote 34
Another important implication of social comparisons is for us as individuals. People who make fewer comparisons are on average happier.Footnote 35 So we should train our tastes, as far as we can, to reduce Libertarians argue that, if we don’t do this, that is our lookout, and governments should ignore human failings. But this approach is inconsistent with an evidence-based approach to ethical questions.
A second argument in support of corrective taxation comes from unforeseen adaptation. If people work harder to increase their income but overestimate the effects of this on their own wellbeing,Footnote 36 that is another reason why some marginal taxation could be good for efficiency.
Economic Fluctuations
We turn finally to fluctuations of income over the business cycle. What is certain is that wellbeing rises in booms and falls in slumps. Two processes are at work here. The first is adaptation. Morale is higher when income is high relative to previous income and low in recessions (when the reverse applies). And the second is loss-aversion. This time we are talking about ex post loss aversion (not the ex ante loss aversion that affects decisions). There is powerful evidence that the loss of wellbeing when income falls by a given amount is roughly double the increase in wellbeing when income rises by the same amount.Footnote 37 This has profound implications.
First, it helps to explain the weak long-term relationship between income and wellbeing, since the years of income decline have such strong negative effects.
Second, in terms of policy, it argues strongly for the importance of economic stability. In his famous Presidential Address to the American Economic Association, the neo-classical economist Robert Lucas argued that economic cycles were unimportant compared with the rate of long-term economic growth. Cycles could therefore be tolerated if they increased long-term growth. The implication of wellbeing research is the opposite: higher long-term growth ought not to be pursued if it leads to economic instability. Humans like stability and it is the job of policy to provide it.
Conclusions
The Easterlin paradox states that
(1) in a given context richer people are on average happier than poorer people,
(2) but over time greater national income per head does not cause greater national happiness.
Statement (1) is certainly true. We reviewed a mass of evidence and concluded that, as a benchmark, a unit increase in log income raises wellbeing by 0.3 points (out of 10). The share of the within country variance in wellbeing explained by income inequality is 3% or less. So income is in no sense a proxy for wellbeing.
Across countries, the effect of a unit change in log income per capita (other things equal) is also around 0.3 points of wellbeing.
Over time, wellbeing has increased with income in some countries but not in others. Thus statement (2) is still the subject of ongoing research.
From direct studies on individual data, it is clear that in most cases a rise in other people’s income reduces your own wellbeing. This means that the effect of one person’s income on that individual’s wellbeing overestimates the effect of economic growth on the overall wellbeing of society.
From a policy point of view, income comparisons mean that when a person earns more she imposes a cost on other people. This is a negative externality and one way to control it would be by corrective taxation. If the externality is as large as our estimates suggest, this could mean that quite high rates of marginal tax are efficient.
The final issue is economic fluctuations. Wellbeing rises in booms and falls in slumps. One important reason is loss-aversion – people dislike a loss of income twice as much as they like a gain of equal size (both likes and dislikes being measured in units of ex post wellbeing). This may be a partial explanation of the Easterlin paradox. In terms of policy, it means that economic stability is enormously important and ought not to be sacrificed in pursuit of small increases in long-term economic growth.
Questions for discussion
(1) Are the estimated effects of income plausible?
(2) Why do they not differ between rich and poor countries?
(3) Are national time-series estimates of the effect of income consistent with within-country estimates based on individuals?
(4) How important are social comparisons and what are their policy implications?
(5) How important is adaptation?