The Washout Argument Against Longtermism
Eric Schwitzgebel
Department of Philosophy
University of California, Riverside
Riverside, CA 92521
USA
June 21, 2024
The Washout Argument Against Longtermism
Abstract: We cannot be justified in believing that any actions currently available to us will have a non-negligible positive influence on the billion-plus-year future. I offer three arguments for this thesis. According to the Infinite Washout Argument, standard decision-theoretic calculation schemes fail if there is no temporal discounting of the consequences we are willing to consider. Given the non-zero chance that the effects of your actions will produce infinitely many unpredictable bad and good effects, any finite effects will be washed out in expectation by those infinitudes. According to the Cluelessness Argument, we cannot justifiably guess what actions, among those currently available to us, are relatively more or less likely to have positive effects after a billion years. We cannot be justified, for example, in thinking that nuclear war or human extinction would be more likely to have bad than good consequences in a billion years. According to the Negligibility Argument, even if we could justifiably guess that some particular action is likelier to have good than bad consequences in a billion years, the odds of good consequences would be negligibly tiny due to the compounding of probabilities over time.
Word count: ~6500 words
Keywords: decision theory; futurism; longtermism; MacAskill, William
The Washout Argument Against Longtermism
According to what I will call longtermism, our decisions should be influenced by our expectations for their effects on the very long-term future – not just the next ten years or even the next thousand years, but the next billion-plus years. MacAskill’s 2022 book What We Owe the Future is perhaps the best-developed articulation of this view (see also Beckstead 2019; Ord 2020; Greaves and MacAskill 2021; Tarsney 2023). I will argue, on the contrary, that our decisions should be not at all influenced by our expectations about their effects a billion years in the future.
My grounds are this: There are no practically available actions – nothing we can actually do now – that we are justified in believing will have a non-negligible positive influence on events a billion or more years from now, due to the many massively complex possible causal ramifications of any action. Partly following Greaves and MacAskill (2021), I will call this the washout argument against longtermism.[1] I offer three versions of the washout argument:
· an infinite version, which explores the unintuitive consequences of granting even a small chance that our actions have infinitely many consequences;
· a finite version, which asserts cluelessness about what current actions would have good versus bad impacts after a billion years;
· another finite version, which asserts that even if we could justifiably guess what current actions would have good versus bad impacts in a billion-plus years, the expected impact of any currently available action is negligible.
I will conclude by cheering longtermists’ practical suggestions for devoting resources to the medium-term (several-hundred-year) future. Indeed a more modest version of “longtermism” might commit only to being influenced by expectations over this much shorter time frame. Still, estimates over much longer durations are distinctive of the current longtermist movement; and leading longtermist thinkers are at least implicitly committed to billion-year longtermism in virtue of not discounting future lives while encouraging temporally unbounded estimates of the impact of our actions.
1.
Infinite Washout.
The most straightforward interpretation of current physical theory appears to imply that the universe will endure infinitely and that almost all of our actions will have infinitely many positive and negative effects into the future. I have defended this view in detail elsewhere (Schwitzgebel and Barandes 2024). I will only sketch the general idea here. If you raise your hand now, that will cause many particles to change their trajectories, which will then cause many other particles to change their trajectories, which will then cause many other particles to change their trajectories, in an ever-expanding ripple of effects that extends into the post-heat-death universe. Since there’s no reason to expect a future temporal boundary at which time ceases, a reasonable default assumption is that the universe will endure infinitely post-heat-death. Most physical theories imply there will occasionally be chance thermodynamic or quantum fluctuations of very large magnitude, and there appears to be no cap on the size of these fluctuations. If so, then given infinite time to wait, whole new galaxies will eventually fluctuate into existence by massively improbable chance. Alternatively, black holes or some other process might seed new cosmic inflations, giving rise to new galaxies by those more ordinary-seeming means. Either way, the still-rippling effects of your current actions will then influence what occurs in those galaxies. Eventually – maybe after a googolplex-to-the-googolplex years – a photon that wouldn’t have been exactly where it is, with exactly the energy it in fact has if I hadn’t raised my hand just now, will strike a future radioastronomer’s device in just the right way to push a triggering mechanism over a threshold, causing the astronomer to check a device and then write an article that wins a prize, forever changing her career. Another photon will cause another ripple of effects that gives some future diplomat cancer, resulting in a power shift that precipitates a terrible war. Continuing into the infinite future, if this cosmological model is correct, virtually every action you perform will have infinitely many positive and infinitely many negative effects. Call this the Infinitary Cosmology.
This is a problem
if your decision procedure requires weighing up all the expected positive and
negative effects of your actions, with no temporal discounting. Suppose Action A is giving a million dollars
to an effective charity, with benefit m,
and Action B is using that money to burn down the house of a neighbor with an
annoying dog, with benefit n (n might be negative). The total expected value of Action A might be
modeled as m + ∞ + -∞. The total expected value of Action B might be
modeled as n + ∞ +
-∞. Neither value is mathematically
defined. They can’t be weighed against
each other. Nor can we escape the
problem by assuming, for example, that over the long term, the positive effects
outweigh the negative effects. Then the
expected value of A would be m
+ ∞ and the expected
value of B would be n + ∞. These values are equal. As the saying goes, infinity plus one is just
infinity.
We also can’t escape the problem
by assuming that in the long run the positive and negative effects of both
actions approximately balance. As long
as positive and negative effects are randomly distributed, in both the Action A
stream and the Action B stream, there will be no finite limit to the size of a
positive or negative run of good or bad effects in either stream: Neither
series will converge to a zero sum. What
might converge to zero is the ratio
of positive to negative effects as the series of effects goes to infinity. But then again, if we’re looking at ratios
rather than sums, there’s nothing to choose between Actions A and B: Both
converge toward an equal ratio of positive vs. negative effects: In the ratio
limit, the initial m and n wash out.
Now you might regard the Infinitary Cosmology as unlikely. Maybe you even have a high credence that it’s
false – say, 99.9% credence. The
possible infinitudes will still destroy your long-term expected value
calculations. Suppose that you know that
if the Infinitary Cosmology is false, the expected
value of A will be m. Given your 0.1% credence in the Infinitary Cosmology, the expected value of your action
will be .999 * m + .001 * (∞ +
-∞) – in total, an undefined value.
You might think that symmetry or
dominance reasoning can help you escape this conclusion. Not so.
Consider the following.
Eventually, among the long-term effects of your million-dollar
charitable donation (value m) will be
the value equivalent of burning down your neighbor’s house (value n).
Eventually, among the long-term effects of your burning down the house
(value n) will be the value equivalent
of donating a million dollars to charity (value m). Of course, there will
still remain infinitely many positive and negative effects of both actions:
Pulling an n or an m out of the infinitude will not make it
any less an infinitude. Thus, we can model
the effects of A and B as equally m +
n + ∞ + -∞. Action A and B can thus be modeled as
symmetric, with neither dominating the other.
In general, trying to subtract away infinitudes, or balance them against
each other, or engage in other linear transformations of infinitude, leads to
paradox unless we confine ourselves carefully to the mathematics of limits. Maybe this is unintuitive. But infinitude is unintuitive.
If there’s a non-zero chance
that your actions have infinitely many positive and negative effects in an Infinitary Cosmology of the sort described, then unless
there’s some temporal discounting, decision theory collapses. The lesson I draw is that there’s a problem
with one of the background assumptions of longtermism
as formulated by MacAskill and others. Longtermists’ application of standard decision theory with
no temporal discounting will inevitably generate either indifference, paradox,
or undefined values. The tools break,
and it is at best an open question whether they can be appropriately modified
to handle the full range of epistemically possible infinitary cases.[2] We must think about our decisions in some
other way.
2. Finite Washout: Cluelessness.
Let’s suppose that the longtermist ducks the Infinite Washout objection by
implementing a temporal cutoff – say, at the heat death of the universe. They might justify this cutoff by pleading
radical ignorance about good or bad effects after heat death: The good or bad
effects of our current actions on any particular spatiotemporal region post
heat death will be sheer unpredictable chance, not worth figuring into our
decision calculus.
That is a reasonable response. But notice that it is already a significant
concession. It abandons the
time-indifferent calculus that longtermists tend to
prefer in favor of discounting the far future.
And it requires what is arguably an inelegant and unjustified cutoff at
heat death – inelegant and unjustified in part because arguably we should
assign some tiny credence to the possibility that our descendants persist even
after heat death, for example, by discovering new sources of energy in
violation of current understandings of the laws of thermodynamics. Longtermists could
perhaps find a more elegant solution by employing a smoother discount function
– but smooth discounting functions generally have other unintuitive
consequences and are directly contrary to the spirit in which longtermism is typically offered.[3] Furthermore, if the longtermist
appeals to post-heat-death ignorance to justify disregarding consequences that
far in the future, they open themselves up to the possibility of an earlier
ignorance-based cutoff date.
The last point is the one I will
press in this section. We are radically
ignorant about the value of our current actions for the state of things a
billion years in the future. A billion
years is on the neartermish end of the time between
now and heat death: Sun won’t swallow Earth for about another 8 billion years,
and the last stars of the observable portion of the universe aren’t estimated
to burn out for many trillions of years.
If MacAskill’s and most other longtermists’ reasoning is correct, the world is likely to
be better off in a billion years if human beings don’t go extinct now than if
human beings do go extinct now, and decisions we make now can have a
non-negligible influence on whether that is the case. In the words of Toby Ord, humanity stands at
a precipice. If we reduce existential
risk now, we set the stage for possibly billions of years of thriving
civilization; if we don’t, we risk the extinction of intelligent life on Earth. It’s a tempting, almost romantic vision of
our importance. I also feel drawn to
it. But the argument is a card-tower of hand-waving
plausibilities.
Equally breezy towers can be constructed in favor of human
self-extermination or near-self-extermination.
Let me offer two.
The
Dolphin Argument. The most obvious solution to the
Fermi Paradox is also the most depressing.
The reason we see no signs of intelligent life elsewhere in the universe
is that technological civilizations tend to self-destruct in short order. If technological civilizations tend to gain
increasing destructive power over time, and if their habitable environments can
be rendered uninhabitable by a single catastrophic miscalculation or a single
suicidal impulse by someone with their finger on the button, then the odds of
self-destruction will be non-trivial, might continue to escalate over time, and
might cumulatively approach nearly 100% over millennia. I don’t want to commit to the truth of such a
pessimistic view, but in comparison, most other solutions seem like wishful
thinking – for example, that the evolution of intelligence requires
stupendously special circumstances (the Rare Earth Hypothesis) or that technological
civilizations are out there but sheltering us from knowledge of them until
we’re sufficiently mature (the Zoo Hypothesis).
Anyone who has had the good
fortune to see dolphins at play will probably agree with me that dolphins are
capable of experiencing substantial pleasure.
They have lives worth living, and their death is a loss. It would be a shame if we drove them to
extinction.
Suppose it’s almost inevitable
that we wipe ourselves out in the next 10,000 years. If we extinguish ourselves peacefully now –
for example, by ceasing reproduction as recommended by antinatalists
– then we leave the planet in decent shape for other species, including
dolphins, which might continue to thrive.
If we extinguish ourselves through some self-destructive catastrophe –
for example, by blanketing the world in nuclear radiation or creating
destructive nanotech that converts carbon life into gray goo – then we probably
destroy many other species too and maybe render the planet less fit for other
complex life. To put some toy numbers on
it, in the spirit of longtermist calculation, suppose
that a planet with humans and other thriving species is worth X utility per
year, a planet with other thriving species with no humans is worth X/100
utility (generously assuming that humans contribute 99% of the value to the
planet!), and a planet damaged by a catastrophic human self-destructive event
is worth an expected X/200 utility. If
we destroy ourselves in 10,000 years, the billion year sum of utility is 104
* X + (approx.) 109 * X/200 = (approx.) 5 * 106 * X. If we peacefully bow out now, the sum is 109
* X/100 = 107 * X. Given
these toy numbers and a billion-year, non-human-centric perspective, the best
thing would be humanity’s peaceful exit.
Now the longtermists
will emphasize that there’s a chance we won’t wipe ourselves out in a terribly
destructive catastrophe in the next 10,000 years; and even if it’s only a small
chance, the benefits could be so huge that it’s worth risking the
dolphins. But this reasoning ignores a counterbalancing
chance: That if human beings stepped out of the way a better species might evolve on Earth. Cosmological evidence suggests that
technological civilizations are rare; but it doesn’t follow that civilizations are rare. There has been a general tendency on Earth,
over long, evolutionary time scales, for the emergence of species with
moderately high intelligence. This
tendency toward increasing intelligence might continue. We might imagine the emergence of a highly
intelligent, creative species that is less destructively Promethean than we are
– one that values play, art, games, and love rather more than we do, and
technology, conquering, and destruction rather less – descendants of dolphins
or bonobos, perhaps. Such a species
might have lives every bit as good as ours (less visible to any ephemeral
high-tech civilizations that might be watching from distant stars), and they
and any like-minded descendants might have a better chance of surviving for a
billion years than species like ours who toy with self-destructive power. The best chance for Earth to host such a
species might, then, be for us humans to step out of the way as expeditiously
as possible, before we do too much harm to complex species that are already
partway down this path. Think of it this
way: Which is the likelier path to a billion-year happy, intelligent species:
that we self-destructive humans manage to keep our fingers off the button
century after century after century somehow for ten million centuries, or that
some other more peaceable, less technological clade finds a non-destructive
stable equilibrium? I suspect we flatter
ourselves if we think it’s the former.
This argument generalizes to
other planets that our descendants might colonize in other star systems. If there’s even a 0.01% chance per century
that our descendants in Star System X happen to destroy themselves in a way
that ruins valuable and much more durable forms of life already growing in Star
System X, then it would be best overall for them never to have meddled, and
best for us to fade peacefully into extinction rather than risk producing
descendants who will expose other star systems to their destructive touch.
The
Nuclear Catastrophe Argument. MacAskill and other longtermists emphasize the importance of humans learning to
take existential risks seriously. They
also generally hold that a nuclear war would not likely permanently destroy the
species. This suggests a method for
teaching humanity to take existential risk seriously: Start a nuclear war.
Suppose that a nuclear war has a
5% chance of destroying humanity, but that if it doesn’t destroy us, for the
next 10,000 years the survivors take existential risk more seriously, reducing
the existential risk to humanity from, say 2% per century to 1.9% per
century. Assuming that risk per century
is otherwise constant and independent, the 10,000-year survival odds are as follows:
without nuclear war, 13.3%; with nuclear war, 14.0%. Now might in fact be the optimal time to
engage in all-out nuclear war. If our
weapons of mass destruction grow more powerful over the next hundred years,
all-out war will likely be riskier. Our
odds of surviving all-out worldwide war in a hundred years might only be 75%
instead of 95%. Best, then, to act in
the sweet spot where there’s enough destruction to durably scare us into a
sober attitude, while it’s still very likely we’d survive!
Is this implausible? I don’t think so. Humans learn much better by hard experience
than by philosophical treatises about relative risks and benefits. Humanity might be like the reckless teen
driver whose nearly fatal accident finally teaches them caution. Humanity is far too cavalier about
existential risk, longtermists say. Well, what could teach us more powerfully and
unforgettably?
My aim with the Dolphin Argument
and the Nuclear Catastrophe Argument is not to convince readers that humanity
should bow out for the sake of other species, much less that we should start a
nuclear war. Rather, my thought is this:
It’s easy to concoct stories about how what we do now might affect the
billion-year future, and then to attach decision-theoretic numbers to those
stories. We lack good means for
evaluating these stories. We are likely
just drawn to one story or another based on what it pleases us to think and
what ignites our imagination.
I suggest that it is no more plausible that the best thing we
can do for the billion-year future is, as MacAskill suggests, fund studies of existential
risk in hopes that we can permanently lock in positive values than that the
best thing we can do for the state of the world in a billion years is suffer
near-term extinction or global catastrophe.
I’m not saying that extinction
or nuclear war (or whatever) are almost
as likely to have good as bad billion-year consequences. If our credences are close to parity but
uneven – say, 50.1% good vs. 49.9% bad – the longtermist
argument can get a toehold by appeal to extremely high stakes (50.1% times
trillions is much better than 49.9% times trillions). I am making a stronger claim of radical
uncertainty. One way of characterizing
this radical uncertainty is to balance the credences and values precisely. Maybe for all values of x, contingent on
assuming that the world is either better or worse by x amount over the course
of a billion years as a result of having a nuclear war now, our credence that
the world is better by x amount should be exactly 50%. Or maybe better: If the expected
short-to-medium term impact of an action is y, we should always treat the
expected billion-year-impact as also y, unless there is sufficient
justification to think otherwise, meeting an epistemic standard that is not met
in these cases. Alternatively, maybe the
best approach to radical uncertainty is to say that here, as in the infinite
case, mathematical decision theory fails – more likely to mislead than
enlighten.[4]
We are radically clueless about
what we can do now that would be likely to have good effects after a billion
years. We can reasonably estimate some things about the billon-year
future. It’s a good guess that the Sun
will still exist. But the positive or
negative utilities in Year One Billion of current efforts to prevent human
extinction or near-extinction wash out in a fog of radical inscrutability.[5]
3. Finite Washout: Negligibility.
Suppose we ignore the problem of
infinite utilities, and suppose we also reassuringly guess that various things
we can do now can reasonably be estimated to improve the long-term future by
improving the odds of continued human existence. The question arises: How much impact on the far-distant future should we expect from any
action that is possible today? There’s
reason to think that the expected impact would be negligibly small.[6]
To get a sense of how small,
let’s start with some toy numbers.
Suppose that I can do some Action A now (perhaps a $100,000 donation to
the Longtermism Fund) with the following expected
consequences in the temporal range from Year One Billion to heat death: a one
in a trillion chance of leading to a trillion happy descendants who wouldn’t
otherwise exist, plus a one in ten trillion chance of ten trillion happy descendants,
plus a one in a hundred trillion chance of a hundred trillion happy
descendants, and so on. Assuming for
simplicity that all descendants’ lives are of equal value and applying the
expected formula in the usual way, Action A would have infinite value. Ah – but wait, that would presumably require
infinite resources, and this is supposed to be the finite portion of the argument. So let’s cap the possible number of happy
descendants. If we cap at 1016,
and if we value one future happy life that wouldn’t otherwise exist at one util, then the expected value of Action A is 1 + 1 + 1 + 1
+ 1 = 5 utils.
If we more generously cap the possible number of happy descendants at 1050,
then the value is 39 utils – quite possibly worth my
$100,000 if there’s no temporal discounting.
But tweaking the probabilities yields
very different numbers. Suppose that
Action A has a one in ten quadrillion chance of leading to a trillion happy
descendants who wouldn’t otherwise exist, a one in 200 quadrillion chance of
leading to ten trillion happy descendants who wouldn’t otherwise exist, a one
in four quintillion chance of leading to a hundred trillion happy descendants,
and so on. Now, even with no cap on the
number of descendants, the expected value of Action A is 1/10000 + 1/20000 +
1/40000 + 1/80000 + … = 2/10000 descendants – less than one thousandth of a
happy life. You might do much better
spending that $100,000 on mosquito nets, or maybe just having some unprotected
sex.
So, which set of numbers is more
reasonable? For comparison, consider the
chance of changing the outcome of an election by voting. Political scientists have estimated this in
various ways, but in a simple popular-vote election with ten million voters, a
not-unreasonable ballpark estimate is that there’s a one in ten million chance
that your vote is decisive (Gelman, Katz, and Bafumi 2004; Barnett 2020).
I suggest that the confidence you should have that any action you
perform now will lead to a trillion or more additional good lives in the
long-term future should be many orders of
magnitude less than the confidence you should have that your vote in a
ten-million person election will be the decisive vote for your favored
candidate. A one in a trillion chance –
what we assumed in our first calculation – seems far too close in magnitude to
one in ten million, only five orders of magnitude different, despite an
intervening billion years. In a billion
years, a lot can happen to ruin your plans.
Before any action of yours could give rise to a trillion descendants,
there must be at least 999 billion other people with various other plans. All of these many billions of people will be
temporally closer to that billion-year future than you are and thus presumably
able to influence that future more directly than you can, with fewer
intervening causal threads and fewer contingencies that need to work out
favorably.
You cast your stone into the
pond, hoping for a billion-year positive ripple. You donate that $100,000. Suppose you estimate a 50% chance that in a
hundred years your action is still having some positive consequences, versus a
50% chance that it is washed out, counteracted, or lost into the noise of the
past, with no continuing overall positive influence of the sort you were hoping
for. (A washed out action, in this
sense, might still have random or unpredictable positive and negative ripples
in the manner described in Section 1, but it would take a great stroke of luck
for all those ripples to add up to plus-one-trillion good lives, and once the
effects become random, symmetry suggests similar odds of similar magnitude
negative effects.) Will it continue to
have the positive effects you were hoping for in another hundred years, and
another, and another? If we assume a 50%
washout chance per hundred years, the odds that your action is still having those
overall positive consequences in a billion years is one in 210,000,000
– that is, under one in 101,000,000.
Presumably with calculations of
this sort in mind, MacAskill, Ord, and other longtermists
suggest that we are currently in an especially
crucial time. If by surviving the next 10,000 years we could
almost eliminate existential risk and maybe also permanently lock in good
values, then our actions now are particularly significant and stand a
reasonable chance of not being washed out in the ten million centuries between
now and a billion years from now.
It’s not completely implausible
to think that we live in a uniquely crucial time. Maybe existential risk would fall
dramatically if we were to colonize other star systems sufficiently independent
from one another that a catastrophe in one is unlikely to precipitate
catastrophe throughout. Maybe we’re now
in a uniquely risky ten-thousand-year window, pre-colonization. Or maybe we have a unique opportunity over
the next few millennia to durably lock in a set of future values by using emerging
AI to create a rigid, unchangeable, billion-year-long world governance system
based on our current values. To me, this
seems somewhat less likely than space colonization. If the system is too inflexible, that might
increase existential risk via inability to respond flexibly to unforeseen
problems. More fundamentally, if future
generations have the kinds of creative, intelligent capacities that human
philosophers tend to prize and that presumably the longtermists
want to preserve, it seems unlikely that we can bind them forever to our
current value systems.
But suppose we grant a 1% chance
of the near-elimination of existential risk and the enduring lock-in of good
values if we survive the next 10,000 years, after which there’s just a 1%
chance per million years of extinction or some other type of washout of your
current good intentions. A 99% chance of
continuing good effects per million years for a thousand million-year eras
compounds to (0.99)1000 – only a 0.004% chance of continuing good
effects a billion years out. Even under these
generous assumptions, that’s still a seven-orders-of-magnitude decrement, atop
whatever small chance there is of your action continuing its intended good
effects over the first 10,000 years. It’s
hard to see how any action you take now could realistically have anything like
a one in a trillion chance of leading to a trillion more good lives in a
billion years.
To think otherwise is, I suspect,
to not vividly appreciate how long a billion years really is or how quickly
chances multiply. It only takes twelve
independent one-in-ten chances to fall to odds of one in a trillion. It seems likely that for any action of yours
to cause a trillion more good lives to exist a billion years or more in the
future would require much more than twelve independent one-in-ten chances to land
your way. Suppose, instead, it requires
a thousand one-in-ten chances. Maybe
every million years on average, something a little lucky needs to happen. (Of course, these thousand independent
chances needn’t be distributed equally over the billion years.) To me, even this estimate seems extremely
optimistic. The expected value of that
action is then 1012/101000 – well under a googolth of a life.
Adding further one-in-ten chances of bigger impacts – say, another 1 in
10 chance for every extra order of magnitude of lives up to a maximum of 1050
(1013/101001, 1014/101002, etc.) –
still doesn’t bring the total expectation above one in a googolth. Only if we calculate long past heat death
will we get a different result. But
unless there’s some principled reason to start discounting long after heat
death, such a move risks casting us back into the problems with the Infinitary Cosmology.[7]
I’m not committed to particular
odds or magnitudes, other than the general observation that the odds are tiny. My main thought is, I hope, sufficiently plausible
that we should accept it absent some rigorous argument to the contrary. Any action you engage in now has only a minuscule
chance of making the world vastly better in a billion years, and the chance
grows precipitously more minuscule the larger the imagined benefit. Our decision-making should be grounded in nearer-term
consequences. Whatever you draw in the
beach sand today has no material chance of enduring a billion years; and even
if the sands tumble around differently evermore, due to chaotic ripple effects
upon ripple effects, we cannot possibly know whether these effects will sum
more toward the good or the bad.
The Infinite Washout argument
establishes the incoherence of standard longtermist
decision theory without relying on any particular empirical facts, as long as
we have a non-zero credence in the Infinitary
Cosmology. In contrast, the finite
versions of the washout argument do depend on particular empirical facts. For example, if there were some action
available to us now that would entirely obliterate planet Earth, then I would
concede that it’s reasonable to think there’s a good chance that the
billion-year future would be much worse if we perform that action. (One might doubt even this, though, if one
worries that future life might be so miserable, on average, that it’s better to
just end things now.) Thus, unlike
Infinite Washout, Cluelessness and Negligibility are not arguments against longtermism in principle, regardless of situation or
background knowledge. They are only
arguments that we ourselves should
not be longtermists.
4. The Costs of Longtermist Thinking.
The argument of Section 3, if
successful, only established that billion-plus-year expectations should have at
most a very small influence on our decisions.
But suppose we rationally come to believe that Options A and B are
identical in their less-than-billion-year expected value while Option A has a
better billion-year-plus expected value?
Could we then rationally use those very long term expectations to
justify selecting Option A?
A case can be made against even
that seemingly innocuous application of longtermist
reasoning if there’s reason to think that longtermist
reasoning is costly. I see three ways in
which longtermist reasoning might be costly:
First, longtermist
reasoning might be effortful. If the
expected benefit of longtermist reasoning is a
quadrillionth of a life, it might not be worth even a millisecond of cognitive
effort to try to get the decision right; one’s cognitive resources are better
expended elsewhere.[8]
Second, longtermist
reasoning might have negative effects on other aspects of one’s cognitive life,
for example, by encouraging inegalitarian or
authoritarian fantasies, or a harmful neo-liberal quantification of goods, or
self-indulgent rationalization, or a style of consequentialist thinking that
undervalues social relationships or already suffering people (Srinivasan 2015;
Torres 2021).
Third, given its difficulty and
uncertainty, billion-year-plus thinking adds significant noise and risk of
error to our decision making. For
example, overconfidence is endemic in forecasting, and even expert geopolitical
and economic forecasters tend to decline toward chance over five- to ten-year
time frames (Tetlock and Gardner 2015). It’s a reasonable conjecture that decisions
driven by a billion-year-plus forecasts, to the extent they diverge from
decisions driven by nearer-term forecasts, will tend to be mistakes of
overconfidence. One might optimistically
hope to correct for cognitive biases by being appropriately modest in one’s
thinking, but the literature on rationalization, motivated reasoning, and “bias
blind spots” suggests that people tend to be poor judges of their own degree of
bias, making effective self-correction difficult (Pronin,
Gilovich, and Ross 2004; Kahan
2013; Scopeletti et al. 2015; Schwitzgebel
and Ellis 2017; Porter et al. 2022) On
higher-order epistemic grounds, we should avoid billion-year longtermist reasoning as likelier to lead to costly
mistakes than to generate valuable insight.
The first and third claims seem
plausible on their face, and the second seems plausible if the third is
plausible, since cognitive complexity is fertile soil for cognitive vice. If any of these three claims about the costs
of longtermist thinking is correct, they can be
appended to earlier considerations about negligibility to derive the conclusion
that we shouldn’t give any weight to billion-year-plus outcomes in our decision
making, even just to resolve apparent medium-term ties. The cost of billion-year thinking outweighs
the tiny potential expected benefits.
5. The Medium Term.
A thousand years is more visible
than a billion. No evolutionary
descendants of dolphins or bonobos will be constructing civilizations within a
thousand years, at least not without the help of humans or our descendants or
constructions. A nuclear war would
plausibly be overall bad in its thousand-year implications. And some people or organizations who exist
today might perform actions whose consequences retain a stable overall positive
or negative balance a thousand years from now – though if we imagine a medieval
scholar attempting to predict the various good and bad thousand-year
consequences of agricultural innovations, or of the reunification of China by
the Song Dynasty, or of the policies of the medieval Catholic Church, we begin
perhaps to see the challenges we face in anticipating the various good and bad
thousand-year consequences of, say, developments in Artificial Intelligence or
world governance. If the arguments of
Sections 2 through 4 are correct, at some point after a thousand years but
before a billion years – ten thousand years? a million years? ten million
years? – any available action you could now perform is so swamped over with
unpredictable good and bad possible effects that you should give that distant
future no weight in your decisions.
Still, a thousand years is long
term compared to quarterly profits and next year’s elections, and we should
probably attempt some humble, uncertain conjectures concerning what would be
relatively good or bad for the world over that time frame. Since the specific actions that longtermists like MacAskill and Ord recommend are plausibly
good over a thousand-year time frame, I think we can support such actions under
that conception. It’s worth taking AI
risk and climate change seriously, not just for their possible effects on our
grandchildren but also for their possible effects in six hundred years; and
there’s certainly a sense of “longtermism” in which
this more visible time frame is the exclusive driver of practical decisions. However, let’s not try to bolster the case
for action by conjecturing about one-in-a-trillion chances of particular good
or bad consequences a billion years in the future.[9]
References
Arntzenius, Frank (2014). Utilitarianism, decision theory and eternity. Philosophical
Perspectives, 28, 31–58.
Barnett, Zach (2020). Why you should vote to change the
outcome. Philosophy & Public Affairs, 48, 422-446.
Beckstead, Nick (2019). A brief argument for the overwhelming
importance of shaping the far future. In
H. Greaves and T. Pummer, eds., Effective altruism. Oxford
University Press.
Beckstead, Nick, and Teruji
Thomas (2024). A paradox for tiny
probabilities and enormous values. Noûs, 58, 431-455.
Bostrom, Nick (2009). Pascal’s mugging. Analysis,
69, 443-445.
Bostrom, Nick (2011). Infinite ethics. Analysis
and Metaphysics, 10, 9-59.
Easwaran, Kenny (2021). A new method for value aggregation. Proceedings
of the Aristotelian Society, 121, 299-326.
Gelman, Andrew, Jonathan N. Katz, and Joseph Bafumi (2004). Standard voting power indexes do not work: An empirical analysis. British Journal of Political Science, 34, 657-674.
Greaves, Hilary (2016). XIV – Cluelessness. Proceedings of the Aristotelian Society, 116, 311-339.
Greaves, Hilary, and William
MacAskill (2021). The case for strong longtermism. Global Priorities Institute, working paper
no. 5-2021.
Kahan,
Dan (2013). Ideology, motivated reasoning,
and cognitive reflection. Judgment and Decision Making, 8,
407-424.
Lenman, James (2000). Consequentialism
and cluelessness. Philosophy & Public Affairs, 29, 342-370.
MacAskill, William (2022). What we
owe the future. Basic Books.
Meacham, Christopher J. G.
(2020). Too much of a good thing:
decision-making in cases with infinitely many utility contributions. Synthese 198, 7309-7349.
Monton, Bradley (2019). How to avoid maximizing expected
utility. Philosophers’ Imprint, 19 (18).
Mogensen, Andreas L. (2021). Maximal cluelessness. Philosophical
Quarterly, 71, 141-162.
Nelson, Mark T. (1991). Utilitarian eschatology. American
Philosophical Quarterly, 28, 339-347.
Ord, Toby (2020). The
precipice. Hachette.
Porter, Tenelle, Abdo Elnakouri,
Ethan A. Meyers, Takuya Shibayama, Eranda Jayawickreme, and Igor
Grossmann (2022). Predictors and
consequences of intellectual humility. Nature Reviews Psychology, 1, 524-536.
Pronin,
Emily, Thomas Gilovich, and Lee Ross (2004). Objectivity in the eye of the beholder:
Divergent perceptions of bias in self versus others. Psychological
Review 111, 781-799.
Schwitzgebel, Eric (2024a). Formal
decision theory is an optional tool that breaks when values are huge. Blog post at The Splintered Mind (May 9).
Schwitzgebel, Eric (2024b). The weirdness of the world. Princeton University Press.
Schwitzgebel, Eric (forthcoming).
Repetition and value in an infinite universe. In S. Hetherington, ed., Extreme Philosophy. Routledge.
Schwitzgebel, Eric, and Jacob Barandes (2024). Almost
everything you do causes almost everything (under certain not wholly implausible
assumptions); or infinite puppetry. In
E. Schwitzgebel, The
weirdness of the world. Princeton
University Press.
Schwitzgebel, Eric, and Jonathan E. Ellis
(2017). Rationalization in moral and philosophical
thought. In J.-F. Bonnefon
and B. Tremoliere, eds., Moral inferences. Psychology
Press.
Scopelleti,
Irene, Carey K. Morewedge, Erin McCormick, H. Lauren
Min, Sophie Lebrecht, and Karim S. Kassam (2015). Bias
blind spot: Structure, measurement, and consequences. Management
Science, 61, 2468-2486.
Srinivasan,
Amia (2015).
Stop the robot apocalypse. London Review of Books, 37 (18),
September 24.
Tarsney, Christian (2023). The epistemic challenge to longtermism. Synthese, 201 (195).
Tarsney, Christian and Hayden Wilkinson (2023). Longtermism in an infinite world. Global Priorities Institute working paper no. 4-2023.
Thorstad, David (2021).
The scope of longtermism. Global
Priorities Institute, working paper no. 6-2021.
Thorstad, David (2023). High risk, low reward: A challenge to the astronomical
value of existential risk mitigation. Philosophy and Public Affairs, 51, 373-412.
Thorstad, David (2024).
Mistakes in the Moral Mathematics of Existential Risk. Ethics.
Titelbaum, Michael (2021). Normative modeling. PhilSci Archive no. 18670.
Torres, Émile
P. (2021). The dangerous ideas of “longtermism” and “existential risk”. Current
Affairs, July 8.
Wilkinson, Hayden (2021a). In defence of
fanaticism. Global Priorities Institute,
working paper no. 4-2020.
Wilkinson, Hayden (2021b). Infinite aggregation: Expanded addition. Philosophical Studies, 178, 1917-1949.
[1] This definition of washout
is somewhat broader than “washing out” as it is sometimes used in the
literature, encompassing Greaves and MacAskill’s (2021) “washing out” and
“cluelessness”, Thorstad’s (2021) “rapid diminution”
and “washing out”, and Tarsney’s (2023) “epistemic
challenge”.
[2] Schwitzgebel
(forthcoming) also presents a version of this argument; see also Nelson (1991)
for a precedent. Bostrom (2011) argues that no formal approach to decision making in
an infinite context is likely to be entirely elegant and intuitive, since the
mathematics of infinitude can’t handle all the plausible cases, and various
discounting regimes appear to generate unintuitive consequences. Easwaran (2021)
offers a dominance-reasoning based solution for a limited range of cases, but
the Infinitary Cosmology does not belong to that
range. Wilkinson (2021b) also offers no good
decision-theoretic solution to the problem, finding the outcomes in cases like
charity-vs-house-burning to diverge to either
+∞ or -∞ with equal probability, and thus no
decision-theoretical basis to choose one over the other. Arntzenius (2014)
offers another dominance-style approach, but as Meacham (2020) notes, it unintuitively privileges dense over sparse distributions of
seemingly equivalent good infinite outcomes (e.g., an outcome in which everyone
stays home will be preferable to one in which everyone accelerates away from
each other without changing their happiness); it also depends on controversial
assumptions about the structure of spacetime. Meacham’s more conservative alternative,
however, plausibly does not apply to the real case, since without a
spatiotemporal basis for ordering outcomes, “permutation invariance” is
probably not satisfied in physically plausible infinitary
cosmologies. Tarsney
and Wilkinson (2023) likewise acknowledge a range of epistemically
possible infinitary cases where extensions of standard
decision theory fail.
[3] See discussions in Bostrom 2011 and Ord 2020.
[4] “Fanaticism” – in which our
decisions are shaped by extremely tiny chances of extremely huge benefits
(Wilkinson 2021a) – might be another type of case in which the standard tools
of decision theory break. See my
discussion of Beckstead and Thomas 2024 in Schwitzgebel 2024a. More
generally, see Titelbaum 2021 on “normative modeling”.
[5] Although the appeal to
cluelessness about long-term effects might bring to mind Lenman
(2000), I accept Greaves’s (2016) critique of the Lenman argument for finite cases. Perhaps the dolphin and nuclear war cases can
be treated as the more intractable form of “complex cluelessness” described by
Greaves; see also Mogensen (2021) on the complexity
of comparing donation to Make-A-Wish vs. Against Malaria. The Infinite Washout argument, however, can
be seen as an adaptation of Lenman’s argument for a
particular infinite cosmological possibility.
[6] See also Thorstad (2021, 2023, forthcoming) for detailed critiques
of some of the numerical assumptions underlying longtermist
arguments.
[7] One might think of the longtermist argument as a Pascal’s mugging argument in
disguise (Bostrom 2009). Pascal’s mugging only succeeds if your
credence in the mugger’s promised outcomes does not decrease sufficiently
quickly to offset the additional goods promised. For example, if your credence that the mugger
will later give you $0-$1 is 1/1000, that he will give you $1.01-$2 is 1/4000,
that he will give you $2.01-$3 is 1/8000, etc., you won’t pay him even a
penny. A reasonable response to both
Pascal’s mugger and longtermist speculations about 1030
or more happy future lives is to decrease one’s credence sharply with each
additional order of magnitude of hypothesized benefit. (Another solution is to generally reject
standard decision-making models under conditions of extremely low probabilities
of extremely high or low value outcomes, as discussed in, e.g., Monton 2019 and applied to longtermism
in Tarsney 2023.)
[8] See Schwitzgebel 2024b, ch. 4, for a similar argument in favor of disregarding outcomes with less than a 1030 chance of occurring, if one has not already paid the cognitive cost of considering them.
[9] For helpful discussion,
thanks to William MacAskill, Brice Bantegnie, Jonathan
Birch, Richard Yetter Chappell, Kenny Easwaran, Myisha Cherry, Alex
Guerrero, Toby Ord, Matthew Southey, Jeff Sebo, Christian
Tarsney, David Thorstad,
and Michael Titelbaum; the audience at the Global
Priorities Institute; and the various people who have commented on relevant
posts on my blog and social media accounts.