Introduction
Causation is central to our understanding of the world and ourselves. The sciences provide causal explanations. Geology and physics explain the mechanisms that cause earthquakes and the causal processes by which earthquakes cause tsunamis. Biology explains the causes of the appearance of new biological species and of the extinction of other species. Historians explore the causes of antisemitism and the Holocaust. Our understanding of all these cases of causation is itself causally structured, and psychologists study the causal processes of learning, remembering and reasoning. Scientists and philosophers have also practical motivations for improving our understanding of causation. It is after all crucial for improving our means of changing things. Understanding the causes of the increase in economical inequality is inseparable from finding out by what means we might slow it down or reverse it. Understanding the causes of cancer, multiple sclerosis, or Covid-19 is required for conceiving of means for diminishing the incidence of these diseases, for soothing their symptoms and treating them. Understanding causation is also fundamental in the law. Individuals and firms are condemned on the basis of the courts’ judgment that they have caused harm. Causation poses a challenge both to philosophy of science and to metaphysics. What do scientists do when they look for and provide causal explanations? The challenge for metaphysics is to understand what reality must be like if causal explanations are true.
Before we look at philosophical theories of causation, we may start with the observation that the concept of causation that is used both in common sense and in science is structured around the following “central connotations of causation” (Reference SchafferSchaffer 2000a, p. 289).
1. The causal relation holds between distinct events.Footnote 1
2. The causal relation is intrinsic to these events. In particular, whether a causes b is determined locally, at the spatiotemporal location of a, b and in the region between them. It does not depend on what happens at other times and places.
3. Causes raise the chance of their effects.
4. Causes precede their effects, or at least don’t follow them, in time.
5. Asymmetry: if a causes b, it is not the case that b also causes a.
6. Objectivity: whether a causes b does not depend on us or any other observers or explanation-seekers.
7. The cause can be used as a means for making the effect happen.
8. One can infer the effect from knowledge of the cause, and one can infer the cause from knowledge of the effect.
9. Agents are morally and legally responsible for what they cause.
10. Effects depend on their causes. This dependence can be expressed in counterfactuals.
This Element presents the major attempts to account for causation as a metaphysical concept, in terms of (1) regularities and laws of nature, (2) probabilities and Bayes nets, (3) necessitation between universals and causal powers, (4) counterfactual dependence, (5) interventions, (6) causal models, and (7) processes or mechanisms. I will argue that causal models and process accounts are complementary. Causal models provide an account of causation at the level of types, whereas processes account for causation between particular events. The last part presents some contemporary debates in the metaphysics of causation: on the relation between grounding and causation, eliminativism with respect to causation in physics, the challenge against “downward” causation from the Closure and Exclusion principles, robust and proportional causation, and degrees of causation.
1 Regularities and Laws of Nature
David Hume’s account of causation is one of the main sources of contemporary attempts to analyze causation in terms of regularity. Hume himself did not offer a metaphysical theory of causation, as it is independently of us, because he was skeptical about the possibility of knowing what it takes, in a metaphysical sense, for one event to cause another.Footnote 2 Rather, he offers an epistemological theory of what makes us judge that one event causes another. There are two necessary conditions for a causing b (where a and b are particular eventsFootnote 3) that Hume claims we can know by observation: that a is earlier than b and that a and b touch each other (in technical jargon, that they are contiguous). Both can be challenged, especially on the background of contemporary physics,Footnote 4 but let us leave this to one side here. According to Hume, the origin of the difference between a causing b and a merely preceding and touching b, is psychological. When we judge that a causes b, rather than just following it, this judgment is based on an expectation that is the result of a psychological process of habituation.
1.1 Regularity Theory
Hume’s criteria of contiguity, anteriority and regularity can be used to construct a metaphysical theory of causation.
(R) “c causes e iff
i. c is spatiotemporally contiguous to e;
ii. e succeeds c in time; and
iii. All events of type C (i.e., events that are like c) are regularly followed by (or are constantly conjoined with) events of type E (i.e., events like e)” (Reference Psillos, Vanderbeeken and D’HooghePsillos 2009, p. 131).
Interpreting (R) as a metaphysical account of what makes it the case that c causes e excludes interpreting it as, for example, a semantic analysis of the meaning of the term “cause,” or as providing an epistemic tool for how the concept of cause can or should be used.Footnote 5
Here are some challenges to (R). It can be doubted (1) that each of conditions (i) – (iii) is individually necessary and (2) that all three are jointly sufficient for it to be the case that c causes e.
1. (i, ii, iii not jointly sufficient for causation) Thomas Reid has famously objected to Hume’s theory of causation that it has the absurd consequence that night is the cause of day and day the cause of night.Footnote 6 This doesn’t seem to be the only case of regular contiguous succession that is not a case of causation.
2. ((ii) not necessary: simultaneous causation) Stable situations seem to contain simultaneous causation: Kant mentions “a ball that lies on a stuffed pillow and makes a dent in it” (Reference Kant, Guyer and WoodKant 1781/1998, A203/B248, p. 312). Bricks and mortar and their composition cause the stability of a house. In a Newtonian model in which two bodies gravitate around each other only under the influence of the gravitational attraction of their masses, each body’s attraction toward the other is, at each instant, caused by their masses at that instant. All of these relations seem to be simultaneous, so that temporal precedence cannot be used to distinguish cause and effect.
3. ((ii) not necessary: backward causation) The possibility of causal influence running in the opposite direction of time, often called “backward causation,” is a matter of debate within physics.Footnote 7 Therefore, it would be ill-advised to build temporal precedence into the analysis of the concept of causation. This would make backward causation conceptually incoherent. It would mean to construe the concept in a way that is incompatible with its use in science, so that something would come out as conceptually impossible, although science takes it to be possible. Philosophy should construct concepts that are fit for being used in contemporary science.Footnote 8
4. ((iii) not necessary: singular causation) It is probably not correct, from a psychological point of view, that humans (and other animals) learn about causal connections between events only after having observed a succession of events several times. One can learn that flames burn our skin after a single experience of putting our hand in a fire. Moreover, the requirement of regular succession seems difficult to justify from a metaphysical point of view,Footnote 9 and it is not clear whether it could be expressed by a formal criterion: If F is a predicate that applies only to a single event a, and if Ga is also true, then Fa and Ga are formally an instance of a regular succession (U) (x)(Fx→Gx), though only in a trivial sense.Footnote 10
5. ((iii) not necessary: indeterministic causation) Another fundamental problem for regularity theories is that many, and probably most, causal regularities allow for exceptions. Benfluorex causes death by heart failure, although it doesn’t do this invariably but only in a significant proportion of cases. Even the most effective drug does not result in recovery in 100% of cases. It is now standard to require of an adequate theory of causation that it be at least adaptable to causal influences that make an effect more probable than it would have been otherwise, without making it certain to occur.Footnote 11 C may cause E even if C raises E’s probability from 0% only to some very small percentage, such as 0.1%.
1.2 Deductive-Nomological Theory
An influential version of regularity theory has been developed by the philosophers of the school of logical empiricism. As empiricists, they require that scientific facts must be observable at least in principle. Logical empiricism condemns metaphysics as systematically meaningless (Reference CarnapCarnap 1931). Nevertheless, the logical empiricist theory of what determines the truth value of causal statements can be interpreted as a metaphysical theory of the causal relations expressed by those statements. “A statement about a causal relation […] describes an observed regularity of nature, nothing more” (Reference Carnap and GardnerCarnap 1966/1995, p. 201). Carnap’s thesis is that the truth value of singular causal statements is determined by the truth value of universal statements expressing regularities. The metaphysical interpretation (not intended by Carnap himself) is a regularity theory of singular causation. The relevant regularities are taken to be (a certain type of) laws of nature.Footnote 12 According to this interpretation, whether a causes b is determined by whether there are laws of nature relating type (or description) A, to which a belongs, to type B, to which b belongs. “What is meant when it is said that event B is caused by event A? It is that there are certain laws in nature from which event B can be logically deduced when they are combined with the full description of event A” (Reference Carnap and GardnerCarnap 1966/1995, p. 194).Footnote 13
According to the so-called Deductive-Nomological (DN) model of scientific explanationFootnote 14 (Reference HempelHempel 1965, p. 247/248), a scientific explanation is a sound argument, that is, a valid deductive argument with true premises. The conclusion of the argument expresses the fact that is up for explanation: the so-called explanandum; the premises of the explanation (the so-called explanans) of a particular fact or event essentially contain (1) a description of particular facts or events, the so-called initial conditions, and (2) at least one law of nature. “Essentially” means that the deductive argument would lose its validity if some part of those premises were omitted. Carnap takes the conditions for the existence of a causal relation to be the same as the conditions for scientific explanation and prediction,Footnote 15 so that it is equivalent to say that C causes E and that E can be explained by a valid DN argument on the basis of C.Footnote 16
(DN-C) C causes E if and only if
1. C and E have empirical content and are true,
2. there is a valid argument that has as its premises C and at least one law of nature L, and E as conclusion.
3. Both C and the laws L are indispensable for the validity of the argument.
A useful way to express the link between the DN accounts of explanation and causation is to say that, if we ignore the epistemological aspects of the DN relation between explanans and explanandum and concentrate on the objective facts required to make the explanation correct, we get the metaphysical doctrine according to which causal relations are instances of lawful regularities.
(DN-C) requires the premises to contain laws, that is, regularities that are nomological and not merely accidental. However, defenders of (DN-C) typically identify laws with regularities. “Repetition is all that distinguishes the causal law from a mere coincidence” (Reference ReichenbachReichenbach 1951, p. 158).Footnote 17 If the argument were valid without any laws of nature, but only by virtue of logic, E would follow from C by conceptual or logical necessity. The link between C and E would be tighter than a causal link, which is traditionally thought to be contingent.Footnote 18
Like other theories that take regularities to be constitutive of causal relations between particular events, the DN account of causation conflicts with the intuition that causation is an intrinsic relation.Footnote 19 A property P of an object x is intrinsic if and only if the question whether x has P depends only on x itself and not on x’s relations to other things. The relation R between x and y is intrinsic if and only if the question whether x and y stand in relation R depends only on x and y, and not on their relations to other things.
The DN account of causation has eventually been abandoned because non-causal dependence can satisfy the conditions for (DN-C). According to (DN-C), the existence of a law (or laws) to the effect that all As are Bs (expressed by the formula (x)(Ax→Bx)) is necessary for an event of type A to cause an event of type B.Footnote 20 This has been called the “Principle of the Nomological Character of Causality.”Footnote 21 However, the reverse implication does not hold. The law that all As are Bs is not sufficient to conclude that events of type A cause events of type B. So-called laws of coexistence or association are counterexamples that show that not all DN explanations are causal.Footnote 22 Physics has many laws expressing one quantity as a mathematical function of others. The law of the simple pendulum says that the period T of a simple pendulum is proportional to the square root of its length L ( ), where g represents the acceleration of massive objects near the surface of the Earth. However, “one surely would not say that that the pendulum’s having a period of two seconds was caused by the fact that it had a length of 100 centimeters” (Reference HempelHempel 1965, p. 350). It seems even more obvious that the period of a pendulum is not among the causes of its length.
According to the ideal gas law, expressed by the formula PV=nRT, the product of the pressure (P) and the temperature (T) of an ideal gas is proportional to the volume (V) occupied by the gas, and to the quantity of gas, measured in moles (n), where R represents the ideal gas constant. Given a fixed volume occupied by a gas, maybe the gas’s pressure is a cause of its temperature and maybe its temperature is a cause of its pressure, but both are certainly not true at the same time for the same sample.Footnote 23
The law of the simple pendulum and the ideal gas law express mutual functional dependencies between different variables characterizing some physical system at a time. They are “laws of association” or “laws of coexistence,” in opposition to laws of succession. In the case of equations expressing the mutual functional dependence between variables, nothing in the law provides any ground for the asymmetry of cause and effect, that is, for taking some of the variables figuring in the equation to correspond to a cause and other variables to correspond to an effect. Furthermore, assuming transitivity, some variables would indirectly cause themselves. If pressure caused temperature and temperature caused pressure, the temperature of a sample of gas at a given moment would indirectly (by way of causing its pressure) cause itself.
Here is another sort of law of association whose instances do not yield instances of causation. The solubility of salt in water is grounded on its molecular structure (NaCl) and on the chemical interactions between sodium (Na+) and chloride (Cl−) ions and water molecules. However, the ions composing a given sample of salt do not cause the salt’s solubility. The dependence of the chemical properties of a macroscopic sample of substance on the properties of its molecular and atomic components is tighter than causal dependence. Tighter still is conceptual dependence. High-density cholesterol is cholesterol, but the high-density cholesterol in someone’s blood isn’t a cause of the cholesterol in her blood, because the cholesterol is the union of high-density cholesterol and low-density cholesterol.Footnote 24
All this shows that causation cannot be directly reduced to the application of a law to a particular situation. Here is another way of expressing the point. If (DN-C) were interpreted as an implicit definition of “causation,” all scientific explanations would be causal. This would result in modifying the sense of the word “causal” so that it becomes equivalent to “scientific.” By depriving the concept of causation of its specific content, such a justification of the use of the concept of causation would be only verbal; it would be equivalent to its elimination.
Reference HempelHempel (1965, p. 352) suggests that a sufficient condition for causation can be obtained by restricting the laws mentioned in (DN-C) to laws of succession: If a (of type A) occurs and there is a law of succession “all As are followed by Bs,” then a causes b (of type B). However, this restriction doesn’t fill the bill. Depending on how “is followed by” is interpreted, it makes the analysis either tautological or false.
1) If “is followed by” is taken to mean “is followed causally by,” the analysis is circular: “If a occurs and there is a law of succession according to which all As are causally followed by Bs then a causes b.”
2) If “is followed by” is taken to mean (1) all As are Bs, and (2) if some a is A at t1, then at some t2 >t1, there is a b that is B at t2, then the analysis is refuted by any stable system to which a law of association applies. If a is A at t1, then if the system is stable (in the sense that the values of variables A and B do not change between t1 and t2), then a is A at t2. If As are Bs by virtue of a law of association, then if a is A at t2, a is B at t2. By transitivity, if a is A at t1, a is B at t2. However, there is no reason why non-causal dependence and stability together would make for causation.
It remains to be seen whether there is another interpretation of “law of succession” that makes the criterion avoid both circularity and refutation and that could be used in a sufficient condition for causation. Differential equations such as F=ma do not provide a sufficient condition for causation either. In a deterministic framework, the equations of motion that determine future states of a system on the basis of its present state also determine past states of the system on the basis of its present state. Let us call “Galileo’s law” the law according to which all massive objects that fall near the surface of the Earth, undergo constant acceleration.Footnote 25 From Galileo’s law and the speed of a body at time t, one can determine its speed at both later and earlier times. However, the present state can only be the cause of later states of the system, not of earlier states.
Reference HempelHempel (1965, p. 353) offers another reason for which the existence of a law does not, together with initial conditions, provide a sufficient condition for causation. Take a beam of light that travels from point A to point B on a path that crosses an interface separating one optical medium from another at some point C; say the beam enters from air into water. Fermat’s principle of least time can be used to calculate the position of C, making the time for the light to get from A to B a minimum. (DN-C) yields the wrong result that the light’s being at B is among the causes of the beam’s crossing the interface at C.
To sum up our critical examination of regularity theories of causation, there is a deep link between causation and lawful regularities but causation cannot simply be reduced to lawful regularity. Regularities are observable indicators, or fallible symptoms, of lawful dependencies and causal influences. We learn that flames burn by noticing that burning systematically follows getting too close to a flame. We infer that fluorine causes increased resistance to tooth decay from the existence of a correlation between a high concentration in fluorine in drinking water and lower incidence of tooth decay. The DN theory is an improvement over a simple regularity theory of causation, in that nomological deducibility is a more reliable indicator of causation then mere regular co-occurrence. But from a metaphysical point of view, regularity theories make the mistake of confusing a fallible criterion used as a means for getting knowledge about causation, with causation itself.Footnote 26 Causal influence of one factor on another is one thing, and our means for finding out and getting knowledge about that causal influence is something else.
2 Probabilities and Bayes Nets
We have seen that (DN-C) does not provide a sufficient condition for causation: facts that are related by valid nomological deduction are not always causally related. In statistics, great efforts have been deployed to develop an account of the conditions under which causal conclusions can be derived directly from observations (or data) without any use of the controversial notion of a law of nature. The idea is to construct criteria for causal dependence on the basis of conditional probabilities, which are estimated on the basis of the observation of relative frequencies.Footnote 27 The central hypothesis of statistical analyses of causation is that the fact that the probability of factor B is higher (or lowerFootnote 28) in the presence of factor A than in its absence, is a (fallible) indicator of the existence of a causal influence from factor A on factor B.
The attempt of finding statistical methods for discovering relations of causal influence has one big advantage over (DN-C), over and above avoiding the notion of law. It naturally accounts for the fact that causal influence manifests in observable data only statistically, not universally. Smoking causes lung cancer, but not every smoker gets lung cancer, and some nonsmokers do. In complex situations, which are characteristic of all sciences outside fundamental physics, each case is different and a large number of factors contribute positively or negatively to such an eventuality as the development of lung cancer in a particular person. Furthermore, contemporary fundamental physics contains irreducibly probabilistic processes such as radioactive decay, where only a certain percentage of nuclei undergo the process of decomposition. The effect of the radioactive decay of a given particular 14C nucleus, that is, the coming into existence of a 14N nucleus, cannot be deduced by a DN argument from the presence of the 14C nucleus, together with laws. Only its conditional probability can be deduced.
Most proposals for analyzing causation in terms of conditional probabilities do not even attempt to develop an analysis of causation at the level of particular events.Footnote 29 Even if it could be established on purely statistical grounds (which is not the case, as we will see), that “smoking causes lung cancer,” such a causal influence at the level of populations does not entail that a particular smoker will get lung cancer, or if she gets cancer, that this cancer was caused by her smoking. The causal influence of S on C, at the level of population level factors, is often called “type-level causation,” or “causation between types.” The relation between type-level causation and causation between particular events falling under these types, or “token-level causation,” is controversial. The type-level causal influence of S on C is neither necessary nor sufficient for the token-level fact that person a’s smoking causes her to get cancer.
All attempts to analyze causation in terms of conditional probabilities start with the requirement that the cause be correlated with the effect. A necessary condition for S to be a (type-level) cause of C, is that P(C/S)>P(C) or, equivalently, P(C/S)>P(C/non-S). However, correlation is not sufficient for causation. If both lung cancer (C) and cardiovascular disease (D) are due to a common cause, such as smoking (S), then C and D can be correlated although none of C and D has any direct causal influence on the other. Reference ReichenbachReichenbach (1956, p. 159) has proposed to characterize the concept of common cause in purely probabilistic terms, with the help of the concept of a screening factor. If C and D are correlated so that P(C/D)>P(C/non-D), but if there is no correlation once the probabilities are evaluated conditional on factor S, to that P(C/S&D)=P(C/S&non-D), S is said to “screen off” C from D.Footnote 30 This is indeed an indicator of the fact that S is a common cause of C and D, which explains C and D’s correlation. However, it is only a fallible indicator, in the sense that the fact that C is a screening factor for A and B is not sufficient for C being a common cause of A and B.Footnote 31
The inference from correlation to causation requires the absence of screening factors. However, it is not possible to obtain a sufficient condition for causation in purely probabilistic terms by using the criterion of the absence of screening factors. Either the absence of screening factors is explicitly expressed as a causal condition or not. In the first case, as in Reference Cartwright and CartwrightCartwright’s (1979, p. 423) principle: (CC) C causes E iff P(E/C&K)>P(E/K) for all other causes K of E that are not intermediate between C and E, we don’t have an analysis of causation in terms of probabilities alone because the analysans contains information about causes.
In the second case, the requirement that there are no screening factors is expressed in purely probabilistic terms. This doesn’t provide an appropriate analysis of causation either, for the following reasons. (1) It yields a requirement that is empirically unrealistic: conditionalizing on all other factors means comparing populations that are homogeneous with respect to all other factors.Footnote 32 Such populations do not exist outside of fundamental physics. (2) If we could somehow observe homogenous populations (homogenous except for the factors C and E), we wouldn’t find probabilities different from 0 and 1 except if factors C and E belong to fundamental physics.Footnote 33 Intermediate probabilities (e.g., in medicine or social sciences) stem from the fact that empirically accessible populations are not homogeneous.
Now, here is the main problem we face in all situations where frequencies are measured in populations that are not homogeneous: as long as conditional probabilities are estimated from the observation of non-homogeneous populations, in other words, as long as not all factors different from C and E are held fixed, measures of conditional probabilities are always provisional. Judgments of comparative conditional probability are always at risk of being reversed once new factors are taken into account. Such reversal of conditional probabilities is known as Simpson’s reversal or Simpson’s paradox. Here is an example.Footnote 34 A study on thyroid disease found that the survival rate over twenty years (L) of smokers (S) was higher than that of non-smokers. However, it would be mistaken to conclude from P(L/S)>P(L/non-S) that smoking is a causal factor increasing the chance of survival. Age turns out to be a so-called confounding factor. Once frequencies are calculated conditional on age, the inequality is reversed: In six out of seven age groups, the rate of survival was lower for smokers than for non-smokers, and the difference was minimal in the seventh.
This means that we can safely judge whether the higher conditional probability of E given C than given not-C is due to the causal influence of C on E, only if we know all confounding factors. However, confounding factors are typically common causes of C and E. In other words, it is possible to extract information about causal influence from information about conditional probabilities only on the basis of causal knowledge. This is often expressed by the slogan “no causes in, no causes out.”Footnote 35
The development of Bayesian networks from the 1980s has much improved statistical methods for discovering (type-level) causal relations between variables. Bayes nets provide powerful means for estimating conditional probabilities on the basis of other probabilities. A Bayes net is a model, which can be represented as a graph that consists of (1) nodes, occupied by variables representing features of target systems, (2) directed edges linking some of these nodes to others, and (3) a probability distribution of each variable conditional on its “parents” in the graph. The set of parents of a variable A is the set of variables that are at the origin of an edge pointing toward A. Bayes nets are constructed on the basis of that probability distribution, together with a fundamental assumption, the “Markov condition.” The Markov condition says that each variable is probabilistically independent of its non-descendants, conditional on its parents in the graph. In other words, all correlations between variables that are not related as cause and effect are due to parent variables, and disappear once the probabilities are calculated conditional on these parent variables. Bayes nets are required to be acyclic, in the sense that they must not contain any circular path, where a path is a sequence of adjacent edges whose directions are aligned.
Here is an example.Footnote 36
In Figure 1, variable X5 has, for example, only parent variable, X4. By construction of the net, the probability of X5, given X4, is independent of all other variables, that is, X1, X2, and X3.
Although the links in a Bayes net are oriented (from parents to descendants or “children”), they become causal only by virtue of an interpretation. A Bayes net becomes a causal net by interpreting the arrows as representing direct causal influence between variables. In such a causal interpretation, the Markov condition becomes the causal Markov condition, according to which every variable in the net is probabilistically independent of its non-effects, conditional on its direct causes.Footnote 37 If a causal net is interpreted as representing a certain system or type of systems, one can read off from that causal net the relations of causal influence between variables. Whether the results are correct of some real system depends on several hypotheses: that the set of variables V represents all causally relevant factors, that the probability distribution P over these variables represents the physical probabilities of the factors represented by these variables, and that the causal net represents the smallest directed acyclic graph on V that satisfies the Markov condition with respect to P.
From a metaphysical point of view, what we want to know is whether causal nets are a reliable tool in the sense of representing relations of causal influence in reality. First, causal nets have been introduced as tools for discovering and representing causal influence between general factors, which provides no direct information about causation between particular events.Footnote 38 Second, powerful as they are, they constitute a tool that is as fallible as the hypotheses that are presupposed by their construction.
The Causal Markov Condition implies the following version of Reichenbach’s Principle of the Common Cause (Reference WilliamsonWilliamson 2009, p. 200):
(PCC) If variables A and B are probabilistically dependent then one causes the other or there is a set U of common causes in V which screens off A and B, that is, renders them probabilistically independent.
There seem to be many systems for which PCC is not correct because they contain factors that stand in non-causal dependence relations. Such dependence can be the consequence of logical, mathematical, or semantical constraints or of non-causal association laws; correlations may also exist in the absence of any common causes and any other constraint (Reference SoberSober 2001). Variables representing aspects of physical systems that are non-causally related can be probabilistically correlated although neither is a cause of the other and there is no characteristic that plays the role of a common cause so that it could be represented by a variable that screens them off from each other.Footnote 39
Examples may be found in quantum theory. It has been argued that EPR experiments in quantum mechanics give rise to correlations between the states of entangled particles that are not screened off by common causes yet cannot be causal either.Footnote 40 However, we can sidestep the controversial interpretation of quantum correlations. Here is Reference CartwrightCartwright’s (2007, p. 122) example of non-causally correlated variables whose correlation is not screened off by any common causes. Z is a chemical reaction that produces substance X and as a by-product substance Y. The probability of getting X (the valueFootnote 41 of variable X is 1; in short X=1) from Z (Z=1) is 0.8. We get Y (Y=1) if and only if we get X (X=1). Z is the common cause of X and Y. X and Y are correlated, yet the correlation does not vanish upon conditionalizing on the common cause (Z=1). P(X=1/Y=1&Z=1)=1> P(X=1 / Z=1)=0.8.
In Cartwright’s example, X=1 is not a cause of Y=1 although the variables X and Y are probabilistically correlated and although their correlation is not screened off by their common cause Z. Such examples show that probabilistic correlation that persists, once the probabilities are calculated conditional on common causes, is not sufficient for causation.
However, it would be a mistake to conclude that they refute the Causal Markov Condition (CMC). The CMC is a condition that holds by convention, in the sense that it is an assumption that is used to construct causal nets.Footnote 42 What Cartwright’s case shows is that the assumptions built into causal nets, and the CMC in particular, make it problematic to draw realistic conclusions on the causal influence relations in real systems, from causal nets built on the basis of those assumptions. In other words, if Cartwright is correct, then our conventions may prohibit us from constructing a causal net which models the world’s actual causal structure.
Maybe a model such as Cartwright’s, containing only variables X, Y, and Z, which does not satisfy the CMC, could in principle be completed (Reference PearlPearl 2000, p. 62) by the addition of some hitherto unknown and unobserved variable intermediate between Z, X, and Y, to yield a model that satisfies CMC. However, there is no guarantee that one can always find such variables that can be interpreted as representing features of the real systems that are the target of the model.
A similar reply is that one should avoid choosing variables that are non-causally related.Footnote 43 The PCC holds of all models that are constructed without such variables. However, such a rule would undermine the prospect of using causal nets for the search for causal relations in real systems. For it would mean that we could use the PCC as a criterion for causal relations in real systems only if we knew beforehand (1) which variables to include in the model because they might be causes of the variables whose causes we are looking for and (2) which dependence relations are causal and which are not. If we have that knowledge before we even start building our models, we can build models using only causally related variables (satisfying the CMC). Statistical correlation without any common cause represented in the model can then be used as a sufficient condition for causation in the systems represented by the model.Footnote 44
The hypothesis that probabilistic correlation is a necessary condition for causation is problematic too. Some real systems contain causal influence relations that are not reflected in the corresponding conditional probabilities. We will look at an example illustrating this in a moment. First note that although, in general, positive causal influence of factor C on factor E increases the probability of E, given C, that is, P(E/C)>P(E/not-C), and negative causal influence of C on E manifests as P(E/C)<P(E/not-C), this is not always the case. Take the radioactive decay of nuclei of the isotope 223Fr of Francium,Footnote 45 represented in Figure 2.Footnote 46
99.994% of 223Fr decay into 223Ra, whereas the rest (0.006%) decays into 219At. Both 223Ra nuclei and 3% of 219At nuclei then decay into 219Rn. Take a nucleus of 219At which is at the causal origin of a nucleus of 219Rn. It is much less likely to obtain a 219Rn, given a 219At nucleus, than it would have been if it had been a 223Ra nucleus. After all, only a small portion of 219At become 219Rn (97% of 219At become 215Bi), whereas all 223Ra become 219Rn. This is a case where 219At has a positive causal influence on 219Rn, in the sense that the former nucleus is the causal source of the latter. However, this positive causal influence is not reflected in a higher probability of the effect given the cause, so that probability raising is not necessary for positive causal influence: The probability of having an effect of type 219Rn is lower given the cause (a nucleus of type 219At) than given the alternative possibility that the nucleus be of type 223Ra.Footnote 47
It is neither necessary that the probability of the effect is higher given the cause (as we have just seen) nor that it is lower given the cause. Some causes are probabilistically independent of their effects. Take a roulette wheel without a “0.” B is the event of throwing a ball in the roulette; B takes value b0 if the ball had fallen on black after the preceding throw; B takes value b1 if it had fallen on red after the preceding throw; variable C takes value c if the ball falls on a pair number. Within a framework specified by these variables, B is the only possible cause of C=c, with P(c/b0)=P(c/b1)=1/2. Here B is a cause of C but B and C are probabilistically independent.Footnote 48
Moreover, many (and probably most) causal interactions at the level of particular events are conceived by concepts which do not give rise to any correlations. Here is Dowe’s example: “Two men collide in the corridor. One gets a bruise on his arm, the other drops his papers. […] No one would expect to find any useful statistical correlations between bruises and sets of disordered papers” (Reference DoweDowe 1992, p. 206).Footnote 49
Causal influence relations between variables do not always manifest in the form of the raising or lowering of the probability of the effect, conditional on the cause. Is this an obstacle to the use of causal nets to represent causal relations in real systems? The situation is similar to the problem raised by non-causal relations that refute the PCC. Causal influence without probability raising is excluded from causal nets by stipulation, by means of the so-called Causal Faithfulness Condition (CFC). A model satisfies the CFC if and only if every conditional independence relation true in the probability distribution P corresponding to the graph (V,E), where V is a set of variables V and E a set of edges, is entailed by the CMC (Reference Spirtes, Glymour and ScheinesSpirtes et al. 2000, p. 31).
The CFC is another condition that is, in addition to the Causal Markov condition, stipulated as part of the standard algorithm for the construction of causal nets from data on probability distributions over a set of variables. The Causal Markov condition roughly says that all statistical correlations between variables X and Y are either due to the fact that X causally influences Y or to the fact that X and Y have common causes. The Causal Faithfulness Condition roughly says that, if variables X and Y are probabilistically independent, neither causally influences the other and they do not have common causes. The Faithfulness condition rules out the possibility, among others,Footnote 50 that X causally influences Y through two different paths whose effects exactly cancel each other out, in such a way that the total effect of X on Y is null. It is controversial whether this can happen in real systems. On one hand, external perturbations (i.e., noise) that influence each variable independently of the others make violations of the CFC by exact cancelation very improbable.Footnote 51 On the other hand, given that the values of the variables characterizing real systems can be measured only with finite precision, the cancelation of measured values seems possible. Indeed, many biological and artificial systems possess features that are designed to remain stable under external influences.Footnote 52 Let T represent the temperature of an animal’s body, H the heat in the environment, and C the state of a cooling mechanism. If C works well, the causal influences of H and C on T will cancel out, so that T remains stable under variations of H. If the values of these variables are discrete and correspond to measures with finite precision, it may well be that the cancelation is exact, so that H and T are not correlated.Footnote 53
Let me conclude this section on the link between probability raising and causation with the following case.Footnote 54 Consider an atom that has a very low probability, say .01, of decaying during a given time interval. The atom is hit by a neutron, which raises its probability of decaying in that interval to .99. The probabilities can be interpreted both as epistemic and, by the lights of contemporary physics, as physical chances independent of our knowledge. The collision of the atom with the neutron raises its probability of decaying although the collision is neither necessary nor sufficient for its decaying. The probabilities alone provide no reason why the fact that the collision raises the probability of the decay should entail that it causes the decay. True, the collision made the decay more probable, yet nothing precludes the possibility that the decay was exclusively due to factors internal to the atom, and thus that the collision was not the cause of the decay.
The fact that one factor A raises the probability of a second factor B, independently of the background as in the case just considered, or against the background of a causal net, is a strong but fallible indicator of the fact that A causally influences B. It is fallible indicator in the sense that A may not have any causal influence on B, although A raises B’s probability, and A may causally influence B although this isn’t manifest in the probabilities.
3 Necessitation between Universals and Causal Powers
If regularities and statistical correlation are only fallible indicators of causation, what is causation itself? Let us look at attempts to analyze causation with the help of specifically metaphysical concepts, such as necessitation between universals and causal powers.Footnote 55
David Armstrong analyses singular causation in terms of the instantiation of structural universals.Footnote 56 Every case of causation results from the instantiation of a causal law; and a causal law is a relation of necessitation of one universal by another. “The fundamental causal relation is a nomic one, holding between state-of-affairs types, between universals. Singular causation is no more than the instantiation of this type of relation in particular cases” (Reference ArmstrongArmstrong 1997, p. 227). Armstrong gives the example of “guillotining (that state-of-affairs type) causing immediate decapitation (a further state-of-affairs type). It is a second order state of affairs, a relation holding between the universals involved. This second order state of affairs must itself be a universal, a structural universal involving a certain linking of universals, a linking of state-of-affairs-types” (Reference ArmstrongArmstrong 1997, p. 226/7).
The postulate of universals is a controversial metaphysical hypothesis, which we cannot evaluate here. Let me mention three difficulties that stand in the way of the possibility of explaining causation in terms of universals. (1) Even if universals are accepted, the postulate of a second order universal of causal necessitation relating one universal (guillotining) to another (decapitating) raises the worry that Armstrong’s theory explains something familiar, that is, causation between states of affairs, in terms of something that is much more mysterious than what it is supposed to explain. If we don’t know what causation is at the level of singular events or states of affairs, does it help to learn that it is the instantiation of a causal relation between universals? (2) The “instantiation” of causation between universals by causation between instances of those universals raises what van Fraassen has called the “inference problem” (Reference LewisLewis 1983, p. 366; Reference van Fraassenvan Fraassen 1989, p. 96). How can laws, as relations between universals, explain regularities at the level of singular instances of these universals? (3) Armstrong’s theory also suffers from a more specific problem that it shares with the DN account: given that it takes all laws to be causal, it cannot account for non-causal association laws.
Others have suggested explaining causation in terms of the manifestation of causal powers (Reference Bird and MarmodoroBird 2010; Reference Mumford, Lill Anjum and MarmodoroMumford and Anjum 2010, Reference Mumford and Lill Anjum2011). The metaphysical theory of causal powers (Reference EllisEllis 2001; Reference MolnarMolnar 2003) challenges the assumption that events are metaphysically independent of one another, and that causation is an external relation between them. Rather, causation is built into the natural properties constituting events. These properties are causal powers that are, by their very nature, disposed to manifest in specific ways, according to the circumstances. According to the power theory of causation, the relation between cause and effect can be analyzed in terms of the relation between a power and its manifestation. Given that powers are causal by definition, such an analysis is not reductive. Mumford and Anjum’s version of this theory makes several original and controversial claims. Causes do not necessitate their effects, in the sense of being sufficient conditions for them, although they necessarily give rise to a tendency toward their effects. This tendency is a sui generis form of modality, irreducible to contingency and necessity (Reference Anjum and StephenAnjum and Mumford 2018a). Causes and effects are always simultaneous, and causation is non-symmetric and non-transitive (Reference Mumford and Lill AnjumMumford and Anjum 2011). The combined action of several powers can be represented in terms of the addition of vectors. Causation can be directly perceived, in particular in proprioception. I can do justice here neither to the metaphysics of powers nor to these specific theses about causation. However, it can be challenged whether it is possible to account for causation that extends over time, on the basis of the thesis that all causation is simultaneous (Reference ChakravarttyChakravartty 2013; Reference McKitrick, Mumford and AnjumMcKitrick 2013). It can also be challenged whether the analogy between the combination of causes and vector addition is satisfactory, given that causes often interact, in the sense that the effect of several causes acting together is not simply the cumulative result of the addition of the effects of each of them taken individually (Reference Fenton-GlynnFenton-Glynn 2012; Reference ChakravarttyChakravartty 2013).
4 Counterfactual Dependence
Some correlations do not correspond to any causal influence (and some cases of causal influence do not manifest in correlations). What distinguishes those correlations that are grounded on causal influence from those that are not?
David Hume developed the first regularity theory of causation. However, in one of his definitions he characterizes the relation between cause and effect in counterfactual form. “We may define a cause to be an object, followed by another, and where all the objects similar to the first are followed by objects similar to the second. Or in other words where, if the first object had not been, the second never had existed” (Reference Hume, Selby-Bigge and NidditchHume 1777/1975, p. 76, italics in original). Reference Lewis and LewisDavid Lewis (1973b/1986) and othersFootnote 57 have developed this counterfactual condition, which Hume presents as an alternative but equivalent formulation of the regularity theory, into a new theory of causation. Counterfactual theories of causation are motivated by the strong intuitive link between causation and certain counterfactual statements. Event c is a cause of event e if and only if c makes some difference to e, and difference-making can be expressed in counterfactual form: If c had not occurred, e would not have occurred.Footnote 58
The counterfactual approach to causation between particular events takes causation to be equivalent to a certain form of dependence. True, causation is a form of dependence. However, not every dependence is causal. Counterfactual theories of causation have attempted to find appropriate additional constraints on dependence in order to obtain a sufficient condition for causation. These constraints result from a reflection on cases of dependence that seem clearly non-causal; the aim is to construct an account that avoids those counterexamples.
One constraint on cause and effect is that they are distinct particular events in the sense that they do not share any part. This allows for avoiding the result that the following cases of (counterfactual) dependence be wrongly considered as causal, which they are not. If I had not spoken this sentence, I would not have spoken the first half of it.Footnote 59 The events of my speaking this sentence and of my speaking the first half of it are not distinct because the second is a part of the first. Without the constraint according to which cause and effect must be distinct events, this would be a “false positive,” a counterexample, which the counterfactual theory of causation wrongly categorizes as causal.
Other constraints have been introduced to avoid that later events be wrongly considered as causes of earlier events although there seem to be many cases of so-called “backtracking” counterfactual dependence.
(1) Let us suppose that there are situations in which e1 causes e2 in such a way that the occurrence of e1 is a sufficient condition for the occurrence of e2. Nothing could prevent e2 once e1 has happened. A plausible case may be two successive events e1 and e2 on the trajectory of a particle traveling in a straight line through empty space. If e1 is a sufficient condition for e2, e2 is a necessary condition for e1: if e2 hadn’t happened, that can only be because e1 has not happened. So it would seem that the backtracking counterfactual ¬e2→¬e1 is true. Without additional constraints, this is a counterexample that refutes the analysis because e2 is not a cause of e1.
(2) Cases of counterfactual dependence between so-called epiphenomena are another source of counterexamples. If c is a common cause of e1 and e2, then there can be counterfactual dependence of one effect, e2, on the other e1. Let c represent a drop in atmospheric pressure, e1 a drop in my barometer’s reading, and e2 a storm. If c is a sufficient condition for e1 and c is a necessary condition of e2, then ¬e1→¬c and ¬c→¬e2, and it may also be the case (though there is no valid inference to this effect) that ¬e1→¬e2. The storm may be counterfactually dependent on the fall of the barometer but it is not caused by it.
Both problems can be solved if backtracking counterfactuals are excluded. This requirement may be added as an additional constraint, or it may be argued that it results from the semantics of counterfactuals, that is, that it is implicit in the meaning of counterfactual statements.Footnote 60
Additional constraints are also required to yield a form of counterfactual dependence that is necessary for causation. Without such constraints, there are cases of causation where the effect does not counterfactually depend on its cause. These are “false negatives,” that is, counterexamples that the counterfactual theory of causation wrongly categorizes as not causal.
If a cause is accompanied by a backup event, the effect is not counterfactually dependent on the cause, for the effect would have occurred in the absence of the cause, thanks to the presence of the backup. Redundant causation corresponds to situations where both events are effective in bringing out the effect. In situations where the backup cause is not effective, the effective cause is said to “preempt” the backup cause. The backup cause has the potential to cause the effect in the absence of the effective, preemptive cause. There is counterfactual dependence of the effect on the cause neither in cases of redundant causation nor in cases of preemption. Redundant and backup causes are widespread in biology. If the causal role of some organ A is vital, organisms will have higher fitness if they have a redundant or backup organ B that takes over whenever A is not well functioning. Each of our kidneys is redundant given the presence and functioning of the other. Here is a biological example of preemption.Footnote 61 Even in a healthy brain, neurons die at a certain rate, which increases with age. As dead cells and cellular debris accumulate, they harm surrounding cells, which in turn accelerates neuron death and causes neurodegenerative diseases such as Alzheimer’s disease. Microglia are cells that remove cellular debris from the brain. However, if microglia fail to accomplish their function, astrocytes, another kind of brain cell, step in to remove debris as a backup to microglia.
Cases of redundant causes and preemption point to a deep problem with all counterfactual theories. According to counterfactual theories, whether c causes e depends not only on c, e, and what happens in the spatiotemporal region between these events, but also on what goes on elsewhere. This contradicts the intuition that causation is intrinsic to the relata and the region between them.Footnote 62
Reference Lewis and LewisLewis (1973b/1986) has adapted the counterfactual analysis to this type of situation by requiring that there exist a series of intermediate events between the cause and effect so that each of the events in the series is counterfactually dependent on its predecessor. In cases of so-called “early preemption,” there is indeed no stepwise counterfactual dependence of the effect on the preempted (i.e., non-effective) backup cause. However, this solution raises the problem that it builds transitivity into the concept of causation. It is controversial whether transitivity is part of the concept of causation.Footnote 63
Anyway, the requirement of the existence of an intermediate chain with stepwise counterfactual dependence cannot account for some types of preemption, in particular “late pre-emption” and “trumping pre-emption” (Reference Schaffer, Collins, Hall and PaulSchaffer 2000/2004),Footnote 64 which corresponds to the overdetermination of an event by two independent causes. To take Schaffer’s example of trumping preemption, if both the Major and the Sergeant give the Corporal at the same time the same order, for example, the order to advance, the behavior of the Corporal is overdetermined, in the sense that it is caused twice over. Given that the Major’s rank is higher, it is plausible that his order is the cause of the Corporal’s moving, but this effect does not counterfactually depend on its cause.
The counterfactual approach has mostly been interpreted by its advocates as belonging to conceptual analysis. The intuition that our concept of causation has at least a strong counterfactual component provides enough motivation to develop the account so as to cope with counterexamples. It is indeed plausible that the semantics of causal statements can be analyzed in terms of counterfactuals, whose truth conditions can in turn be analyzed in the framework of possible worlds. The psychological mechanisms of reasoning about causation also seem to involve reasoning about counterfactual situations.Footnote 65 However, the role of counterfactuals for metaphysics is more controversial. Other possible worlds can be interpreted not only as a semantic tool but also as what makes true counterfactual statements in a metaphysical sense.Footnote 66 Alternatively, the truth-makersFootnote 67 of counterfactuals may be taken to be part of the actual world: either laws of nature, or powers or potentialities.Footnote 68
5 Interventions and Causal Models
A strong motivation for searching knowledge about causation is that such knowledge enables us to control things. If I know that A causes B I can exploit that knowledge to obtain B by manipulating A. The same connection allows us, in the other direction, to obtain causal knowledge by manipulation. Observing the effects of our own manipulations plays an important role in the psychological mechanisms that allow humans to acquire causal knowledge (Reference Gopnik and SchulzGopnik and Schulz 2007; Reference WaldmannWaldmann 2017). In experimental sciences, interventions are a crucial method for discovering causal influences, in addition to the observation of correlations. Under certain conditions (to which we return shortly), if an experimenter manipulates a variable C and observes a subsequent variation in variable E, this indicates that C causally influences E.
Structural equations (SE), a formal tool first introduced in genetics (Reference WrightWright 1921) and econometrics (Reference HaavelmoHaavelmo 1943), can be used to build models of causal structures on the basis of information obtained in this way, completing the models built in the framework of Bayes nets.Footnote 69 The philosophical analysis of such causal models integrates insights of older philosophical theories of causation in terms of manipulation by a free agent, according to which “an event A is a cause of a distinct event B just in case bringing about the occurrence of A would be an effective means by which a free agent could bring about the occurrence of B.”Footnote 70 However, in that form, such an account suffered from two major defects: circularity and anthropocentrism. The latter is implicit in the thesis that an event can be a cause only if its occurrence can be the result of the decision of a free agent.Footnote 71 We will see shortly that recent accounts in terms of causal models replace the concept of manipulation by the concept of intervention, which is not tied to free action, thus avoiding anthropocentrism. As for circularity, it seems impossible to build a non-circular analysis of causation that is grounded on manipulation or intervention, insofar as both are forms of causation. Recent interventionist theories of causation such as Reference WoodwardWoodward’s (2003) are explicitly intended not to be reductionist.Footnote 72
Each of the directed edges in a causal net (as introduced earlier) represents the direct influence of a “parent” variable on a “child” variable. The dependence of a variable on its parent variables can also be represented, in a more precise way, by a structural equation. SEs provide an alternative and complementary tool for the construction of causal models, aside from probability distributions, which we saw earlier. Contrary to ordinary equations, SE are defined to be asymmetrical: the SE of an “endogenous” variable V expresses the functional dependence of V on other variables in the model. A variable is called “exogenous” if its value is determined by factors external to the causal system whose model is being built.
SE have an interventionist interpretation and imply counterfactuals.Footnote 73 Let Y be a function of X, and let X and Y have the values X=x1 and Y=y1 in some actual situation. The SE Y=f(X) entails that if X had some value ≠ x1, for example, x2, Y would have the value Y=y2=f(x2), and it entails that if an intervention set X to value x2, Y would take value y2.
Let me illustrate the construction of a causal net on the basis of structural equations with the biological case of preemption mentioned earlier, which presents a challenge to earlier counterfactual accounts.
A simple deterministic model can be built with the following binary variables:Footnote 74
MA=1 microglia active; MA=0 microglia not active;
MR=1 microglia remove debris; MR=0 microglia don’t remove debris;
AA=1 astrocytes active; AA=0 astrocytes not active;
AR=1 astrocytes remove debris; AR=0 astrocytes don’t remove debris;
BW=1 brain without debris; BW=0 brain with debris.
Each endogenous variable is associated with a SE. To apply the model to a situation, one needs to specify the values of the exogenous variables. Set MA=1 and AA=1. The value of the endogenous variable MR is then determined by the value of MA, according to the SE MR=f(MA)=MA. If the microglia are active, they remove the debris (MA=1 and MR=1) and if they aren’t, they don’t (MA=0 and MR=0).
The preemption of the process beginning with active astrocytes is expressed by the SE for AR: AR = min (AA, 1-MR). The astrocytes remove the debris only if (1) the astrocytes are active and if (2) the microglia haven’t removed the debris. The variable BW representing the state of the brain is also endogenous: BW = max (MR, AR). The brain is free of debris either because the microglia have removed the debris or because the astrocytes have removed them.
The content of the set of SE can be represented in a graph (Figure 3), where each variable corresponds to a node in the graph. An arrow from variable X to variable Y represents the fact that the value of Y depends on the value of X; in this case, X is called a “parent” of Y. A directed path from X to Y is a sequence of adjacent arrows leading from X to Y.
Each arrow and each SE represents a set of counterfactual conditionals. Once a model is constructed, it can be used to determine the truth-value of new counterfactuals that do not simply correspond to one arrow. Say we want to know what would have happened if the microglia, though active, would somehow not have removed the debris. To find this out, one sets the variable corresponding to the antecedent of the counterfactual to the value it would have if the antecedent were true. In this case, one sets MR=0. This represents an “atomic intervention” (Reference PearlPearl 2000, p. 70). One does not take into consideration the past that might have led to the truth of the antecedent. Rather, the value of the antecedent (here, MR) is set while the values of all variables corresponding to the past of the antecedent keep the values they actually have. In the graphical representation, this means that all arrows leading to the variable MR are erased, which is equivalent to transforming MR into an exogenous variable.
In the interventionist interpretation of this formalism, this corresponds to a localized experimental intervention on variable MR, which originates from outside the system and is direct in the sense that it is not obtained indirectly by intervening on factors that influence MR within the system.Footnote 75
Just as Lewis’ miracles, this conception of interventions guarantees that no “backtracking” counterfactual can be true. When the value of variable X is modified, the variables situated upstream from X are left untouched. In the standard representation, these are the variables figuring at the left of X. The values that the variables downstream from (i.e., to the right of) X take in a situation in which X takes the stipulated value can then be determined on the basis of the equations corresponding to the arrows starting at X.
Reference PearlPearl (2000, p. 70) defines the causal effect of X on Y, written “P(y/do(x)),” as the probability distribution of the different values y of Y, given that an intervention (“do”) has set variable X to the value x. This has the consequence that all factors different from X that also influence Y are included in X’s impact on Y. To avoid this result, Reference WoodwardWoodward (2003, p. 98) imposes additional constraints on interventions I appropriate to determining whether X causes Y.Footnote 76 (1) I causes X. (2) I is the only cause of X, in the sense that all other influences on X are cut. (3) I does not cause Y through any paths that do not go through X. Say we want to find out whether the calculating activity of my computer (X) heats it up (Y). If I, the intervention of switching the computer on, not only triggers X but also turns on the cooler Z, I is not an intervention on X in the technical sense of Woodward’s conditions, for whether the value of Y is changed by I does not only depend on X but also on Z. In particular, the fact that Y doesn’t vary when I varies (because the heating influence of X on Y is offset by the cooling influence of Z on Y) does not justify the conclusion that X doesn’t influence Y. (4) I is statistically independent of all variables Z that influence Y through paths that do not go through X. If, in order to find out whether the indication X of a barometer causes thunderstorms Y, my interventions I on X depend on (my knowledge of) air pressure Z, then Y may vary as a function of the values that I imposes on X, whereas X does of course not cause Y.
Causal models built from SE can be used to define different concepts of causal influence. The concept of an actual cause is illuminating in situations of preemption.Footnote 77 In the situation represented by Figure 3, BW depends counterfactually neither on MA nor on AA. However, it would be wrong to conclude that neither MA nor AA causes BW. Indeed, in each situation, either one or the other causes BW. The concept of actual cause can be used to justify the intuition that MA is the cause of BW in situations where MA=1, although BW doesn’t depend counterfactually on MA.
MA taking value 1 (MA=1) is an actual cause of BW=1. X=1 is an actual cause of Z=1 if and only if there is an “active causal route” (Reference HitchcockHitchcock 2001, p. 287) from X to Z in an appropriate causal model <V, E>. A “route” between X and Z in the set V is an ordered sequence of variables <X, Y1, … , Yn, Z> such that each variable in the sequence is in V and is a parent of its successor in the sequence. The route <X, Y1, … , Yn, Z> is active in the causal model <V, E> if and only if Z depends counterfactually on X within the new system of equations E’ constructed from E, by setting the values of all variables that do not lie on the route to their actual values (Reference HitchcockHitchcock 2001, p. 286).
In our example, MA=1 comes out as an actual cause of BW=1 because, if AR, which does not lie on the route from MA to BW is set to its actual value, that is, 0, BW counterfactually depends on MA.Footnote 78
The structural equations framework provides the means for distinguishing different causal notions that can all be expressed by the common-sense word “cause.” This shows the fecundity of this approach, although it cannot provide a non-circular analysis of causation. As we have seen earlier, a variable can influence another variable in two independent ways in such a way that these influences cancel each other out. Switching on (S) the calculating activity X of my computer raises its temperature Y,Footnote 79 but it also causes the onset of the cooling system Z, which lowers its temperature. It is possible that the positive direct influence of X on Y is exactly compensated by the negative influence of Z on Y, so that S has zero net influence on Y.Footnote 80 In such a case, it seems intuitively both correct to say that switching the computer on heats it up and that it doesn’t heat it up. However, this involves no paradox insofar as the two judgments contain different notions of cause.Footnote 81 The former is correct if “heats it up” is taken to express the concept of being a contributing cause, the latter is correct if “heats it up” is taken to express the concept of being a total cause. In the situation sketched, S is not a “total cause” of Y but S is a “contributing cause” of Y.
(TC) X is a total cause of Y if and only if there is a possible intervention on X that will change Y or the probability distribution of Y (Reference WoodwardWoodward 2003, p. 51).
(CC) X is a contributing cause of Y with respect to a set of variables V, if and only if (1) there is a directed path from X to Y so that each variable on the path directly influences its immediate descendant, and (2) there is an intervention on X that changes the value of Y if the values of all variables in V that do not lie on that path are held fixed at some value (Reference WoodwardWoodward 2003, p. 57).
Indeed, if we hold Z in our example fixed, we find that an intervention on X modifies the value of Y, so that S is a contributing cause of Y, although the application of condition (TC) shows that it is not a total cause of Y: if Z is not held fixed, switching on the computer does not make its temperature rise.
Older versions of the manipulability theory make the judgment “X causes Y” depend on the possibility of acting on X. This seems to make it impossible to apply the concept of causation to events that are in principle outside the sphere of influence of human interventions. However, eruptions of volcanoes and explosions of supernovae are causes although no possible human action could ever bring them about or modify them. The notion of intervention solves this problem because it is defined without any reference to human action. Analyses of causation in terms of structural equations and directed graphs avoid anthropocentrism because the intervention that sets the value of the putative cause need not be the result of a human action. Natural events entirely independent of intentional actions can satisfy the formal conditions on an intervention modifying the value of the putative cause. Such “natural experiments”Footnote 82 provide just as good a basis for judging causal influence as intentional interventions by human experimenters. In neuropsychology, the hypothesis that the activation X of one brain region causally influences the activation Y of another brain region is confirmed by the observation that a modification of X due to accident or illness is systematically followed by a modification of Y.
However, there seem to be causal relations on which even interventions as defined in this sense seem to be impossible. To judge whether the gravitational attraction of the moon causally influences the tides, one must examine the consequences of an intervention on the position or the mass of the moon. It can be doubted whether interventions on the moon are physically possible: such an intervention would require modifying the position or the mass of the moon by some means that does not also directly influence the tides.Footnote 83
The clearest limit to the applicability of the concept of intervention concerns cosmology. The possibility of an intervention on a system requires the system under study to be smaller than the whole universe because the origin of an intervention on some system must lie outside that system.Footnote 84
Interventions provide a strong epistemic criterion for causal influence. However, we have seen that it may not always be possible, even in principle, to carry out interventions in the required sense. Even if interventionist conditions are sufficient, they are not necessary for causation. In this respect, the interventionist account does not provide a complete metaphysical theory of causation.
6 Methodological Interlude
We have had a look at four reductive approaches to causation and at the framework of causal models, which can be interpreted with the conceptual tools of earlier accounts (lawful regularity, conditional probability, counterfactual dependence, intervention) so as to yield a powerful formal method for representing causal knowledge and the search for causes. Before I introduce and evaluate the approach to causation in terms of physical processes linking spatiotemporally localized particular events, let me make some methodological remarks.
6.1 Type and Token Causation
Approaches in terms of regularities, laws, counterfactuals, and manipulation were first introduced as theories of causation between particular events. However, accounts in terms of conditional probabilities represent relations of causal influence between factors or types of events. Causal models, on the basis of Bayes nets or structural equations, were also initially developed as representing causal influence between types of events. The variables that occupy the nodes of causal nets and that are related by SE have initially been interpreted as representing types of events. However, it is possible to build models of causal influence among particular events, with the help of the concept of actual causation. In models of actual causation, values of variables, rather than those variables themselves, play the role of the terms of causal influence.
In causal models, causation relates variables or values of variables. It may seem straightforward to conclude that “it is most perspicuous to think of causal relationships as relating variables” (Reference WoodwardWoodward 2003, p. 39; italics in original). However, in the context of a metaphysical enquiry into causation as a feature of reality, it is crucial to note that variables are a representational device, which are created and postulated together with models by human model-builders. If a model correctly represents some part of reality, these variables represent features of reality. If a model contains variables X and Y with a direct edge pointing from X to Y, then if the model represents reality correctly, there are features of reality represented by X and Y so that the feature represented by X causally influences the feature represented by Y. Causation can be studied both at the level of types of events and at the level of particular events. Causal dependence at the level of token events does not reduce to causal dependence at the level of types of events, yet these dependence relations are not independent of each other either.
No consensus has been reached concerning the notion of event that is most appropriate for understanding causation among particular events (Reference GallowGallow 2022). Events can be construed in coarse-grained and fine-grained ways. According to a coarse-grained construal, an event is the content of a space-time zone (Reference Quine, LePore and McLaughlinQuine 1985), whereas according to a fine-grained construal (Reference Kim, Brand and WaltonKim 1976), events are triples, constituted by an object, a property and a time. Take Reference DavidsonDavidson’s (1969/1980, p. 178) example of a metal ball that rotates and gets warmer simultaneously. According to the coarse-grained conception, the ball’s warming up and the ball’s rotating are two properties of the same event; whereas they are constitutive of two different events in the fine-grained conception. The structure of such fine-grained events is analogous to that of facts. One can distinguish between the fact that the ball warms up and the fact that it rotates; these facts (and the corresponding fine-grained events) have different causes and effects.
The distinction between type-level and token-level causation may shed light on the controversy over whether omissions and preventions can be causes. Is Alice’s omitting to water Ben’s plants a cause of their death?Footnote 85 It is much more natural to construe omissions as facts rather than as events. Causal propositions in which the cause and/or the effect is/are a negative fact(s) are incompatible with three intuitive properties of causation noted by Reference HallHall (2004): a causal process is local (in the sense that the cause is linked to the effect by an intermediate series of events), intrinsic (it does not depend on what happens or is the case elsewhere), and transitive. If a can cause b by omission or prevention, then certain causal relations obey neither to locality nor to intrinsicality nor to transitivity. Three (incompatible) consequences can be drawn from this.
1 Omissions and preventions are not instances of causation although they appear to us as such, because we often conflate causal and non-causal dependence and explanation or because we conflate causation and moral responsibility.Footnote 86
2 Omission and prevention are forms of causation, which means that locality, intrinsicality, and transitivity are not after all necessary conditions for causation (Reference Schaffer, Collins, Hall and PaulSchaffer 2000/2004).
3 There are two concepts of causation or two aspects of the concept of causation: One corresponds to dependence between variables (associated with nomological dependence, counterfactual dependence, and probability raising), the other corresponds to the existence of a transmission process at the level of particular events.Footnote 87 Omission and prevention are causal according to the first concept but not the second.
One way of resolving this debate is to interpret omission statements as expressing type-level dependence at the level of variables. Omissions and other negative facts can be represented by values of variables, just as “positive” facts. However, omission statements do not directly make reference to relations between particular events. Rather, they give counterfactual information about possible causal relations among events,Footnote 88 often with implicit information about norms or rules: Alice has promised to water Ben’s plants, and if she hadn’t omitted to water them, they wouldn’t have died. There is a difference making relation between the omission and the plants’ death, which violates the norms of promises.
The distinction between causation at the level of particular events and causal influence at the level of factors or types can also shed light on the question whether causation is intrinsic. A stone thrown against a window followed by the window’s breaking it is one of philosophers’ favorite examples of causation. The thesis that causation is intrinsic means that only facts determined locally at the spatiotemporal region stretching from the stone throw to the window breaking determine whether this is a case of causation, or merely a case of non-causal succession.
The thesis that causation is intrinsic is plausible for causal relations between particular events.Footnote 89 Whether or not c causes e is determined exclusively by spatiotemporally localized events c and e and the space-time region between them. However, accounts of causation in terms of difference-making at the level of types of events entail that causation is not intrinsic. This is clear for regularity theories: Whether or not the stone breaks the window is not determined only by what happens in the spatiotemporal surrounding of the stone’s impact and the window’s breaking, but by regularities concerning what happens to other events elsewhere in time and space.Footnote 90
Another controversial question is whether causation has a contrastive structure rather than the structure of a relation or a process between events (Reference SchafferSchaffer 2005). A contrastive causal claim is that c rather than c* causes e rather than e*. This idea can be analyzed in an interventionist framework, in terms of two interventions, the first setting the value of a cause variable to c, the second setting it to c*. The contrastive claim then expresses the fact that the first intervention leads to the effect variable taking value e, whereas the second leads to its having value e*.
6.2 Conceptual Analysis and Metaphysics of Science
Theories of causation may pursue one of several aims.Footnote 91 They may aim at analyzing (a) the common-sense (or “folk”) notion of causation (as in Lewis’ theory of causation in terms of counterfactual dependence), (b) representations of causation in human minds, together with the nature and psychological mechanisms of causal reasoning,Footnote 92 (c) the concept of causation as it is implicit in scientific methods, and made explicit in algorithms for constructing Bayes Nets and causal models, (d) what causation is in reality.
Here is a straightforward way to see the difference between theories that aim at analyzing the concept of causation and theories of what causation is in reality. Theories of the former sort are intended to apply not only to reality but also to possibilities: Our concept of causation puts us in a position to understand causation as it occurs in fairy tales, in science fiction novels and films, and more generally causation as it could be. We easily understand fairy tales in which magicians transform princes in frogs, even if their spells act at a temporal and spatial distance without any intermediate processes. Theories of causation as it is in the real world have nothing to say on magical causation, but they must at least in part be informed by contemporary science, whereas theories of concepts of causation are not so restricted. The distinction has implications for their respective criteria of adequacy: theories about the concept of causation are assessed by a priori criteria. They must be coherent in light of other concepts and intuitions; they can be refuted by thought experiments or situations featuring conceptually possible events and processes, such as magical spells, teletransportation, or time travel. They cannot be refuted by their incompatibility with what physics tells us about physically possible influences. By contrast, theories of causation as it is in the real world are assessed by empirical criteria and can only be refuted by being incompatible with scientific facts or theories concerning the real world.
One can express the difference between the aims of such theories in terms of two interpretations of metaphysics. Theories like (a) and (b), which aim at analyzing the concept of causation, can be seen as belonging to descriptive metaphysics (Reference StrawsonStrawson 1959), which aims at describing the structure of reality as it is according to common sense. Theories of causation as it is in the real world (d) can be taken to belong to the metaphysics of science.Footnote 93 Causal models (c) can be interpreted in both ways. The application of the so-called “Canberra plan” (introduced in Section 8), which consists in trying to provide a functional reduction of the concept of causation, integrates both conceptual analysis and empirical hypotheses, and attempts to achieve the objectives of both descriptive metaphysics and metaphysics of science. Reference WoodwardWoodward (2021a) shows how and why the enquiry of what causation is “in the world” is inseparable from inquiries about how we humans think about causation, how we go about finding out about what causes what and how we use that knowledge for manipulating the world.Footnote 94
7 Processes and Mechanisms
We have examined accounts of causation in terms of regularity, probability raising, counterfactual dependence, intervention, and causal models. The first three of these suffer from counterexamples and the fourth cannot be applied in all situations. The approach in terms of causal models, which analyzes causation in terms of difference-making at the level of types of events, presupposes the distinction between causal and non-causal dependence.Footnote 95 Causal models are built under the constraint of using only variables that are “independently fixable,” that is, that do not stand in non-causal dependence relations.Footnote 96 Therefore, they do not provide a framework that can be used to explain the difference between causal and non-causal dependence relations.
This distinction may be made at the level of particular events, on the basis of the empirical hypothesis that causation is a local physical process that stretches between two events that are localized in space and time. We will see that this empirical hypothesis can be used as a complement to accounts of causal influence in terms of causal models.
One historical source of process accounts of causation is Reference RussellRussell’s (1948) analysis of causation in terms of “causal lines,” which is inspired by the physical notion of a world line (Reference ReichenbachReichenbach 1928/1958). The world line of an object or process is its spatiotemporal trajectory. Russell defines a causal line as a world line that possesses qualities or structures that are either constant or change only smoothly (Reference RussellRussell 1948, p. 477). However, being a causal line is neither necessary nor sufficient for being a real causal process. It is not sufficient because the continuity of structure or quality can also characterize what Reichenbach calls an “unreal sequence” (Reference ReichenbachReichenbach 1928/1958, §23, p. 148) and Reference SalmonSalmon (1984, p. 141ff) a “pseudo-processes.” Pseudo-processes are world lines that give human observers the illusory impression of a causal process: they are not causal, but their qualitative continuity qualifies them as Russellian causal lines. Take Reference ReichenbachReichenbach’s (1928/1958, §23, p. 148) fast rotating projector casting a spot of light on a distant wall so that the spot sweeps over the wall. The world line consisting of the series of places on the wall at the times at which the light spot appears on them is a causal line without being a causal process. It exhibits qualitative continuity but cannot be causal. This can be seen from the fact that its speed can exceed the speed of light if the distance between the projector and the wall is sufficiently large and the speed of rotation of the projector sufficiently fast. Each spot at x at t is the end point of a causal process originating in the projector, without having any influence on the adjacent spot. Being a causal line is not necessary for being a causal process either. Some causal processes lack continuity of structure and exhibit large and fast qualitative changes, for example, when several particles of different types follow each other in a “cascade” of radioactive decomposition (see Section 2).
Reference SalmonSalmon (1984) has combined Russell’s and Reichenbach’s analyses, suggesting that a causal process is a process that (1) has structure or qualities that are either permanent or only changing continuously and (2) is capable of transmitting a mark, which is defined as a local modification of structure.Footnote 97 The light spot gliding along the wall is not a causal process because, if one modifies its color by inserting a red filter between the projector and the wall at one point, this modification will not propagate to the subsequent world line of the spot.
This analysis in terms of continuity of structure and mark transmission raises several difficulties.Footnote 98 The mark transmission criterion is shown to be insufficient by pseudo-processes capable of transmitting marks. Kitcher (1989, p. 463) mentions derivative marks: when a passenger in a car holds a flag out of the window, the shadow cast by the car as it passes along a wall bears the mark of the flag. Moreover, the analysis of the notions of mark and of causal interaction seems to be circular:Footnote 99 A mark is a modification of structure introduced into a process by a causal interaction, but an interaction is causal if it leads to the introduction of a mark.
An alternative proposal characterizes a causal process as a world line along which some physical quantity is transmitted or transferred, such as energy, momentum,Footnote 100 or more generally, some conserved quantity.Footnote 101 Processes construed in terms of transference can be used to justify causal judgments that challenge difference-making accounts. If active microglia preempt active astrocytes in cleaning the brain of debris, there is no (chain of) counterfactual dependence between active microglia and the cleaning of debris. Transference accounts suggest that what grounds the fact that the microglia’s activity causes the cleaning of debris is that there is a physical process transmitting energy and momentum to the debris in the microglia’s case, but not in the astrocytes’. However, the latter claim can be challenged. Given that there is physical transmission everywhere (except for events that are spacelike separate in the sense of relativity theory), the astrocytes also transmit physical quantities to the debris.
Or take the radioactive decay of nuclei of the isotope 223Fr of Francium (see Section 2). This case refutes accounts of causation in terms of probability raising because the probability of having an effect of type 219Rn is lower given the cause (there being a nucleus of type 219At) than given the alternative possibility that it be of type 223Ra. It seems doubtful whether interventions can be used to overcome this difficulty. But transference of energy and other conserved quantities provides a ground for the judgment that the decay of a 219At nucleus causes the coming into being of a 219Rn nucleus. More generally, transference justifies causal judgments in cases where (1) the cause belongs to a type that lowers the probability of events of the type of the effect, (2) cause and effect are categorized in terms of types that have only one member, so that probabilistic approaches are inapplicable.
The hypothesis that causation is grounded on a process of transmission fits the intuitions of locality and intrinsicality, according to which the existence of a causal relation between a and b only depends on processes stretching from a to b. Difference-making accounts, in terms of nomological dependence, probability raising, counterfactuals, or in terms of causal models, run counter that intuition. They all construe causation as non-intrinsic.
Challenges against the hypothesis that causation is grounded on transference can be grouped in two sorts. Objections of the first sort are based on considerations having to do with the content of physical theory; they accept the project of finding out what causation is in the real world according to science, but object to the hypothesis that it is transference of an amount of a conserved quantity. The second type of challenge is conceptual and based on counterexamples: cases of causation without transference or cases of transference without causation. Let me look at the former challenges first.Footnote 102
1. In order to give sense to the idea that something x is transferred from a to b, x must persist over time. It can be questioned whether particular quantities of conserved quantities, such as energy, can persist through time like substances.Footnote 103 However, the statement that an amount q of energy (or some other conserved quantity) is transferred between space-time regions A and B, where events a and b are located, can be justified without requiring that q is a substance. A necessary and sufficient condition for the transference of energy (or another conserved quantity) between A and B is the existence of a time-like or light-like curve connecting these regions along which a field carrying energy (or another conserved quantity) is propagated.
2. In the framework of the theory of general relativity, it is not possible to derive a generally valid form of the conservation of energy. However, the notion of transmission presupposes that of conservation: only what is conserved can be transmitted. Therefore, transference of energy cannot be what makes true causal propositions bearing on large scale cosmological events, for example, that the big bang is the distant cause of the present expansion of the Milky Way, or on events in regions with non-negligible space-time curvature (Reference CurielCuriel 2000; Reference LamLam 2005). That leaves open the possibility that transference provides a necessary and sufficient local condition for causation between events that are situated in approximately flat regions of space-time.
3. A third argument against the transference hypothesis relies on the existence of non-local dependencies in entangled systems in quantum mechanics. In a variant of the experiment first conceived by Reference Albert, Boris and RosenEinstein, Podolsky and Rosen (1935), two electrons are prepared in an entangled singlet state of spin ½, and then move in opposite directions. According to quantum mechanics, measurements of a given component of the spins of such entangled pairs of particles are strictly correlated. Whether or not the results of the measurements are determined before the measurements (as hidden-variable theories assume) or not (as the orthodox version of quantum mechanics assumes), the result of one measurement seems to have an instantaneous causal influence at a distance on the other measurement. If causation is grounded on transference, the dependence of one measurement on the other in an EPR-style experiment cannot be causal. The non-causal character of the dependence of one result of measurement on the other can also be brought out by the fact that it is impossible to manipulate one result by intervening on the other.Footnote 104 The correlations between measurement events on entangled pairs of particles that are spacelike separated may be cases of non-local and non-causal determination.Footnote 105
4. A necessary and sufficient condition for the transference of energy (or other conserved quantities) between A and B (according to the transference hypothesis, itself necessary and sufficient for causation) is the existence of a time-like or light-like curve connecting these regions along which a field carrying energy (or another conserved quantity) is propagated. This seems to make the account circular because “carry” and “propagate” are causal notions. Attempts to analyze the meaning of these causal notions in terms of transmission would either lead to an infinite regress or a vicious circle. It seems more promising to analyze the meaning of these expressions in terms of laws of nature. This yields a theory in two parts: (1) Causation is grounded on transference. (2) Transference of amounts of conserved quantities is grounded on laws of physics. The transference theory is a variant of the nomological theory because, at bottom, laws of physics ground causation. However, not any law of any type, or even any type of law of evolution, is sufficient for causation, as it is according to (DN-C). The only laws that ground causation are the laws of propagation of energy and other conserved quantities along time-like or light-light curves.
The fundamental role of laws raises two problems. (1) It threatens to make the account lose its locality and intrinsicality, which seemed to give it an advantage over non-local theories, such as causal models.Footnote 106 (2) If the relevant laws are reversible, the asymmetry of causation is not a consequence of the physical transmission process that grounds causation in the actual world, but has an independent ground,Footnote 107 such as the fact that our region of the universe contains many irreversible processes that are all oriented in the same direction, according to the second law of thermodynamics.Footnote 108
Now let us look at challenges that have been raised against process accounts from a conceptual point of view.
5. Process theories take causation to be a type of relation or process whose characteristics can only be discovered by empirical research; they contain empirical hypotheses about what it is, physically, that makes a relation or process causal. Such hypotheses are fallible, that is, may turn out to be false on empirical grounds. That is a strength: after all, the fallibility of a theory proves that it is not tautological (Reference Popper KarlPopper 1934). But the fact that these theories have empirical content can also been seen as a weakness: Contrary to theories that aim at analyzing causation in an a priori manner by pure conceptual analysis, an empirical theory aims only at correctly characterizing causation as it is in the real world.Footnote 109 One reply to this worry is that it is inappropriate to criticize process theories for not satisfying criteria that are not appropriate for them. An empirical theory of causation in reality has no ambition to account for all possible sorts of causation, in particular to those featuring in science fiction novels or fairy tales. Another reply is that process theory, although fallible as an empirical hypothesis, might well be, if it is true, necessarily true, in analogy with Kripke’s thesis that the statement that water is H2O is a posteriori, but nevertheless, if true, necessarily true.Footnote 110
6. Another objection is that transmission processes are everywhere. Events that are spatiotemporally sufficiently close to each other are, for example, often linked by transmissions of photons. Therefore, transmission theory seems condemned to lead to a plethora of true causal judgments.Footnote 111 It can be replied first that those plethoric causal judgments are true but lack communicational relevance.Footnote 112 Second, the relevant causal processes can be chosen on perfectly objective grounds, on the basis of the properties of the effect that is indicated in the explanandum of the causal explanation one is looking for. If one asks for the cause of Peter’s standing up, the relevant causal process is at the physiological and psychological level and leads to the instantiation of the physiological and psychological properties constitutive of standing up.
7. An objection to this latter reply is that the transference hypothesis applies only to physical causal processes and is thus inadequate to ground the truth of ordinary causal judgments involving non-physical, for example, psychological, properties. Mary asks Peter to stand up, and he stands up. The cause of Peter’s standing up has to do with his understanding of the content of Mary’s request but apparently it does not seem to be relevant to consider the underlying causal process from the point of view of energy transmission.Footnote 113 The transference hypothesis entails indeed that all causes and effects are physical. What the objection shows is that a satisfactory account of causation must take the influence of properties into account, over and above the mere existence of causal relations based on transference. We will look at proposals for integrating the process account with causal models, in which the relevant properties are represented by variables, next.
8. Process accounts seem to be refuted by omissions and preventions. If I kill a plant by omitting to water it, it seems that I have caused its death without having transmitted anything to it.Footnote 114 If on the contrary I prevent the plant’s death by watering it, the event of the plant’s death does not take place and cannot therefore be the target of any transmission. Reference Schaffer, Collins, Hall and PaulSchaffer (2000/2004) argues that there are many common-sense causal propositions bearing on situations in which no transmission seems to be involved. Striking cases are propositions expressing double prevention (Reference HallHall 2004, p. 241), in which the cause prevents the prevention of an effect. In the operon theory of gene expression,Footnote 115 gene expression is triggered by an “inductor” molecule that prevents a “repressor” molecule from preventing it. Omission and prevention are forms of difference-making. In the next section, we will look at attempts to integrate processes with difference making.
8 Integrating Causal Models with Processes
We have seen that accounts in terms of (1) nomological deducibility, (2) conditional probabilities, (3) counterfactual dependence, (4) interventions, all yield fallible criteria for causation.Footnote 116 The framework of causal models can be interpreted using the conceptual tools of (1) – (4) so as to solve many of the problems of earlier accounts. However, causal models provide only a part of a complete theory of causation. The construction of a model presupposes the distinction between causal and non-causal dependence. A set of variables is appropriate for the construction of a causal model only if it doesn’t contain any variables related by forms non-causal dependence. Otherwise, the variables do not respect “independent fixability.”Footnote 117 Causal models do not provide a framework within which one could represent the question which dependence relations are causal, and which are logical, conceptual or metaphysical, because their construction presupposes an answer to that question.
Accounts in terms of mechanisms can be seen as complementary to difference-making accounts, and in particular to causal models. This complementarity can be seen from two sides. Pearl says that “each parent-child relationship in the network represents a stable and autonomous physical mechanism” (Reference PearlPearl 2000, p. 22). These physical mechanisms are not characterized within causal models; rather, they can be interpreted as constituting the metaphysical reality that explains the functional dependence between variables expressed in structural equations and causal models. See Section 5.
Mechanisms have been introduced in the philosophy of causal explanation and causation as a reaction to the inadequacy of the DN model for the analysis of the scientific analysis of causation in the life sciences.Footnote 118 Accounts in terms of probability raising, counterfactuals, interventions, or causal models all seem inappropriate for causation in biology and neuroscience, where causal explanation does not in general rely on the discovery of exceptionless regularities, or manipulability at the level of variables, but on the discovery of mechanisms. There are at least two ways of analyzing the concept of a mechanism. One is Salmon’s and Dowe’s in terms of processes (see Section 7). A second analysis takes mechanisms to be systems of interacting parts.Footnote 119 “A mechanism underlying a behavior is a complex system which produces that behavior by the interaction of a number of parts according to direct causal laws” (Reference GlennanGlennan 1996, p. 52). The direct links between variables in causal models can be interpreted in terms of mechanisms, which motivates the mechanistic theory of token level causation: “two events are causally related when and only when they are connected by an intervening mechanism” (Reference Glennan, Beebee, Hitchcock and MenziesGlennan 2009, 316).Footnote 120
So here is one aspect of the complementarity of causal models and mechanistic accounts: Causal models and other difference-making theories represent networks of causal links between variables, and mechanistic theories aim at identifying the metaphysical processes that underlie each direct link between variables (Reference GlennanGlennan 2017, p. 168).
Here is a second way in which mechanisms and difference-making relations (corresponding to direct links between variables in causal models) complement each other. Influence relations between variables representing higher-level features of, for example, biological systems, are “mechanically explicable” (Reference GlennanGlennan 1996, pp. 61–3). They are the result of interactions between parts of the system. The interactions between parts that help explain a given causal link at level n lie at lower levels. The concept of levels of properties is controversial and can receive different interpretations. In a mereological interpretation, the properties of a whole object lie at a higher level than the properties of its parts.Footnote 121 In general, interactions at level n-1 are themselves mechanically explicable in terms of still lower levels, n-2 and beyond. One example is the multilevel mechanism of long-term memory (Reference CraverCraver 2007, pp. 165–70). The mechanistic account of causation is not reductive and must be completed by some complementary account of what makes true the most fundamental interactions. The account of the fundamental account may be in terms of laws of nature, as Reference GlennanGlennan (1996) and Reference PsillosPsillos (2004) suggest, or in terms of direct causal influence between variables in causal models (Reference WoodwardWoodward 2011). Fundamental influence relations may be grounded on transference.
The complementarity of difference-making and mechanistic/process accounts must not be confused with pluralism, according to which “cause” is a polysemic word that is ambiguous between different concepts. For pluralism, what is meant by a causal statement depends on the context. Many varieties of pluralism are conceivable and many forms have been explored.Footnote 122 It is undeniable that there exist several causal concepts, all useful in certain contexts and irreducible to one another, such as direct and indirect cause, or contributing and total cause. The complementarity thesis just sketched is very different from the pluralist claim that causal statements like “smoking causes cancer” are ambiguous and express either the concept of causation as difference-making or the concept of causation as process or mechanism.Footnote 123
The application to causation of the so-called “Canberra plan”Footnote 124 provides another important way to conceive of the complementarity of difference-making accounts (such as causal models) and process accounts (Reference LewisLewis 1994a, Reference Lewis2004; Reference MenziesMenzies 1996; Reference BontlyBontly 2006). The Canberra plan is a general strategy for the naturalization of philosophical concepts, that is, for integrating common-sense notions with scientific knowledge. It is in part conceptual and in part empirical. In a first a priori step, the content of a concept is derived from ordinary common-sense intuitions or “platitudes” about the correct application of the concept: The concept is characterized in terms of a “functional role.” In a second, empirical step, scientific knowledge is used to produce a hypothesis about what in reality plays the functional role identified in the first step. Kim offers the example of the biological notion of a gene. In the first step the functional role corresponding to the notion of gene is identified as “that mechanism in a biological organism causally responsible for the transmission of heritable characteristics from parents to offsprings” (Reference KimKim 1998, p. 25). In the second step, (parts of) DNA molecules are identified as what fills the role.
Applying this strategy to causation, a first step identifies the conceptual role corresponding to the notion of cause. It is plausible that this conceptual role can be characterized by virtue of all or some subset of “the central connotations of causation” (Reference SchafferSchaffer 2000a, p. 289) I have mentioned at the very beginning of this Element.Footnote 125 The second step consists in making a hypothesis, informed by science, about what fills the role identified in the first step. Menzies mentions as one promising hypothesis for the filler of the role described by the folk concept of causation, Reference FairFair’s (1979) proposal that “the causal relation is the relation of transfer of energy-momentum from cause to effect” (Reference MenziesMenzies 1996, p. 104).
One worry is that the Canberra plan takes the functional role identified in the first step to be a causal role.Footnote 126 This makes it seem impossible to apply the reduction procedure to the concept of causation itself. But the role identified by a concept need not be a causal role. More substantial worries are that there are no clear-cut and universally shared intuitions about causation,Footnote 127 and that these intuitions are not purely a priori because they are in part shaped by what science tells us about possible role-fillers at the second step.
Another proposal is that causation at the level of particular events is analyzed in a first step in physical terms, in terms of mechanisms or processes of transference, and that we select, in a second step, from the vast number of physical causal relations, those that make a difference, in terms of probability raising, counterfactuals or causal models. According to “causal foundationalism” (Reference NeyNey 2009), difference-making causation between events at the level of types results from our selecting those among the physical causal relations that are important for our purposes of explanation, prediction and action.
According to the complementarity thesis, there are two components in what makes true ordinary causal statements such as “Bob’s coughing caused Carol’s waking” (Reference GlennanGlennan 2010, p. 365). (1) A physical process between the relevant particular events and (2) a difference-making relation at the level of relevant types under which these events fall, that is, in this case coughing and waking.Footnote 128 Both aspects of causation are required: on one hand “the spatiotemporal aspects of causation” (Reference Hitchcock, Machamer and WoltersHitchcock 2007, p. 214), on the other hand, the fact “that effects depend upon their causes, or that causes make a difference for their effects” (Reference Hitchcock, Machamer and WoltersHitchcock 2007, p. 214). What makes it true that c’s being F is causally responsibleFootnote 129 for e’s being G, has two parts: there is a physical process linking c and e; 2) F makes a difference to G. The types by which events make a difference are in general not physical, but in a physicalist framework, it is supposed that all facts superveneFootnote 130 on the set of physical facts. If this is correct, the process of Bob’s coughing waking Carol up supervenes on a physical process.Footnote 131
9 Contemporary Debates
9.1 Grounding and Causation
One common thread that runs through modern philosophy of science is the idea that modern science replaces causation with functional dependence. We have seen earlier that causation can be construed as a particular form of dependence. But there are many non-causal forms of dependence.Footnote 132
Grounding is a very general concept of dependence that has recently been the focus of much research in logic and metaphysics.Footnote 133 Grounding is formally defined as a relation of strict partial ordering, that is, one that is asymmetric, irreflexive, and transitive.Footnote 134 It applies to both formal (or logical) and empirical forms of dependence. The propositions p and q (logically) ground together the conjunction p∧q. Two H atoms and one O atom, when in a bound state, together (empirically) ground an H2O molecule. Both the grounding of a conjunction on its terms and the grounding of a molecule on its component atoms are non-causal, even though the latter is a form of empirical, not conceptual, dependence. Grounding can be represented by structural equations, just as causation, because grounding and causation are both forms of dependence.Footnote 135 However, this is not enough to show that causation is a sort of groundingFootnote 136 or that grounding is a form of causation: “metaphysical causation.”Footnote 137 The latter thesis modifies the traditional concept of causation, so that it can encompass many forms of dependence usually opposed to causation. It thereby abolishes the useful distinction between causal and non-causal dependence.Footnote 138
9.2 Eliminativism with Respect to Causation in Physics
It has often been claimed that the concept of cause is obscure and anthrophomorphic and has no place in science. “There is no cause nor effect in nature; nature has but an individual existence; nature simply is” (Reference MachMach 1960, p. 483; italics in original). “The reason why physics has ceased to look for causes is that, in fact, there are no such things” (Reference RussellRussell 1912/1919, p. 180). According to Mach and Russell, the use of the concept of cause results from anthropomorphic prejudice.
[M]any imagine they understand motions better when they picture to themselves the pulling forces; and yet the accelerations, the facts, accomplish more, without superfluous additions. I hope that the science of the future will discard the idea of cause and effect, as being formally obscure; and in my feeling that these ideas contain a strong tincture of fetishism, I am certainly not alone. (Reference MachMach 1894, p. 254; italics in original)
“The word ‘cause’ is so inextricably bound up with misleading associations as to make its complete extrusion from the philosophical vocabulary desirable” (Reference RussellRussell 1912/1919, p. 180).
Eliminativists argue that the concept of cause should not be used in a worldview informed by science because in contemporary science, “the concept of cause is replaced […] by the concept of function” Reference MachMach (1960, p. 555).Footnote 139 Russell’s claim that the most developed parts of physics do not use causal vocabulary is incorrect,Footnote 140 but it remains controversial whether the use of causal vocabulary should be interpreted literally. The fact that formulas and mathematical models don’t have any causal characteristics does not show that physical theories have no causal content: Causation might be part of the interpretation of the formulas and models, which is required to use them to predict and explain real phenomena.Footnote 141
Here are two arguments for why the functions expressed by equations used in physics cannot be interpreted causally:Footnote 142 (1) Causation is local, but according to classical mechanics it is the state of the whole universe that determines what happens at a given instant. (2) Causation is asymmetrical but no such asymmetry can be found in the laws. Classical mechanics and at least parts of quantum mechanics are time reversible. The state of the universe at a given time determines its state at other times, both in the past and in the future.
In reply, instead of concluding that physics gives us grounds to think there is no causation in physical nature, we may conclude that physics give us reasons to revise our intuitive concept of causation. Causation is less local than we might have expected. Here are two proposals for articulating the symmetry of fundamental physics with macroscopic asymmetrical causation. (1) David Reference AlbertAlbert (2000) proposes that macroscopic asymmetry of causation can be explained by the “past-hypothesis” (which says that the entropy of the universe was much lower in the very distant past than it is now) and statistical mechanical principles, although fundamental laws are time-symmetric. (2) Price and IsmaelFootnote 143 argue that the asymmetry of macroscopic causation is “perspectival.” Instead of being a metaphysical feature of the world, it results from a projection of the asymmetry of deliberation onto the world, which appears only to agents like ourselves.
It is generally accepted that causation is indispensable in practice and plays a major role in all applications of science, such as engineering. Causal knowledge is indispensable for finding effective strategies (Reference Cartwright and CartwrightCartwright 1979). However, it is controversial whether certain physical theories contain or imply causal hypotheses, in particular linear response theory, and special and general relativity.Footnote 144
Causal eliminativism with respect to physics seems incompatible with causal realism with respect to models that are developed outside fundamental physics, given the (partial) unity of science (Reference TahkoTahko 2021), and in particular the existence of reduction relations (Reference NagelNagel 1961). If causal models are interpreted realistically, how could causation “emerge” as a relation of variables outside of fundamental physics, although the variables of the special sciences supervene on those of physics? The tension can be resolved in an instrumentalist and pragmatic way by interpreting causal models as tools for prediction, not to be taken as descriptions of reality (Reference HitchcockHitchcock 2007a, p. 52).Footnote 145 Moreover, it has been suggested that the tools provided by causal models have no application in fundamental physics.Footnote 146
Eliminativist and antirealist arguments often rely on controversial assumptions. Such arguments can be criticized by showing that those assumptions can be dropped.
1) Reference Norton JohnNorton (2003, p. 3) suggests that if causation were real it would have to be analogous to energy. However, causation can be realistically interpreted without postulating the existence of “causity”Footnote 147 as a type of substance or quantity, analogous to energy.
2) Many authors argue that a realistic interpretation of causation is incompatible with quantum mechanics because causation requires necessitation of the effect by the cause.Footnote 148 This assumption is dropped by probabilistic accounts.
3) Russell does not clearly distinguish between the claim that there are causes and the truth of the “law of causality” or “principle of causality,” “the principle ‘same cause, same effect’” (Reference RussellRussell 1912/1919, p. 188).Footnote 149 Causal realism is compatible with abandoning the law of causality.
4) Reference Norton JohnNorton (2003, p. 8) suggests that the fact that classical mechanics is not a deterministic theory entails that there are “uncaused events” and “acausality” in classical physics. Probabilistic accounts, such as causal models, show that the concept of causation is independent of determinism. Counterfactual and process accounts are compatible with the existence of events to which classical mechanics doesn’t even attribute any probability.Footnote 150
9.3 The Challenge against Mental and “Downward” Causation from Closure and Exclusion
The idea that the mind causally influences the physical world is often claimed to be incompatible with physicalism.Footnote 151 Physicalism is the doctrine according to which (1) everything is either physical or exclusively composed of physical parts, and (2) all properties of all objects supervene on the physical properties of those objects.Footnote 152 According to the construal of physicalism in terms of supervenience, mental properties, events, and processes are distinct from physical properties, events, and processes.
Reference KimKim (1998, Reference Kim2005) argues that downward causation is incompatible with this construal of physicalism: It can never be literally correct that a mental event causes a physical event, because the causes of physical events are always exclusively physical.Footnote 153 If the argument were sound, it might be generalized to argue for the inefficacy of all higher-level causes.Footnote 154
1) (Closure) The causal closure of the physical domain.Footnote 155 If a system p has at t1 a physical property R, then there is, at each time t0 preceding t1, a physical property N such that the fact that p has N at t0 is causally responsibleFootnote 156 for the fact that p has R at t1.
2) (Exclusion) Principle of causal exclusion.Footnote 157 If the fact that p has N at t0 is causally responsible for the fact that p has R at t1, there cannot be any property M distinct from N, and in particular no property M at some level higher than N, such that p has M at t0 and such that the fact that p has M at t0 is also causally responsible for the fact that p has R at t1.
3) No downward causation. Therefore, no higher-level level property M is such that the fact that p has M at t0 is causally responsible for the fact that p has R at t1.
This result is troubling because it seems obvious that its conclusion is wrong and that our minds do influence physical events. My thoughts (M) cause my fingers to move over the keyboard (R) at this moment. It also seems to be incompatible with the scientific exploration of downward influence from psychological on physiological features of persons, for example, in the study of physiological effects of psychotherapy.Footnote 158
Both Closure and Exclusion can be challenged. Closure, which is also called the principle of the “completeness of the physical,” is regularly challenged in the context of the defense of emergentism.Footnote 159 Against Exclusion, it may be argued that mental (and other higher-level properties) can exercise their own autonomous causal influence, in parallel with the influence of their physical supervenience base.Footnote 160 It may also be argued that higher-level variables (or properties) can be efficacious by being identical with underlying physical variables (Reference Heil and RobbHeil and Robb 2003), or because they are composed of causal powers that are a subset of the causal powers of the underlying physical properties (Reference ShoemakerShoemaker 2007; Reference WilsonWilson 2011).
9.4 Robust and Proportional Causation
In the preceding section, we have considered an important challenge to the conceivability of mental causation. Mental events seem to be unable of influencing both physical and mental events, because any such influence is always exclusively due to the physical events underlying those mental events. One strategy for answering that challenge and thus justifying the possibility that mental events are causes merits a closer look,Footnote 161 if only because it also has other fruitful applications, for example, in biology (Reference WoodwardWoodward 2010). Dependence relations can be conceived in different ways. If Y depends on X, and if the dependence relation can be mathematically represented as functional dependence between variables, one can further distinguish whether the dependence is robust (or stable), and whether it is proportional (or specific). Both of these features are connected to the usefulness of knowing them for providing causal explanations and making causal predictions.
Take a causal model M that contains variables X and Y, so that X directly causally influences Y. Direct causation can be defined with the help of the notion of a counterfactual intervention (Reference WoodwardWoodward 2010). X causes Y if and only if, for some set of values of all other variables in set M, that is, in set B = M\{X, Y}, if an intervention set the value of X to a different value from its actual value, the value of Y, or the probability distribution of Y, would change.
The stability or robustness of the causal influence of X on Y can be defined with respect to the range of the values of the variables in B, for which the influence of X on Y exists.Footnote 162 Stability comes in degrees: the causal influence of X on Y is the more stable the larger the set of values of B variables for which it persists. If X causes Y indirectly through a chain of intermediate variables X1, X2, …, where each direct link between adjacent variables is less than perfectly stable, the indirect (or distal) causal influence of X on Y will in general be less stable than each direct link, and will in general be the less stable the longer the chain. This may be one source of the intuition that causation is not always transitive. But causation must not be confused with stable causation. A defender of transitivity, such as Lewis, can argue that long chains preserve causal influence, though they have little stability. The relative instability of causal relations between variables in biology may be used to justify the thesis that there are no laws in biology, or the weaker thesis according to which such laws are “system laws” (Reference SchurzSchurz 2002), which hold only under special circumstances, characterizing what Cartwright calls “nomological machines” (Reference CartwrightCartwright 1999), whereas “laws of nature” or fundamental laws are more stable.Footnote 163
Another important concept is proportional causation. Let me introduce this concept with Reference YabloYablo’s (1992, p. 257) example of a pigeon that has been conditioned to peck at all and only red targets. It reliably pecks at all targets of all shades of red but not at any targets of any other color. Now consider a type of situation in which a scarlet target is shown and in which the perception of that scarlet target causes the pigeon to peck. If P is a variable whose values represent pecking and not pecking, the influence of the pigeon’s perception of a colored target on P can correctly be represented both (1) in a model in which R is a variable whose values represent the perception of a red target and the absence of such a perception, and (2) in a model in which S is a variable whose values represent the perception of a scarlet target and the absence of such a perception. In the former model (1) is true, in the latter model, (2) is true:
(1) R causes P (perception of red target causes the pigeon to peck).
(2) S causes P (perception of scarlet target causes the pigeon to peck);
However, (1) seems intuitively “more correct” than (2) because (1) “furnishes a better explanation” than (2) (Reference WoodwardWoodward 2010, p. 298). The difference between (1) and (2) can be explained in terms of proportionality. R is proportional to P, whereas S is not. Each modification of the value of R leads to a modification of the value of P,Footnote 164 whereas this is not the case for S with respect to P. Some modifications of the value of S do not lead to any change in the value of P: if the value of S is switched from scarlet to some non-scarlet shade of red, this will not change the value of P.
Mental and other higher-level causes may be proportional to their effect, whereas the underlying physical causes are not.Footnote 165 Thus, mental and other higher-level variables can be (1) causes, in parallel to underlying lower-level variables, and (2) more relevant for causal explanation than those underlying lower-level variables.Footnote 166
9.5 Degrees of Causation
Most events have more than one cause. Often, it seems possible to compare different factors influencing the same event according to the strength of their influence. This can be important for many reasons, scientific, medical, moral, or legal. The issue can be raised both with respect to particular events and with respect to types of events. Take a river that is polluted by the sewage of several chemical plants. A scientific question bears on the contribution of each pollutant on different aspects of the modification of the ecosystem and in particular on the populations, or death, of animals and plants of various species. A medical issue is the contribution of each source of pollution to the incidence of certain diseases in the population living downstream who drink or otherwise use the polluted water. Moral and legal issues arise when it comes to determine the amount of moral and legal responsibility of each of the polluting plants for the damage to people’s health and to the ecosystem.
Several conceptual frameworks make room for the introduction of a concept of degree of causation or strength of causal influence.Footnote 167 Several measures of degrees of causal influence have been offered within the framework of probabilistic theories of causation.Footnote 168
However, Sartorio argues that “the appearance that causal contributions come in degrees is just an illusion” (Reference SartorioSartorio 2020, p. 346), on the basis of her analysis of a “puzzle about the relationship between degrees of causation and degrees of responsibility” (Reference Bernstein and ShoemakerBernstein 2017, p. 165). Degrees of causal influence and degrees of moral responsibility are closely connected. Other things being equal, the amount of moral responsibility of an agent for an outcome seems to be “proportionate” (Reference Bernstein and ShoemakerBernstein 2017) to the degree to which the agent’s actions caused that outcome.Footnote 169 The puzzle arises from the fact there seem to be two incompatible criteria for evaluating moral responsibility. To take Bernstein’s example, in a first situation (“Victim”), two assassins independently kill a victim by shooting at her exactly at the same time, where the bullet shot by each is alone sufficient for her death. A second situation (“Hardy Victim”) resembles the first in all respects but for the fact that both bullets are necessary for the victim’s death, none of the individual bullets shot by the two assassins being alone sufficient (either because the bullets are weaker or because the victim is stronger).
According to one criterion for degrees of causal influence of an agent for an outcome, an action is more of a cause of a given outcome to the extent that it comes closer to being a sufficient condition for that outcome. According to another criterion, an action is more of a cause of a given outcome to the extent that it comes closer to being a necessary condition for that outcome.Footnote 170 The puzzle arises from the fact that the two criteria sometimes yield incompatible verdicts: according to the sufficiency criterion the assassins are more of a cause of the death in Victim than in Hardy Victim because each is sufficient in Victim but not in Hardy Victim, whereas the opposite holds according to the necessity criterion: each assassin is more of a cause in Hardy Victim because her contribution is necessary for the death, which is not the case in Victim.
There are at least two strategies to react to the puzzle, which both preserve the existence of degrees of causation. One strategy is to hold that the puzzle arises only in the field of judgments of moral responsibility, whereas closeness to sufficiency is the only appropriate measure of degree of causation.Footnote 171 Another strategy is to accept that the puzzle concerns intuitive judgments of causal influence but that this is just one more case in which there are several causal concepts, each of which comes with its own degrees of causal influence.Footnote 172
10 Concluding Remarks
This Element began with the observation that causation is crucial for our conception of the world and ourselves, from the perspective of both common sense and science, but that its philosophical analysis remains controversial. So far, all efforts to elaborate a reductive account able to account for all uses of the concept of causation have failed. This has led many authors to abandon the attempt at finding a unique reductive analysis. Various forms of pluralism have been explored according to which there are many concepts of causation. One form of pluralism consists in interpreting various concepts of causation as so many conceptual tools that are useful in limited contexts. The strategy of elaborating causal concepts as tools has proven very fruitful in the framework of formal methods of causal modeling. These formal methods have allowed making much progress in elaborating different concepts of causation, such as contributing and total cause, direct and indirect cause, and actual cause. It has become clear that the failure to elaborate a reductive account is no obstacle to our improving our understanding of causation. In particular, it has turned out that one gets a better grasp on the concept of cause by clearly understanding various types of situations that have been obstacles to all simple reductive accounts, such as cases involving omissions, preventions, and cases where causation doesn’t seem to be transitive. Just as important is progress that brings into sharp focus unresolved issues that constitute challenges for future philosophical research on causation. Here are two of them.
Most contemporary philosophers accept both that causation, understood with the help of formal models such as structural equations and causal nets, is crucial for science and common sense and that causation has no place in physics. But how can there be causation in our ordinary life, and in chemistry, biology, psychology, economy and the law, if there is no physical causation? It seems difficult to reconcile this in particular with the widely accepted, and often presupposed, physicalist worldview, according to which all events and processes described in terms of the concepts of common sense or in the terms of some science other than physics, depend on physical events and processes. The tension is certainly alleviated once reductionism is abandoned: if one abandons the thesis of logical empiricism that all other sciences are in principle reducible to physics, there is no contradiction in holding both that there is no causation in physics and that there is causation everywhere else although everything depends in some sense on physics. But there remains a tension. This tension is apparent in arguments for eliminativism in the philosophy of mind, with respect to cognitive states and events. According to the so-called exclusion argument, cognitive states and events cannot be causes because their causal powers are excluded by the causal powers of the physiological and physical states of the cognitive systems entertaining these cognitive states. This argument presupposes that those underlying physical states have causal powers. Therefore, it is incompatible with the thesis that there is no physical causation.
Another topic that awaits further exploration is the distinction between causal and non-causal dependence. The construction of causal models presupposes this distinction because causal nets can only be built with variables that are chosen so that, for each pair of variables, there are only two possibilities: either they are independent of each other or they stand in a relation of causal influence. Variables must be chosen so that they do not depend on each other in any other way: a causal model must not contain variables that stand in logical, conceptual or supervenience relations, or that are related by non-causal association laws. Thus, the possibility of analyzing causation in terms of causal models depends on the distinction between causal and non-causal dependence. However, no generally accepted account of this distinction has yet emerged. From a metaphysical point of view, what is needed is a theory of what makes properties or events independent enough of each other so as to be able to stand in causal relations. Only variables that represent properties or events of that sort are appropriate for use in structural equations and causal nets. The challenge is to find a non-circular way of distinguishing causal and non-causal dependence and of characterizing various non-causal forms of dependence. As long as the choice of variables that are appropriate for causal models remains outside of philosophical accounts of the elaboration and use of such models, our understanding of causation remains incomplete.
Acknowledgments
Many people have helped me by providing me with critical comments on earlier versions of the manuscript. I would like to thank Thomas Blanchard, Toby Friend, Sam Kimpton-Nye, Etienne Ligout, Yanis Pianko, Tom Wysocki, and two anonymous referees for Cambridge Elements. Many thanks to the series editor Tuomas Tahko for his generous support. I have carried out the research for this Element during a partial research leave funded by Institut Universitaire de France.
Tuomas E. Tahko
University of Bristol
Tuomas E. Tahko is Professor of Metaphysics of Science at the University of Bristol, UK. Tahko specializes in contemporary analytic metaphysics, with an emphasis on methodological and epistemic issues: ‘meta-metaphysics’. He also works at the interface of metaphysics and philosophy of science: ‘metaphysics of science’. Tahko is the author of Unity of Science (Cambridge University Press, 2021, Elements in Philosophy of Science), An Introduction to Metametaphysics (Cambridge University Press, 2015) and editor of Contemporary Aristotelian Metaphysics (Cambridge University Press, 2012).
About the Series
This highly accessible series of Elements provides brief but comprehensive introductions to the most central topics in metaphysics. Many of the Elements also go into considerable depth, so the series will appeal to both students and academics. Some Elements bridge the gaps between metaphysics, philosophy of science, and epistemology.