To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Those sciences, created almost in our own days, the object of which is man himself, the direct goal of which is the happiness of man, will enjoy a progress no less sure than that of the physical sciences, and this idea so sweet, that our descendants will surpass us in wisdom as in enlightenment, is no longer an illusion. In meditating on the nature of the moral sciences, one cannot help seeing that, as they are based like physical sciences on the observation of fact, they must follow the same method, acquire a language equally exact and precise, attaining the same degree of certainty.
Nicolas de Condorcet
There is . . . progress in the social sciences, but it is much slower [than in the natural sciences], and not at all animated by the same information flow and optimistic spirit. Cooperation is sluggish at best; even genuine discoveries are often obscured by bitter ideological disputes. For the most part, anthropologists, economists, sociologists, and political scientists fail to understand and encourage one another . . . Split into independent cadres, they stress precision in words within their specialty but seldom speak the same technical language from one specialty to the next. A great many even enjoy the resulting overall atmosphere of chaos, mistaking it for creative ferment.
Edward O. Wilson
The subject of this book is the set of disciplines known as the social sciences (which in earlier times would have been referred to as the moral or human sciences). By this is meant a scientific study of human action focusing on elements of thought and behavior that are in some degree social (nonbiological). “The object of the social sciences,” writes Hans Morgenthau, “is man, not as a product of nature but as both the creature and the creator of history in and through which his individuality and freedom of choice manifest themselves.” Wherever nurture matters more than nature, or where some significant decisional element is involved, we are on the turf of social science. (This does not mean that genetic dispositions are eliminated from consideration; indeed, they comprise an active research agenda in the social sciences today. However, one presumes that any outcome of interest to the social sciences is not entirely biologically determined; there must be a significant component of choice.)
Having discussed the formal (super-empirical) criteria of a good argument, we turn now to the empirical portion of social science research, the hoped-for encounter with reality. This stage may be referred to variously as analysis, assessment, corroboration, demonstration, empirics, evaluation, methods, proof, or testing. (While acknowledging the subtle differences among these terms, I shall treat them as part of the same overall enterprise.)
Of course, the distinction between theory formation and theory-testing is never clear and bright. As is the case everywhere in social science, tasks intermingle. One cannot form an argument without considering the empirical problem of how to appraise it, and vice versa. Moreover, the task of (dis)-confirming theories is intimately conjoined with the task of forming theories. As Paul Samuelson notes, “It takes a theory to kill a theory.”
I hope that my chosen approach to social science methodology strikes readers as commonsensical. Indeed, none of the tasks, strategies, and criteria were invented by the author (though I have chosen labels for things that do not have established names), and most have received extensive discussion. From this perspective, the present book qualifies as a compendium of truisms – a function shared, I might add, by any integrative work on methodology. The first, and perhaps most important, justification for the proposed framework is that it represents a formalization of what we already know.
Nevertheless, readers are bound to have qualms about some elements of the argument. They might take issue with the criterion of generality, for example. They might like what I have to say about description, but not about causation. On what grounds might one adjudicate this sort of dispute?
There is perhaps no more controversial practice in social and biomedical research than drawing causal inferences from observational data. When interventions are assigned to subjects by processes not under the researcher’s control, there is always the real possibility that the treatment groups are not comparable in the first place. Then, any inferences about the role of the interventions are suspect; what one takes to be the causal effects of the interventions perhaps derive from “preexisting” differences among the treatment groups. Despite such problems, observational data are widely available in many scientific fields and are routinely used to draw inferences about the causal impact of interventions. The key issue, therefore, is not whether such studies should be done, but how they may be done well.
Richard Berk
Chapter 9 set forth general criteria applying to research designs whose purpose is to assess causal relationships. In this chapter and Chapter 11 I lay out specific strategies of causal inference. To be sure, there is a high degree of overlap between these topics. Indeed, each strategy can be analyzed according to the criteria that it fulfills (or does not fulfill). Yet the shift from criteria to strategies is an important one. While Chapter 9 is about principles, Chapters 10 and 11 come closer to a “how-to” guide, complete with detailed discussions of specific studies.
Strategies of causal inference will be divided into three categories: (1) randomized designs; (2) nonrandomized designs; and (3) strategies that move beyond X and Y, as summarized in Table 10.1. Because this covers a lot of ground the material is divided into two chapters, with this chapter focused on the first two topics and Chapter 11 on the third.
Surely, if there be any relation among objects which it imports to us to know perfectly, it is that of cause and effect. On this are founded all our reasonings concerning matter of fact or existence. By means of it alone we attain any assurance concerning objects which are removed from the present testimony of our memory and senses. The only immediate utility of all sciences is to teach us how to control and regulate future events by their causes. Our thoughts and enquiries are, therefore, every moment, employed about this relation: Yet so imperfect are the ideas which we form concerning it, that it is impossible to give any just definition of cause.
David Hume
I argued in Part II for a resuscitation of descriptive inference within the social sciences, both as a topic of methodology and as a topic of substantive research. However, I do not suppose that description will displace causation as the reigning motif of social science. We wish to know not only what happened but also, perhaps more critically, why these things happened.
Causation is the central explanatory trope by which relationships among persons and things are established – the cement of the universe, in Hume’s much quoted words. Without some understanding of who is doing what to whom we cannot make sense of the world that we live in, we cannot hold people and institutions accountable for their actions, and we cannot act efficaciously in the world. Without a causal understanding of the world it is unlikely that we could navigate even the most mundane details of our lives, much less matters of long-term policy. This is obvious in the policy world, where causal understanding undergirds any rational intervention. And it is obvious in other areas of politics, for example, in social movements, lobbying, voting, and revolutionary change. Anyone who engages in these activities must be conscious of oneself as a causal actor in the world, and accordingly, must make assumptions (implicit or explicit) about what one’s actions might achieve – whether one supports the status quo or wishes to undermine it. Lenin, like Metternich, was vitally concerned with the causes of revolution. Even where causal understanding does not lead to social change (for not all causal analysis is directly relevant to public policy, and more to the point, not all policy proposals are implemented), we are likely to be reassured when we can order events around us into cause-and-effect relationships. “When we have such understanding,” notes Judea Pearl, “we feel ‘in control’ even if we have no practical way of controlling things.”
Most people who bother with the matter at all would admit that the English language is in a bad way, but it is generally assumed that we cannot by conscious action do anything about it. Our civilization is decadent and our language – so the argument runs – must inevitably share in the general collapse. It follows that any struggle against the abuse of language is a sentimental archaism, like preferring candles to electric light or hansom cabs to airplanes. Underneath this lies the half-conscious belief that language is a natural growth and not an instrument which we shape for our own purposes.
Now, it is clear that the decline of a language must ultimately have political and economic causes: it is not due simply to the bad influence of this or that individual writer. But an effect can become a cause, reinforcing the original cause and producing the same effect in an intensified form, and so on indefinitely. A man may take to drink because he feels himself to be a failure, and then fail all the more completely because he drinks. It is rather the same thing that is happening to the English language. It becomes ugly and inaccurate because our thoughts are foolish, but the slovenliness of our language makes it easier for us to have foolish thoughts. The point is that the process is reversible. Modern English, especially written English, is full of bad habits which spread by imitation and which can be avoided if one is willing to take the necessary trouble. If one gets rid of these habits one can think more clearly, and to think clearly is a necessary first step toward political regeneration: so that the fight against bad English is not frivolous and is not the exclusive concern of professional writers.
George Orwell
I cannot resist inserting a few words on the stylistic properties of social science. Although not methodological in the strict sense of the term, it is nonetheless difficult to separate the desiderata of good writing from the desiderata of good argumentation. Kristin Luker comments:
Writing engages a very different part of the brain than reading and talking do . . . [It] is the door that opens out to the magic. Someone once asked Balzac, who supported himself by writing reviews of plays, how he liked a play he had just seen. “How should I know?” he is reported to have answered. “I haven’t written the review yet!” Balzac was onto something: I find that when I write things down, I write and think things that I’ve never really thought before. Novelists sometimes say that their characters do things that surprise their authors, and I guess this is the sociological version of that phenomenon.
In this respect, writing shares certain characteristics in all realms in which it is employed. Good writing is good thinking, to paraphrase Orwell.
B: Oh, I’m sorry, just one moment. Is this a five minute argument or the full half hour?
Monty Python, "The Argument Clinic"
Argumentation in contemporary social science descends from the ancient art of rhetoric and the equally ancient science of logic. A complete argument consists of a set of key concepts, testable hypotheses (aka propositions), and perhaps a formal model or larger theoretical framework. A causal argument should also contain an explication of causal mechanisms (Chapter 8). An argument is what we speculate might be true about the world; it engages the realm of theorizing.
Sometimes, it is important to distinguish among arguments lying at different levels of abstraction. The most abstract may be referred to as macro-level theories, theoretical frameworks, or paradigms. Examples would include structural functionalism, modernization theory, exchange theory, symbolic interactionism, or conflict theory. At a slightly less abstract level one finds meso-level theories or models. And at the most specific level one speaks of hypotheses, inferences, micro-level theories, or propositions, which are assumed to be directly testable. (Explanations may apply to any level.) So, for example, work on the topic of school vouchers might include a general theory about why consumer choice enhances the educational process, a formal model incorporating various elements of that theory, and a specific hypothesis or set of hypotheses regarding the impact of a voucher-based intervention on educational attainment.
One of the mistakes commonly made in contemporary social research is assuming the existence of standards or procedures that are applicable in a mechanical way to evaluate data (as, say, variance explained, predictive power, tests of significance, and the like). Social science methodology should not be viewed as developing a foolproof system in which data are plugged in at one end and the best answer is generated at the other. As is the case in all areas where evidence is gathered to evaluate a theory, success is often marked by imaginative and creative efforts. This is at least partly what might be called an “art.”
Stanley Lieberson and Joel Horwich
We have now reviewed randomized and nonrandomized approaches to the assignment problem, and to causal analysis more generally. Our discussion has focused on the ways in which these strategies attempt to isolate the covariational pattern between a causal factor, X, and an outcome, Y. This pattern may be of many sorts: positive or negative, proximal or distal, and so forth. There are many types of causal relationships (as summarized in Table 9.2), and each presupposes a somewhat different covariational pattern. The key point is that X/Y covariation is a necessary condition of causality. If Y does not vary with X, at least some of the time and in some real or imagined sample, then X cannot be a cause of Y. Accordingly, the discovery of X/Y covariation may provide strong evidence of causality, if various assumptions hold.
However, the problem of confounders is ubiquitous when one is dealing with nonexperimental data. (Additionally, background factors lying orthogonal to the factor of theoretical interest may present an obstacle to causal inference if the signal is overwhelmed by stochastic noise.) Consequently, one is often obliged to move beyond an exclusive focus on X and Y. Other factors will need to be measured, and conditioned, if the covariation between X and Y is to be interpreted as evidence of a causal relationship.
During my career in science, now nearly a half century in duration, I have grown more and more aware that success in science, paralleling success in most careers, comes not so much to the most gifted, nor the most skillful, nor the most knowledgeable, nor the most affluent of scientists, but rather to the superior strategist and tactician. The individual who is able to maneuver with propriety through the world of science along a course that regularly puts him or her in a position of serendipity is often the one who excels.
Jack Oliver
Broadly stated, the goal of science is to discover new things about the world and to appraise the truth-value of extant propositions about the world. Consider our exemplars, democracy and vouchers, introduced in Chapter 1. We want to uncover new things about the process of democratization and the impact of vouchers on school performance. At the same time, we want to test extant theories about these two subjects. Social science may, therefore, be understood as a twin quest for discovery and for appraisal, as summarized in Table 2.1.
The chapter begins by introducing these goals, followed by a review of their implications for more specific methodological tasks. The next section approaches the goal of discovery through the concrete task of finding a research question. Since the remaining chapters of the book assume that a research question – perhaps even a specific hypothesis – has been identified, this chapter functions as a prologue to the rest of the book.
The history of the social sciences is and remains a continuous process passing from the attempt to order reality analytically through the construction of concepts – the dissolution of the analytical constructs so constructed through the expansion and shift of the scientific horizon – and the reformulation anew of concepts on the foundations thus transformed . . . The greatest advances in the sphere of the social sciences are substantively tied up with the shift in practical cultural problems and take the guise of a critique of concept-construction.
Max Weber
As we are . . . prisoners of the words we pick, we had better pick them well.
Giovanni Sartori
Description will be understood in this book as any empirical argument (hypothesis, theory, etc.) about the world that claims to answer a what question (e.g., how, when, whom, or in what manner). By contrast, wherever there is an implicit or explicit claim that a factor generates variation in an outcome the argument will be regarded as causal. The distinction between these two key concepts thus hinges on the nature of the truth-claim – not on the quality of the evidence at hand, which may be strong or weak. Description is the topic of Part II, while causation is the topic of Part III. Description rightly comes first; one must describe in order to explain (causally). However, the reader will find many comparisons and contrasts across the two topics interwoven throughout the book.
Because this book is focused on generalizing statements about the world (Chapter 1), I am not concerned with descriptions that reflect only on individual cases or events (without any attempt to exemplify larger patterns). Consequently, in this book description is always an inferential act. To generalize is to infer from what we know (or think we know) to what we do not know. One sort of inferential leap is from observations within a sample that are deemed secure to those that are uncertain or missing (problems of “measurement error” or “missing data”) and to dimensions that are inherently unobservable (“latent characteristics”). Another sort of inferential leap is from a studied case or sample to a larger (unstudied) population. In both respects, descriptive models offer a “theory” about the world, “a ‘formula’ through which the data can be reproduced.”
Our problem as methodologists is to define our course between the extremes of inert skepticism and naive credulity . . .
Donald Campbell
If you insist on strict proof (or strict disproof) in the empirical sciences, you will never benefit from experience, and never learn from it how wrong you are.
Karl Popper
All the tasks, strategies, and criteria noted in the previous chapters are considered valid, ceteris paribus. And yet ceteris is not always paribus. A key theme of this book is that methodological choices frequently involve tradeoffs. Satisfying one dimension may involve sacrificing another. Tasks, strategies, and criteria often conflict. Accordingly, for every dimension listed in Table 1.1 and in subsequent tables throughout the book one can locate conflicting imperatives.
The call for discovery is at odds with the call for accurate appraisal. Indeed, exploratory research is typically carried out in a different fashion than research whose primary goal is confirmatory (falsificationist), as emphasized in Chapter 2.
Obviously there is no classification of the Universe not being arbitrary and full of conjectures. The reason is quite simple: we do not know what the universe is.
Jorge Luis Borges
What the devil is going on around here?
Abraham Kaplan
How do social scientists describe a social reality? What arguments do we employ in our attempts to bring order to the great blooming, buzzing confusion of the world? One might suppose that the shape of a descriptive inference is limited only by the social phenomenon that we seek to describe, the models (cognitive, linguistic, mathematical, and visual) that we have at our disposal, and our imagination. In practice, however, descriptive inferences draw from a standard itinerary of tropes.
I shall argue that most descriptive claims can be classified as indicators, syntheses, typologies, or associations, along with their various subtypes as illustrated in Table 6.1. This is how social scientists carve up nature at the descriptive level. These are the patterns that we look for when attempting to describe classes of events in the social world.