To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
The public’s support for the rule of law is a key democratic value and a cornerstone concept in the study of public support for courts. We provide the most systematic analysis to date of its measurement, correlates, and stability. We validate an updated measure of the public’s support for the rule of law, drawing on original survey data. We demonstrate that support for the rule of law is highest among the most politically sophisticated and those with strong support for democratic values. Further, we draw upon thousands of survey responses in the United States and an original six-wave survey panel in Germany to demonstrate the temporal stability of the public’s support for the rule of law at both the aggregate and individual levels. Finally, we illustrate the predictive validity of our measure through the analysis of an original survey experiment.
The last decades have seen important progress in the economic analysis of institutions, with increasing concern about the need to ‘unbundle’ this concept and the diversity of situations it covers. It is so because of the complexity of the systems the concept intends to capture and the ambiguity of definitions often perceived as catch-all ideas without a clear connection to a research strategy. This essay contributes to the literature emphasising that overcoming these difficulties requires a theoretical framework identifying and characterising distinct institutional layers. The content of this framework is substantiated through the analysis of the nature and role of the long-ignored intermediate layer of ‘meso-institutions’. Meso-institutions designate devices and transmission mechanisms linking general rules, norms and beliefs established at the macro-institutional level with their perception, adaptation, and implementation (or challenge) by the actors populating the micro-level. Operationalising this framework relies on a research strategy that proceeds from a ‘substantive theory’ of institutions to the collection and processing of ‘empirical evidences’ through the development of ‘auxiliary theories’ designed to capture specific institutional objects. References to several empirical studies support the relevance of this approach.
This article develops the first dynamic method for systematically estimating the ideologies and other traits of nearly the entire federal judiciary. The Jurist-Derived Judicial Ideology Scores (JuDJIS) method derives from computational text analysis of over 20,000 written evaluations by a representative sample of tens of thousands of jurists as part of an ongoing, systematic survey initiative begun in 1985. The resulting data constitute not only the first such comprehensive federal-court measure that is dynamic, but also the only such measure that is based on judging, and the only such measure that is potentially multi-dimensional. The results of empirical validity tests reflect these advantages. Validation on a set of several-thousand appellate decisions indicates that the ideology estimates predict outcomes significantly more accurately than the existing appellate measures, such as the Judicial Common Space. In addition to informing theoretical debates about the nature of judicial ideology and decision-making, the JuDJIS initiative might lead courts scholars to revisit some of the lower-court research findings of the last two decades, which are generally based on static, non-judicial models. Perhaps most importantly, this method could foster breakthroughs in courts research that, until now, were impossible due to data limitations.
Strengthening the research workforce is essential for meeting the evolving needs and challenges in the health and biomedical fields. To do so effectively, it requires an understanding of how the experiences of a researcher shift over time and how one’s research career evolves, particularly as supports are put in place to foster research. This narrative review provides a summary of published individual-level assessment measures and survey tools from 2000–2024. All measures were abstracted, classified, and coded during analyses to describe the areas of focus, and they were organized into one of six research categories. The review identified a range of measures and methods across all categories. However, the measures were often narrow, focused on outputs, and not ideal for assessing the full range of experiences a researcher may have throughout their career. The most common metrics were related to research productivity and bibliometric measures. Our review of survey tools revealed a gap in comprehensive approaches available to assess an individual’s research experience, efforts, supports, and impact. As efforts expand to evaluate and study the research workforce, tools that focus on a broad range of individual-level measures, tied to specific underlying constructs and drawn from the literature, may prove useful.
Having relevant indicator(s) of students’ school adjustment is the basis for making educational decisions with regard to an enormous scope of topics that refer to either individual students, a specific class, or even the entire school level. Thus, a major cornerstone in the effort to promote students’ school adjustment is the ability to correctly and accurately measure it. The problem of defining a given student’s state is a multi-aspect and multi-level challenge that is shaped by the local authorities’ guidelines, cultural norms, economic circumstances, and the student’s intellectual qualifications and personal characteristics. The existing literature suggests a rich list of measurements of the student’s feelings and the teacher’s evaluation of the student’s academic achievements, but parental and peer reflections are relatively underrepresented. This chapter advocates the priority of students’ subjective evaluations. Such measurements appear under various titles: students’ belonging, engagement, attitude, feelings, satisfaction, well-being, liking, burnout, sentiment, and more. In order to conduct routine evaluations of students’ school adjustment there is a need for a short and easily administered scale. For this end, the School Adjustment Questionnaire (SAQ) is presented as a possible example.
We apply a synthesis review to revisit the concept, measurement, and operationalisation of social inclusion and exclusion in the context of comparative social policy, integrating the vast literature on the concepts, with the aim of elucidating a clearer understanding of the concepts for use by scholars and policymakers around the planet. In turn, we outline the conceptual development of the concepts, how they have been operationalised through social policy, and how they have been measured at the national and individual levels. Through our review, we identify limitations in extant conceptualisation and measurement approaches and suggest directions for refining conceptual and measurement frameworks to enhance their utility in social inclusion policy, emphasising the concepts’ multidimensional, multilevel, dynamic, and relational essence and highlighting their connection to related concepts such as social capital, social integration, and social citizenship.
The increasing multimodality (e.g., images, videos, links) of social media data presents opportunities and challenges. But text-as-data methods continue to dominate as modes of classification, as multimodal social media data are costly to collect and label. Researchers who face a budget constraint may need to make informed decisions regarding whether to collect and label only the textual content of social media data or their full multimodal content. In this article, we develop five measures and an experimental framework to assist with these decisions. We propose five performance metrics to measure the costs and benefits of multimodal labeling: average time per post, average time per valid response, valid response rate, intercoder agreement, and classifier’s predictive power. To estimate these measures, we introduce an experimental framework to evaluate coders’ performance under text-only and multimodal labeling conditions. We illustrate the method with a tweet labeling experiment.
Risks and priorities change during the management of public health incidents. Here we describe a new tool, the Incident Management Measurement Tool (IMMT), that can be used to inform midcourse corrections during public health emergencies and realistic exercises.
Methods
We developed the IMMT through a literature review and subject matter expert interviews. We field tested the tool in 23 incidents ranging in size, duration, and complexity, making changes based on user feedback.
Results
The IMMT consists of 2 modular data collection methods, a survey of the incident management team and a protocol for a peer assessor. Pilot testing suggested that the tool is valid, reliable, feasible, and useful.
Conclusions
Measurement of public health incident management is feasible and may be useful for improving response times and outcomes. Moreover, a limited set of standard measures is relevant to a wide range of incident response contexts.
TOTs are inherently subjective experiences; only the experiencer can really know whether one is happening and what it feels like as it does. As such, methodologies and their nuances are extremely important. This chapter covers the various methods that have been employed to measure TOTs. The prospecting method – now the most widely used method of studying TOTs – was first developed by Brown and McNeill (1966) in their seminal paper. Present-day use of the method commonly employs word definitions, general-information questions, or faces of famous people. The method can also be adapted to new learning. It is also important to determine how accurate TOTs are at predicting later memory, and we discuss approaches to doing so. Another approach to studying TOTs involves diary studies, in which people are asked to record their naturally occurring TOTs and their qualities and characteristics over a set period of time. How TOT rates should be computed remains an important issue. Depending on one's theoretical approach, it can make sense to divide the number of TOTs over all unrecalled items, or it may be better to divide the number of TOTs over TOTs plus correctly recalled items.
The introduction outlines four major tasks of this study: (1) to present evidence of disability-based intergroup economic disparity in the United States; (2) to engage the lived experiences of individuals and communities experiencing multiple simultaneous axes of oppression, including disability-based oppression; (3) to contribute to emerging understandings of the importance of intersectionality to economic research and policy; and (4) to contribute to stratification economics in applied terms through direct engagement with policy proposals for a federal jobs guarantee and federal “baby bonds” program. It provides an overview of disability and the US economy, disability and economic research methods, common models of disability, and the challenge of race/disability analogies.
Emotions and their sociopolitical impact have received increasing scholarly attention. However, it remains largely unclear whether emotional expression within surveys is subject to social desirability bias. By drawing on impression management theory and the disclosure decision model, I argue that emotional expression is likely prone to social desirability bias in interviewer-administered survey modes and test my hypotheses on mixed-mode ANES data. The findings demonstrate that respondents significantly underreport negative emotions—anger and fear—when interviewed face-to-face as compared to online. Furthermore, positive emotions, such as hope and pride, are not exempt from biased reporting related to interview mode. These results highlight the risks of estimating emotions and their salience by either relying on interviewer administration or combining survey modes.
The interpersonal and relational dimensions of mentoring have been identified as critical components of effective mentorship. However, no scale currently exists to assess this specific aspect of the relationship. This study introduces a new instrument, the mentorship working alliance (MWA) – mentee version, and presents initial evidence supporting its validity in evaluating the interpersonal elements of mentoring relationships.
Methods:
Through a series of pilot tests and revisions, we developed a 12-item scale that assesses two dimensions of the MWA: relational quality (6 items), which captures how a mentee feels about the relationship, and relational effectiveness (6 items), which reflects the mentee’s perception of their mentor’s actions in facilitating or advancing the working relationship. To evaluate the scale’s construct validity and reliability, we conducted a confirmatory factor analysis (CFA) and internal consistency reliability analysis on a sample of 345 graduate students.
Results:
CFA provided evidence for the validity of the two-dimensional MWA scale, which assesses relational quality and relational effectiveness, with Cronbach’s alpha coefficients of 0.96 and 0.89, respectively. All parameter estimates for individual items were significant, with standardized factor loadings ranging from 0.66 to .83.
Conclusions:
The MWA scale – mentee version enables researchers to assess the interpersonal dimensions of mentoring relationships, offering valuable insights into the components of effective mentorship. By introducing this scale, we pave the way for further investigation into how mentorship interventions influence the MWA, thereby enhancing the overall quality of mentoring experiences. Additionally, we offer recommendations for future studies.
A growing literature explores the effect of economic inequality in citizens’ surrounding environment on their political attitudes and behavior. This literature typically relies on measures of income concentration or gap-size, which reflect under-tested presumptions about how citizens perceive the economic conditions surrounding them. Utilizing survey data to explore perception of economic inequality in Americans’ residential environment, this note finds that measures capturing income concentration or gap-size perform poorly relative to a measure capturing the joint prevalence of “haves” and “have-nots.” These results suggest that commonly used measures of economic inequality may not fully capture the features of people’s daily environment used to perceive the existence or magnitude of inequality. The results guide future research toward using contextual indicators that treat inequality as a compound phenomenon involving manifestations of poverty and affluence.
Conditioning on variables affected by treatment can induce post-treatment bias when estimating causal effects. Although this suggests that researchers should measure potential moderators before administering the treatment in an experiment, doing so may also bias causal effect estimation if the covariate measurement primes respondents to react differently to the treatment. This paper formally analyzes this trade-off between post-treatment and priming biases in three experimental designs that vary when moderators are measured: pre-treatment, post-treatment, or a randomized choice between the two. We derive nonparametric bounds for interactions between the treatment and the moderator under each design and show how to use substantive assumptions to narrow these bounds. These bounds allow researchers to assess the sensitivity of their empirical findings to priming and post-treatment bias. We then apply the proposed methodology to a survey experiment on electoral messaging.
We use data on Latino children in the United States who have been randomly assigned calculation tests in English or Spanish to check for the so-called bilingual advantage, the notion that knowing more than one language improves individuals’ other cognitive skills. After controlling for different characteristics of children and their parents, as well as children's time in the US, we find a bilingual advantage among children who read or write in English and Spanish but not for those who only speak or understand both languages. In particular, bilingual readers or writers perform one-fourth to one-third of a standard deviation better than monolingual children, equal to learning gains of an additional school year. Applying the Oster test, we find that selection on unobservables would need to be 3–4 times stronger than selection on observables to explain away our results. The bilingual advantage is stronger among children in two-parent households with siblings and for those at the upper end of the ability distribution.
This chapter clarifies the theoretical arguments through discussion of issues and questions that may arise in conceptualizing, testing, and evaluating not only comprehensive deterrence theory (CDT) but also, more generally, that can arise in deterrence research. For example, it discusses the nature of punishment. Deterrence scholarship understandably has examined the idea that punishments may deter. What has not been systematically theorized or empirically studied is punishment itself. Historical accounts exist, of course. And numerous scholars certainly have detailed many aspects of certain types of punishment, such as the death penalty. However, deterrence scholarship lacks a coherent foundation for predicting the effects of a wide variety of legal punishments, or how to distinguish when one type of punishment meaningfully differs from another. Similarly, there is a great deal of confusion about legal vs. extralegal punishment as well as specific vs. general deterrence. The chapter examines these and other issues with an eye towards clarifying CDT and charting directions for improving deterrence scholarship.
Mass polarization is one of the defining features of politics in the twenty-first century, but efforts to understand its causes and effects are often hindered by empirical challenges related to measurement and data availability. To address these challenges and provide a common standard of analysis for researchers, this Element presents the Polarization in Comparative Attitudes Project (PolarCAP). PolarCAP clearly defines polarization as a property of group relations and uses a Bayesian measurement model to estimate smooth panels of ideological and affective polarization across ninety-two countries and forty-nine years. The author uses these data to provide a descriptive account of mass polarization across time and space. They further show how PolarCAP facilitates substantive inference by applying it to three sets of variables often hypothesized as causes or consequences of polarization: institutional design, economic crisis, and democracy. Open-source software makes PolarCAP easily accessible to scholars and practitioners.
The field of criminology is limited by a 'hidden' measurement crisis. It is hidden because scholars either are not aware of the shortcomings of their measures or have implicitly agreed that scales with certain properties merit publication. It is a crisis because the approaches used to construct measures do not employ modern systematic psychometric methods. As a result, the degree to which existing measures have methodological limitations is unknown. The purpose of this Element is to unmask this hidden crisis and provide a case study demonstrating how to build a measure of a prominent criminological construct through modern systematic psychometric methods. Using multiple surveys and item response theory, it develops a ten-item scale of procedural justice in policing. This can be used in primary research and to adjudicate existing measures. The goal is to reveal the nature of the field's measurement crisis and show a strategy for solving it.
Over the past quarter-century, the literature on gender, peace, and security has evolved into a substantial interdisciplinary field. In this line of work, researchers have investigated the interplay between state security and women’s security, or how gender equality at the state level affects the occurrence of international and intranational conflict. The conclusion is that more gender-equal countries are less prone to engage in warfare, pointing toward a link between women’s security and national security. Various indicators have been used to capture gender equality in this literature, such as the representation of women in parliamentary roles, the proportion of women participating in the labor force, and school enrollment among girls relative to boys.
The standard measure of authoritarianism asks respondents about desirable qualities in children. Although these questions are gender-neutral, respondents may differ in the gender of the child in their heads when answering. The items also may tap into gendered expectations about boys' and girls' behavior. We conducted three experiments that randomly assigned respondents to be asked about a child, a boy, or a girl in the items. We compare the means, measurement properties, and correlation between authoritarianism and other important variables across the conditions. Asking respondents about a girl creates significant differences in the level and measurement of authoritarianism, which is partially driven by the respondents' sexism. There are, fortunately, few other significant differences in the correlates of authoritarianism.