1. Introduction
Overconfidence (and underconfidence) is a miscalibrated belief about own absolute or relative performance. It is a pervasive individual trait, with significant implications, and has been explored across various domains, including economics, psychology, and behavioral sciences. Overconfidence may be viewed as beneficial, as it may enhance ambition, morale, and persistence, creating a reinforcing mechanism where inflated confidence increases the likelihood of success (Bénabou & Tirole, Reference Bénabou and Tirole2002). For example, in a controlled laboratory experiment, Reuben et al., (Reference Reuben, Rey-Biel, Sapienza and Zingales2012) found that men tend to be more overconfident than women, leading to male overrepresentation in leadership positions. Using a British Cohort study, Adamecz-Völgyi and Shure (Reference Adamecz-Völgyi and Shure2022), estimate that overconfidence explains about 5–11% of the gender gap in top job employment for midlife adults.
However, research on overconfidence is not only relevant to understanding gender gapsFootnote 1 and overconfidence is not always beneficial. Inflated overconfidence can have a range of serious negative outcomes, such as increased new business entry and failure (Camerer & Lovallo, Reference Camerer and Lovallo1999), stock market bubbles (Glaser & Weber, Reference Glaser and Weber2007), ideological extremism (Ortoleva & Snowberg, Reference Ortoleva and Snowberg2015), wars (Johnson, Reference Johnson2004), and excessive litigation (Neale & Bazerman, Reference Neale and Bazerman1985). Overconfidence can have perverse effects including poor financial decision-making (Daniel et al., Reference Daniel, Hirshleifer and Subrahmanyam1998), more myopic investments (Kaplan et al., Reference Kaplan, Sørensen and Zakolyukina2022), and academic complacency (Vancouver et al., Reference Vancouver, More and Yoder2008). This begs the question, why are some people more overconfident than others?
We focus on the foundations of overconfidence by asking: ‘To what extent are interpersonal differences in overconfidence explained by differences in genes and in environment?.’ More accurately, and consistent with related studies, we analyze a miscalibrated belief in which positive values denote overconfidence and negative values underconfidence. Only two twin studies estimate the genetic heritability of overconfidence. Cesarini et al., (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009) find that 16%–34% of interpersonal variation in the tendency to overestimate own relative performance (overplacement) is due to differences in genes among adult Swedish twins. Vogt et al. (Reference Vogt, Zheng, Briley, Malanchini, Harden and Tucker-Drob2022) attribute 18%–28% of interpersonal differences in overestimation (measured as a residual between subjective belief in and objective performance) to genes in a sample of children aged 7–15 years in Texas. The remaining variation in overconfidence in both studies was mainly due to unique personal circumstances.
We extend earlier work by measuring different types of overconfidence in one sample. We consider both overestimation, the overconfidence in own absolute performance, and overplacement, the overconfidence in own relative performance. Overestimation and overplacement are conceptually (and empirically) distinct (Moore & Healy, Reference Moore and Healy2008), with overestimation relying on imperfect information about own performance and overplacement additionally relying on imperfect beliefs about other’s performance. We estimate the extent to which these distinct traits are explained by genes and the extent to which their correlation is explained by shared genetic influences. The potential interaction between genes and the common social environment means that the heritability of behavioral traits may vary across social contexts. Existing twin studies provide only two data points: for adult Swedes and for children in Texas. Here, we study a different social context (Australia) and check whether previous findings generalize.
2. Data
We analyze the Australian Twins Economic Preferences Survey (ATEPS) (Kettlewell & Tymula, Reference Kettlewell and Tymula2021). 560 twin pairs (ages 18–66, mean 44.7) completed the online survey in 2020–21. The monozygotic pairs (identical) total 401, and 159 are dizygotic (fraternal).Footnote 2 83% of the sample is female and 59% have a university degree. Further demographic details are in Tables A1–A2.
In one task, participants undertook a cognitive challenge in which they were incentivized to solve ten puzzles from the matrix reasoning item bank, developed by Chierchia et al. (Reference Chierchia, Fuhrmann, Knoll, Pi-Sunyer, Sakhardande and Blakemore2019). Participants had 30 seconds to select the correct image from four available options to complete a sequence (Figure A1 presents an example).Footnote 3 They completed a practice round before starting the task. Puzzles had different difficulty levels, with an expected average score of 6/10. Participants earned AU$2 per correct answer if this task was chosen for payment.
Upon completing the task, participants reported ‘How many of the ten puzzles do you think you got right?’ and ‘Where do you think you will rank in the puzzle task compared to other twins in this study, out of 100?.’ As in Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009), the raw differences between perceived and actual score and rank are our baseline measures of overconfidence (overestimation and overplacement).Footnote 4 However, raw differences can be a biased measure of overconfidence (see e.g., Cronbach & Furby, Reference Cronbach and Furby1970; Edwards, Reference Edwards2001; Juslin et al., Reference Juslin, Winman and Olsson2000). For example, we worry that very low/high scores constrain the ability to reveal under- and overconfidence due to floor and ceiling effects. Consequently, for our twin decomposition analysis, we again follow Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009) and estimate our models controlling for performance in the individual puzzle task.Footnote 5
Participants correctly answered 5.0 (s.d. = 1.9) puzzles on average. They are slightly underconfident about their score (
$\mu $= − 0.57 s.d. = 2.23) but overconfident about their rank (
$\mu $=13.42 s.d. = 39.96). The distributions of both types of overconfidence look normal (Figure A2) and overestimation and overplacement are slightly correlated (
$r$=0.20).
3. Methodology
To decompose the variation in overconfidence into genetic and environmental effects, we exploit that monozygotic (MZ) twins are genetically identical, while dizygotic (DZ) twins share 50% of the genes on average (assuming no assortative mating). Greater discordance between DZ than MZ twins is then indicative of genetic heritability. Despite growing access to genomic data, twin studies relying on discordance remain the dominant approach to estimate heritability (Becker et al., Reference Becker, Burik, Goldman, Wang, Jayashankar, Bennett, Belsky, Karlsson Linnér, Ahlskog, Kleinman and Hinds2021). This is in part because of practical advantages – relatively small samples of twins are required to estimate heritability whereas tens or hundreds of thousands of genomes may need to be mapped. Additionally, twin studies have other appealing features such as capturing rare genetic variations and not being affected by incomplete genetic mapping (Friedman et al., Reference Friedman, Banich and Keller2021; Wainschtein et al., Reference Wainschtein, Jain, Zheng, Cupples, Shadyab, McKnight, Shoemaker, Mitchell and Psaty2022).Footnote 6
Formally, denote the variance of a trait
$Y$ as
$\sigma _Y^2 = \sigma _a^2 + \sigma _d^2 + \sigma _c^2 + \sigma _e^2$ where the independent components are additive genetic effects (
$\sigma _a^2$), dominance genetic effects (where expression of genes depends on interactions at particular loci) (
$\sigma _d^2$), environmental effects common to siblings (e.g., from shared parenting) (
$\sigma _c^2$), and environmental effects that are unique (
$\sigma _e^2$), which include noise. Identification comes from the assumption that the common family environment contributes to overall variance; the same for MZ and DZ twins. Because additive and dominance genetic effects cannot be separately identified,
$\sigma _d^2 = 0$ is frequently imposed. Under these assumptions, the correlation of
$Y$ between MZ twins (
${r_{MZ}}$) captures
$\left( {\sigma _a^2 + \sigma _c^2} \right)/\sigma _Y^2$ and between DZ twins (
${r_{DZ}}$)
$\left( {0.5\sigma _a^2 + \sigma _c^2} \right)/\sigma _Y^2$. These correlations can be manipulated to estimate narrow sense heritability:
${h^2} = \sigma _a^2/\sigma _Y^2 = 2*\left( {{r_{MZ}} - {r_{DZ}}} \right)$.
In practice, researchers assume normality for the variance components and use structural equation modeling (SEM) to estimate variance shares (Neale & Cardon, Reference Neale and Cardon1992). The SEM approach is appealing because it ensures variance shares sum to one, it can accommodate controls and can be extended to multivariate decompositions, here allowing to estimate the degree to which overlapping genes explain the correlation between different overconfidence types. We follow the SEM tradition and estimate the bivariate AC(D)E-Cholesky model, using the umx package for R (Bates et al., Reference Bates, Maes and Neale2019). We report our estimates as shares of explained variance.
We estimate four versions of the model and compare their fit. Our baseline model imposes
$\sigma _d^2 = 0$ (ACE), most common in the literature. Next, we impose
$\sigma _d^2 = \sigma _c^2 = 0$ (AE), then
$\sigma _a^2 = \sigma _d^2 = 0$ (CE), and finally
$\sigma _c^2 = 0$ (ADE). As discussed earlier, we follow Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009) and estimate our models with and without controlling for individual puzzle task score (rank), since very low/high scores (ranks) constrain the ability to reveal under- and overconfidence. We use non-parametric bootstrap clustered at the twin pair level (999 replications) to estimate standard errors and confidence intervals.
4. Results
First, we compare pairwise correlations in overconfidence for MZ and DZ twins (Fig. 1).Footnote 7 For overestimation,
${r_{MZ}} = 0.11$ (s.e. = 0.049) and
${r_{DZ}} = - 0.02$ (s.e. = 0.089). The stronger correlation for MZ twins hints at the importance of genes. The lack of correlation for DZ twins suggests dominance genetic effects might be important (i.e., genes act multiplicatively); however, given fewer DZ twins this may reflect imprecision. Overplacement correlations are stronger,
${r_{MZ}} = 0.31$ (s.e. = 0.047) and
${r_{DZ}} = 0.11$ (s.e. = 0.080), pointing to genes accounting for part of the interpersonal differences.

Fig. 1 Overconfidence correlations
Next, we formally estimate the variance shares using SEM (Table 1). The SEM path diagram (Fig. 2) shows the ACE model standardized factor loadings, including the covariances. When adjusting for the task score (Panel A), we estimate genetic heritability of 17% [CI = 0.11%, 27%] for overplacement and 19% [12%, 27%] for overestimation in the ACE specification (AE is virtually identical). AE model fits best and ADE fits slightly better than ACE. The implied heritability is similar across model specifications with little evidence for the importance of common family environment. There is strong negative correlation between genes that explain overplacement and overestimation (Fig. 2). Following Kuntsi et al. (Reference Kuntsi, Eley, Taylor, Hughes, Asherson, Caspi and Moffitt2004), we calculate the genetic correlation coefficient = − 1.00 [−0.59, − 1] implying a near perfect relationship (for unadjusted overconfidence corr. = − 0.85 [−0.04, − 1]). This implies that genes that increase the tendency for overplacement decrease the tendency for overestimation.
Table 1 AC(D)E-Cholesky decomposition estimates

Notes: SEM estimates for the additive genetic effects (
${\hat a^2}$), dominance genetic effects (
${\hat d^2}$), common environment (
${\hat c^2}$), and unique environment (
${\hat e^2}$). Adjusted estimates control for the puzzle task score in the overestimation equation and the percentile rank in the overplacement equation; all estimates control for sex.
${{\Delta }}{\chi ^2}$ p-val tests the improvement in log-likelihood relative to ACE. Nonparametric bootstrap 95%-level confidence intervals clustered at the twin pair level in parentheses.

Fig. 2 SEM path diagram (standardized coefficients). Notes: Standardized factor loading for ACE model controlling for puzzle task score/rank
Without controlling for the task score (Panel B), the genetic variance estimates for overplacement are larger (29% [23%, 36%] in the best-fitting AE model), which is also the case in Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009) (34% unadjusted, 16% adjusted). However, our estimates for overestimation become smaller (10% [3%, 18%]). In either case, conclusions are qualitatively similar – unique experiences explain most variation, but genes are a non-trivial factor.
5. Discussion
We provide estimates for the extent to which interpersonal differences in overconfidence are explained by genes versus environment in an Australian adult twins sample. Our unique contribution is that we study two types of overconfidence – overplacement and overestimation – in the same sample to uncover whether these distinct measures of overconfidence have different and related heritability. Two previous twin studies estimated the heritability of overconfidence. Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009) estimated the heritability of overplacement in adult Swedes and Vogt et al. (Reference Vogt, Zheng, Briley, Malanchini, Harden and Tucker-Drob2022) estimated the heritability of overestimation in Texan children.
We find that genetic variation can account for 17%–29% of variation in overplacement, an estimate that is in line with Cesarini et al. (Reference Cesarini, Lichtenstein, Johannesson and Wallace2009). We estimate the heritability of overestimation at 10%–19%, less than 18–28% estimated for Texan children in Vogt et al. (Reference Vogt, Zheng, Briley, Malanchini, Harden and Tucker-Drob2022). While it is true that Vogt et al. (Reference Vogt, Zheng, Briley, Malanchini, Harden and Tucker-Drob2022) sample differs from ours, particularly in age, we cannot definitively say why our estimates are smaller than theirs.
Additionally, we find a negative genetic correlation between overplacement and overestimation suggesting that individuals who are genetically predisposed to exhibit more overplacement will show less overestimation. This estimate is fairly imprecise, and we caution against overinterpretation; nonetheless, the idea that the genes that drive overplacement might also nudge us away from overestimation is intriguing and could have practical implications. For example, researchers are increasingly using polygenetic scores to proxy for different traits. Our results imply that if a score is developed for overconfidence, researchers should be very careful in understanding what type of overconfidence the score reflects before applying it to their work, to avoid misinterpreting results. Moore and Healy (Reference Moore and Healy2008) defined three separate forms of overconfidence. In addition to overplacement, and overestimation studied in this paper, they also recognize overprecision which is an excessive certainty in the accuracy of one’s knowledge. The implications of our finding are similar to Moore and Healy (Reference Moore and Healy2008) who, using a basic Bayesian belief updating model, predict a negative correlation between overplacement and overestimation and support this theoretical prediction with their own data and previous literature. They find that in difficult tasks people tend to overestimate their own absolute but underplace their own relative performance, and in easy tasks people underestimate and overplace. Moore and Healy urged researchers to not assume that all different types of overconfidence are the same and our results underscore this recommendation.
Our estimates suggest overconfidence is largely due to idiosyncratic circumstances. The relatively modest pressure from genetic forces could mean this trait is quite malleable (although it bears mentioning genes are not deterministic and merely increase the chances of certain behaviors). Given the impact that both over- and underconfidence have on various aspects of our lives, and considering their malleability, future research should focus on understanding how to achieve optimal levels across all types of overconfidence. We estimate that the variance due to genes is smaller for overestimation than for overplacement, suggesting that overestimation should be more easily influenced by non-genetic factors than overplacement.
Finally, we note that our study focused on a specific sample – Australian adults – and measured overconfidence with a single task. While our results align with previous findings for adults (Cesarini et al., Reference Cesarini, Lichtenstein, Johannesson and Wallace2009), further research utilizing a variety of tasks designed to measure different types of overconfidence is necessary.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/esa.2025.10016
Competing interest
Nothing to declare.
Funding
This research was facilitated through access to Twins Research Australia, a national resource supported by the National Health & Medical Research Council (NHMRC) since 1981, administered by the University of Melbourne. We received funding support from the University of Technology Sydney and from the Australian Research Council’s Centre of Excellence for Children and Families over the Life Course (Project ID CE140100027 and CE200100025).