1. Introduction
The first papers describing nearly complete genome sequences were typically entitled ‘The genome of …’. Implicit in these titles was the assumption that much of the genome is shared between different members of a species. Looking back today, it is clear that a great deal has been learned from the study of genes conserved in each species, the proteins they encode, their regulatory elements and so on. Similarly, the commonalities as well as fixed differences between both closely and more distantly related species have greatly informed biology, including our understanding of evolution over different time scales.
As interesting as all this knowledge is, we cannot fully understand biology without considering the genetic differences between individuals. These are not only at the root of adaptation to specific environments but also underlie susceptibility to disease and abiotic factors. Such seemingly deleterious variants are often linked by evolutionary trade-offs, where genes and alleles that are favourable in one environment become a liability in a different environment. The original genome papers for different species, therefore, were often quickly followed by attempts to record interindividual differences at the whole-genome level. Pioneering in this regard were scientists working with the plant Arabidopsis thaliana. The initial genome paper already contained information on shotgun sequences from a second strain in addition to the one from which the genome sequence had been primarily generated (The Arabidopsis Genome Initiative, 2000). Soon thereafter, the first genome-wide polymorphism analyses were published, at increasingly higher resolution, relying on a series of different technologies (Clark et al., Reference Clark, Schweikert, Toomajian, Ossowski, Zeller, Shinn, Warthmann, Hu, Fu, Hinds, Chen, Frazer, Huson, Schölkopf, Nordborg, Rätsch, Ecker and Weigel2007; Kim et al., Reference Kim, Plagnol, Hu, Toomajian, Clark, Ossowski, Ecker, Weigel and Nordborg2007; Nordborg et al., Reference Nordborg, Borevitz, Bergelson, Berry, Chory, Hagenblad, Kreitman, Maloof, Noyes, Oefner, Stahl and Weigel2002, Reference Nordborg, Hu, Ishino, Jhaveri, Toomajian, Zheng, Bakker, Calabrese, Gladstone, Goyal, Jakobsson, Kim, Morozov, Padhukasahasram, Plagnol, Rosenberg, Shah, Wall, Wang and Bergelson2005). Some of the approaches could interrogate in principle every position in the reference genome, but were limited by the extent of sequence divergence that could be recorded, and in highly divergent or missing regions, the exact sequence remained unknown (Clark et al., Reference Clark, Schweikert, Toomajian, Ossowski, Zeller, Shinn, Warthmann, Hu, Fu, Hinds, Chen, Frazer, Huson, Schölkopf, Nordborg, Rätsch, Ecker and Weigel2007). This remained the case when much cheaper short-read sequencing entered the scene, although with increasing lengths of short reads, more and more of the genome became accessible to polymorphism analyses (Ossowski et al., Reference Ossowski, Schneeberger, Clark, Lanz, Warthmann and Weigel2008)]. Importantly, short-read sequencing provided access to sequences that were not present in the original reference genome, and some of these could even be anchored to reference positions (Cao et al., Reference Cao, Schneeberger, Ossowski, Günther, Bender, Fitz, Koenig, Lanz, Stegle, Lippert, Wang, Ott, Müller, Alonso-Blanco, Borgwardt, Schmid and Weigel2011; Gan et al., Reference Gan, Stegle, Behr, Steffen, Drewe, Hildebrand, Lyngsoe, Schultheiss, Osborne, Sreedharan, Kahles, Bohnert, Jean, Derwent, Kersey, Belfield, Harberd, Kemen, Toomajian and Mott2011; Long et al., Reference Long, Rabanal, Meng, Huber, Farlow, Platzer, Zhang, Vilhjálmsson, Korte, Nizhynska, Voronin, Korte, Sedman, Mandáková, Lysak, Seren, Hellmann and Nordborg2013; Ossowski et al., Reference Ossowski, Schneeberger, Clark, Lanz, Warthmann and Weigel2008; Schneeberger et al., Reference Schneeberger, Ossowski, Ott, Klein, Wang, Lanz, Smith, Cao, Fitz, Warthmann, Henz, Huson and Weigel2011). The picture for other plant species, especially the crops rice and maize, was broadly similar, albeit typically with a delay of a few years (Gao et al., Reference Gao, Gonda, Sun, Ma, Bao, Tieman, Burzynski-Chang, Fish, Stromberg, Sacks, Thannhauser, Foolad, Diez, Blanca, Canizares, Xu, van der Knaap, Huang, Klee and Fei2019; Li et al., Reference Li, Zhou, Ma, Jiang, Jin, Zhang, Guo, Zhang, Sui, Zheng, Zhang, Zuo, Shi, Li, Zhang, Hu, Kong, Hong, Tan and Qiu2014; Zhao et al., Reference Zhao, Feng, Lu, Li, Wang, Tian, Zhan, Lu, Zhang, Huang, Wang, Fan, Zhao, Wang, Zhou, Chen, Zhu, Li, Weng and Huang2018).
The use of short reads for the identification of sequence polymorphisms that distinguish the focal individual from the reference strain begins with the mapping of reads against the reference genome sequence. A limitation, therefore, is the reference bias that is caused by the degree of mismatch between a specific short read and its target. Although mismatches may still allow confident mapping, more reads can be mapped with a more closely related genome. This insight led early on to the suggestion of producing synthetic reference genome sequences that represented all possible combinations of polymorphisms across the genome, including those not yet discovered in the genomes analysed (Schneeberger et al., Reference Schneeberger, Hagmann, Ossowski, Warthmann, Gesing, Kohlbacher and Weigel2009).
A less obvious problem has been that a specific sequence might be present only once in one genome but multiple times in another genome. And even if a sequence is present only once, it might occur in different regions of the genome across individuals. In both cases, short-read mapping can be misleading. For example, heterozygosity may be inferred when in reality there are two closely related but not identical duplicated fragments in the short-read sequenced genome.
Rather than relying on a single reference genome, the concept of the pangenome was introduced to capture all sequence variation within a species. First applied in bacterial genomes (Tettelin et al., Reference Tettelin, Masignani, Cieslewicz, Donati, Medini, Ward, Angiuoli, Crabtree, Jones, Durkin, DeBoy, Davidsen, Mora, Scarselli, Peterson, Hauser, Sundaram, Nelson and Fraser2005), this framework quantified core and dispensable genes across populations, highlighting extensive diversity. As sequencing costs declined, the pangenome approach was extended to eukaryotic genomes, including humans and Arabidopsis thaliana (Cao et al., Reference Cao, Schneeberger, Ossowski, Günther, Bender, Fitz, Koenig, Lanz, Stegle, Lippert, Wang, Ott, Müller, Alonso-Blanco, Borgwardt, Schmid and Weigel2011; Li et al., Reference Li, Li, Zheng, Luo, Zhu, Li, Qian, Ren, Tian, Li, Zhou, Zhu, Wu, Qin, Jin, Li, Cao, Hu, Blanche and Wang2010; Sudmant et al., Reference Sudmant, Kitzman, Antonacci, Alkan, Malig, Tsalenko, Sampas, Bruhn, Shendure and Eichler2010). Over time, this concept evolved into a broader model encompassing the full genomic landscape of a population, species or clade. A somewhat unfortunate fact is that ‘pangenome’ these days more often refers to the collection of a specific set of genomes rather than the ideal of all reasonably common variants in a population.
2. The evolution of building pangenome graphs
One intuitive way to overcome the limitations of a linear reference genome is through genome graphs, which offer a compact data structure where sequences are represented as node-labelled graphs with edges connecting variants across multiple genomes. Unlike traditional linear representations, genome graphs preserve original coordinates by tracing paths through the sequence graph, accommodating both shared and unique genomic regions. The interpretability of pangenome graphs and their level of detail exist on a two-dimensional spectrum. At one extreme, highly abstract graphs (Figure 1a,b), such as those representing every nucleotide variant with numerous loops and alternative paths, may be difficult to understand and have limited practical use. At the other extreme, unaligned genomic sequences, while easy to interpret, can obscure meaningful genomic differences. Depending on the approach, fixed k-mer-based de Bruijn graphs (Figure 1a,b) and fully aligned multi-individual genomes (Figure 1c–h) represent different points along this continuum (Figure 1i).

Figure 1. Multiple approaches to building pangenome graphs. (a) A graph that has only four nodes, corresponding to the four DNA bases, with all possible connections between the nodes. (b) A k-mer graph based on short sequences (here, triplets). (c) and (d) The same sequences are combined in different representations, which highlights the equivalency of a multiple genome alignment and a genome graph. (e) The same sequences shown in VCF format in a symbolic manner. (f), (g) and (h) represent the entire chromosome 1 from five A. thaliana individuals. (f) A Biforst 21-mer graph. (g) A Minigraph-Cactus graph. (h) A PGGB graph. (i) Summary of the trade-offs in node size and complexity for different types of graphs. Note that even with the same method, parameter choice will result in different graphs from identical sequences.
2.1. Variation-first: Augmenting the reference
Early pangenome graph construction was constrained by practical realities: high-quality genome assemblies were expensive and rare, while catalogues of variation from resequencing projects were abundant. This imbalance led to reference-augmented approaches that embedded known variants into linear reference backbones. These methods typically integrate variant genotyping within the same workflow, though here we focus on the graph construction aspect.
Schneeberger et al. (Reference Schneeberger, Hagmann, Ossowski, Warthmann, Gesing, Kohlbacher and Weigel2009) pioneered this concept with GenomeMapper, demonstrating that incorporating known polymorphisms could reduce mapping bias in Arabidopsis thaliana. The approach was later extended to the major histocompatibility complex region in humans (Dilthey et al., Reference Dilthey, Cox, Iqbal, Nelson and McVean2015), where high diversity makes linear references particularly inadequate. These early successes established the fundamental principle that graph representations could capture variation more faithfully than linear sequences. It was further generalised by several groups to the whole genomes of thousands of human individuals, with each group using distinct strategies (Eggertsson et al., Reference Eggertsson, Jonsson, Kristmundsdottir, Hjartarson, Kehr, Masson, Zink, Hjorleifsson, Jonasdottir, Jonasdottir, Jonsdottir, Gudbjartsson, Melsted, Stefansson and Halldorsson2017; Garrison et al., Reference Garrison, Sirén, Novak, Hickey, Eizenga, Dawson, Jones, Garg, Markello, Lin, Paten and Durbin2018; Rakocevic et al., Reference Rakocevic, Semenyuk, Lee, Spencer, Browning, Johnson, Arsenijevic, Nadj, Ghose, Suciu, Ji, Demir, Li, Toptaş, Dolgoborodov, Pollex, Spulber, Glotova, Kómár and Kural2019). GraphTyper (Eggertsson et al., Reference Eggertsson, Jonsson, Kristmundsdottir, Hjartarson, Kehr, Masson, Zink, Hjorleifsson, Jonasdottir, Jonasdottir, Jonsdottir, Gudbjartsson, Melsted, Stefansson and Halldorsson2017) iteratively realigned, clipped and unaligned short reads with an embedded genome graph for small variant calling. The VG toolkit (Variation Graph toolkit, (Garrison et al., Reference Garrison, Sirén, Novak, Hickey, Eizenga, Dawson, Jones, Garg, Markello, Lin, Paten and Durbin2018; Hickey et al., Reference Hickey, Heller, Monlong, Sibbesen, Sirén, Eizenga, Dawson, Garrison, Novak and Paten2020) emerged as the first comprehensive open-source framework for this reference-augmented paradigm. VG constructs variation graphs by threading known variants from VCF files into reference genomes or directly from genome alignment, creating alternative paths that represent different allelic states. It supports complex structural variations, which include duplications and inversions, using a bidirectional cyclic graph. Graph Genome pipeline (Rakocevic et al., Reference Rakocevic, Semenyuk, Lee, Spencer, Browning, Johnson, Arsenijevic, Nadj, Ghose, Suciu, Ji, Demir, Li, Toptaş, Dolgoborodov, Pollex, Spulber, Glotova, Kómár and Kural2019) also supports structural variants (SVs) for genotyping with high speed, but it is limited to human genomes and is not openly distributed.
Due to the flexibility of constructing graphs from precomputed VCFs, VG has become the backbone of many pipelines. By incorporating variants derived from genome assemblies or filtered long-read alignments, VG-based workflows have been successfully applied across diverse species, including humans and crops (Liu et al., Reference Liu, Du, Li, Shen, Peng, Liu, Zhou, Zhang, Liu, Shi, Huang, Li, Zhang, Wang, Zhu, Han, Liang and Tian2020; Qin et al., Reference Qin, Lu, Du, Wang, Chen, Chen, He, Ou, Zhang, Li, Li, Li, Liao, Gao, Tu, Yuan, Ma, Wang, Qian and Li2021; Sirén et al., Reference Sirén, Monlong, Chang, Novak, Eizenga, Markello, Sibbesen, Hickey, Chang, Carroll, Gupta, Gabriel, Blackwell, Ratan, Taylor, Rich, Rotter, Haussler, Garrison and Paten2021; Zhou et al., Reference Zhou, Zhang, Bao, Li, Lyu, Zan, Wu, Cheng, Fang, Wu, Zhang, Lyu, Lin, Gao, Saha, Mueller, Fei, Städler, Xu and Huang2022).
2.2. The alignment-first paradigm: Towards unbiased representation
With decreasing costs and improving quality of long-read sequencing, generating multiple high-quality genome assemblies has become increasingly feasible, shifting the bottleneck from data generation to comparative analysis. This shift enabled a new paradigm in pangenome graph construction – moving from reference-based variant threading to graph building through whole-genome alignments directly, adopting an ‘alignment-first’ approach. Theoretically, this approach can reduce reference bias and better capture complex structural variations, including inversions, duplications and rearrangements that are hard to encode using VCF-based models.
Even before genome graphs were formally introduced, a multiple genome alignment (MGA) already served as an implicit representation of shared and divergent sequence features across assemblies. Multiple sequence alignments (MSAs) naturally lend themselves to representation as partially ordered sequence (POA) graphs (Lee et al., Reference Lee, Grasso and Sharlow2002), which have been extended into A-Bruijn graphs (Raphael et al., Reference Raphael, Zhi, Tang and Pevzner2004) and cactus graphs (Paten et al., Reference Paten, Earl, Nguyen, Diekhans, Zerbino and Haussler2011) to better accommodate genome rearrangements and duplications. Mauve (A. C. E. Darling et al., Reference Darling, Mau, Blattner and Perna2004) and TBA (Threaded Blockset Aligner) (Blanchette et al., Reference Blanchette, Kent, Riemer, Elnitski, Smit, Roskin, Baertsch, Rosenbloom, Clawson, Green, Haussler and Miller2004) represent some of the earliest efforts to align genome regions across multiple species. Vaughn and colleagues (Vaughn et al., Reference Vaughn, Branham, Abernathy, Hulse-Kemp, Rivers, Levi and Wechter2022) recently used progressiveMauve (Darling et al., Reference Darling, Mau and Perna2010) to align melon genomes and convert them into a genome graph for genotyping.
To bridge traditional alignment and graph construction, several intermediate tools have been developed. REVEAL (Recursive Exact-Matching Aligner) (Linthorst et al., Reference Linthorst, Hulsman, Holstege and Reinders2015) employs a recursive exact-matching strategy to construct alignments, while tools like NovoGraph (Biederstedt et al., Reference Biederstedt, Oliver, Hansen, Jajoo, Dunn, Olson, Busby and Dilthey2018) and Seq-seq-pan (Jandrasits et al., Reference Jandrasits, Dabrowski, Fuchs and Renard2018) utilise progressive or block-based alignment strategies to scale MGAs to a large number of genomes. ProgressiveCactus (Armstrong et al., Reference Armstrong, Hickey, Diekhans, Fiddes, Novak, Deran, Fang, Xie, Feng, Stiller, Genereux, Johnson, Marinescu, Alföldi, Harris, Lindblad-Toh, Haussler, Karlsson, Jarvis and Paten2020; Paten et al., Reference Paten, Earl, Nguyen, Diekhans, Zerbino and Haussler2011) dramatically improves scalability using a guide-tree-based alignment strategy. Its output can be used as alignment input to the VG toolkit, enabling the inclusion of large duplications and inversions in yeast (Garrison et al., Reference Garrison, Sirén, Novak, Hickey, Eizenga, Dawson, Jones, Garg, Markello, Lin, Paten and Durbin2018). This approach provided the first workflow for converting an MGA into a graph that can be used both to infer genotype information and to map short reads. SibeliaZ (Minkin & Medvedev, Reference Minkin and Medvedev2020) generalised these ideas based on information from a de Bruijn graph to construct improved MGAs.
The Human Pangenome Reference Consortium (HPRC) (Liao et al., Reference Liao, Asri, Ebler, Doerr, Haukness, Hickey, Lu, Lucas, Monlong, Abel, Buonaiuto, Chang, Cheng, Chu, Colonna, Eizenga, Feng, Fischer, Fulton and Paten2023) has greatly advanced the field by releasing an initial pangenome draft from 47 humans, constructed using methods like Minigraph, Minigraph-Cactus and Pangenome Graph Builder (PGGB). Minigraph (Li et al., Reference Li, Feng and Chu2020) extended the minimap2 chaining algorithm to progressively add large SVs (>50 bp) into the graph. Minigraph-Cactus (Hickey et al., Reference Hickey, Monlong, Ebler, Novak, Eizenga, Gao, Marschall, Li and Paten2024) recruits the graph from Minigraph as a backbone. It then adds base-level alignments after clipping sequences that are highly divergent from a chosen reference sequence (‘clipping’ is the technical term for removing the portion of a query sequence that cannot be confidently aligned to the target genome). The details of these graphs will depend on the order of the input of sequences or the divergence between samples in the collection of genomes (Garrison & Guarracino, Reference Garrison and Guarracino2023), but it simplifies the graph structure and makes the graph suitable for downstream genotyping tasks. Similarly, ACMGA (AnchorWave-Cactus Multiple Genome Alignment) (Zhou et al., Reference Zhou, Su and Song2024) combines cactus with AnchorWave, which improves the alignment of long repetitive sequences in the plant genomes, for detection of large SVs (Song et al., Reference Song, Marco-Sola, Moreto, Johnson, Buckler and Stitzer2022). Huijse et al. (Reference Huijse, Adams, Burton, David, Julian, Meshulam-Simon, Mickalide, Tafesse, Calonga-Solís, Wolf, Morrison, Augusto and Endlich2023) found that AnchorWave outperformed Minigraph-Cactus in producing alignments in the highly divergent MHC region of human genomes. The PGGB (Garrison et al., Reference Garrison, Guarracino, Heumos, Villani, Bao, Tattini, Hagmann, Vorbrugg, Marco-Sola, Kubica, Ashbrook, Thorell, Rusholme-Pilcher, Liti, Rudbeck, Golicz, Nahnsen, Yang, Mwaniki and Prins2024) tries to capture all variations in the input sequences by constructing and all-to-all genome alignment by wfmash and rendering it with seqwish (Garrison & Guarracino, Reference Garrison and Guarracino2023) and GFAffix, then further consensus with smoothxg. While this approach offers a comprehensive representation of variations, the computational demands of all-to-all alignments are substantial. Instead of building a whole genome graph, PGR-TK (PanGenomic Research Took Kit) (Chin et al., Reference Chin, Behera, Khalak, Sedlazeck, Sudmant, Wagner and Zook2023) rapidly constructs subgraphs of specific regions using data structures designed for long-read assembly (Chin & Khalak, Reference Chin and Khalak2019; Li, Reference Li2016); it was shown to be very fast in rebuilding the complex variations in MHC haplotypes, though its use demands substantial expertise for parameter tuning and result interpretation.
2.3. Scalable alternatives to whole-genome alignments
Over the past decade, the complexity and scalability challenges of constructing and querying large genome graphs have become increasingly apparent. As a result, researchers have explored pangenomes using analyses based on specific sequence blocks – such as orthologous genes or k-mers – rather than base-resolution DNA sequences. Different strategies have been developed to make pangenome analyses more scalable, each with its own trade-offs. K-mer based approaches are computationally efficient, making them attractive for large-scale comparisons. However, they sacrifice sequence context and struggle to distinguish between repeats, particularly in complex eukaryotic genomes. In contrast, gene-based methods are more interpretable and extensible across genomes but depend heavily on good gene annotation. Annotation quality in turn is dependent on a range of factors, such as the availability of RNA and proteomics data, whether a genome is from a taxon that contains other well-annotated genomes and so on. The good news is that ever more comprehensive sampling, at the level of individuals (tissues and conditions), populations, species and higher-order groupings will undoubtedly improve gene annotations.
In bacterial pangenomics, gene presence–absence matrices generated by orthogroup clustering with OrthoMCL have been the standard (Contreras-Moreira & Vinuesa, Reference Contreras-Moreira and Vinuesa2013; Li et al., Reference Li, Stoeckert and Roos2003; Page et al., Reference Page, Cummins, Hunt, Wong, Reuter, Holden, Fookes, Falush, Keane and Parkhill2015). This strategy was subsequently extended by incorporating gene graphs in tools like PPanGGOLiN (Gautreau et al., Reference Gautreau, Bazin, Gachet, Planel, Burlot, Dubois, Perrin, Médigue, Calteau, Cruveiller, Matias, Ambroise, Rocha and Vallenet2020) and Panaroo with partitioned and fixed annotation error (Tonkin-Hill et al., Reference Tonkin-Hill, MacAlasdair, Ruis, Weimann, Horesh, Lees, Gladstone, Lo, Beaudoin, Floto, Frost, Corander, Bentley and Parkhill2020). Genome Complexity Browser visualised and quantified variability with orthogroup inference (Manolov et al., Reference Manolov, Konanov, Fedorov, Osmolovsky, Vereshchagin and Ilina2020). PanPA constructs graphs based on protein sequence alignments (Dabbaghie et al., Reference Dabbaghie, Srikakulam, Marschall and Kalinina2023), and Pangene leverages rapid protein alignments to build gene graphs for eukaryotic genomes, enabling analysis of gene copy number changes and orientation – remarkably, it can build a graph from 100 human haplotypes in under one minute (Li et al., Reference Li, Marin and Farhat2024).
Although implicit graphs constructed from fixed k-mers provide a valuable snapshot of genomic diversity, their resolution is inherently limited, and other tools have taken different routes. PanTools (Sheikhizadeh et al., Reference Sheikhizadeh, Schranz, Akdel, de Ridder and Smit2016) detects homology groups with k-mers and builds a database for pan-proteome query, while PanKmer (Aylward et al., Reference Aylward, Petrus, Mamerto, Hartwick and Michael2023) and Panagram (Benoit et al., Reference Benoit, Jenike, Satterlee, Ramakrishnan, Gentile, Hendelman, Passalacqua, Suresh, Shohat, Robitaille, Fitzgerald, Alonge, Wang, Santos, He, Ou, Golan, Green, Swartwood and Lippman2025) decompose assembled genomes into a k-mer database with further ability to locate specific positions in assemblies. Furthermore, methods like Biforst (Holley & Melsted, Reference Holley and Melsted2020) and mdBG (Ekim et al., Reference Ekim, Berger and Chikhi2021) efficiently construct de Bruijn graphs for storage and rapid querying; they can be applied to genotype variable tandem repeats with short reads (Lu et al., Reference Lu and Chaisson2021), though they fall short in accurately representing complete loci for downstream analyses (Andreace et al., Reference Andreace, Lechat, Dufresne and Chikhi2023).
3. Variant calling in the graph era
Once a pangenome graph is constructed, it can serve as an enhanced reference for genotyping resequenced samples – either by aligning reads or matching k-mers – capturing a broader range of sequence variation than linear references. While many current tools rely on read mapping or k-mer comparison to identify SNPs and SVs, some have advanced to support haplotype reconstruction and the detection of novel variants – capabilities that are particularly effective with long-read resequencing.
Among these, one of the most widely adopted and versatile tools is the Variation Graph Toolkit (VG), which provides a comprehensive framework for mapping, small variant calling, and SV genotyping. VG has become popular since its first open-source release (Garrison et al., Reference Garrison, Sirén, Novak, Hickey, Eizenga, Dawson, Jones, Garg, Markello, Lin, Paten and Durbin2018; Hickey et al., Reference Hickey, Heller, Monlong, Sibbesen, Sirén, Eizenga, Dawson, Garrison, Novak and Paten2020). It also reduces reference bias in ancient samples (Martiniano et al., Reference Martiniano, Garrison, Jones, Manica and Durbin2020). Another VG module, Giraffe (Sirén et al., Reference Sirén, Monlong, Chang, Novak, Eizenga, Markello, Sibbesen, Hickey, Chang, Carroll, Gupta, Gabriel, Blackwell, Ratan, Taylor, Rich, Rotter, Haussler, Garrison and Paten2021), was developed as a successor of VG map to accelerate the process for large-scale genotyping. PHG (Practical Haplotype Graph) utilises established tools for mapping against linear references (e.g., GATK) for genotyping in the offspring of crops (Bradbury et al., Reference Bradbury, Casstevens, Jensen, Johnson, Miller, Monier, Romay, Song and Buckler2022). DRAGEN (Dynamic Read Analysis for GENomics) (Behera et al., Reference Behera, Catreux, Rossi, Truong, Huang, Ruehle, Visvanath, Parnaby, Roddey, Onuchic, Finocchio, Cameron, English, Mehtalia, Han, Mehio and Sedlazeck2024) is currently the fastest for mapping and genotyping against pangenome references, exploiting hardware acceleration and tricks from machine learning, but it requires a commercial license. Apart from directly mapping to a graph, mapping reads to multiple references first and then injecting them into a graph based on mapping coordinates is another direction; one example is Gfa2bin (Vorbrugg et al., Reference Vorbrugg, Bezrukov, Bao and Weigel2024) and cosigt (Bolognini et al., Reference Bolognini, Halgren, Lou, Raveane, Rocha, Guarracino, Soranzo, Chin, Garrison and Sudmant2024), which uses node coverage across multiple references for genotyping with mapping by bwa (Li, Reference Li2013). Such approaches benefit from the maturity of linear reference mapping and the compatibility of their outputs with downstream graph-based analyses. Mapping long reads directly to genome graphs has become increasingly viable. Graphaligner (Rautiainen & Marschall, Reference Rautiainen and Marschall2020) is the first tool to achieve long-read mapping to a graph with a seed-and-extend strategy, with much higher speed than VG. Minigraph (Li et al., Reference Li, Feng and Chu2020) can find approximate mapping locations without base-level alignment, while Minichain (Chandra et al., Reference Chandra, Gibney and Jain2024) introduces a recombination penalty for long reads mapping to the graph.
To sidestep the computational cost of full mapping, many tools employ k-mer comparison strategies that match sequencing reads to known variants encoded in the graph. PanGenie (Ebler et al., Reference Ebler, Ebert, Clarke, Rausch, Audano, Houwaart, Mao, Korbel, Eichler, Zody, Dilthey and Marschall2022) and KAGE (Grytten et al., Reference Grytten, Dagestad Rand and Sandve2022, Reference Grytten, Rand and Sandve2023) compare k-mers from reads to a pangenome graph to reduce run time and mapping bias. Ensemble Variant Genotyper (Du et al., Reference Du, He and Jiao2024) is a framework designed to standardise the performance of various genotyping tools by accounting for the genomic features specific to plant species. Varigraph (Du et al., Reference Du, He, Xiao, Hu, Yang and Jiao2025) further optimised the k-mer-based approach with memory efficiency and extended the model for dosage estimation in autopolyploid genomes. A drawback is that these tools only genotype the known variations independently and thus cannot reconstruct the haplotypes in the population. To address this gap, Locityper (Prodanov et al., Reference Prodanov, Plender, Seebohm, Meuth, Eichler and Marschall2025) and cosigt (Bolognini et al., Reference Bolognini, Halgren, Lou, Raveane, Rocha, Guarracino, Soranzo, Chin, Garrison and Sudmant2024) have been developed to utilise read alignment profiles to locate the closest haplotype in the graph.
Furthermore, SV calling directly from pangenome graphs remains a critical challenge. To overcome the issues, SVarp (Soylev et al., Reference Soylev, Ebler, Pani, Rausch, Korbel and Marschall2024) tackles this by locally assembling potential SV alleles from long-read data, while PALSS (Denti et al., Reference Denti, Bonizzoni, Brejova, Chikhi, Krannich, Vinar and Hormozdiari2025) augments the graph with the consensus from sample-specific long reads without mapping.
In summary, the field of pangenome graph construction is dynamic, with no single tool dominating; the optimal tool choice depends on the specific research objectives and the desired resolution. Reference-based variation graphs, for instance, facilitate population genetics analyses across extensive cohorts but may omit certain genomic variations. Tools like PGGB offer comprehensive graph representations; however, their complexity can pose challenges for downstream applications such as VG Giraffe alignment, necessitating tailored pruning strategies for effective read mapping. Notably, developing and benchmarking efforts have predominantly centred on human genomics. Given that non-human species, including plant genomes, are often much more diverse than human genomes, there is a need for expanded evaluation across diverse species of tools for the building and use of pangenomes.
4. Functional pangenomics: Linking variation to mechanism
Reference bias not only affects variant discovery. Its shortcomings have knock-on effects in downstream functional analyses, including the comparison of chromatin accessibility, gene expression, or DNA methylation (Galli et al., Reference Galli, Chen, Ghandour, Chaudhry, Gregory, Feng, Li, Schleif, Zhang, Dong, Song, Walley, Chuck, Whipple, Kaeppler, Huang and Gallavotti2025; Igolkina et al., Reference Igolkina, Vorbrugg, Rabanal, Liu, Ashkenazy, Kornienko, Fitz, Collenberg, Kubica, Mollá Morales, Jaegle, Wrightsman, Voloshin, Bezlepsky, Llaca, Nizhynska, Reichardt, Bezrukov, Lanz and Nordborg2025). Compared to the growing adoption of genome graphs for structural variation calling and genotyping, much more still needs to be done to take advantage of graph-based frameworks for functional genomics.
Grytten et al. (Reference Grytten, Rand, Nederbragt, Storvik, Glad and Sandve2019) implemented Graph Peak Caller to identify ChIP-seq peaks using a variation graph in A. thaliana, identifying more than twice as many base pairs absent from the linear reference than had been found with previous methods. DNA methylation studies have revealed analogous benefits – and also underscore the extent of reference bias in functional assays. In cattle, using the wrong reference genome can lead to substantial errors in methylation quantification, with up to ~2% global bias and large numbers of methylated cytosines being affected by breed-specific variation (MacPhillamy et al., Reference MacPhillamy, Chen, Hiendleder, Williams, Alinejad-Rokny and Low2024). In Arabidopsis thaliana, methylation profiling was even more sensitive to reference choice, with only ~88% of sites being consistent between reference and focal strain, with one major reason being that transposable elements, which are prime targets of DNA methylation, have been much more active in this species than, for example, in humans (Igolkina et al., Reference Igolkina, Vorbrugg, Rabanal, Liu, Ashkenazy, Kornienko, Fitz, Collenberg, Kubica, Mollá Morales, Jaegle, Wrightsman, Voloshin, Bezlepsky, Llaca, Nizhynska, Reichardt, Bezrukov, Lanz and Nordborg2025). To address this, methylGrapher (Zhang et al., Reference Zhang, Macias-Velasco, Zhuo, Belter, Tomlinson, Garza, Tekkey, Li and Wang2025) introduced the first graph-based approach for mapping bisulfite sequencing data. Compared to traditional methods such as Bismark, it uniquely identified 2.2–2.9 million mCpGs across five human samples, many of which were absent from the reference or misclassified as unmethylated before.
Reference bias also affects RNA-seq analysis. In Arabidopsis thaliana, expression estimates diverged for a subset of genes depending on whether reads were mapped to the reference genome or to the accession’s own genome; these genes were strongly enriched for transposable elements and copy number–variable loci (Igolkina et al., Reference Igolkina, Vorbrugg, Rabanal, Liu, Ashkenazy, Kornienko, Fitz, Collenberg, Kubica, Mollá Morales, Jaegle, Wrightsman, Voloshin, Bezlepsky, Llaca, Nizhynska, Reichardt, Bezrukov, Lanz and Nordborg2025). Similar trends were observed in barley, but at an even higher rate, where mapping transcriptomic reads to a pan-transcriptome built with 20 genotypes improved the mapping rate by around 11% compared to a single linear reference (Guo, Schreiber et al. Reference Guo, Schreiber, Marosi, Bagnaresi, Jørgensen, Braune, Chalmers, Chapman, Dang, Dockter, Fiebig, Fincher, Fricano, Fuller, Haaning, Haberer, Himmelbach, Jayakodi, Jia and Waugh2025). VG rpvg (Sibbesen et al., Reference Sibbesen, Eizenga, Novak, Sirén, Chang, Garrison and Paten2023) extends genome graph approaches to RNA-seq analysis by building spliced pangenome graphs and quantifying expression along haplotype-resolved paths (Sibbesen et al., Reference Sibbesen, Eizenga, Novak, Sirén, Chang, Garrison and Paten2023). These methods improve accuracy and enable haplotype-specific quantification, even without prior haplotype phasing, but they are ideally based on comprehensive pan-transcriptome annotation, which is absent in most species. Haplotype information in turn is immensely useful in outbred species, and perhaps even more so, in polyploid species with their complex allele ratios (Bao et al., Reference Bao, Li, Li, Wang, Peng, Cheng, Li, Zhang, Li, Huang, Ye, Dong, Cheng, VanderZaag, Jacobsen, Bachem, Dong, Zhang, Huang and Zhou2022; Bird et al., Reference Bird, Brock, Grabowski, Harder, Healy, Shu, Barry, Boston, Daum, Guo, Lipzen, Walstead, Grimwood, Schmutz, Lu, Comai, McKay, Pires, Edger and Kliebenstein2025; Du et al., Reference Du, He, Xiao, Hu, Yang and Jiao2025).
Despite these advances, graph-based approaches to functional genomics remain in their infancy. Few tools have been developed, and most remain proof-of-concept applications limited to model species. Even where tools exist, broader adoption has been slow, partly due to the lack of comprehensive functional annotations and the complexity of graph-aware analytical workflows. Expanding these approaches across multiple omics layers – including methylation, expression, chromatin states and chromatin accessibility – and to diverse species with more complex genomes remains a critical challenge for future research (Figure 2).

Figure 2. Functional pangenomics. (a) A pangenome graph can integrate diverse layers of functional annotations (e.g., genes, transposons, methylation level) in its reference coordinate system and serve as a unified platform for cross-genome comparison. (b) Graph nodes can be used directly for genome-wide association analyses. The colors represent different length-based categories for the node, while node shapes indicate whether the sequence originates from the chosen reference genome. Broken line indicates nominal significance threshold. This figure was adapted from (Vorbrugg, Bezrukov, Bao, Xian, et al., 2024).
5. Navigating the tangled graph: visualisation, comparison and scalability
Although there are multiple strategies for graph construction, most approaches now adopt the Graphical Fragment Assembly (GFA) format to store graph information. Unfortunately, querying large-scale pangenomes remains challenging due to the inherent complexity and enormous size of these graphs. For instance, the VG toolkit offers a versatile suite of functions to construct, convert and manipulate genome graphs, but even with VG, extracting information from Gb-scale pangenomes can be nontrivial. To overcome scalability issues, several specialised tools have been developed (Figure 3a). ODGI (Optimised Dynamic Genome/Graph Implementation) (Guarracino et al., Reference Guarracino, Heumos, Nahnsen, Prins and Garrison2022) implements scalable algorithms to visualise graphs at multiple resolutions, to extract specific loci and to compare path similarities. Meanwhile, tools such as Gretl (Vorbrugg et al., Reference Vorbrugg, Bezrukov, Bao, Xian and Weigel2024) are designed to evaluate the quality of multiple graphs by providing a range of quantitative metrics for graph description and comparison. PANCAT (Dubois et al., Reference Dubois, Zytnicki, Lemaitre and Faraut2025) characterises differences among variation graphs derived from the same sequence set using edit distance metrics.

Figure 3. Timeline of pangenome graph algorithms. (a) The differently coloured circles indicate the main functions of tools; some workflows/tools may have multiple usages. For tools described in journals, we use the date of publication, but we note that many colleagues in this area are very generous and often release their tools long before formal publication. Given the rapid development in this area, it is perhaps not surprising that some tools only have a public GitHub repository. In the text, we provide hyperlinks as references. (b) The number of graph-based tools developed for different purposes.
On the visualisation side, early GUI-based tools like Bandage (Wick et al., Reference Wick, Schultz, Zobel and Holt2015) and GfaViz (Gonnella et al., Reference Gonnella, Niehus and Kurtz2019) provide whole-graph views of assembly graphs but are limited when it comes to base level or Gb-scale pangenome graphs. VG view and VG viz can display sequences up to about 100 kb, whereas SequenceTubemap (Beyer et al., Reference Beyer, Novak, Hickey, Chan, Tan, Paten and Zerbino2019) adopts an intuitive visualisation model (inspired by public transport network maps) to display variation graphs along with read mappings at the appropriate scale. Momi-G (Yokoyama et al., Reference Yokoyama, Sakamoto, Seki, Suzuki and Kasahara2019) extends this concept for large-scale SV inspection in human variation graphs, and ODGI viz further expands on the VG viz layout by exporting rasterised images suitable for chromosome-scale genome graphs.
Efforts to integrate graph layouts with functional annotation are also emerging. For example, VRPG, a visualisation and interpretation framework for linear reference–projected pangenome graphs (Miao & Yue, Reference Miao and Yue2025), extracts subgraphs based on reference path coordinates and annotations, while PPanG (Liu, Zhang et al., Reference Liu, Zhang, Lu, Xue, Dong, Li, Xu, Wang and Wei2024) adapts the SequenceTubemap framework to display multiple genome annotations through embedded JBrowse2 components in real time. Additionally, Gfaestus (https://github.com/chfi/gfaestus) leverages GPU frameworks to visualise full graphs from projects like HPRC, and waragraph (https://github.com/chfi/waragraph) can integrate annotation information into ODGI layouts interactively.
Compared to graph construction, the visualisation and comparison of pangenome graphs have lagged significantly. While multiple tools exist for assembling and processing variation graphs, there is still no comprehensive, scalable and interactive visualisation framework that can handle large-scale pangenomes efficiently and connect the functional annotation (Figure 3b). As the pangenome expands to hundreds of individuals rapidly, it could even go beyond species, and extracting biological knowledge from the complex tangles in graphs requires better tools than what is currently available.
6. Conclusion and perspectives
The development of eukaryotic pangenomics has entered a transformative phase. Advances in sequencing technologies and assembly algorithms have made it feasible to generate high-quality genomes at population scale (Antipov et al., Reference Antipov, Rautiainen, Nurk, Walenz, Solar, Phillippy and Koren2025; Cheng et al., Reference Cheng, Asri, Lucas, Koren and Li2024; Koren et al., Reference Koren, Bao, Guarracino, Ou, Goodwin, Jenike, Lucas, McNulty, Park, Rautiainen, Rhie, Roelofs, Schneiders, Vrijenhoek, Nijbroek, Nordesjo, Nurk, Vella, Lawrence and Phillippy2024). As a result, pangenome references constructed from tens to hundreds of assemblies now exist for a growing number of species, including foundational species such as Arabidopsis thaliana (Kang et al., Reference Kang, Wu, Liu, Liu, Zhu, Han, Liu, Chen, Song, Tan, Yin, Zhao, Yan, Lou, Zan and Liu2023; Lian et al., Reference Lian, Huettel, Walkemeier, Mayjonade, Lopez-Roques, Gil, Roux, Schneeberger and Mercier2024; Wlodzimierz et al., Reference Wlodzimierz, Rabanal, Burns, Naish, Primetis, Scott, Mandáková, Gorringe, Tock, Holland, Fritschi, Habring, Lanz, Patel, Schlegel, Collenberg, Mielke, Nordborg, Roux and Henderson2023), key crops (Cheng et al., Reference Cheng, Wang, Bao, Zhou, Guarracino, Yang, Wang, Zhang, Tang, Zhang, Wu, Zhou, Zheng, Hu, Lian, Ma, Lassois, Zhang, Lucas and Huang2025; Guo et al., Reference Guo, Li, Lu, Zhao, Kurata, Wei, Wang, Wang, Zhan, Fan, Zhou, Lu, Tian, Weng, Feng, Huang, Zhang, Gu, Wang and Han2025; Hufford et al., Reference Hufford, Seetharam, Woodhouse, Chougule, Ou, Liu, Ricci, Guo, Olson, Qiu, Della Coletta, Tittes, Hudson, Marand, Wei, Lu, Wang, Tello-Ruiz, Piri and Dawe2021; Liu et al., Reference Liu, Du, Li, Shen, Peng, Liu, Zhou, Zhang, Liu, Shi, Huang, Li, Zhang, Wang, Zhu, Han, Liang and Tian2020; Lynch et al., Reference Lynch, Padgitt-Cobb, Garfinkel, Knaus, Hartwick, Allsing, Aylward, Bentz, Carey, Mamerto, Kitony, Colt, Murray, Duong, Chen, Trippe, Harkess, Crawford, Vining and Michael2025; Zhou et al., Reference Zhou, Zhang, Bao, Li, Lyu, Zan, Wu, Cheng, Fang, Wu, Zhang, Lyu, Lin, Gao, Saha, Mueller, Fei, Städler, Xu and Huang2022) and humans (Liao et al., Reference Liao, Asri, Ebler, Doerr, Haukness, Hickey, Lu, Lucas, Monlong, Abel, Buonaiuto, Chang, Cheng, Chu, Colonna, Eizenga, Feng, Fischer, Fulton and Paten2023). Its application has shown that additional variations capture some of the heritability previously missed (Zhou et al., Reference Zhou, Zhang, Bao, Li, Lyu, Zan, Wu, Cheng, Fang, Wu, Zhang, Lyu, Lin, Gao, Saha, Mueller, Fei, Städler, Xu and Huang2022), find more associations between variations and agronomic traits (Hufford et al., Reference Hufford, Seetharam, Woodhouse, Chougule, Ou, Liu, Ricci, Guo, Olson, Qiu, Della Coletta, Tittes, Hudson, Marand, Wei, Lu, Wang, Tello-Ruiz, Piri and Dawe2021) and can uncover the complex evolution history of well-studied loci (Bolognini et al., Reference Bolognini, Halgren, Lou, Raveane, Rocha, Guarracino, Soranzo, Chin, Garrison and Sudmant2024).
Nevertheless, capturing the full spectrum of variation across a species in an unbiased and comprehensive way remains a challenge. While tools such as Minigraph-Cactus use iterative construction to simplify the process of graph alignment, they are sensitive to input order and tend to discard sequences that diverge too much from the reference – an issue especially problematic in high-diversity species (Cheng et al., Reference Cheng, Wang, Bao, Zhou, Guarracino, Yang, Wang, Zhang, Tang, Zhang, Wu, Zhou, Zheng, Hu, Lian, Ma, Lassois, Zhang, Lucas and Huang2025; Garrison & Guarracino, Reference Garrison and Guarracino2023). On the other hand, all-to-all alignment approaches, such as PGGB, provide more complete graphs but require substantial computational resources, making them impractical for datasets involving hundreds of genomes (Lynch et al., Reference Lynch, Padgitt-Cobb, Garfinkel, Knaus, Hartwick, Allsing, Aylward, Bentz, Carey, Mamerto, Kitony, Colt, Murray, Duong, Chen, Trippe, Harkess, Crawford, Vining and Michael2025). Similarly, genotyping tools face scalability constraints for large graphs: VG Giraffe, for instance, typically downsamples to 64 haplotypes prior to mapping (Sirén et al., Reference Sirén, Monlong, Chang, Novak, Eizenga, Markello, Sibbesen, Hickey, Chang, Carroll, Gupta, Gabriel, Blackwell, Ratan, Taylor, Rich, Rotter, Haussler, Garrison and Paten2021).
Progress in these areas depends on the availability of high-quality benchmark datasets for validation. Yet such resources are scarce in non-human species, and even in human genomics, benchmarking is often confined to a few well-characterised individuals (Dwarshuis et al., Reference Dwarshuis, Kalra, McDaniel, Sanio, Alvarez Jerez, Jadhav, Huang, Mondal, Busby, Olson, Sedlazeck, Wagner, Majidian and Zook2024). This creates systematic bias and limits our ability to assess how well genome graphs capture rare, complex or population-specific variation. Developing robust metrics and comparative frameworks to evaluate graph quality remains a crucial direction for the field.
Moreover, the continued reliance on biallelic SNP models restricts the development of population genetic theory capable of explaining the full complexity of pangenomic variation. However, SVs are generated by a wide range of distinct mutational mechanisms – including non-homologous end joining, non-allelic homologous recombination, template-switching, nested transposon insertions and tandem repeat expansion – that often result in multi-allelic loci rather than simple binary variants (Collins & Talkowski, Reference Collins and Talkowski2025). In association studies, the assumption of biallelic variation also introduces confounding effects, particularly in the presence of genetic heterogeneity (Liu et al., Reference Liu, Fu, Xu and Nordborg2025). Incorporating haplotype-aware models may reveal additional associations that are otherwise missed due to the underlying complexity of SVs and the impact of multi-allelic loci (Smith et al., Reference Smith, Strausz, FinnGen, Spence, Ollila and Pritchard2025; Zhou et al., Reference Zhou, Zhang, Bao, Li, Lyu, Zan, Wu, Cheng, Fang, Wu, Zhang, Lyu, Lin, Gao, Saha, Mueller, Fei, Städler, Xu and Huang2022). As such, there is a growing need to develop new population genetic models.
Looking ahead, integrating pangenome graphs with evolutionary models remains a wide-open frontier. Ancient hybridisation, incomplete lineage sorting, and structural rearrangements complicate cross-species alignment, increasing the difficulty of graph construction. Yet global biodiversity sequencing projects (Lewin et al., Reference Lewin, Robinson, Kress, Baker, Coddington, Crandall, Durbin, Edwards, Forest, Gilbert, Goldstein, Grigoriev, Hackett, Haussler, Jarvis, Johnson, Patrinos, Richards, Castilla-Rubio and Zhang2018) are beginning to fill the tree of life with genome assemblies. Embedding phylogenetic history directly into graph construction – rather than treating it as a downstream layer – may help generate more meaningful, interpretable graphs.
In contrast to advances in variant calling, the interpretation, visualisation and benchmarking of pangenome graphs are still in early stages. There is still no equivalent of IGV for intuitive graph exploration, and widely adopted formats for complex variant representation are lacking. While tools like ODGI offer useful summaries and visualisations, they lack interactivity and scalability for Gb-scale graphs. Even more critically, the integration of functional genomics with graph frameworks remains far behind. At present, RNA-seq, methylation, and chromatin accessibility data cannot be seamlessly analysed in graph-aware contexts. Bridging this gap will require unified, scalable methods for aligning and interpreting multi-omic data within graph-based references for genotype-phenotype association.
As we enter the next phase of pangenomic research, the field faces substantial computational and modelling hurdles. Yet the growing ecosystem of graph-based methods offers more than just an ever-expanding toolkit – it provides the foundation for a new paradigm in genomics. By embracing the full complexity of genomic variation, pangenome graphs have the potential to reshape how we conduct association studies, trace evolutionary history and interpret regulatory landscapes. Moving beyond reference bias and linear constraints, these graphs can unify population-scale diversity, functional readouts and comparative signals across the tree of life. Realizing this vision will require not only scalable tools and new theoretical frameworks but also sustained community efforts in benchmarking, visualisation and data integration. Still, the promise is profound: to build genomic models that reflect biological reality more accurately, and in doing so, to understand evolution in unprecedented detail.
Open peer review
To view the open peer review materials for this article, please visit http://doi.org/10.1017/qpb.2025.10028.
Acknowledgements
We thank both our colleagues from the 1001 Genomes Project in the Weigel and Nordborg labs and the participants of the International Genome Graph Symposium (IGGSy'24) in Ascona 2024 for inspiration, and especially Andrea Guarracino and Erik Garrison from the University of Tennessee Health Science Center for extensive discussions.
Competing interest
D.W. holds equity in Computomics, which advises plant breeders. D.W. also consults for KWS SE, a globally active plant breeder and seed producer.
Author contributions
Z.B. wrote the draft of the manuscript. Z.B. and D.W. revised the manuscript.
Funding statement
Our work is supported by the International Max Planck Research School (IMPRS) ‘From Molecules to Organisms’ (Z.B.), the Novozymes Prize of the Novo Nordisk Foundation (D.W.), and the Max Planck Society.




Comments
Dear QPB Team,
Apologies for being so late in submitting our review. The good news is that we are really happy with the outcome (and that there are by now enough recent papers for this to be meaningful).
Best,
Detlef Weigel