Introduction
Patterns in the diversity and distribution of mineral-forming elements have pointed to systematic trends in the chemical character of these elements, as proposed by the Goldschmidt geochemical classification (Goldschmidt, Reference Goldschmidt1937; Moore et al., Reference Moore, Golden, Morrison, Hao and Spielman2022; Hummer et al., Reference Hummer, Ma, Xiang, Zhang, Liu, Hazen, Golden and Downs2023; Lodders and Hazen, Reference Lodders and Hazen2026). This useful classification model groups elements into one of four preferred hosts phases, including siderophile (in iron and other metals), chalcophile (in sulfides and other chalcogenides), lithophile (in oxygen-based rocks), and atmophile (in the atmosphere as gases). Most mineral-forming elements occur predominantly with other elements in their geochemical group.
As Goldschmidt emphasised, many elements, especially those with multiple oxidation states, have affinities for more than one host. For example, iron occurs as a siderophile native element and as chalcophile or lithophile Fe2+ and Fe3+ ions (e.g. Wood and Kiseeva, Reference Wood and Kiseeva2015). Recent efforts to expand mineral formulas to include oxidation states (see https://rruff.info/ima/#, accessed 05 February 2025; Lafuente et al., Reference Lafuente, Downs, Yang, Stone, Armbruster and Danisi2015) have resulted in the tabulation of ions in most mineral species. Thus, for example, the iron-bearing olivine fayalite (Fe2SiO4) is recast as Fe2+2Si4+O2–4. The resulting formulas offer the opportunity to expand the Goldschmidt approach by identifying the preferred host phases for redox-sensitive, mineral-forming ions in these minerals. Furthermore, the availability of the comprehensive RRUFF and Mindat (https://mindat.org, accessed 05 February 2025) data resources, coupled with mineral informatics methods of network analysis with community detection and heatmap analysis with agglomerative clustering, offer a rigorous, data-driven approach to this problem.
Mineral-forming ions
The concept of a mineral-forming ‘ion’ is a flawed but useful fiction. Despite the common description of many mineral structures in terms of ionically bonded cations and anions, theoretical models and experimentally determined electron density studies demonstrate that the chemical bonds in oxide and sulfide minerals feature both significant ionic and covalent aspects (Pauling, Reference Pauling1960; Gibbs et al., Reference Gibbs, Ross, Cox and Rosso2014). For example, Si–O bonds in silicates, C–O bonds in carbonates and S–O bonds in sulfates are intermediate on the spectrum between predominantly ionic (i.e. with the transfer of electron density from the cation/metal to the anion/oxygen), such as Na–Cl bonds in halite (NaCl), and predominantly covalent (i.e. sharing of electron density across the bond), as in As–S bonds in orpiment (As2S3).
One can employ relative electronegativities (Pauling, Reference Pauling1932), χ, to assign cations versus anions, as well as to estimate the relative ionicity of those bonds. For example, Na+ (χ = 0.93) is the positively-charged cation relative to the anion Cl– (χ = 3.16); the relatively large electronegativity difference of 2.23 units suggests a predominantly ionic bond in halite (Na+Cl–). Similarly, As3+ (χ = 2.18) is the cation relative to S2– (χ = 2.58) in the arsenic sulfide orpiment (As3+2S2–3), though the 0.40 difference in electronegativity is significantly less than in NaCl and points to a predominantly covalent bond.
In this study we focus on ions that co-exist in a mineral (as opposed to ions directly bonded to each other), as recorded in each mineral’s chemical formula using the International Mineralogical Association (IMA) approved formulae at https://rruff.info/ima/# (accessed 5 February 2025). We only consider minerals with anions O2– and/or halogens (F–, Cl–, Br–, I–), as well as select minerals with C4–, N3–, or S2–. Consequently, we do not include several important mineral groups with likely metal–metal, cation–cation, or anion–anion bonding, including native elements; predominantly covalently bonded sulfides, arsenides and other chalcogenides; sulfosalts; hydrocarbon minerals; or borides, carbides, nitrides, phosphides and silicides. These restrictions result in the removal in this study of eight noble metal elements (Au, Ir, Os, Pd, Pt, Re, Rh and Ru) from the list of 72 mineral-forming elements.
We acknowledge that nominal oxidation states can be informative even in the case of minerals with non-ionic bonding. For example, the reaction of pyrite (FeS2) to pyrrhotite (Fe7S8) requires a change in the number of bonded electrons, because Fe remains effectively Fe2+ but a portion of S transitions from S1– to S2– (Letard et al., Reference Letard, Sainctvit and Deudon2007, and references therein). Even though the distribution of electrons is not strongly centred around a given Fe or S atom, owing to nearly metallic bonding, the nominal oxidation states still reveal how the electron distribution is different between different materials. The problem with including most sulfides and sulfosalts is that for many of these minerals the nominal oxidation state is unknown or highly debated (e.g. Pearce et al., Reference Pearce, Pattrick, Vaughan, Henderson and van der Laan2006). Therefore, trying to include them in this study would add uncertainties to the dataset that are not present in minerals with oxygen or halogens as anions.
These constraints result in 4834 IMA-approved minerals for which the oxidation states of 64 essential chemical elements, including 87 different ions, are unambiguously defined (Table 1). Among the characteristics of these ions and ion pairs:
(1) The 25 most common ions in the 4834 mineral species considered, all with more than 250 occurrences, are O2– (4741), H+ (3230), Si4+ (1618), Ca2+ (1428), Al3+ (1157), Na+ (1107), Fe3+ (740), Mg2+ (738), P5+ (701), S6+ (609), K+ (545), Mn2+ (545), Fe2+ (488), Cu2+ (467), As5+ (463), F– (458), Cl– (405), Ti4+ (391), C4+ (381), Pb2+ (372), B3+ (302), U6+ (283), Zn2+ (279), REE3+ (274) and Ba2+ (260).
(2) By contrast, four of the 87 ions (C4–, Hf4+, I3+ and V2+) have been recorded in only one mineral species, whereas two ions (In3+ and S2+) occur only twice in our compilation. (Note that, though ‘S2+’ is a useful way to designate thiosulfate, the thiosulfate ion in fact incorporates equal numbers of S6+ and S2–).
(3) The majority of the 87 mineral-forming ions are cations, including 1+ (10 examples), 2+ (17), 3+ (18, including ‘REE3+’, which lumps all species-determining trivalent rare earth elements), 4+ (17), 5+ (10) and 6+ (7).
(4) Of the 87 mineral-forming ions tabulated, eight are anions, including 1– (F–, Cl–, Br–, I–), 2– (O2–, S2–), 3– (N3–), and 4– (C4–).
(5) Most ion pairs never form. With 87 different mineral-forming ions, there exists 3741 {i.e. [(87 × 87) – 87]/2} possible pairs between two different ions. However, only 1392 (37.2%) of these possible pairs are found to co-occur one or more times in the 4834 minerals in our survey. Furthermore, 280 of those observed pairs (20.1% of the 1392) are only found in one mineral.
(6) The five most commonly co-occurring ions are also bonded cation–O2– pairs, including: H+–O2– (3222 pairs), Si4+–O2– (1611), Ca2+–O2– (1412), Al3+–O2– (1139), and Na+–O2– (1084), with significant co-occurrence of Fe3+–O2– (735), Mg2+–O2– (731), P5+–O2– (699), and S6+–O2– (607). The most commonly co-occurring pairs of cations (not bonded pairs) are Si4+–H+ (1058), Ca2+–H+ (985), Na+–H+ (719), Mg2+–H+ (719), Al3+–H+ (880), Ca2+–Si4+ (699), and Al3+–Si4+ (618), while the most common co-occurring anion-anion pairs (also not bonded) are O2––F– (407) and O2––Cl– (367).
Table 1. Distribution of 87 mineral-forming ions among Clusters and Communities

* Based on a survey of 4834 minerals; see Supplementary Table 1.
** Not included in this visualisation.
*** All trivalent lanthanide elements are lumped into REE3+.
Methods
Employing data recorded at https://rruff.info/ima/# (accessed 5 February 2025), we compiled an xlsx spreadsheet with 4834 rows, each representing a different mineral species approved by the International Mineralogical Association’s Committee on New Minerals, Nomenclature, and Classification (IMA-CNMNC; Burke Reference Burke2006; Mills et al., Reference Mills, Hatert, Nickel and Ferrais2009; Schertl et al., Reference Schertl, Mills and Maresch2018), and 87 columns, each representing a different mineral-forming ion (Supplementary Table 1).
The case of trivalent rare earth elements deserves special attention, because official IMA formulas for REE minerals are inconsistent. In some cases (at least 36 instances) ‘REE’ appears in an approved mineral formula. In two additional minerals, aeschynite-Nd and aeschynite-Y, the symbol ‘Ln’, presumably for ‘lanthanide’, appears in the formula. However, in many other minerals the formula specifies a dominant rare earth element, including Ce3+ (167 minerals), Dy3+ (1), Er3+ (1), Gd3+ (2), La3+ (63), Nd3+ (34), Sm3+ (2), and Yb3+ (5). Given that all 14 stable rare earth elements in the 3+ oxidation state will co-occur in all these mineral species, we have combined these varied instances in the data analysis process into one ion—REE3+. In addition, note that six occurrences of Ce4+ are treated separately in this study (see Supplementary Tables 1, 2 and 3).
Unipartite network analysis and Louvain community detection of coexisting ions
A principal objective of this study is to apply community detection and agglomerative clustering to discern patterns in the co-occurrence of mineral-forming ions. Network analysis reveals communities of associated members of a group, whether people in a social setting or minerals in a natural assemblage, through links between nodes (Kolaczyk, Reference Kolaczyk2009; Newman, Reference Newman2013; Morrison et al., Reference Morrison, Liu, Eleish, Prabhu, Li, Ralph, Downs, Golden, Fox, Hummer, Meyer and Hazen2017; Hazen et al., Reference Hazen, Morrison, Prabhu, Walter and Williams2023a). Each node represents a member of the population, while each link represents an association between two nodes. Figure 1 displays a unipartite network that illustrates the coexistence of 79 mineral-forming ions, as listed in Table 1.

Figure 1. A unipartite network of 79 mineral-forming ions (coloured circles), with links between pairs of ions when at least 18% of the less common ion also occurs with the more common ion. Node sizes indicate the relative abundances of ions, while colours indicate six communities of ions that were determined using Louvain community detection (see text). Each of these six communities (see text) corresponds to groups of ions that co-occur relatively frequently. This figure is a static image of an interactive web-based graphic. One can vary the percentage cutoff for links between nodes by sliding ‘Ion Cutoff’ vernier in the interactive version of this graph at https://observablehq.com/d/726edcffdbc8fecc. Slide the‘Ion cutoff by ABS to remove the least common ions, based on the absolute numbers of occurrences. Use the ‘remove Ion’ feature to remove one or more ions from the network. Click and hold your cursor on any node to move that node and identify links to other nodes.
This graph was built on ‘Observable’ (https://observablehq.com/), using D3js (Bostock et al., Reference Bostock, Ogievetsky and Heer2011). The networks use the D3-force algorithm (https://d3-wiki.readthedocs.io/zh_CN/master/Force-Layout/) for its network layout. This force-directed graph treats edges between nodes as an array of interconnected springs. The interactive network can be rotated into varied orientations; therefore, the visual distances between nodes will vary based on the projection employed. The code and an interactive version of this network can be found at: https://observablehq.com/d/726edcffdbc8fecc (for instructions, see Fig. 1 caption).
Each of the 79 nodes in Fig. 1 represents a mineral-forming ion. Note that eight of the 87 ions in Table 1 and Supplementary Table 1 are not incorporated as nodes in Fig. 1 to avoid skewing the results and interpretation of network analysis. We apply two criteria for removing these eight ions:
(1) We exclude the two most common ions, O2– [found in 4741 (98.1%) of the 4834 minerals studied] and H+ [in 3230 minerals (66.8%)]. These ions are so abundant as to span all potential network communities—a situation that significantly increases the density of the network while diminishing the discrimination among several communities.
(2) We exclude six of the rarest ions—C4–, Hf4+, I3+, In3+, Mo4+, S2+ and V2+—because all coexisting pairs that incorporate any of these ions are only known from one mineral. Links between nodes in our network models are based on the percentage occurrence of the rarer ion with the more common ion. Thus, ion pairs of the rarest ions will inevitably result in a high-percentage linkage that skews network topologies.
In Fig. 1, the node size indicates the number of occurrences of that ion in our tabulations of 4834 mineral species (Supplementary Table 1). Links between pairs of nodes indicate ion co-occurrence. Figure 1 illustrates the case where at least 18% of minerals incorporating the less common ion also incorporate the more common ion (based on percentages of ion co-occurrences tabulated in Supplementary Tables 1 and 3). Figure 1 is a static rendering of one state of the online dynamic interactive network in which several attributes of the network can be varied, including: (1) the minimum co-occurrence percentage, P, which can be varied from 1 to 70%; (2) the minimum number of minerals in Supplementary Table 1 with two co-occurring ions, which can be varied from 1 to 500; (3) the node size and the font size of node labels; and (4) the force constant of the force-directed network. Features also include the ability to exclude specific ions from the network and to recalculate distribution of ions among communities in real time as network parameters change. In addition, one can choose among four different community detection algorithms: Louvain, Informap, Label Propagation and Walktrap.
An important feature of Fig. 1 is the nonuniform distribution of ions among several communities that represent the most closely associated groups of ions. To determine the ion community structure in Fig. 1 we employed Louvain community detection analysis (Girvan and Newman, Reference Girvan and Newman2002; Fortunato, Reference Fortunato2010), which is based on modularity optimisation (Blondel et al., Reference Blondel, Guillaume, Lambiotte and Lefebvre2008). This method identifies members of a group in two steps: (1) form small communities, starting with each node as its own community, followed by maximising modularity of certain nodes; (2) aggregate each small community into ‘super nodes’ to form a new ‘super node network’. These two steps are repeated until the network is optimised.
The Louvain modularity approach is unsupervised with respect to the number of resultant communities. Based on the network with P ≥ 18% and excluding eight ions as noted above, we observe two principal communities of 31 and 23 mineral-forming ions, two smaller communities of 13 and eight ions, as well as two binary communities, each with two ions (Table 1). Each community represents a distinct suite of ions that commonly co-occur.
Community 1 (Louvain)
Thirty one orange-coloured ion nodes dominate the left half of Fig. 1 as a well-defined Community 1. These ions include several of the most common lithophile cations in igneous and metamorphic oxides and silicates—e.g. Al3+, Ca2+, Fe2+, K+, Mg2+, Na+ and Si4+ found in feldspar, mica, pyroxene, amphibole, garnet and clay mineral groups, as well as many other rock-forming phases. Some of these common ions adopt central positions with high network centrality, which anchor an arcuate arrangement of other Community 1 nodes in the lower left. Community 1 features the defining ions of carbonates (C4+) and phosphates (P5+), as well as ions of primary minerals in complex pegmatites (B3+, Be2+, Cs+, F–, Li+, Rb+, Sc3+, Sn4+, Th4+, U4+; Černý, Reference Černý1991; London, Reference London2008), in carbonatites and/or agpaitic rocks (REE3+, Ce4+, Nb5+, Ti3+, Ti4+, Y3+, Zr4+; Mitchell, Reference Mitchell1996a, Reference Mitchell1996b; Marks and Markl, Reference Marks and Markl2017), and in layered mafic intrusions (Cr3+; O’Driscoll and Van Tongeren, Reference O’Driscoll and Van Tongeren2017). Note that Community 1 also includes most of the ions that are characteristic of stellar minerals (Hazen and Morrison, Reference Hazen and Morrison2020) and the earliest nebular condensates found in chondrite meteorites (Morrison and Hazen, Reference Morrison and Hazen2020). Most of the ions in Community 1 occur in only one oxidation state—exceptions being Cr, I, S, Ti, U and V. Notably absent from Community 1 ions are semi-metal chalcophile elements. As a result, the average electronegativity of 30 Community 1 cations is 1.45—the lowest of any community, while the only anion (F–) has the highest known electronegativity, 3.98. Consequently, the average bonding character among minerals with Community 1 ions, all of which contain O2– and/or F–, tends to be more ionic than in the other communities.
Community 2 (Louvain)
The 23 red-coloured nodes in Fig. 1 are concentrated in a circular region of the upper half of Fig. 1. A halo of smaller nodes represents less common ions, most of which are connected to the centrally located nodes for relatively common Cu2+ and Pb2+. Community 2 is notable for hosting five of the eight anions in this study (Br–, Cl–, I–, N3– and S2–). The halogen anions are common constituents of hydrothermal brines and evaporite minerals (Guilbert and Park, Reference Guilbert and Parks1986; Pirajno, Reference Pirajno2009; Poot et al., Reference Poot, Buelens, Dekoninck, Rochez and Yans2024); whereas N3– is found in 95 ammonium-bearing biominerals, many of which are associated with Phanerozoic urine, guano and plant decay (Hazen and Morrison, Reference Hazen and Morrison2022). The sulfide anion, S2–, occurs in 22 of the minerals recorded in Supplementary Table 1, all of which also incorporate O2– and/or a halogen anion. Community 2 includes monovalent and divalent transition metal cations (Ag+, Cu+, Cu2+, Hg+ and Hg2+) typical of near-surface hydrothermal settings. Of special note are the close associations among Ag+, Cu+ and Hg+ with the halogens Br–, Cl– and I– (Dunning et al., Reference Dunning, Hadley, Magnasco, Christy and Cooper2005; Hazen et al., Reference Hazen, Golden, Downs, Hysted, Grew, Azzolini and Sverjensky2012). Of the remaining ions, ten are of metalloids (Ge, Sb and Te) or other cations with relatively high electronegativity (Pb, S, Se and Tl). Community 2 also hosts Cr6+, which is found in 26 chromate minerals, both as alteration phases via oxidation of other Cr-bearing minerals in soils, and as hydrothermal alteration products (Liu et al., Reference Liu, Hystad, Golden, Hummer, Downs, Morrison, Grew and Hazen2017; Morrison et al., Reference Morrison, Liu, Eleish, Prabhu, Li, Ralph, Downs, Golden, Fox, Hummer, Meyer and Hazen2017, their figure 4). Additionally, Community 2 features the nitrate (N5+) ion, which invariably forms the (N5+O2–3)-anionic group and is found in 20 minerals typically derived from urine or guano. Most of the Community 2 ions are relatively rare; 16 of the 23 ions are found in fewer than 30 of the 4834 minerals recorded in Supplementary Table 1. The average electronegativities of Community 2’s 18 cations and five anions are 2.12 and 2.88, respectively—a substantially smaller difference than for Community 1, suggesting a comparatively more covalent average character. Of note is the strong discrimination between ions of Communities 1 and 2 in Fig. 1—a topology that suggests these two contrasting groups of ions do not frequently co-occur.
Community 3 (Louvain)
The 13 green nodes of Community 3, located in the centre-right of Fig. 1, include primarily ions of siderophile and chalcophile transition metals (Cd2+, Co2+, Co3+, Mo5+, Mo6+, Ni2+, V4+, V5+ and Zn2+). Note that this community has no anions; rather, this community links the anionic clusters (As5+O2–4)3– and (S6+O2–4)2– that are closely associated with hundreds of arsenate and sulfate minerals, most of which form through alteration/oxidation of primary chalcogenide deposits (Chang et al., Reference Chang, Howie and Zussman1996; Hazen and Morrison, Reference Hazen and Morrison2022). In this respect, Community 3 is more closely tied to (and partially overlaps with) Community 2 in Fig. 1. The average electronegativity of Community 3’s 13 cations is 1.92, which is intermediate between those of Communities 1 and 2.
Community 4 (Louvain)
The eight blue nodes of Community 4 represent cations of two commonly associated first-row siderophile transition elements (Fe3+, Mn2+, Mn3+ and Mn4+), as well as the defining cations of antimonates (Sb5+), arsenites (As3+), tellurites (Te4+), and tungstates (W6+), all of which have fewer than 80 representative minerals. Community 4 nodes are anchored by the relatively common Fe3+ ion, which is linked to the seven other Community 4 cations, as well as to ions in Communities 1, 2 and 3. In particular, these ions are interspersed with those of Community 3; both groups of ions are most commonly associated with alteration/oxidation of primary chalcogenide minerals (Hazen and Morrison, Reference Hazen and Morrison2022). The average electronegativity of Community 4 cations is 1.90, similar to that of Community 3.
Community 5 (Louvain)
Two ions, Sn2+ and Ta5+, form their own Community 5 as a pair of brown nodes on the extreme bottom of Fig. 1—close to the nodes of Community 1, with which Ta5+ is also linked. These two ions are both relatively scarce in minerals, with six and 58 examples, respectively. Only two minerals, oxystannomicrolite (Sn2+2Ta5+2O6O) and thoreaulite (Sn2+Ta5+2O6), contain both ions. However, because O2– is the only other ion in these minerals, these two cations are particularly strongly linked (hence forming a separate community). However, these ions are also closely associated with Ca2+, Mg2+, Nb5+ and Y3+—all cations in Community 1.
Community 6 (Louvain)
The two ions of uranium with highest oxidation states, U5+ and U6+, lie at the upper-left of Fig. 1, adjacent to several high field strength cations of both Community 1 and Community 2. These cations are associated with late-stage igneous fluids (e.g. with Ce4+, Th4+, Y3+ and REE3+). U5+ occurs in only three minerals, but all three of those minerals also incorporate the more common U6+, which occurs in 283 minerals. This 100% association of U5+ with U6+ is sufficient to create a separate community, even though U6+ is frequently found with Ca, K, Si and other lithophile cations of Community 1.
With 85 nodes representing mineral-forming ions (excluding H+ and O2–), the maximum number of ion pairs possible in Fig. 1 is [(852 – 85)/2] = 3570 edges. In the case of mineral-forming ions, far fewer pairs are observed. The network ‘density’, D, is the fraction of the possible edges that are observed—a number that typically decreases with increasing P. One conclusion of this study is that the great majority of possible ion pairs are not observed. For example, at P = 0% with all observed ion pairs represented by network edges, the network density is (1222/3570) = 0.342.
Results of Louvain community detection depend on the selection of network nodes as well as the percentage cutoff, P, for network edges. Therefore, we also performed Louvain community detection analysis for P ≥ 25 and P ≥ 50% (based on percentages recorded in Supplementary Table 3). At P = 0%, the centre of the graph is occupied by Na+, Ca2+, Fe2+, Fe3+, Mg2+, Mn2+, Al3+, Si4+, P5+ and Ti4+, the majority of constituents needed to form most of the rocks in the Earth’s crust. By removing these essential ions, the underlying trends of co-occurrences among less common ions become more evident, with patterns that are dependent on relatively restrictive paragenetic environments in which specific P–T–X conditions are met.
At P ≥ 20%, W6+ becomes the first of the 85 ions that is no longer connected to the network. At P ≥ 25%, 79 nodes remain (the cations Mo4+, Sb5+, Sn2+, Ta5+, W6+ and Zn2+ are no longer connected), with 190 of 3081 possible edges (D = 0.062). At P ≥ 50%, only 35 ion nodes remain connected to each other with a sparse 38 edges (out of 595 possibilities) between ion pairs (D = 0.064). It is intriguing that as nodes become disconnected with increasing P above 18%, the network becomes smaller but the density appears to remain relatively constant at ∼0.06.
As P increases, and as ions and edges are consequently removed, the interactive graphics package (https://observablehq.com/d/726edcffdbc8fecc) automatically recalculates the community structure of the network. Therefore, the detailed distribution of ions among communities differs with P. Nevertheless, Community 1 always appears as the dominant feature, with strong connections among the most common mineral-forming ions, including Si4+, Al3+, Ca2+ and Na+.
Walktrap community detection
We applied multiple methods for community detection to the ion co-occurrence data. In contrast to Louvain community detection, which is based on modularity optimisation, the Walktrap algorithm analyses connections among nodes based on the lengths of random walks. The Walktrap algorithm assumes that random walks tend to spend more time within the same community due to the denser internal connections within that community (Pons and Latapy, Reference Pons and Latapy2005). In detail, the Walktrap algorithm clusters communities by simulating multiple random walks, calculating their path similarities, and then merging nodes using hierarchical clustering. Due to the nature of random walks, community detection in Walktrap is influenced by local relationships, leading to the formation of closely connected communities.
Though both Louvain and Walktrap algorithms aim to optimise community structure, they differ in their approaches. The Louvain algorithm follows a bottom-up greedy strategy, initially treating each node as an individual community and iteratively merging them to maximise modularity (Blondel et al., Reference Blondel, Guillaume, Lambiotte and Lefebvre2008). With its nearly linear time complexity, the Louvain algorithm is highly efficient for processing large-scale networks. In contrast, the Walktrap algorithm is better suited for capturing finer local community structures.
Figure 2 is a unipartite network with initial parameters P = 20%. The two most common ions (H+ and O2–) and seven of the least connected rare ions (C4–, Hf4+, I3+, In3+, Mo4+, S2+, V2+ and W6+) are excluded from the network, leaving 78 nodes. In Fig. 2 we employed the Walktrap algorithm instead of Louvain community detection (see https://observablehq.com/d/726edcffdbc8fecc). Figure 2 displays many of the same features as Fig. 1. Both networks display two dominant communities that are significantly separated from each other, as recorded in Table 1.

Figure 2. A unipartite network of 78 mineral-forming ions (coloured circles), with links between pairs of ions when at least 20% of the less common ion also occurs with the more common ion. Node sizes indicate the relative abundances of ions, while colours indicate communities determined using the Walktrap method (see text). The resulting analysis reveals two major communities of 40 and 27 nodes representing mineral-forming ions plus five minor communities, each with two or three nodes. Each of the two larger communities corresponds to groups of ions that co-occur relatively frequently. The 40 orange nodes of Community 1 include all 31 Community 1 ions from Fig. 1. Community 2 includes 27 ions represented by red nodes, 21 of which also correspond to Community 2 in Fig. 1. This figure is a static Image of an Interactive web-based graphic (see https://observablehq.com/d/726edcffdbc8fecc).
Community 1 (Walktrap)
The 40 orange-coloured nodes that populate the righthand side of Fig. 2 include all 31 ions of Louvain Community 1 in Fig. 1, as well as I5+ from Louvain Community 2; As3+, V4+ and V5+ from Louvain Community 3; and Fe3+, Mn2+, Mn3+, Mn4+ and Sb5+ from Louvain Community 4. As in Fig. 1, the most common lithophile ions (Al3+, Ca2+, K+, Na+ and Si4+) assume central positions, in this case connected to an arc of less common ions to the lower right.
Community 2 (Walktrap)
Community 2 links 27 red-coloured nodes, of which 21 are also in Louvain Community 2 of Fig. 1. The other six ions in the Walktrap version of Community 2 are Co2+, S6+, Tl+ and Zn2+, which are in Louvain Community 3, and As5+ and Te4+ in Louvain Community 4. As in Fig. 1, the separation of Walktrap Communities 1 and 2 is striking.
Community 3 (Walktrap)
Three green nodes representing Bi3+, Mo5+ and Mo6+, form a separate Walktrap community at the bottom left. These ions, all of which are found in Louvain Community 3 of Fig. 1, are closely associated because Mo5+ occurs in only three minerals, two of which also incorporate the more common Bi3+ {gelosaite [Bi3+Mo6+(2–5x)Mo5+6xO7(OH)·H2O (0≤x≤0.4)] and mambertiite [Bi3+Mo5+2.80O8(OH)]}, while two incorporate Mo6+ {gelosaite and novikovite [(N3–H4)4Mo6+2Mo5+2O8(S6+O4)5]}.
Community 4 (Walktrap)
Two blue nodes in Fig. 2 represent S2– and Sb3+—a community that reflects the co-occurrence of these ions in six of the 22 rare S2– minerals recorded in Supplementary Table 1. Both ions occur in Louvain Community 2 in Fig. 1.
Community 5 (Walktrap)
Two brown nodes connect the transition metal ions Co3+ and Ni2+. These two siderophile ions are closely linked because four of the six Co3+ minerals recorded in Supplementary Table 1 also contain Ni2+. These two ions occur in Louvain Community 3 in Fig. 1.
Community 6 (Walktrap)
The two purple ions of Walktrap Community 6 (U5+ and U6+) are the same as Community 6 determined by Louvain community detection in Fig. 1. As noted above, all three U5+ minerals in Supplementary Table 1 also contain U6+, thus creating a strong association.
Community 7 (Walktrap)
Community 7, which holds two nodes representing Sn2+ and Ta5+, is identical to Louvain Community 5 in Fig. 1.
Comparisons between Fig. 1 based on Louvain community detection and Fig. 2 based on the Walktrap algorithm reveal many small differences, but significant overall similarities in the network structures. In particular, in both approaches most mineral-forming ions separate into two large communities, one of which (Community 1 in both cases) includes ions of most lithophile elements found in common rock-forming igneous and metamorphic rocks, whereas the other (Community 2 in both Figs 1 and 2) includes primarily ions of chalcophile elements found in minerals associated with near-surface processes of weathering, oxidation, aqueous alteration and evaporation.
Of the smaller communities, nine of 13 ions in Louvain Community 3 match ions distributed among Walktrap Communities 2, 3 and 5, while five of eight Louvain Community 4 ions are located in Walktrap Community 1. These distributions underscore similarities in the visual distributions of ion nodes in Figs 1 and 2. In both networks, Communities 1 and 2 are well separated, whereas ions of smaller communities display greater overlaps with adjacent communities.
Heatmap and agglomerative hierarchical cluster analysis of co-occurring ions
Heatmaps coupled with hierarchical cluster analysis provide a complementary approach to visualising patterns of co-occurrence among mineral-forming ions (Fig. 3). Building upon the mineral data heatmap visualisation demonstrated by Que et al. (Reference Que, Huang, Ralph, Zhang, Prabhu, Morrison, Hazen and Ma2024) and Zhang et al. (Reference Zhang, Que, Madhikarmi, Hazen, Ralph, Prabhu, Morrison and Ma2024), we utilised the Plotly Package of Python (Plotly Technologies Inc., 2015) and determined the sequence of ions with agglomerative hierarchical clustering (Maimon and Rokach, Reference Maimon and Rokarch2006).

Figure 3. This 87 × 87 heatmap (in this case including the two most common ions, H+ and O2–) displays a diagonally symmetrical matrix of coexisting mineral-forming ions, with 3741 unique non-diagonal matrix elements. Each matrix element represents the percentage of the rarer ion that coexists with the more common ion (Supplementary Table 3) in a list of 4834 minerals (Supplementary Table 1; see text), as defined by the colour scale. Ions that co-occur most frequently are indicated by darker coloured matrix elements. The order of ions from both top to bottom and from right to left was determined by agglomerative hierarchical clustering using the heatmaply package of R (Galili et al., Reference Galili, O’Callaghan, Sidi and Sievert2018), which grouped ions according to their most frequent associations. Adjacent pairs of ions are most closely related, with larger groups arranged hierarchically, as indicated by the hierarchical tree on the top (and, equivalently, the righthand side) of the matrix. We highlight seven clusters, labelled A to G, as well as four sparsely-populated regions labelled AB, AD, AE and BE (see text).
We applied the 87 × 87 diagonally symmetric data matrix that lists percentages of co-occurrences between pairs of mineral-forming ions (see Supplementary Table 3) to the heatmaply function. This method performed agglomerative hierarchical clustering with a complete linkage approach to map ions in the heatmap so that ions with higher percentages of co-occurrences are placed closer to one another. Thus, for example, the two most commonly co-occurring ions, O2– and H+, are grouped next to each other and form the bottom two rows and two lefthand columns of the matrix. Similarly, U5+ appears adjacent to U6+ because all three U5+ minerals also contain U6+. Each matrix element represents the co-occurrence of two ions, with more intense colours indicating higher percentages of co-occurrence. Figure 3 thus provides both an alternative approach to visualising the most strongly associated ions, while having the valuable ability to highlight ion antipathies—those systematic absences of large groups of ion pairs in the mineral kingdom, as indicated by regions AB, AD, AE and BE. In this way, heatmap analysis is complementary to the network visualisation and community detection algorithms applied to ion co-occurrence data in Figs 1 and 2.
A particularly useful aspect of hierarchical cluster analysis is the ‘tree diagram’ that appears on both the bottom and, equivalently, on the lefthand side of Fig. 3. This branching representation reveals pairs, triads, and larger clusters of ions that most frequently co-occur. Accordingly, one can select any desired number of clusters, from merging all ions into a single global cluster of 87 ions, to 87 individual clusters, each of which is a different ion. To compare this approach to the unipartite networks of Figs 1 and 2, we identified and examined seven major clusters of minerals arranged along the diagonal axis and designated Clusters A to G (Table 1), with O2– and H+ forming an eighth cluster of only two strongly associated ions.
Comparisons between Network Communities and Heatmap Clusters
The minerals in each Cluster A to G, and their correspondences to Louvain Communities 1 to 6 in Fig. 1 and Walktrap Communities 1 to 7 in Fig. 2, are identified in Table 1. As might be expected, significant similarities emerge, while there are also important differences.
Cluster A
The upper righthand area of Fig. 3 features a cluster of 13 ions, most of which are scarce and only one of which is an anion (N3–). Most of the ions are siderophile or chalcophile transition metals and semi-metals (As5+, Bi3+, Co2+, Co3+, Ga3+, Ge4+, In3+, Ni2+, Tl+, V4+ and V5+). Of these ions, 11 come from Louvain Communities 2 and 3 in the upper right of Fig. 1 or, equivalently, Walktrap Communities 2, 3, and 5 from the left side of Fig. 2. This arrangement provides additional support for the close association of Louvain Communities 2 and 3.
Cluster B
Cluster B, with 23 ions, is the largest cluster in Fig. 3. Of those ions, 15 are also in Louvain Community 1, while an additional six are in Louvain Community 4—a distribution of 21 ions that underscores the close association between Louvain Communities 1 and 4 in Fig. 1. Indeed, those same 21 ions are all in Walktrap Community 1 in Fig. 2. Of the 21 different elements represented in Cluster B, 13 are lithophiles according to Goldschmidt classification.
Cluster C
The 12 cations of Cluster C are predominantly from Louvain and Walktrap Communities 2 and 3, occurring in the upper right of Fig. 1 and the left side of Fig. 2. Seven of these 12 ions are high field strength cations (Hf4+, Mo5+, Mo6+, Se4+, Se6+, U5+ and U6+).
Cluster D
Cluster D features only five ions (Cs+, F–, Li+, Rb+ and Ti3+), all of which are ions of lithophile elements that occur in Community 1 of both Figs 1 and 2.
Cluster E
Most of Cluster E’s 15 ions (Ag+, Br–, Cr6+, Cu+, Hg+, I–, I5+, Mo4+, Pb4+, S2–, Sb3+, Sn4+, Ta5+, Te4+ and Te6+) are relatively uncommon constituents of near-surface alteration and aqueous processes (Hazen and Morrison, Reference Hazen and Morrison2022). These ions, mostly of chalcophile elements, are closely associated with Louvain Communities 2 and 5 (13 of 15 ions) and Walktrap Communities 2, 4 and 7 (the same 13 of 15 ions).
Cluster F
The eight ions of Cluster F include an eclectic mix of relatively common ions from Louvain/Walktrap Communities 1 (Al3+) and 2 (Cl–, Hg2+ and Pb2+), as well as some of the rarest ions—S2+ is recorded from two minerals in Supplementary Table 1, whereas C4-, I3+, and V2+ are recorded from only one mineral. Indeed, the latter three ions have been excluded from the networks of Figs 1 and 2. Aluminium joins this cluster because the only V2+ mineral is a vanadium aluminate, dellagiustaite [V2+Al2O4], which results in a 100% association of V2+ with Al3+. Similarly, the only I3+ mineral, schwartzembergite [Pb2+5H2I3+O6Cl3] has Pb2+and Cl– (Welch et al., Reference Welch, Hawthorne, Cooper and Kyser2001), whereas the only C4– mineral, mikecoxite [(C4–Hg2+4)OCl2], has Hg2+ and Cl– (Cooper et al., Reference Cooper, Dunning, Hawthorne, Ma, Kampf, Spratt, Stanley and Christy2023). In the case of S2+, known only in boojumite [Pb2+8O4(OH)2(S2+2O3)2–3] and fassinaite [Pb2+2(CO3)(S2+2O3)2–], both minerals also incorporate Pb2+. These 100% associations with the rarest ions illustrate how agglomerative cluster distributions may be distorted by mineral idiosyncrasies.
Cluster G
Cluster G, with nine cations (Ca2+, Fe2+, Na+, Nb5+, REE3+, Si4+, Sr2+, Ti4+ and Zr4+), includes several of the most abundant mineral-forming ions that occur in primary oxide and silicate minerals from igneous and metamorphic rocks (Hazen and Morrison, Reference Hazen and Morrison2022; Hazen et al., Reference Hazen, Morrison, Prabhu, Walter and Williams2023a; Morrison et al., Reference Morrison, Prabhu and Hazen2024). All these cations occur in both the Louvain and Walktrap Communities 1 of Figs 1 and 2. This result demonstrates that the strong associations among these ions are independent of the community/cluster detection method employed.
The patterns observed with agglomerative clustering underscore that the different community/cluster detection methods employed in this study lead to similar (though in detail not identical) distributions of ions among two major groups, with important subgroups. The most obvious clusters of ions, shared by Figs 1, 2 and 3, feature the most common cations in igneous rocks, high field strength cations in oxidised ore zones, and ions associated with hydrothermal and near-surface brines. At the same time, 29 (33.3%) of the 87 mineral-forming ions occur in 20 or fewer minerals; consequently, patterns of their co-occurrence may be skewed by rare-ion distributions in ways that do not easily fall into well-defined communities or clusters. Consequently, the co-occurrence of a rare ion with more common ions often overrides geochemical association trends in arranging ions by these community detection and agglomerative clustering methods. Nevertheless, most clusters in Fig. 3 have strong overlaps with a single community in Figs 1 and 2. This result highlights the strong geochemical and paragenetic affinities that cause some groups of ions to seldom co-occur with other groups.
Missing ions and ion pairs
Our list of 87 mineral-forming ions represents fewer than half of all the known oxidation states of elements studied and identified in chemistry (Emsley, Reference Emsley1991; Greenwood and Earnshaw, Reference Greenwood and Earnshaw1997). We suggest two principal underlying reasons for these ‘missing’ mineralogical ions, as well as their ion pairs.
Missing ions
A principal reason for the absence of an ion in our tabulations is element rarity. The number of observed minerals for each essential chemical element is a function of that element’s crustal abundances (Christy, Reference Christy2015; Hazen et al., Reference Hazen, Grew, Downs, Golden and Hystad2015) and is consequently also closely tied to mineral rarity (Hazen and Ausubel, Reference Hazen and Ausubel2016; Gavryliv et al., Reference Gavryliv, Ponomar, Bermanec and Putiš2022). In many instances, ions of relatively rare elements have not yet been found as essential elements in any oxygen-based minerals. Notable examples are well-documented ions of siderophile noble metals (Au+, Au3+, Ir4+, Os2+, Os3+, Os4+, Pt2+, Pt4+, Re4+, Re7+, Rh3+, Ru3+ and Ru4+), which are all known in synthetic oxide phases, yet rarely bond with oxygen in terrestrial settings.
In addition, many ions, including a few of relatively common elements, are not found as essential constituents of minerals because they require either extremely reducing environments (Cr2+, Ga+, Ge2+, Nb2+, Nb4+, P3+, Ta4+, Ti2+ and W4+) or unusually oxidised conditions (Ag3+, Bi5+, I7+, Ni3+ and Pd4+). In some instances, as in polydymite [Ni2+Ni3+2S4] and siegenite [Co2+Ni3+2S4], these ions occur in chalcogenide minerals and probably await discovery in secondary oxide minerals. For example, synthetic Ni3+ compounds such as the oxide (Ni3+2O2–3; Aggarwal and Goswami Reference Aggarwal and Goswami1961) and other compounds (e.g. Y3+Sr2+5Ni3+3O2–11; James and Attfield, Reference James and Attfield1993), point to the possibility of as yet undocumented Ni3+–O2– minerals. A similar case can be made for Ag3+–O2– minerals: the synthetic reagents Ag3+2O2–3 and Ag2+Ag3+2O2–4 (Greenwood and Earnshaw, Reference Greenwood and Earnshaw1997) are well known mineral-like compounds that might be formed in extremely oxidising, silver-rich terrestrial environments.
In at least three examples, such missing ions are known to exist as minor constituents of rock-forming minerals, but they are not yet known to occur in sufficient concentrations to form their own mineral species. For example, the reduced Cr2+ ion, well known to chemists in the reagent chromium monoxide (Cr2+O2–), has been documented by spectroscopic evidence as a minor constituent of lunar olivine (Hazen et al., Reference Hazen, Mao and Bell1977), though it is not yet known as an essential mineral-forming ion in oxides. Eu2+ is another well-known example of a reduced ion occurring only as a trace element. The fractionation of minor amounts of Eu2+ and Eu3+ between feldspar and magmatic liquids leads to the so-called ‘europium anomaly’ in patterns of rare earth element distributions (Weill and Drake, Reference Weill and Drake1973). Furthermore, the combination of the reducing conditions required and low concentrations of Eu in crustal rocks results in a low likelihood of Eu2+ minerals. In magmas [Eu3+/(total Eu)] ranges from 50% at the Iron-Wüstite buffer to 90% at Quartz-Fayalite-Magnetite (QFM) buffered rocks. Most magmas are at QFM or higher fO2, with < 10% Eu2+ (Burnham et al., Reference Burnham, Berry, Halse, Schofield, Cibin and Mosselmans2015). Other interesting cases are Nb and Ta, which only occur as pentavalent cations in our tabulations of oxide and halide minerals, yet are also known as trace divalent and tetravalent species in minerals (Martin and Wüsler, Reference Martin and Wüsler2014, and references therein).
In a similar situation, the rarity of some mineral-forming ions in IMA-approved chemical formulas is often a consequence of masking by much more common ion of similar radius and valence. Thus Rb+ is commonly masked by K+, Ga3+ by Al3+, and V4+ by Ti4+. These and many other rare ions are commonly present in minerals, yet rarely appear in their chemical formulas.
These examples highlight a significant geochemical limitation of our dataset: we analysed only IMA-approved species-defining ‘essential’ ions in 4834 minerals. Consequently, significant information about the geochemistry of minor and trace elements and their oxidation states has been excluded from the list. Furthermore, the associations of ions change dramatically as a function of element concentrations. At trace levels, a variety of different substitution mechanisms or incorporation of ions in different structural defects is possible, however there is an upper limit to the incorporated trace element concentration possible with these mechanisms. Even so, numerous minerals incorporate 100s to 1000s of ppm of relatively rare elements, though no IMA-approved formula with that element is known.
The expansion of mineral definitions and descriptions to include trace elements, ions, and isotopes is an important objective of mineral informatics. Further quantification of the distribution of trace ions among minerals would significantly expand the 87 ion list of this study. Such added data would also significantly affect the structure of the networks in Figs 1 and 2, both because different oxidation states of the same element can often have different geochemical affinities despite co-occurring in the same mineral species and because a given ion can occur in a much wider range of minerals at trace levels than as a minor or major constituent.
Ion antipathies and ‘missing’ ion pairs
Figure 3 reveals several largely blank, off-diagonal regions that represent groups of ions that rarely co-occur. These patterns of ‘ion antipathies’ [borrowing a term from Bowen (Reference Bowen1928)] suggest systematic strong avoidances among many groups of mineral-forming ions. A question arises: are these systematic trends the result of crystal chemical constraints (i.e. ions that can never occur together because of structural/bonding limitations), or rather do they arise from the idiosyncrasies of geochemical sorting of elements and their ions?
To address this question we consider four particularly ‘vacant’ areas, or ‘anticlusters’, of Fig. 3, designated AB, AD, AE and BE. Recall that 1392 (37.2%) of 3741 possible co-occurring ion pairs are observed in at least one mineral species. Therefore, any region of Fig. 3 with significantly less than 37% occupancy represents an area of statistically significant ion antipathies.
Anticluster AB
The area labelled AB in Fig. 3, representing the 13 ions of Cluster A versus the 23 ions of Cluster B, is a relatively sparse region of the heatmap, as only 43 of 299 possible ion pairs (14.4%) have non-zero values. Most of these absences reflect the geochemical antipathies between the ions of both Louvain and Walktrap Communities 1 (represented by almost all ions in Cluster B) and Communities 2 (represented by Cluster A).
Anticluster AD
A similar situation pertains to a second markedly sparse area of Fig. 2 that is formed by pairs of the 13 ions in Cluster A versus the five ions in Cluster D, all of which occur in Louvain and Walktrap Community 1. Most of the Cluster D ions (Cs+, F–, Li+ and Rb+) are commonly associated with late-stage igneous fluids and rarely if ever associate with the transition elements and high field strength semimetals of Cluster A. Of the 5 × 13 = 65 possible pairs, only five (7.7%) have non-zero values, and none of the observed pairs has greater than 4% co-occurrence.
Anticluster AE
The sparse area of Fig. 3 represented by the 13 ions of Cluster A versus 15 ions of Cluster E presents a different situation. Both clusters include ions that are predominantly from Louvain Communities 2 and 3 and Walktrap Community 2 in Figs 1 and 2. Accordingly, one might expect significant co-occurrences among these ions. However, of the 195 possible ion pairs, only 21 pairs (10.8%) have non-zero values. This significant absence of coexisting ion pairs within the well-defined Community 2 reveals that there are important substructures within this community. The Louvain and Walktrap algorithms identify communities in different ways, but both methods will lump ions that do not themselves co-occur if they are commonly connected through a third shared node. Thus, for example, while Community 2 ions Ga3+ and Ge4+ never co-occur in minerals with Pb4+ or Te4+, all four ions are connected through Pb2+.
Anticluster BE
A fourth poorly populated region of the heatmap occurs for Cluster B versus Cluster E—a 23 × 15 section in which only 52 (15.1%) of 345 possible ion pairs are nonzero. This situation is of special interest because Cluster B incorporates 11 relatively common mineral-forming ions, including B3+, Ba2+, Be2+, C4+, Cr3+, Fe3+, Mg2+, Mn4+, P5+, Sr2+ and Zr4+, all represented by more than 100 mineral species in Supplementary Table 1. Admittedly, most of the Cluster E ions are rare, with the exceptions of Cr6+, Sb3+, Sn2+, Ta5+, Te4+ and Te6+ (recorded in 61, 39, 41, 58, 54 and 52 mineral species, respectively). None of the other Cluster E ions occurs in more than 19 minerals. Nevertheless, we find that no known minerals combine pairs of relatively common ions, such as Be2+, Cr3+, or Sr2+ with Sb3+, Ta5+, Te4+, or Te6+. Similarly, Zr4+ does not co-occur with Sb3+, Te4+, or Te6+; neither does Mg2+ co-occur with Sb3+; and the pairs Mg2+/Ta5+ and Zr4+/Ta5+ are each known from only one mineral species. We suggest that these absent or rare ion pairs reflect geochemical rather than crystal chemical constraints. For example, the Crystallography Open Database (https://www.crystallography.net/cod/search.html, accessed 24 December 2024) lists two synthetic inorganic oxide phases with Be2+ and Te6+; 30 phases with Mg2+ and Ta5+; 34 phases with Cr3+ and Sb3+ or Sb5+; and 37 phases with Sr2+ and Te4+ or Te6+.
A conclusion from these observations of ion antipathies is that the list of co-occurring ion pairs in minerals reflects limitations in the chemical compositions of mineral-forming fluids and the idiosyncrasies of near-surface paragenetic modes, more than crystal chemical restrictions. For example, ions in Cluster D that are commonly associated with late-stage igneous fluids (Cs+, F–, Li+ and Rb+) rarely if ever associate with Cluster E’s low field strength ions found concentrated in brines (Ag+, Br–, Cu+, Hg+ and I–) or with high field strength ions from weathered primary oxide or sulfide deposits (Cr6+, Pb4+, Mo4+, Te4+ and Te6+).
Note, however, that such combinations of ions are not precluded on strictly crystal chemical grounds, as evidenced by such synthetic compounds as BTe3O3F15 (Sawyer and Schrobilgen, Reference Sawyer and Schrobilgen1982), RbMo2P3O12 (Leclaire and Raveau, Reference Leclaire and Raveau1988), LiAg2N3O6 (Ishihara et al., Reference Ishihara, Saito, Matsumoto and Ohba1986), CdCs2I4 (Touchard et al., Reference Touchard, Louer, Auffredic and Louer1987), and scores of other examples recorded in the Crystallography Open Database (see https://www.crystallography.net/cod/result.php, accessed 24 December 2024). Mineral occurrences depend on their P–T–X conditions of formation. For example, the formation of a mineral such as BTe3O3F15 requires relatively high oxygen and fluorine fugacities, in addition to high activity of B—conditions that are geochemically unrealistic. Furthermore, in most natural environments if Te is sufficiently concentrated it will form a telluride or will be incorporated into a chalcogenide mineral. We conclude that the absence of such synthetic compounds in the mineral realm, and the corresponding ‘missing’ ion pairs, is a consequence of geochemical, not crystal chemical, restrictions.
Conclusions
Our analysis of 87 co-occurring ions in 4834 minerals with oxygen and/or halide anions reveals that 1392 different pairs of ions (out of 3741 possible combinations) coexist in minerals. Employing varied methods of community detection and agglomerative clustering, we demonstrate that both element abundances and geochemical affinities among elements in mineral-forming environments play key roles. A key finding is that the majority of these ions fall in one of two contrasting communities/clusters—one corresponding to the most common ions in rock-forming igneous and metamorphic rocks, and the other with ions primarily found in near-surface zones of weathering, oxidation, aqueous alteration and evaporation. Most ions in Community 1 rarely, if ever, co-occur with ions in Community 2.
Why do many ions never co-occur?
Detailed analysis of all 2349 pairs of ions that are not observed—more than 60% of the 4834 possible combinations in Supplementary Table 1—is beyond the scope of this contribution. However, it is evident that most missing ion pairs arise primarily because of a combination of three factors: (1) the rarity of many elements and therefore the statistical improbability of many pairs to occur in high enough concentration/activity to be an essential constituent of a mineral phase rather than just a trace to minor element; (2) geochemical antipathies among ions, for example of contrasting oxidation states; and (3) the idiosyncrasies of many mineral-forming environments, rather than any systematic crystal chemical limitations.
As we have seen, in many cases geochemical processes tend to separate disparate groups of elements: the most abundant elements in the late-stage fluids that form complex pegmatites differ from elements in sulfide-rich melts or halogen-rich brines. Indeed, many of the ‘nonintuitive’ chemical combinations represented by the extensive areas of rare to nonexistent ion pairs might point to fruitful grounds for future synthetic chemical exploration. Large compilations of the structures and properties of synthetic inorganic compounds (e.g. https://www.crystallography.net/cod/result.php and https://icsd.fiz-karlsruhe.de/search/basic.xhtml, accessed 24 December 2024) reveal that many of the ion combinations not observed in minerals are readily incorporated into synthetics. However, it is important to note that geochemical and crystal chemical conditions are sometimes intertwined. The fact that two ions coexist in synthetic compounds, but are not found together in any known mineral may be in some cases a consequence of high bonding energies requiring special synthesis conditions not likely to occur naturally.
In other instances, two elements may commonly co-occur yet their ions of extreme oxidation states cannot. For example, Mn and Ti co-occur in 62 minerals, Mn and V co-occur in 32 minerals, U and V co-occur in 22 minerals, and Ti and U co-occur in 18 minerals (https://rruff.info/ima; accessed 15 February 2025), yet the relatively oxidised cations Mn4+ or U6+ are unlikely to co-occur with reduced ions such as Ti3+ or V2+ in any stable phase.
The roles of electronegativity and paragenetic modes
The groupings of ions, such as those illustrated in Figs 1, 2 and 3, have been highly influenced by evolving near-surface redox environments, which have changed systematically and significantly through Earth history (Hazen et al., Reference Hazen, Papineau, Bleeker, Downs, Ferry, McCoy, Sverjensky and Yang2008, Reference Hazen, Morrison, Prabhu, Williams, Wong, Krivovichev and Bermanec2023b; Hazen and Morrison, Reference Hazen and Morrison2022; Zhuang et al., Reference Zhuang, Zhang, Li, Yin, Li, Lu, Lai, Bai, Wang and Jia2024). Of note, Moore et al. (Reference Moore, Golden, Morrison, Hao and Spielman2022) documented systematic changes in average electronegativity of minerals through Earth history—changes that reflect how the range of redox conditions on Earth’s surface have expanded significantly over the past 4 billion years, while greatly increasing Earth’s mineral diversity and chemical complexity (e.g. Hazen et al., Reference Hazen, Papineau, Bleeker, Downs, Ferry, McCoy, Sverjensky and Yang2008; Krivovichev et al., Reference Krivovichev, Krivovichev, Hazen, Aksenov, Avdontceva, Banaru, Gorelova, Ismagilova, Kornyakov, Kuporev, Morrison, Panikorovskii and Starova2022; Moore et al., Reference Moore, Diedolf, Morrison and Hummer2024a; Reference Moore, Li, Zhang, Hao, Morrison, Hummer and Yee2024b). The mineral evolution of several redox-sensitive elements, including uranium (Hazen et al., Reference Hazen, Ewing and Sverjensky2009), mercury (Hazen et al., Reference Hazen, Golden, Downs, Hysted, Grew, Azzolini and Sverjensky2012), carbon (Hazen et al., Reference Hazen, Jones, Kah, Sverjensky, Hazen, Jones and Baross2013), chromium (Liu et al., Reference Liu, Hystad, Golden, Hummer, Downs, Morrison, Grew and Hazen2017), and vanadium (Liu et al., Reference Liu, Eliesh, Hystad, Golden, Downs, Morrison, Hummer, Ralph, Fox and Hazen2018) demonstrate details of the systematic increases in the average oxidation state of those elements, and thus the possibilities of new co-occurring ion pairs, especially since the Great Oxidation Event of the Paleoproterozoic Era (Hazen, Reference Hazen2015).
Our study also suggests that changes in mineral-forming conditions on Earth have led to the expansion and emergence of new communities of mineral-forming ions in conjunction with new paragenetic modes. Earth’s earliest mineral-forming ions occur predominantly in the primary igneous minerals, and subsequently metamorphic minerals, reflected in Community 1 of this study. Subsequent introduction of new near-surface paragenetic modes are reflected in Community 2, which is dominated by co-occurring ions in minerals formed by near-surface processes since the Great Oxidation Event. Future studies will examine the chronological appearance of new mineral-forming ions, based on the sequence of new paragenetic modes (Hazen and Morrison, Reference Hazen and Morrison2022; Hazen et al., Reference Hazen, Morrison, Prabhu, Williams, Wong, Krivovichev and Bermanec2023b). This effort will rely on the capabilities of graph theory and community detection analyses to visualise complex datasets influenced by several variables such as redox states, P–T–X conditions, time and other geochemical and crystal chemical constraints.
Goldschmidt’s classification revisited
This study demonstrates the ongoing relevance of Goldschmidt geochemical classification of the elements (Goldschmidt, Reference Goldschmidt1923, Reference Goldschmidt1937, Reference Goldschmidt1954), while suggesting an important modification. The division of mineral-forming elements into siderophile, chalcophile and lithophile groups is a valuable framing of the periodic table in the context of planetary evolution. In one sense, our study of the distribution of mineral-forming ions underscores this division, as the ions of most elements in this study appear to separate into two main groups—the predominantly lithophile elements of Community 1 and the chalcophile elements of Community 2. We find that of the 31 ions in Louvain Community 1, 27 are Goldschmidt lithophile elements. By contrast, 16 of 23 ions in Louvain Community 2 are chalcophile. The community/cluster structure of Figs 1, 2 and 3 are thus in large measure a reflection of contrasting element types. Furthermore, only six of Goldschmidt’s 14 siderophile elements (Mn, Fe, Co, Ni, Mo and W) are represented by our studies of oxygen- and/or halide-bearing minerals.
However, discrepancies appear when different oxidation states of an element are considered. For example, in both Louvain and Walktrap analyses, the lithophile element chromium appears in Community 1 in its Cr3+ state, but in Community 2 in the more oxidised Cr6+ state. Similarly, As3+ occurs in Walktrap Community 1, but As5+ is in Walktrap Community 2. The abundant ions Fe2+ and Fe3+ fall in Louvain Communities 1 and 4, respectively; Sb3+ and Sb5+ occur in Louvain Communities 2 and 4, respectively; and the different ions of thallium, tin, uranium and vanadium also span two different Louvain and Walktrap communities.
Goldschmidt recognised this aspect of elements with variable oxidation states. For example, he states: “chromium is a strongly lithophile element under ordinary terrestrial conditions. If, however, oxygen is deficient, as in iron meteorites, chromium is decidedly chalcophile, entering almost exclusively into the sulpho-spinel daubréelite, FeCr2S4” (Goldschmidt, Reference Goldschmidt1937; p. 659). Similarly, iron is a siderophile element, preferentially found in the metallic state in planetary cores. However, in its Fe2+ state iron often behaves as a chalcophile, while Fe3+ is usually bonded to oxygen as a lithophile element. In general, the more oxidised the ion, the more lithophile is its character.
Our study suggests an intriguing wrinkle in this trend. Here we consider the subset of minerals with cations bonded to O2– and/or halogen anions. Therefore, by definition, we are limiting our analysis to lithophile contexts, albeit with the participation of many ions of chalcophile or siderophile character in the Goldschmidt sense. At a planetary scale, the more oxidised state of an element is typically more lithophile in character—a significantly greater fraction of As5+, Cr6+, and Fe3+ minerals are bonded to O2– than As3+, Cr3+, or Fe2+ minerals, which frequently form chalcogenide minerals. This tendency has been formalised by Christy (Reference Christy2018a, Reference Christy2018b), who derived energetic parameters that distinguish among siderophile, chalcophile, ‘soft lithophile’ (e.g. large-ion lithophiles), and ‘hard lithophiles’ (high field strength elements).
Nevertheless, in this study, we find that ions in their lower oxidation states, such As3+, Cr3+ and Fe2+, are typically grouped with the unambiguously lithophile elements of Community 1, including Al, Ba, Be, Ca, K, Li, Mg, Na, P, REE, Rb, Si, Ti and Zr. By contrast, the more oxidised As5+, Cr6+ and Fe3+ ions fall into Community 2, where they are most strongly linked to chalcophile elements, including Ag, Cu, Ga, Ge, Hg, Pb, Sb, Sn, Te and Zn. This unanticipated result reflects the fundamental role of evolving paragenetic modes in the distribution of mineral-forming elements.
At the time of planetary formation, siderophile elements preferentially concentrated in the dense metal fraction that would become Earth’s core. The contrasting relatively low densities of many oxides and silicates concentrated lithophile elements in the crust and chalcophile elements initially formed primarily sulfides, arsenides and other chalcogenide minerals of intermediate average density. Subsequent tectonic processes led to new volcanic and metamorphic mechanisms that concentrated chalcophile and, to a lesser extent, rare siderophile elements in localised crustal deposits. The oxidation of these deposits, notably following the Great Oxidation Event, led to an explosion of mineral diversity, with more than 2000 new oxidised mineral species, many of which incorporated chalcophile and siderophile elements.
These deep-time trends in the nature and distribution of mineral-forming ions suggest a plausible paragenetic classification of the elements—a complement to Goldschmidt’s geochemical approach that clearly delineated the initial planetary starting point for element classification. Such an effort, placing element and ion distributions in the context of Earth’s diverse paragenetic modes, will be the topic of a forthcoming contribution.
In conclusion, minerals continue to provide us with a vivid, information-rich record of Earth’s deep-time history. Emerging studies of evolving average fundamental mineral properties such as average symmetry (Bermanec et al., Reference Bermanec, Vidović, Gavryliv, Morrison and Hazen2024a; Reference Bermanec, Vidović, Ma and Hazen2024b), structural and chemical complexity (Krivovichev et al., Reference Krivovichev, Krivovichev, Hazen, Aksenov, Avdontceva, Banaru, Gorelova, Ismagilova, Kornyakov, Kuporev, Morrison, Panikorovskii and Starova2022), mineral hardness (Bermanec et al., Reference Bermanec, Eleish, Morrison, Prabhu, Wong and Hazen2023), and chemical composition and oxidation states of mineral-forming ions (e.g. Hummer et al., Reference Hummer, Golden, Hystad, Downs, Eleish, Liu, Ralph, Morrison, Meyer and Hazen2022; Moore et al., Reference Moore, Diedolf, Morrison and Hummer2024a; Reference Moore, Li, Zhang, Hao, Morrison, Hummer and Yee2024b), embed a record of global crustal changes tracked through geological time. Thus, minerals represent an invaluable resource when reconstructing the geological history of our planet.
Dedication
We dedicate this contribution to Professor Edward Grew on the occasion of his 80th birthday. We continue to be inspired by Ed’s passion for mineralogy, the integrity and rigor of his research, his tireless advocacy for international cooperation in science, his recognition of the power of informatics to explore complex systems, and his outsized original scientific contributions to our field.
Hallimond Lecture contribution
This publication is submitted in conjunction with the Hallimond Lecture of the Mineralogical Society of Great Britain and Ireland, which was delivered by Robert M. Hazen on 20 August 2024 in Dublin, Ireland at the Fourth European Mineralogical Conference. In a departure from tradition, the awardee is the corresponding author of a contribution with several other scholars, rather than the sole author. This decision reflects both the interdisciplinary nature of the research and the vital role of exceptional early-career scientists in the emerging field of mineral informatics.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1180/mgm.2025.10105.
Acknowledgements
We are grateful to Jolyon Ralph for his efforts in developing and disseminating mineral data on https://mindat.org, without which research in mineral informatics would not be possible. We thank two anonymous reviewers, who provided insightful and constructive comments on the penultimate version of this contribution.
Funding statement
Studies of mineral evolution and mineral ecology have been supported by the Deep-time Digital Earth program, the John Templeton Foundation, the NASA Astrobiology Institute ENIGMA team, a private foundation, and the Carnegie Institution for Science. Any opinions, findings, or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the National Aeronautics and Space Administration.
Competing interests
The authors declare none.




