Towards equitable governance of human genomic data sharing: guided by genomic contextualism

Gang Wang

doi:10.1017/dap.2025.10046

Towards equitable governance of human genomic data sharing: guided by genomic contextualism

Published online by Cambridge University Press: 11 December 2025

Gang Wang

Show author details

Gang Wang*: Affiliation:
Faculty of Law, University of Macau, Macao Intellectual Property Research Institute, University of Science and Technology of China, Hefei, China
*: Email: yc37241@connect.um.edu.mo

Article contents

Abstract
Policy Significance Statement
Introduction
Defining genomic data and tracing the historical accumulation of genomic datasets
The key features of genomic data and genomic contextualism
Tripartite risk taxonomy of genomic data sharing
Rules for genomic data sharing: a comparison of China and the EU
Proposals for governance reform of genomic data sharing
Conclusion and future work
Data availability statement
Author contribution
Competing interests
References

Abstract

This article examines the governance challenges of human genomic data sharing. The analysis builds upon the unique characteristics that distinguish genomic data from other forms of personal data, particularly its dual nature as both uniquely identifiable to individuals and inherently collective, reflecting familial and ethnic group characteristics. This duality informs a tripartite risk taxonomy: individual privacy violations, group-level harms, and bioterrorism threats. Examining regulatory frameworks in the European Union (EU) and China, the article demonstrates how current data protection mechanisms—primarily anonymisation and informed consent—prove inadequate for genomic data governance due to the impossibility of true anonymisation and the limitations of consent-based models in addressing the risks of such sharing. Drawing on the concept of “genomic contextualism,” the article proposes a nuanced framework that incorporates interest balancing, comprehensive data lifecycle management, and tailored technical safeguards. The objective is to protect individuals and underrepresented groups while maximising the scientific and clinical value of genomic data.

Keywords

data protection genomic data genomic contextualism data anonymisation informed consent

Information

Type: Research Article
Information: Data & Policy , Volume 7 , 2025 , e83

DOI: https://doi.org/10.1017/dap.2025.10046 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press or the rights holder(s) must be obtained prior to any commercial use and/or adaptation of the article.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Policy Significance Statement

This research highlights the urgent need for enhanced governance frameworks for human genomic data sharing. Existing regulatory mechanisms, like anonymisation and informed consent, fall short in addressing the unique risks associated with genomic data. Policymakers must consider the dual nature of genomic data—both personal and collective—when developing regulations. This article proposes practical measures for genomic data governance informed by the concept of “genomic contextualism,” including the integration of fair interest balancing and comprehensive data lifecycle management. These recommendations aim to protect individuals and underrepresented groups while maximising the scientific and clinical benefits of genomic data.

1. Introduction

Genomic data—specifically, human genomic data as referred to throughout this article—are a valuable asset for advancing genomic research and scientific understanding. It plays a crucial role in unravelling the complex mechanisms of diseases and biological processes (Gürsoy, Reference Gürsoy, Jiang and Tang2020). Genomic analyses capture emergent properties and interactions absent in discrete genetic assessments (Gallagher and Chen-Plotkin, Reference Gallagher and Chen-Plotkin2018). High-resolution genomic datasets facilitate population-level analyses of evolutionary patterns and genetic adaptations while allowing examination of molecular processes at cellular levels. Unlike single-gene studies, genomic data reveal complex gene–gene interactions and regulatory networks, providing a comprehensive account of how genomic variation relates to human phenotypes (Ritchie et al., Reference Ritchie, Holzinger, Li, Pendergrass and Kim2015). In addition, the temporal stability of genomic data, when combined with other omics data, deepens our understanding of cellular ageing and disease mechanisms (Unger Avila et al., Reference Unger Avila, Padvitski, Leote, Chen, Saez-Rodriguez, Kann and Beyer2024).

For individuals, genomic data are advancing a deeper understanding of disease care and health management. Notably, data from whole-genome sequencing (WGS) deliver more accurate results in the molecular genetic diagnosis of rare and unknown diseases, as well as the identification of actionable cancer drivers (Bagger et al., Reference Bagger, Borgwardt, Jespersen, Hansen, Bertelsen, Kodama and Nielsen2024). Due to the complexity of gene regulatory networks, WGS data outperform exome sequencing in diagnosing rare diseases, establishing it as the preferred first-line resource for this purpose (Wojcik et al., Reference Wojcik, Lemire, Berger, Zaki, Wissmann, Win, White, Weisburd, Wieczorek, Waddell, Verboon, VanNoy, Töpf, Tan, Syrbe, Strehlow, Straub, Stenton, Snow, Singer-Berk, Silver, Shril, Seaby, Schneider, Sankaran, Sanchis-Juan, Russell, Reinson, Ravenscroft, Radtke, Popp, Polster, Platzer, Pierce, Place, Pajusalu, Pais, Õunap, Osei-Owusu, Opperman, Okur, Oja, O’Leary, O’Heir, Morel, Merkenschlager, Marchant, Mangilog, Madden, MacArthur, Lovgren, Lerner-Ellis, Lin, Laing, Hildebrandt, Hentschel, Groopman, Goodrich, Gleeson, Ghaoui, Genetti, Gburek-Augustat, Gazda, Ganesh, Ganapathi, Gallacher, Fu, Evangelista, England, Donkervoort, DiTroia, Cooper, Chung, Christodoulou, Chao, Cato, Bujakowska, Bryen, Brand, Bönnemann, Beggs, Baxter, Bartolomaeus, Agrawal, Talkowski, Austin-Tse, Jamra, Rehm and O’Donnell-Luria2024). Genomic data also enable comprehensive identification of genetic variation and catalogue how such variation contributes to health and disease when combined with environmental and lifestyle factors (Bick et al., Reference Bick, Metcalf, Mayo, Lichtenstein, Rura, Carroll, Musick, Linder, Jordan, Nagar, Sharma, Meller, Basford, Boerwinkle, Cicek, Doheny, Eichler, Gabriel and Gibbs2024). Beyond clinical applications, genomic data may also help inform critical life-course decisions, such as reproductive planning (Bilkey et al., Reference Bilkey, Burns, Coles, Bowman, Beilby, Pachter, Baynam, JS Dawkins, Nowak and Weeramanthri2019). This democratisation of personal health information derived from genomic data has the potential to transform individuals from passive healthcare recipients to active participants in their health trajectories.

The significance of genomic data in driving scientific advancements and benefiting individuals, coupled with the improved efficiency and precision of WGS (Park and Kim, Reference Park and Kim2016; Satam et al., Reference Satam, Joshi, Mangrolia, Waghoo, Zaidi, Rawool, Thakare, Banday, Mishra, Das and Malonia2023), has been a catalyst for the expansion of the genomic sequencing industry and the accumulation of vast genomic datasets. Notably, these developments, including industry growth and data proliferation, have spurred greater use and development of platforms designed to enable genomic data sharing. Such platforms aim to advance genomic research and maximise the utility and value of existing datasets (Kumuthini et al., Reference Kumuthini, Zass, Chaouch, Fadlelmola, Mulder, Radouani, Ras, Samtal, Tchamga, Sathan, Ghoorah, Sangeda, Mwita, Masamu, Kassim, Gill, Mungloo-Dilmohamud, Wells, Mccormick and Pathak2023).

However, genomic data sharing also gives rise to significant ethical considerations, which have become a subject of debate. This is because it is the data controllers, such as researchers and commercial companies, rather than the individuals who have undergone WGS, that share genomic data with third parties (Gil and Guerreiro, Reference Gil and Guerreiro2024). For instance, 23andMe (www.23andme.com) utilises anonymised data from its substantial customer base to collaborate with research partners, including pharmaceutical companies (Majumder et al., Reference Majumder, Guerrini and McGuire2021). This practice is contentious because the advantages derived from technological advancements and product innovations based on shared data primarily benefit data controllers or users, while the associated risks and potential harms predominantly impact individuals and communities (Garner and Kim, Reference Garner and Kim2018; Costello, Reference Costello2022). A vast literature has examined the diverse concerns associated with genomic data sharing, including privacy risks (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020; Gürsoy, Reference Gürsoy2022; Wan et al., Reference Wan, Hazel, Clayton, Vorobeychik, Kantarcioglu and Malin2022; Myers et al., Reference Myers, Kumar, Pilgram, Bonomi, Thomas, Griffith, Fullerton and Gibbs2025) and discrimination practices (Kaiser et al., Reference Kaiser, Uberoi, Raven-Adams, Cheung, Bruns, Chandrasekharan, Otlowski, Prince, Tiller, Ahmed, Bombard, Dupras, Moreno, Ryan, Valderrama-Aguirre and Joly2024; Joly et al., Reference Joly, Dupras, Pinkesz, Tovino and Rothstein2020).

This article aims to contribute to addressing governance challenges in genomic data sharing. Given that existing legal and regulatory mechanisms for genomic data sharing are insufficient, how can a more equitable governance framework be developed to mitigate the risks of such sharing while balancing its benefits? Specifically, Section 2 analyses the historical development of human genome projects (HGPs) and the increasing accumulation of genomic data, highlighting the importance that nations attach to such data. Section 3 explores the distinctive characteristics of genomic data and justifies the concept of genomic contextualism. Section 4 presents a taxonomy of the diverse risks linked to genomic data sharing, including violations of individual privacy, group-level harms, and bioterrorism threats. Section 5 examines the regulatory frameworks of the European Union (EU) and China, demonstrating that their current data protection mechanisms are insufficient for governing genomic data. Section 6 proposes nuanced policy recommendations grounded in genomic contextualism. Finally, Section 7 summarises the article’s key findings and discusses future research directions related to genomic data sharing.

2. Defining genomic data and tracing the historical accumulation of genomic datasets

In this section, the concept of genomic data and its rapid accumulation are examined. Genomic data refer to human WGS data, whose emergence traces back to the HGP and whose accumulation is inseparable from numerous transnational and national human genome initiatives.

2.1. Defining genomic data

The discovery of deoxyribonucleic acid (DNA)’s structure by James Watson and Francis Crick in 1953 laid the foundation for modern genomics (Mersha, Reference Mersha2024). In 1977, the advent of DNA sequencing technologies paved the way for obtaining complete human genomic data (Sanger et al., Reference Sanger, Nicklen and Coulson1977). Since that milestone, advancements in sequencing techniques, particularly next-generation sequencing, have revolutionised the field by enabling rapid and cost-effective analysis of entire genomes (Bentley et al., Reference Bentley, Balasubramanian, Swerdlow, Smith, Milton, Brown, Hall, Evers, Barnes, Bignell, Boutell, Bryant, Carter, Keira Cheetham, Cox, Ellis, Flatbush, Gormley, Humphray, Irving, Karbelashvili, Kirk, Li, Liu, Maisinger, Murray, Obradovic, Ost, Parkinson, Pratt, Rasolonjatovo, Reed, Rigatti, Rodighiero, Ross, Sabot, Sankar, Scally, Schroth, Smith, Smith, Spiridou, Torrance, Tzonev, Vermaas, Walter, Wu, Zhang, Alam, Anastasi, Aniebo, Bailey, Bancarz, Banerjee, Barbour, Baybayan, Benoit, Benson, Bevis, Black, Boodhun, Brennan, Bridgham, Brown, Brown, Buermann, Bundu, Burrows, Carter, Castillo, Chiara, Catenazzi, Chang, Neil Cooley, Crake, Dada, Diakoumakos, Dominguez-Fernandez, Earnshaw, Egbujor, Elmore, Etchin, Ewan, Fedurco, Fraser, Fuentes Fajardo, Scott Furey, George, Gietzen, Goddard, Golda, Granieri, Green, Gustafson, Hansen, Harnish, Haudenschild, Heyer, Hims, Ho, Horgan, Hoschler, Hurwitz, Ivanov, Johnson, James, Huw Jones, Kang, Kerelska, Kersey, Khrebtukova, Kindwall, Kingsbury, Kokko-Gonzales, Kumar, Laurent, Lawley, Lee, Lee, Liao, Loch, Lok, Luo, Mammen, Martin, McCauley, McNitt, Mehta, Moon, Mullens, Newington, Ning, Ling Ng, Novo, O’Neill, Osborne, Osnowski, Ostadan, Paraschos, Pickering, Pike, Pike, Chris Pinkard, Pliskin, Podhasky, Quijano, Raczy, Rae, Rawlings, Chiva Rodriguez, Roe, Rogers, Rogert Bacigalupo, Romanov, Romieu, Roth, Rourke, Ruediger, Rusman, Sanches-Kuiper, Schenker, Seoane, Shaw, Shiver, Short, Sizto, Sluis, Smith, Ernest Sohna Sohna, Spence, Stevens, Sutton, Szajkowski, Tregidgo, Turcatti, vandeVondele, Verhovsky, Virk, Wakelin, Walcott, Wang, Worsley, Yan, Yau, Zuerlein, Rogers, Mullikin, Hurles, McCooke, West, Oaks, Lundberg, Klenerman, Durbin and Smith2008). To date, the final hard-to-sequence segments of the human genome have been mapped, and hundreds of thousands of individuals have undergone WGS (Kaiser, Reference Kaiser2021). Archived genomic data also have the potential to act as a lifelong resource for data subjects, supporting repeated reanalysis and reinterpretation over time.

Building on these technological advances, genomic data are obtained through WGS to offer individuals insights into their genetic composition, including predispositions to diseases, ancestry information, and pharmacogenomic insights affecting medication responses (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). Genome sequencing entails individuals providing biological samples, such as saliva (Martins et al., Reference Martins, Murry, Telford and Moriarty2022), and involves the generation of various types of data, including “sequence read data” comprising WGS and whole-exome sequencing (WES) data, as well as data related to single-nucleotide polymorphisms (SNPs) (Belkadi et al., Reference Belkadi, Bolze, Itan, Cobat, Vincent, Antipenko, Shang, Boisson, Casanova and Abel2015). It is essential to note that raw personal WGS data alone lack meaningful interpretation; hence, these data must undergo analysis to derive interpreted genomic information that is comprehensible. The process of interpreting genomic data involves aligning sequences with a reference genome, identifying variations compared to the reference, and documenting these variances in a variant call format (Paltiel et al., Reference Paltiel, Taylor and Newson2023). Consequently, genomic information can be deduced from genomic data in conjunction with external reference data or information (El Emam, Reference El Emam2011). This implies that the more extensive and accurate the external reference data, the more comprehensive and precise the personal genomic information revealed by WGS will be. Its value will continue to increase as our understanding of it deepens. Within the realm of our research, WGS data hold particular significance as a foundational form of genomic data, serving as a primary focus for our investigation.

2.2. Tracing human genome projects and genomic data accumulation

To better understand human genomic data, its accumulation, and the significance of its sharing, it is necessary to review the historical development of HGPs.

The famous HGP, launched in October 1990, is a foundational initiative for human genomic research. It required global collaboration and accelerated biomedical research worldwide. To deliver a key component of the HGP, the International Human Genome Sequencing Consortium (2004) was formed, an open partnership involving 20 centres across six countries. This consortium ultimately produced a reference human genomic sequence, providing a basis for human genomic research. Notably, the initial reference data contained gaps and errors, which were refined in 2013 and 2019. Most recently, in 2022, the Telomere-to-Telomere (T2T) Consortium released the T2T-CHM13 reference: a complete 3.055 billion–base pair sequence of a human genome (Nurk et al., Reference Nurk, Koren, Rhie, Rautiainen, Bzikadze, Mikheenko, Vollger, Altemose, Uralsky, Gershman, Aganezov, Hoyt, Diekhans, Logsdon, Alonge, Antonarakis, Borchers, Bouffard, Brooks, Caldas, Chen, Cheng, Chin, Chow, de Lima, Dishuck, Durbin, Dvorkina, Fiddes, Formenti, Fulton, Fungtammasan, Garrison, PGS, Graves-Lindsay, Hall, Hansen, Hartley, Haukness, Howe, Hunkapiller, Jain, Jain, Jarvis, Kerpedjiev, Kirsche, Kolmogorov, Korlach, Kremitzki, Li, Maduro, Marschall, McCartney, McDaniel, Miller, Mullikin, Myers, Olson, Paten, Peluso, Pevzner, Porubsky, Potapova, Rogaev, Rosenfeld, Salzberg, Schneider, Sedlazeck, Shafin, Shew, Shumate, Sims, AFA, Soto, Sović, Storer, Streets, Sullivan, Thibaud-Nissen, Torrance, Wagner, Walenz, Wenger, JMD, Xiao, Yan, Young, Zarate, Surti, RC, Dennis, Alexandrov, Gerton, O’Neill, Timp, Zook, Schatz, Eichler, Miga and Phillippy2022).

Following the release of the reference human genomic sequence, understanding the relationship between genotype and phenotype became a central goal in biology and medicine. To deepen knowledge of genetic contributions to human health and disease, the International 1000 Genomes Project was established in 2007. Its aim was to sequence the genomes of at least 1000 volunteers from diverse global populations (Devuyst, Reference Devuyst2015). The project reconstructed the genomes of 2504 individuals from 26 populations, using a combination of low-coverage WGS, deep exome sequencing, and dense microarray genotyping (Auton et al., Reference Auton, Abecasis, Altshuler, Durbin, Abecasis, Bentley, Chakravarti, Clark, Donnelly, Eichler, Flicek, Gabriel, Gibbs, Green, Hurles, Knoppers, Korbel, Lander and Lee2015). It characterised a broad range of genetic variation: over 88 million variants in total, including 84.7 million SNPs, 3.6 million short insertions/deletions, and 60,000 structural variants—all phased onto high-quality haplotypes (Auton et al., Reference Auton, Abecasis, Altshuler, Durbin, Abecasis, Bentley, Chakravarti, Clark, Donnelly, Eichler, Flicek, Gabriel, Gibbs, Green, Hurles, Knoppers, Korbel, Lander and Lee2015). This resource serves as a benchmark for surveys of human genetic variation and remains a key component of human genomic studies.

As the cost of WGS has fallen by more than a million-fold (Satam et al., Reference Satam, Joshi, Mangrolia, Waghoo, Zaidi, Rawool, Thakare, Banday, Mishra, Das and Malonia2023), and when paired with significant public investment in genomic research, many countries have launched their own HGPs. As of 2019, over 96 major genomic programmes had been initiated to collect, store, share, and use human genomic data and related health data for diverse objectives (Nunn et al., Reference Nunn, Tiller, Fransquet and Lacaze2019). Key large-scale national and international initiatives include the US All of Us Research Program (The All of Us Research Program Investigators, 2019), and the European “1 + Million Genomes” Initiative (Saunders et al., Reference Saunders, Baudis, Becker, Beltran, Béroud, Birney, Brooksbank, Brunak, Van den Bulcke, Drysdale, Capella-Gutierrez, Flicek, Florindi, Goodhand, Gut, Heringa, Holub, Hooyberghs, Juty, Keane, Korbel, Lappalainen, Leskosek, Matthijs, Mayrhofer, Metspalu, Navarro, Newhouse, Nyrönen, Page, Persson, Palotie, Parkinson, Rambla, Salgado, Steinfelder, Swertz, Valencia, Varma, Blomberg and Scollen2019), each aiming to sequence at least 1 million individuals to inform evidence-based precision medicine (Howley et al., Reference Howley, Haas, Muftah, Annan, Green, Lundgren, Scott, Stark, Tan, North and Boughtwood2025). Moreover, there are several notable projects aimed at non-European populations, such as the GenomeAsia 100 K Project (Wall et al., Reference Wall, Stawiski, Ratan, Kim, Kim, Gupta, Suryamohan, Gusareva, Purbojati, Bhangale, Stepanov, Kharkov, Schröder, Ramprasad, Tom, Durinck, Bei, Li, Guillory, Phalke, Basu, Stinson, Nair, Malaichamy, Biswas, Chambers, Cheng, George, Khor, Kim, Cho, Menon, Sattibabu, Bassi, Deshmukh, Verma, Gopalan, Shin, Pratapneni, Santhosh, Tokunaga, Md-Zain, Chan, Parani, Natarajan, Hauser, Allingham, Santiago-Turla, Ghosh, Gadde, Fuchsberger, Forer, Schoenherr, Sudoyo, Lansing, Friedlaender, Koki, Cox, Hammer, Karafet, Ang, Mehdi, Radha, Mohan, Majumder, Seshagiri, Seo, Schuster and Peterson2019), China’s Precision Medicine Initiative (Liu et al., Reference Liu, Hui and Song2020), Singapore’s Health for Life in Singapore Study (Wang et al., Reference Wang, Mina, Sadhu, Jain, Ng, Low, Tay, Tong, Choo, Kerk, Low, Team, Lam, Dalan, Wanseicheong, Yew, Leow, Brage, Michelotti, Wong, Sheridan, Yan, Xuan, Bertin, Bellis, Hebrard, Goy, Tsilidis, Sanikini, Li, Han, Lee, Best, Tan, Elliott, Sing, Lee, Ngeow, Riboli, Lam, Loh and Chambers2024a), and Nigeria’s 100 K Genome Project (Fatumo et al., Reference Fatumo, Yakubu, Oyedele, Popoola, Attipoe, Eze-Echesi, Modibbo, Ado-Wanka, Salako, Nashiru, Salako, O’Dushlaine and Ene-Obong2022).

Alongside these HGPs, derivative human genomic data, often stored in national biobanks, have emerged as a transformative resource for understanding human genetic variation and its links to health and disease. These projects and biobanks now serve as critical platforms for advancing genomic research. By integrating high-resolution human genomic data with comprehensive phenotypic, environmental, and clinical datasets, they enable researchers to uncover the genetic basis of diseases, identify novel biomarkers, and develop precision medicine strategies tailored to diverse populations (Lee et al., Reference Lee, Kim, Kwon, Kim, Kim and An2025). For instance, the UK Biobank—a large-scale biomedical database—has recruited approximately 500,000 participants, with over 200,000 whole genomes made available for global access (J. Kaiser, Reference Kaiser2021). Another example is the All of Us Research Program, which had released genomic data for 245,388 participants as of February 2024, with plans to sequence over 1 million individuals (Bick et al., Reference Bick, Metcalf, Mayo, Lichtenstein, Rura, Carroll, Musick, Linder, Jordan, Nagar, Sharma, Meller, Basford, Boerwinkle, Cicek, Doheny, Eichler, Gabriel and Gibbs2024).

Beyond the growing volume of human genomic data collected through public initiatives, private genomic databases from commercial sources are also substantial. Notably, the WGS sector has grown rapidly—particularly since the rise of direct-to-consumer (DTC) genome sequencing enterprises (McGuire et al., Reference McGuire, Diaz, Wang and Hilsenbeck2009). By early 2019, it was documented that over 26 million individuals globally had contributed their personal human genomic information to the databases of four leading testing firms (Majumder et al., Reference Majumder, Guerrini and McGuire2021).

Human genomic data generated by public and private entities are progressively accumulating. It holds the potential to serve diverse purposes and deliver significant value, profoundly shaping scientific research, medical practice, and individuals’ health care. Meanwhile, many genomic researchers, healthcare practitioners, and other stakeholders support human genomic data sharing. Their goal is to fully deliver the benefits of genomic science to the wider human population. A key example is the Global Alliance for Genomics and Health (GA4GH)—a global alliance aimed at enabling the responsible sharing of human genomic data (Rehm et al., Reference Rehm, AJH, Smith, Adams, Alterovitz, Babb, Barkley, Baudis, MJS, Beck, Beckmann, Beltran, Bernick, Bernier, Bonfield, Boughtwood, Bourque, Bowers, Brookes, Brudno, Brush, Bujold, Burdett, Buske, Cabili, Cameron, Carroll, Casas-Silva, Chakravarty, Chaudhari, Chen, Cherry, Chung, Cline, Clissold, Cook-Deegan, Courtot, Cunningham, Cupak, Davies, Denisko, Doerr, Dolman, Dove, Dursi, SOM, Eddy, Eilbeck, Ellrott, Fairley, Fakhro, Firth, Fitzsimons, Fiume, Flicek, Fore, Freeberg, Freimuth, Fromont, Fuerth, Gaff, Gan, Ghanaim, Glazer, Green, Griffith, Griffith, Grossman, Groza, Auvil, Guigó, Gupta, Haendel, Hamosh, Hansen, Hart, Hartley, Haussler, Hendricks-Sturrup, Ho, Hobb, Hoffman, Hofmann, Holub, Hsu, Hubaux, Hunt, Husami, Jacobsen, Jamuar, Janes, Jeanson, Jené, Johns, Joly, SJM, Kanitz, Kato, Keane, Kekesi-Lafrance, Kelleher, Kerry, Khor, Knoppers, Konopko, Kosaki, Kuba, Lawson, Leinonen, Li, Lin, Linden, Liu, Liyanage, Lopez, Lucassen, Lukowski, Mann, Marshall, Mattioni, Metke-Jimenez, Middleton, Milne, Molnár-Gábor, Mulder, Munoz-Torres, Nag, Nakagawa, Nasir, Navarro, Nelson, Niewielska, Nisselle, Niu, Nyrönen, O’Connor, Oesterle, Ogishima, Wang, Paglione, Palumbo, Parkinson, Philippakis, Pizarro, Prlic, Rambla, Rendon, Rider, Robinson, Rodarmer, Rodriguez, Rubin, Rueda, Rushton, Ryan, Saunders, Schuilenburg, Schwede, Scollen, Senf, Sheffield, Skantharajah, Smith, Sofia, Spalding, Spurdle, Stark, Stein, Suematsu, Tan, Tedds, Thomson, Thorogood, Tickle, Tokunaga, Törnroos, Torrents, Upchurch, Valencia, Guimera, Vamathevan, Varma, Vears, Viner, Voisin, Wagner, Wallace, Walsh, Williams, Winkler, Wold, Wood, Woolley, Yamasaki, Yates, Yung, Zass, Zaytseva, Zhang, Goodhand, North and Birney2021).

3. The key features of genomic data and genomic contextualism

Having traced the historical development of HGPs and the growing accumulation of human genomic data—from public initiatives and private sources—it is now critical to explore the inherent characteristics of these data and lay the foundation for their regulations.

3.1. The key features of genomic data

While the commercial use of DTC genome sequencing has commodified both the sequencing process and the information it yields, genomic data are far from ordinary. It is uniquely identifiable and possesses distinct attributes such as predictive capability, immutability, and group impact (Chapman et al., Reference Chapman, Quinn, Natri, Berrios, Dwyer, Owens, Heraty and Caplan2023). More specifically, genomic data have a dual nature. On the one hand, it constitutes a form of unique personal data, even more unique than genetic data. Genomic data encompass an individual’s complete genetic makeup, specifically referring to the DNA found in normal reproductive cells. Each individual’s genomic data are unique; even the germline genomes of monozygotic twins exhibit distinctions due to early developmental mutations (Jonsson et al., Reference Jonsson, Magnusdottir, Eggertsson, Stefansson, Arnadottir, Eiriksson, Zink, Helgason, Jonsdottir, Gylfason, Jonasdottir, Jonasdottir, Beyter, Steingrimsdottir, Norddahl, Magnusson, Masson, Halldorsson, Thorsteinsdottir, Helgason, Sulem, Gudbjartsson and Stefansson2021). Consequently, as intact genetic data, genomic data can reveal unique genetic characteristics that possess a level of specificity not typically found in other forms of biological substances (Tigard, Reference Tigard2019), such as blood and internal organs, even other forms of data, including certain personal genetic data (Rahnasto, Reference Rahnasto2023).

On the other hand, genomic data not only reflect individual characteristics but also serve as collective data, revealing shared familial and ethnic traits (McGonigle, Reference McGonigle2019, p. 3). It is noteworthy that the genomic sequences of any two individuals exhibit approximately 99.9 per cent similarity at the nucleotide level (Hartl and Cochrane, Reference Hartl and Cochrane2017, p. 189). Nevertheless, when considering the approximately 3 billion base pairs in the human genome within a reproductive cell, the 0.1 per cent of the human DNA sequence—equating to around 3 million base pairs—that varies between genomes remains a substantial number. The genomic similarity is often more pronounced within ethnic groups, which may display shared genetic traits (Shriver et al., Reference Shriver, Smith, Jin, Marcini, Akey, Deka and Ferrell1997; Lowe et al., Reference Lowe, Urquhart, Foreman and Evett2001; Spielman et al., Reference Spielman, Bastone, Burdick, Morley, Ewens and Cheung2007), while relatives typically exhibit even greater genetic resemblance (Guo, Reference Guo2008). Consequently, the disclosure of an individual’s genomic data invariably reveals portions of the genetic data of other individuals to whom they are genetically related, including ancestors (Costello, Reference Costello2022).

In light of these two characteristics, a parallel can be drawn with the concept of “relational privacy,” which recognises the interconnectedness of individuals within social and familial networks, especially regarding genetic data (Entrikin, Reference Entrikin2019; Costello, Reference Costello2022). I propose to understand genomic data as “collective personal data,” reflecting both individual and shared genetic traits within families or populations. This concept helps us capture the dual nature of genomic data as both personal and collective, challenging traditional binary perspectives on privacy and blurring the distinctions between the individual and the collective. This duality may initially seem paradoxical, aligning with the notion of “essentially oxymoronic concepts” (Neuwirth, Reference Neuwirth2013), but it underscores the need for a more nuanced understanding of genomic data’s privacy implications and the best governance mechanisms that should apply to it.

Furthermore, this inherent complexity of genomic data is also one of the fundamental reasons why genomic data cannot be fully anonymised. The very nature of these data ties individuals to their familial and communal genetic identities, making it difficult to separate personal data from collective implications. Even when an individual’s genomic information is de-identified, its collective attributes can still allow others to recognise their data through group databases (Ohm, Reference Ohm2009). Specific personal details like family names and observable characteristics such as skin and eye colour are publicly accessible and can be linked to genomic data (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). Furthermore, genomic data remain constant throughout a person’s lifetime. This enduring uniqueness establishes a strong correlation between genomic data and individual identities, making it susceptible to re-identification through identification and phenotype inference attacks (Altman et al., Reference Altman, Clayton, Kohane, Malin and Roden2013; Rocher et al., Reference Rocher, Hendrickx and de Montjoye2019; Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). Consequently, genomic data constitute personal data that cannot be anonymised.

3.2. The genomic contextualism

There has been ongoing debate about whether genetic data are unique and require special treatment, a concept known as “genetic exceptionalism” (Green and Botkin, Reference Green and Botkin2003). Proponents of this idea argue that genetic data possess distinct characteristics, such as heritability, the potential for incidental findings, and complexity, which set it apart from other types of medical data. For instance, a single inconsequential sequence linked to an individual’s identity could potentially reveal genetic information that the person prefers to keep private (Evans et al., Reference Evans, Burke and Khoury2010). According to this view, such features of genomic data warrant special policies and protections (Green and Botkin, Reference Green and Botkin2003; Evans et al., Reference Evans, Burke and Khoury2010).

However, other types of medical and biometric data can be equally sensitive and merit similar safeguards (Price and Cohen, Reference Price and Cohen2019; Migliorini, Reference Migliorini2023). In response to this critique, the concept of “genomic contextualism” has been proposed as a more nuanced framework for addressing the ethical and policy challenges surrounding genomic data (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). Genomic contextualism is grounded in a key characteristic of genomic data: its nature as collective personal data.

To fully grasp this framework, it is first necessary to resolve a common conflation: the distinction between genetic data and genomic data. Genetic data pertain to discrete genes or markers and their variants, focusing on isolated DNA segments linked to specific phenotypic expressions (Hartl and Cochrane, Reference Hartl and Cochrane2017, p. 189). The fundamental difference lies in scope and analytical power: genetic data offer targeted insights into specific biological mechanisms, whereas human genomic data provide a holistic context for understanding the integrative functions of an individual’s complete genetic architecture.

Genomic contextualism posits that the significance of genomic data depends on the specific context in which it is used. Rather than applying blanket policies to genomic data as a whole, this approach advocates for policies tailored to the unique circumstances in which such data are utilised, whether in clinical, research, or societal settings (Garrison et al., Reference Garrison, Brothers, Goldenberg and Lynch2019a; Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). This perspective recognises the uniqueness of genomic data but also emphasises the need for flexibility to account for their varying relevance across different contexts and populations, particularly minority and Indigenous groups whose cultural values and ethical concerns might clash with mainstream approaches to the processing of genomic data (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b).

One key area of application for genomic contextualism is data sharing in genomic research. Large-scale genomic research projects and biobanks routinely generate vast amounts of data, prompting discussions about whether genomic data require special protections compared to other types of research data (Murray, Reference Murray2019). Genomic data share similarities with other sensitive data, such as medical data, in that privacy breaches can cause significant harm. However, genomic data are distinct in that they can be re-identified using demographic information or by cross-referencing other datasets (Rocher et al., Reference Rocher, Hendrickx and de Montjoye2019). They can also contain sensitive health predictions or genetic ancestry information, raising privacy and ethical concerns (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). Therefore, legal data protection regulations that rely on a one-size-fits-all model will fail to address the uniqueness of human genomic data.

4. Tripartite risk taxonomy of genomic data sharing

As mentioned in the previous section, the technological evolution facilitating genomic data proliferation has rendered de-identification measures increasingly vulnerable to reversal, compromising the presumed anonymity of genomic information. In addition, genomic data’s distinctive capacity to reveal temporally extensive and communally diffuse information creates vectors for sensitive disclosure. This architecture of vulnerability engenders a tripartite risk taxonomy: individual privacy violations, group-level harms, and bioterrorism threats.

4.1. Individual privacy violations

Sharing genomic data poses various privacy harms and risks to individuals. These risks encompass a wide range of privacy harms, including physical, psychological, autonomy, and discrimination harms (Citron and Solove, Reference Citron and Solove2022). Privacy harms have both subjective and objective dimensions (Calo, Reference Calo2011). Subjective privacy harm relates to the sense of being monitored without consent, leading to distressing mental states, whereas objective privacy harm involves external actions that exploit personal information against an individual’s wishes (Citron and Solove, Reference Citron and Solove2022).

In the context of genomic data, sharing such data can result in physical, psychological, and autonomy harms. For example, sharing genomic data may expose individuals to potential future misuse, leading to a loss of control over their personal data. This loss of control represents a form of autonomy harm (Citron and Solove, Reference Citron and Solove2022). A notable study illustrates this: researchers combined Y-chromosome haplotype analysis with genealogical registry data to predict the surnames of anonymised participants, directly undermining data control (Gitschier, Reference Gitschier2009). Additionally, discrimination harms occur when individuals face unjust differential treatment based on actual or perceived characteristics inferred from their genomic data (Berndt Rasmussen, Reference Berndt Rasmussen2019). These unjust practices restrict individuals’ access to employment, affordable insurance, housing, and other crucial life opportunities (Citron and Solove, Reference Citron and Solove2022). A famous case, Xie v. Human Resources and Social Security Bureau in Foshan City, was reported in China (Kim et al., Reference Kim, Ho, Ho, Athira, Kato, De Castro, Kang, Huxtable, Zwart, Ives, Lee, Joly and Kim2021). In 2009, 31 applicants to the Foshan local government were denied civil service roles solely because they were thalassemia gene carriers (Qiu, Reference Qiu2010). Three of these applicants later filed a lawsuit alleging discrimination. However, in 2010, the Foshan Intermediate People’s Court ruled that rejecting candidates with the thalassemia gene for civil service positions was legal (Kim et al., Reference Kim, Ho, Ho, Athira, Kato, De Castro, Kang, Huxtable, Zwart, Ives, Lee, Joly and Kim2021). Importantly, thalassemia gene carriers are not equivalent to anaemia patients. This case thus clearly demonstrates how genetic discrimination—rooted in inferences from genomic data—can directly undermine individuals’ interests and access to opportunities.

Genomic data are vulnerable to access, sharing, and use by various entities for a range of purposes, which exacerbates the associated risks and complicates mitigation efforts (Haeusermann et al., Reference Haeusermann, Fadda, Blasimme, Tzovaras and Vayena2018; Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020; Gürsoy, Reference Gürsoy2022). In the private sector, commercial actors often exploit these data for financial gain. For example, Nutrigenomix (https://nutrigenomix.com) uses genetic profiles to develop personalised nutrition services, promoting these via targeted channels like podcasts and health blogs (Gil and Guerreiro, Reference Gil and Guerreiro2024). Such practices contribute to “DNA data marketplaces,” where companies access genomic data to drive research, develop products, and market these to individuals with relevant genetic predispositions (Ahmed and Shabani, Reference Ahmed and Shabani2019). This sensitive information, when exposed publicly, may precipitate social stigmatisation and personal embarrassment. Concurrently, governmental access to genomic data raises significant concerns regarding privacy infringement (Haag, Reference Haag2019), discriminatory practices, surveillance capabilities, and potential abuse of institutional authority (Ram et al., Reference Ram, Guerrini and McGuire2018).

4.2. Group-level harms

Genomic information transcends individual boundaries, generating cascading implications for biological relatives and broader ethnocultural communities, thereby constituting a collective dimension of genomic identity (McGonigle, Reference McGonigle2016). Presently, there is a mounting concern surrounding the concept of “relational privacy” (Entrikin, Reference Entrikin2019; Costello, Reference Costello2022). The sharing of genomic data can potentially unveil sensitive information about relatives without their explicit consent (McGonigle, Reference McGonigle2019), expanding privacy risks beyond the individual and influencing familial relationships. For example, comparing genomic data among family members can reveal details about their familial ties. A significant event in 2018 saw law enforcement authorities in the U.S. utilising consumer genomic databases (e.g. GEDmatch) to identify suspects by tracing distant familial relatives (Erlich et al., Reference Erlich, Shor, Pe’er and Carmi2018; Ram et al., Reference Ram, Guerrini and McGuire2018). The “Golden State Killer,” for example, never submitted his DNA to GEDmatch but was identified through a distant cousin’s genomic profile. This case highlights how sharing one individual’s data can compromise relatives’ privacy without their input, underscoring the need to protect both individual and relational privacy in genomic data practices. Consequently, the repercussions of genomic data extend beyond the individual to encompass relational aspects, impacting all parties involved, even in the absence of explicit consent (Costello, Reference Costello2022).

The concept of group risks associated with genomic data focuses on the potential adverse implications that genomic data can have on specific groups. There is a growing apprehension concerning the collective interests and harms linked to genomic data. The sharing of data from a subset of individuals within a group can impinge on the legitimate interests of other group members (Costello, Reference Costello2022). These groups, defined by shared inherited characteristics, may consist of individuals with particular disease susceptibilities or common physical attributes. When integrated with machine learning (ML) or artificial intelligence (AI) analysis, the sharing of genomic data can endanger group interests, resulting in biases, discrimination, and the establishment and perpetuation of disparities within these specific groups (Chapman et al., Reference Chapman, Quinn, Natri, Berrios, Dwyer, Owens, Heraty and Caplan2023). Furthermore, these harms have the potential to inflict cultural and dignitary risks on these groups (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). The Havasupai Tribe case exemplifies this. In 2003, research on the Tribe’s donated blood samples—originally intended to study diabetes—was expanded without consent to investigate its ancestry and familial connections, prompting a legal dispute (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). The Tribe contended that these additional studies exceeded their initial agreement, causing cultural, dignitary, and group harm. Subsequently, a settlement was reached in 2010, awarding Tribal members $700,000 in compensation (Garrison, Reference Garrison2013).

Group-based harms manifest as stigmatisation and marginalisation, subjecting affected individuals to systemic disadvantages (Chapman et al., Reference Chapman, Quinn, Natri, Berrios, Dwyer, Owens, Heraty and Caplan2023; Rahnasto, Reference Rahnasto2023). Governmental entities may amplify these vulnerabilities through institutional practices informed by implicit biases and structural prejudices. Historical precedents illustrate this phenomenon, as with Cesare Lombroso’s “born criminal” theory, rejected for its racist underpinnings and biological determinism (Sirgiovanni, Reference Sirgiovanni2017). Should officials gain access to genomic data, such information might serve as justification for discriminatory judgments against individuals with specific genetic variations, thereby intensifying societal stratification and interethnic tensions.

4.3. Bioterrorism threats

The widespread sharing of genomic data presents potential risks of bioterrorism, impacting both national and global security due to the universal nature of the human genome. A primary concern is the potential for genetic modification to facilitate the development of biological weapons, a threat that may be intensified by the extensive dissemination and unlimited access to genomic data. Although genomic technology’s current development and application present a limited immediate threat, once it occurs, it will have extremely serious consequences, which means that we cannot wait until the risk actually materialises before regulating it. Moreover, bioterrorism threats are magnified by the intersection of modern genomic technologies with advanced AI, ML, automation, and robotic capabilities (Hendrycks et al., Reference Hendrycks, Mazeika and Woodside2023; Brent et al., Reference Brent, McKelvey and Matheny2024). This convergence could empower private biotech platforms or research communities to craft biological weapons targeting specific groups or populations (Lentzos, Reference Lentzos2020; Painter and Bastian, Reference Painter and Bastian2021). Moreover, the absence of robust cybersecurity measures within the synthetic biology sector exposes it to the potential for unauthorised synthesis of harmful biological agents by malicious actors (Puzis et al., Reference Puzis, Farbiash, Brodt, Elovici and Greenbaum2020). Unlike conventional biological weapons, which rely on naturally occurring microorganisms to inflict harm (Pal et al., Reference Pal, Tsegaye, Girzaw, Bedada, Godishala and Kandi2017), genetically modified biological agents can target specific populations with highly infectious and pathogenic organisms, thereby increasing the likelihood of severe harm (Brockmann et al., Reference Brockmann, Bauer and Boulanin2019; Ristanovic, Reference Ristanovic, Dishovsky and Pivovarov2009, p. 124).

This is not alarmist rhetoric. All technologies possess dual uses: while human genomic data can drive advancements in health-related genetic technologies, it also has the potential to enable harmful applications. A notable example from 2018 highlights the risks associated with genomics: a member of a three-person team utilised recombinant DNA, polymerase chain reaction (PCR), and synthetic DNA to recreate horsepox, a close relative of smallpox (Brent et al., Reference Brent, McKelvey and Matheny2024). Another group further developed this research, using the same tools, along with clustered regularly interspaced short palindromic repeats (CRISPR) technology, to engineer a different smallpox-related virus. Such studies underscore the ease with which this research could be repurposed to produce lethal pathogens. In 2022, a team of researchers modified an AI system initially designed to create non-toxic therapeutic molecules. They altered its parameters to reward toxicity rather than penalise it (Urbina et al., Reference Urbina, Lentzos, Invernizzi and Ekins2022). Following this adjustment, the system independently generated 40,000 candidate chemical warfare agents within just six hours. While the destructive impact of biotechnology has not yet matched that of nuclear armaments, the pace of technological progress may surpass individual nations’ regulatory capacities. Without adequate regulation, these advancements could lead to a resurgence of bioterrorism, posing a severe threat to the security and welfare of particular ethnic groups or humanity at large. The risk of malicious actors creating tools to harm humans raises critical ethical and security questions, posing major challenges to national, transnational, and global bioterrorism prevention efforts.

The sharing of genomic data generates interconnected potential risks of bioterrorism at both national and global levels, each with varying implications. These risks may arise from individual states pursuing their national interests or from the actions of terrorist groups, extremists, or other malicious entities, all of which pose substantial threats. On a national level, bioterrorism risks are heightened by the tailored development of pathogens designed to exploit the susceptibilities or vulnerabilities of specific populations within countries (Dieuliis, Reference Dieuliis2018). At a global scale, larger nations with greater racial and genetic diversity face challenges in identifying shared genetic traits and formulating targeted biological threats (Wang and Liu, Reference Wang and Liu2025, p. 220). In contrast, smaller and more ethnically homogeneous countries may be more susceptible to biological weapons.

5. Rules for genomic data sharing: a comparison of China and the EU

The efficacy of protection strategies against the risks associated with genomic data sharing is a subject of ongoing debate (Joly et al., Reference Joly, Dupras, Pinkesz, Tovino and Rothstein2020; Gürsoy, Reference Gürsoy2022), with different countries adopting varying approaches (Harbord, Reference Harbord2019; Du and Wang, Reference Du and Wang2020; Paltiel et al., Reference Paltiel, Taylor and Newson2023; Solove, Reference Solove2024). This section examines the data protection frameworks of the EU and China and assesses their suitability for the effective prevention of the risks associated with genomic data sharing. The choice of these two jurisdictions is motivated by two reasons. Firstly, both frameworks are generally very protective of personal data (Ding, Reference Ding2024; Fuster, Reference Fuster2014, p. 1; Peng et al., Reference Peng, Shao and Zheng2022) and both jurisdictions recognise genomic data as a special category of data that requires heightened privacy protection (Rahnasto, Reference Rahnasto2023; Zhang, Reference Zhang2015, p. 51). Secondly, the data protection laws of both jurisdictions have an area of geographical influence that extends beyond the borders of the jurisdiction (Bradford, Reference Bradford2020, p. 27; Erie and Streinz, Reference Erie and Streinz2021). This section first provides an overview of the personal data protection laws in each jurisdiction, before then delving into two key mechanisms: technical security mechanisms and informed consent mechanisms. It analyses these mechanisms from both normative and practical perspectives.

5.1. Overview of relevant personal data protection laws

At the constitutional level, the protection of genomic data is fundamentally linked to the safeguarding of basic human rights. Within the EU, data protection is upheld as a fundamental right by primary law, with a particular emphasis on the protection of sensitive data, encompassing genomic data. Article 8 of the Charter of Fundamental Rights of the European Union (2007) stipulates that “everyone has the right to the protection of personal data concerning him or her.” In the EU, the European Court of Human Rights (ECtHR) has underscored the necessity for heightened protection of genetic data, recognising its unique sensitivity compared to other categories of sensitive data. In S. and Marper v. The United Kingdom (2008), the ECtHR highlighted the deeply personal and sensitive nature of genetic data, emphasising its exceptional status. In China, Articles 33 and 38 of the Constitution of the People’s Republic of China (2018) establish a foundation for the right to personal information and personal data (Zhang, Reference Zhang2015, p. 48), thereby providing constitutional grounds for safeguarding genomic data. This has led to the enactment of the Personal Information Protection Law of the People’s Republic of China (2021) (PIPL), which specifically addresses the protection of personal data. Furthermore, Article 28 of the Constitution offers a constitutional basis for the protection of national security. This, in turn, underpins the Biosecurity Law of the People’s Republic of China (2024) (Biosecurity Law), which also relates to the protection of genomic data (Wang, Reference Wang2013, p. 67).

At the legislative level, genomic data receive classification as sensitive data under both EU and Chinese regulatory frameworks. In the EU, Article 9 of the General Data Protection Regulation (2016) (GDPR) establishes “special categories of personal data”—commonly termed sensitive data (Quinn and Malgieri, Reference Quinn and Malgieri2021)—encompassing genetic, health, and biometric data. Genomic data’s inherent capacity to reveal detailed genetic compositions, disease susceptibilities, and distinctive individual and community characteristics substantiates its sensitive categorisation. This position finds additional support through a fortiori reasoning (d’Almeida, Reference d’Almeida2017): if subordinate categories like genetic data warrant sensitive classification, then genomic data, representing a more comprehensive category, merit equivalent or superior protection. While some have noted Article 9(1) GDPR presents an exhaustive enumeration of “special categories” (Quinn and Malgieri, Reference Quinn and Malgieri2021), potentially excluding genomic data under strict interpretation, genomic data’s reducibility to genetic data effectively secures its designation as sensitive data.

While the GDPR was a groundbreaking data protection law, a growing body of legal, socio-political, ethical, and policy research has drawn attention to its shortcomings. For health data—including human genomic data—these shortcomings highlight four broad areas: the limited scope of traditional data protection principles in the face of emerging big data practices, the blurring of key regulatory categories, flaws in the informed consent model, and the Regulation’s narrow focus on harms and discrimination arising from data processing (Marelli et al., Reference Marelli, Lievevrouw and Van Hoyweghen2020). To address these gaps, the EU has advanced a series of legislative measures, including the Regulation (EU) 2022/868 (2022) (Data Governance Act), the Regulation (EU) 2023/2854 (2023) (Data Act), and Regulation (EU) 2025/327 (2025). The first two are cross-sectoral governance frameworks, introduced to ensure better access to data and more responsible use (Casolari et al., Reference Casolari, Buttaboni and Floridi2023). The third, the Regulation (EU) 2025/327 (2025), establishes the European Health Data Space (EHDS), which enables the reuse of health data in healthcare, as well as for research and innovation.

The EHDS has two primary objectives: to enhance individuals’ access to and control over their health data within a healthcare context and to promote societal benefits from data utilisation, such as advancing healthcare delivery and research. Under Article 51.1(f) of Regulation (EU) 2025/327 (2025), health data explicitly include “human genetic, epigenomic, and genomic data.” Beyond aligning with GDPR requirements, this Regulation gives health data access bodies broad discretion to grant data access permits, alongside principles that outline when permits should and should not be issued (Quinn et al., Reference Quinn, Ellyne and Yao2024).

The EHDS’s rules on secondary health data use share similarities with third-party use of shared human genomic data, meaning the Regulation offers useful safeguards for genomic data sharing. Yet, its scope is mainly limited to healthcare and related research contexts; it does not cover commercial scenarios. This is significant because some genomic sequencing is classified as non-health-related, providing services including paternity testing, ancestral origin analysis, athletic ability assessments, matchmaking, and tests for “fun” traits, such as earwax type and eye colour (Hoxhaj et al., Reference Hoxhaj, Stojanovic, Sassano, Acampora and Boccia2020). Thus, while the EHDS effectively protects health-related human genomic data in healthcare settings, it does not offer comprehensive coverage for all human genomic data.

China adopts two primary strategies for the protection of genomic data: first, it aligns with the EU by treating genomic data as sensitive data; second, it implements the Biosecurity Law to address the national security risks that may arise from such data. Similar to the EU, China recognises the concept of sensitive personal information. Article 28 of the PIPL defines sensitive personal information as information that, if exposed or improperly utilised, could potentially infringe upon an individual’s personal dignity or threaten their safety or possessions. It further specifies that this category includes information related to biometric identification, religious beliefs, specific identities, healthcare, financial accounts, personal location, and details concerning minors under the age of fourteen. Consequently, personal genomic sequencing information is appropriately classified as sensitive personal information (Liu et al., Reference Liu, Peng, Wu, Tian and Tian2021; Wang et al., Reference Wang, Wang and Du2024b). Unlike the GDPR, which employs a closed list of sensitive data categories, China’s PIPL does not impose barriers to classifying genomic data as sensitive information.

Article 55 of the Biosecurity Law requires that the use and export of China’s human genetic information comply with ethical principles and not harm public health, national security, or the public interest. Article 56(4) mandates that transporting or mailing this information requires approval from the health department of the State Council. Additionally, the Detailed Rules for the Implementation of the Regulation on the Administration of Human Genetic Resources (2023) stipulates in Article 37(3) that foreign entities providing genome sequencing information resources with over 500 cases must undergo a security review by the Ministry of Science and Technology. This underscores the special considerations given to genomic data.

5.2. Technical security mechanisms

Technical and organisational measures (TOMs) play a crucial role in mitigating the multifaceted risks of human genomic data sharing, and legal frameworks are designed to align with such technological advancements (Staunton et al., Reference Staunton, Slokenberga and Mascalzoni2019). Article 32 of the GDPR, for instance, obliges data processors to implement TOMs to protect personal data. Anonymisation is often treated as a minimum requirement for enabling data sharing in this context.

Traditional privacy frameworks establish a dichotomy between personal and anonymised data, with the latter excluded from regulatory protection. Article 4(1) of the GDPR defines personal data as information relating to identifiable individuals, explicitly exempting anonymised data from its protective scope. Similarly, Article 4 of the PIPL withholds protection from anonymised information.

This categorical exclusion significantly compromises safeguards for genomic data (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). Data anonymisation regimes operate on the premise that data unlinked to personal identity fall outside the classification of “personal data” (Elliot et al., Reference Elliot, O’Hara, Raab, O’Keefe, Mackey, Dibben, Gowans, Purdam and McCullagh2018), thereby permitting unregulated collection, use, and dissemination without the data subject’s consent. While anonymisation measures represent fundamental protective mechanisms of a technical nature, they prove inadequate in mitigating genomic data risks. As established previously, genomic data resist effective anonymisation, with techniques like de-identification and pseudonymisation demonstrating insufficient protective capacity (Rocher et al., Reference Rocher, Hendrickx and de Montjoye2019).

Anonymisation of genomic data involves the removal of protected health information, such as name, and semi-identifiable information, such as postcode (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). However, there is a consensus that genomic data cannot be truly anonymised (O’Doherty et al., Reference O’Doherty, Shabani, Dove, Bentzen, Borry, Burgess, Chalmers, De Vries, Eckstein, Fullerton, Juengst, Kato, Kaye, Knoppers, Koenig, Manson, McGrail, McGuire, Meslin, Nicol, Prainsack, Terry, Thorogood and Burke2021), although researchers have debated the varying levels of identifiability associated with different types of genetic data (Lowrance and Collins, Reference Lowrance and Collins2007). The anonymisation paradigm faces escalating challenges from evolving analytics and re-identification methodologies within genomic contexts (Purtova, Reference Purtova2018). Individual genetic uniqueness creates robust correlations between genomic data and personal identity, rendering such information particularly susceptible to re-identification through identification attacks and phenotype inference attacks (Altman et al., Reference Altman, Clayton, Kohane, Malin and Roden2013; Rocher et al., Reference Rocher, Hendrickx and de Montjoye2019; Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020).

This raises two significant issues. First, data protection frameworks may fail to safeguard individuals’ rights to their genomic data when they rely only on data anonymisation measures. Second, anonymisation provisions can enable data controllers to circumvent the protections established by regulations such as the GDPR and the PIPL. In the context of genomic data sharing, data controllers are required to de-identify or pseudonymise genomic information obtained from clinical medicine, scientific research, and commercial testing. This so-called anonymised data can subsequently be shared without adequately considering the potential risks faced by individuals, groups, and societies involved.

Notably, some argue that data anonymisation could be strengthened by adopting advanced technical measures. As technology and research methodologies evolve, several approaches have emerged to enhance the protection of human genomic data (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020), including access control (Erlich et al., Reference Erlich, Williams, Glazer, Yocum, Farahany, Olson, Narayanan, Stein, Witkowski and Kain2014), homomorphic encryption (Deuber et al., Reference Deuber, Egger, Fech, Malavolta, Schröder, Thyagarajan, Battke and Durand2019), secure multiparty computation (Cho et al., Reference Cho, Wu and Berger2018), and differential privacy (Tramèr et al., Reference Tramèr, Huang, Hubaux and Ayday2015). The combined use of multiple such methods is also becoming more common (Raisaro et al., Reference Raisaro, Choi, Pradervand, Colsenet, Jacquemont, Rosat, Mooser and Hubaux2018). Beyond these discrete technical tools, the EU’s EHDS provides a comprehensive platform for genomic data sharing, which helps address risks within healthcare and related research contexts.

Nevertheless, ongoing advances in technology and cyberattack methods present persistent challenges to human genomic data protection. A high-profile example is the 2023 data breach at 23andMe. In October 2023, the company suffered a significant breach organised by a cybercriminal known as Golem (Holthouse et al., Reference Holthouse, Owens and Bhunia2025). While official statements of 23andMe (2023) claimed only 14,000 accounts were directly compromised, the attack spread via the platform’s DNA relative feature, expanding its impact to expose over 5.5 million customer records. This cybersecurity incident triggered widespread legal action in the US and other jurisdictions, which remains unresolved. As of 23 March 2025, 23andMe had also filed a voluntary petition in a US bankruptcy court to facilitate a rapid sale of the company (Gerke et al., Reference Gerke, Jacoby and Cohen2025).

23andMe had implemented a certain level of technical security measures, but cyberattacks continue to evolve. In this case, the techniques used were relatively unsophisticated yet highly effective, focusing on brute-force attacks and credential stuffing (Holthouse et al., Reference Holthouse, Owens and Bhunia2025). The growing sophistication of such attacks underscores the need for stronger security safeguards—a lesson relevant to other DTC genomic testing companies. While the cybercriminal bears responsibility for the attack, the incident also highlights a critical systemic issue: cost considerations often deter companies from adopting enhanced or multiple technical protections. As profit-driven entities, many companies seek to minimise costs while meeting only the minimum requirements of data protection regulations—leaving human genomic data vulnerable to emerging threats.

Therefore, whether technical measures can fully address the risks of human genomic data sharing is not the focus of this article. What is clear is that some innovative technical approaches already mitigate certain risks associated with such sharing, and technical professionals will likely develop further solutions to tackle its multifaceted threats. As a product of science and technology, human genomic data inherently require technical measures for their protection.

As established earlier, risk assessments of human genomic data sharing confirm that current data anonymisation measures are insufficient, meaning the level of technical protection for these data must be elevated. The EU’s EHDS illustrates this need: it aims to build a secure environment for data access and reuse, which explicitly requires the implementation of multiple technical and organisational safeguards.

Yet, a core question remains unresolved: Who should bear the cost of these technical and/or organisational measures in human genomic data sharing practices? Unlimited free access to genomic data can deliver significant benefits, but the associated risks cannot be shouldered solely by data subjects. For this reason, regulations governing genomic data sharing must go beyond setting basic requirements; they must also allocate clear obligations and responsibilities to the various stakeholders involved.

5.3. Informed consent mechanisms

Besides technical security mechanisms, data protection laws often rely on informed consent mechanisms to legitimise data processors’ activities and avoid establishing complex interest-balancing frameworks. Some may contend that a stringent interpretation of data protection law, particularly the principle of informed consent, could enhance the safeguarding of genomic data. However, informed consent may not provide genuine protection; rather, it can facilitate the data provider’s legal right to exploit genomic data, often prioritising their benefits over the individual’s willingness.

In numerous jurisdictions, data processors are required to obtain consent from data subjects before collecting, sharing, or using their genomic data. For instance, Article 6 of the GDPR serves as the primary legal foundation for data collection and processing, with consent being a key element. When it comes to sensitive data, Article 9.2(a) of the GDPR includes provisions for cases where “the data subject has explicitly consented to the processing of their sensitive personal data for one or more specified purposes.” Similarly, the PIPL incorporates a comparable informed consent principle aimed at safeguarding sensitive information. Informed consent embodies the concept of individual autonomy (Beauchamp, Reference Beauchamp2011) and stands as a fundamental legal principle in relevant legislation. Lawful consent depends on the individual’s decision-making capacity, voluntariness, and a comprehensive grasp of relevant information (Bunnik et al., Reference Bunnik, de Jong, Nijsingh and de Wert2013).

The principle of informed consent is crucial in the realm of genomic data sharing, yet it often lacks effectiveness, enabling data controllers to manipulate the process for diverse motives (Kaye, Reference Kaye2012; Bietti, Reference Bietti2019; Oliva et al., Reference Oliva, Kaphle, Reguant, Sng, Twine, Malakar, Wickramarachchi, Keller, Ranbaduge, Chan, Breen, Buckberry, Guennewig, Haas, Brown, Cowley, Thorne, Jain and Bauer2024). This phenomenon is not new. Rights related to personal data primarily aim to empower individuals with control over their personal information, a concept that has been termed “privacy self-management” (Solove, Reference Solove2013). Under such a framework, several shortcomings of consent are identified (Solove, Reference Solove2013), including (a) cognitive limitations, which suggest that individuals often struggle to make informed and rational decisions regarding consent due to cognitive biases and a lack of understanding of complex privacy issues; (b) meaningless consent, where many individuals consent to data practices without fully grasping the implications, resulting in a scenario where consent fails to provide genuine control over personal information; and (c) structural problems, wherein the sheer volume of entities collecting personal data renders it impractical for individuals to manage their privacy effectively.

Furthermore, privacy harms frequently arise from the aggregation of data over time, complicating individuals’ ability to assess risks and benefits. These issues are particularly relevant in the context of genomic data sharing. For example, regarding cognitive limitations, research indicates that 67% of DTC testing companies fail to provide sufficient information to consumers about the use of their genomic data (Christofides and O’Doherty, Reference Christofides and O’Doherty2016), with issues attributed to ambiguous language and a lack of transparency (Laestadius et al., Reference Laestadius, Rich and Auer2017). When considering genomic data collected in a research context, or data obtained in a clinical setting and intended for future research sharing, the expansive nature of such data sharing poses substantial challenges. It becomes virtually impossible to comprehensively describe, or indeed foresee, all potential future research applications at the time of data collection (McGuire and Beskow, Reference McGuire and Beskow2010).

The complexity of genomic data further complicates matters, making it challenging for data subjects to grasp the implications fully (Majumder et al., Reference Majumder, Guerrini and McGuire2021). This underscores the need for enhanced education among healthcare professionals to effectively convey these complexities to address individuals’ cognitive limitations (Martins et al., Reference Martins, Murry, Telford and Moriarty2022). In addition, and irrespective of the level of informed consent that is given upon the first processing, individuals undergoing WGS frequently lack awareness of how their genomic data will be utilised post-collection (McGuire and Beskow, Reference McGuire and Beskow2010; Niemiec and Howard, Reference Niemiec and Howard2016; Rego et al., Reference Rego, Grove, Cho and Ormond2020). Once realising this, they often express dissatisfaction with companies profiting from their genomic data and perceive a lack of clarity in the consent process (Allyse, Reference Allyse2013).

Moreover, data providers can readily obtain consent from data subjects, either by framing it as a prerequisite in commercial testing environments or by leveraging subjects’ goodwill to advance scientific progress. In the realm of DTC testing, a pressing concern resides in the practice of conditioning access to testing services on consent to data sharing, thereby effectively coercing individuals into acquiescence. For example, while 23andMe does not explicitly detail the future uses of customers’ genomic data, its terms and conditions state: “You understand that by providing any sample, having your information processed, accessing your information, or providing information, you acquire no rights in any research or commercial products that may be developed by 23andMe or its collaborators” (23andMe, 2025). This approach raises profound ethical and legal dilemmas (Raz et al., Reference Raz, Niemiec, Howard, Sterckx, Cockbain and Prainsack2020). In the EU, Article 4 of the GDPR defines “processing” in a manner that obliges companies to obtain informed consent before anonymising, pseudonymising, or sharing data (Shabani and Borry, Reference Shabani and Borry2018). Despite this strict requirement, compliance often amounts to little more than a procedural checkbox: companies make consent a precondition for the use of services, leaving users with no meaningful choice but to accept the terms. In scientific research, by contrast, data subjects frequently donate genomic data voluntarily, motivated by a desire to support technological advancement. Yet, even when driven by altruism, this goodwill does not guarantee that adequate safeguards will be in place when researchers share the data or third parties use the data. While many subjects donate selflessly, the diverse risks inherent in genomic data sharing cannot be dismissed.

A more complex challenge in genomic data governance relates to group consent and the secondary use of such data. Genomic research frequently relies on individual-based consent, even when working with tribal members who reside outside their communities (Tsosie et al., Reference Tsosie, Yracheta and Dickenson2019). Yet, this model fails to account for the unique risks faced by small, cohesive groups like Indigenous tribes, where group-level harms can affect the entire community. Analysing genomic data at a collective level may compromise group privacy (Gusareva et al., Reference Gusareva, Ghosh, Kharkov, Khor, Zarubin, Moshkov, Kalsi, Ratan, Heinle, Cooke, Bravi, Smolnikova, Tereshchenko, Kasparov, Khitrinskaya, Marusin, Razhabov, Golubenko, Swarovskaya, Kolesnikov, Vagaitseva, Eremina, Sukhomyasova, Shtygasheva, Panicker, Ang, Lee, Koh, Leong, Park, Lohar, Yap, Ng, Dacanay, Drautz-Moses, Ramli, Tokunaga, McGonigle, Danjoh, Moreno-Estrada, Tajima, Tanabe, Nakamura, Nakagome, Tatarinova, Stepanov, Schuster and Kim2025), creating impacts that extend far beyond individual data subjects. For this reason, group-level consent, especially for Indigenous communities, is necessary to address these broader risks. The Havasupai Tribe case, referenced earlier, illustrates this clearly: secondary use of their genomic data uncovered information not covered by initial consent clauses—such as details about ancestry and familial connections—that conflicted with the Tribe’s cultural beliefs (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). In addition, under the current informed consent framework, the distribution of risks and benefits of genomic research is uneven for Indigenous communities. These groups often bear substantial risks from genomic research but gain few of its associated benefits (Hudson et al., Reference Hudson, Garrison, Sterling, Caron, Fox, Yracheta, Anderson, Wilcox, Arbour, Brown, Taualii, Kukutai, Haring, Te Aika, Baynam, Dearden, Chagné, Malhi, Garba, Tiffin, Bolnick, Stott, Rolleston, Ballantyne, Lovett, David-Chavez, Martinez, Sporle, Walter, Reading and Carroll2020).

As Professor Solove (Reference Solove2013) concluded, the framework of “privacy self-management” through consent is fundamentally flawed. In the context of genomic data sharing, the principle of informed consent is often reduced to a performative gesture, prioritising data providers’ interests over individual autonomy (Bonomi et al., Reference Bonomi, Huang and Ohno-Machado2020). This aligns with broader critiques of consent as a mechanism, highlighting cognitive limitations, ambiguous language, the inability to foresee future data uses, and the neglect of group interests—factors that collectively render consent an ineffective safeguard.

6. Proposals for governance reform of genomic data sharing

While the individual and societal benefits of data sharing are significant, the associated risks cannot be overlooked. The above discussion has highlighted the unique characteristics of genomic data and shown that the existing legal mechanisms governing data sharing require enhancement to address the diverse risks involved. To strengthen governance practices for genomic data, this section advocates for a rethinking of the governance mechanisms for genomic data under the concept of genomic contextualism. The proposals put forward here aim to balance two core objectives: safeguarding stakeholders’ interests and ensuring the benefits of genomic data sharing are distributed equitably. This balance is particularly critical for underrepresented communities, which have historically been excluded from reaping the rewards of such research (Fullerton, Reference Fullerton2011). Below, I provide a detailed introduction to these proposals, which are summarised and illustrated in Figure 1. These recommendations can be adapted to both European and Chinese contexts, while also laying the groundwork for enhanced governance of genomic data in other jurisdictions.

Figure 1. Equitable governance of genomic data sharing. Data providers may share human genomic data with third parties for utilisation only if (1) data subjects give informed consent for both the acquisition and subsequent activities; (2) additional group consent is obtained when data subjects belong to a group that may face risks of harm from utilisation. Regardless of the acquisition context (clinical, research, or commercial), subsequent data activities must safeguard stakeholders’ interests through effective risk prevention and equitable distribution of derived benefits.

6.1. Supplementing informed consent with an interest-balancing principle

As discussed earlier, informed consent is ineffective for genomic data sharing: it obscures the unfair power dynamics inherent in such transactions and overlooks the risks and interests of affected groups. Moreover, the informed consent model struggles to apply at the group level, largely because reaching group consensus is inherently challenging. Civic epistemology—a framework for understanding how societies engage with science—helps explain this: different individuals, ethnic groups, and nations hold distinct perspectives on science and technology, shaped by their unique contexts (Jasanoff, Reference Jasanoff2005, p. 250). When considering the wide range of ethnic groups across nations, each with its own cultural background, this diversity of views becomes even more pronounced, further complicating efforts to secure meaningful group consent for human genomic data sharing.

This lack of recognition is not new: Indigenous communities have been the focus of Western scientific research for centuries. For Indigenous peoples and minority groups, however, genomic data are often perceived as more sensitive than other types of health data. This sensitivity is particularly pronounced in genealogy and ancestry research—work that can challenge traditionally held beliefs, reshape cultural histories, and impact claims to identity, land, and other resources (Hudson et al., Reference Hudson, Garrison, Sterling, Caron, Fox, Yracheta, Anderson, Wilcox, Arbour, Brown, Taualii, Kukutai, Haring, Te Aika, Baynam, Dearden, Chagné, Malhi, Garba, Tiffin, Bolnick, Stott, Rolleston, Ballantyne, Lovett, David-Chavez, Martinez, Sporle, Walter, Reading and Carroll2020). A history of unethical behaviour, poor communication, disregard for cultural and spiritual beliefs, and failure to prioritise Indigenous communities’ interests has fostered deep mistrust between researchers and these groups (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b). Beyond this mistrust, Indigenous peoples also express hesitancy to participate in genomic research due to years of being studied without seeing benefits, receiving results, or being able to prevent exploitation of their potentially patentable genetic material. Compounding this, private research entities primarily prioritise profit. They are reluctant to invest in developing healthcare products for Indigenous or minority groups when such work offers little financial return. Adding to the challenge, policymakers already struggle to design policy frameworks that advance orphan medicinal products (Aartsma-Rus et al., Reference Aartsma-Rus, Dooms and Le Cam2021). This pre-existing gap makes it even harder to incentivise genomic research focused on creating new healthcare solutions for Indigenous peoples or minority groups under current policy structures.

Indigenous peoples also express concerns about protecting their rights and interests. An illustrative example is the aforementioned All of Us project, sponsored by the US National Institutes of Health (USNIH). The National Congress of American Indians called on the USNIH to “assess consultation input to date, and immediately develop clear processes and guidelines: these should require individual sovereign Tribal Nations to provide prior consent before data and specimens are collected from their members, and grant Tribal Nations oversight—including local control and storage of any data or biospecimens linked to or identified as belonging to a citizen of their nation” (National Indian Health Board, 2020). Article 31 of the United Nations Declaration on the Rights of Indigenous Peoples (2007) provides the legal basis for this stance. It states: “Indigenous peoples have the right to maintain, control, protect and develop their cultural heritage, traditional knowledge and traditional cultural expressions, […] including human and genetic resources.”

Therefore, more equitable human genomic data governance must respect the sovereignty and interests of ethnic groups. This goal can be advanced through actions like community-engaged research, clear guidelines, and policies that guarantee Indigenous communities that their interests are protected—steps that may encourage greater participation from Indigenous leaders, communities, and individuals (Garrison et al., Reference Garrison, Hudson, Ballantyne, Garba, Martinez, Taualii, Arbour, Caron and Rainie2019b).

In practice, however, genomic research projects often fail to meet this standard: they frequently recruit Indigenous individuals living in urban centres without establishing formal partnerships or consulting the individuals’ home tribes. To address this gap and achieve genuine fairness, robust due process in legislation and policy development is essential. Scholars advocate for deliberative democratic methods as a solution (Koenig, Reference Koenig2014), which prioritise inclusive dialogue and collective decision-making among all stakeholders. These methods bring diverse community members into discussions about the ethical, social, and practical impacts of genomic research (Lemke et al., Reference Lemke, Esplin, Goldenberg, Gonzaga-Jauregui, Hanchard, Harris-Wai, Ideozu, Isasi, Landstrom, Prince, Turbitt, Sabatello, Vergano, Taylor, Yu, Brothers and Garrison2022), ensuring perspectives from data subjects, providers, and users all shape equitable genomic data management, ultimately supporting fair distribution of benefits and responsibilities.

Yet, these deliberative approaches remain more theoretical than practical for much genomic research. A key challenge is the geographic reality of many tribal communities: many members live in remote areas and have high mobility, making it difficult to implement critical protocols such as recruitment, initial consent, reconsent, and long-term follow-up (Tsosie et al., Reference Tsosie, Yracheta and Dickenson2019). Compounding this, given that genomic research may pose greater harm than benefit to Indigenous peoples, many tribal nations are left questioning whether the value of their involvement outweighs the associated risks.

Beyond securing prior consent from Indigenous peoples, the interest-balancing mechanism extends beyond personal self-determination; it also has the potential to enhance or complement the informed consent framework. Importantly, interest balancing focuses on the justice of human genomic data sharing activities themselves, rather than solely prioritising personal autonomy or group sovereignty. Before exploring this further, it is critical to clarify the relationship between interest balancing and risk mitigation. Risk prevention and benefit distribution are two interrelated aspects of the same challenge. When genomic data sharing does not generate sufficient benefits to cover its costs, data providers and users must assume responsibility for risk prevention, including bearing necessary financial obligations. Conversely, when sharing activities yield substantial benefits for providers and users, these gains must not be exclusively appropriated; instead, benefits should be redistributed to ensure data subjects receive equitable returns. While fair distribution of responsibilities and benefits is equally important in human genomic data sharing, current practices often impose greater risks than benefits, particularly for data subjects and related groups. For this reason, a heightened focus on risk prevention is imperative, with clear emphasis on the obligations of data providers and users.

That said, human genomic data sharing could and should deliver benefits to individuals and their communities. Personal genomic companies have already explored models to compensate individuals for contributing their genomic data to research (Grishin et al., Reference Grishin, Obbad, Estep, Quinn, Zaranek, Zaranek, Vandewege, Clegg, César, Cifric and Church2018). A notable example is the data dividend model (Kudva and Aswani, Reference Kudva and Aswani2023), where platforms compensate data subjects for data use. In genomic data sharing, many firms also use similar strategies to encourage customers to sequence their data—offering future rewards for sharing, such as helping them sell their data to researchers (Molteni, Reference Molteni2016) and compensating genomic data contributors with company stock (Grishin et al., Reference Grishin, Obbad, Estep, Quinn, Zaranek, Zaranek, Vandewege, Clegg, César, Cifric and Church2018). Mechanisms such as data user fees (Gillette and Hopkins, Reference Gillette and Hopkins1987) can further promote fairness among all stakeholders, ensuring data subjects and related groups receive compensation. Besides financial incentives, data providers and users should also be encouraged or required to share research findings with data subjects and related groups (Ormond et al., Reference Ormond, Stanclift, Reuter, Carter, Murphy, Lindholm and Wheeler2025, p. 1). Concurrently, data users must act responsibly to ensure scientific benefits are equitably shared (Gil and Guerreiro, Reference Gil and Guerreiro2024, p. 1).

The interest-balancing mechanism not only improves the current informed consent framework but also holds independent value. The existing informed consent model is built around individual autonomy or self-management. Yet, even when data controllers legally obtain consent from data subjects, the fairness of these transactions may still be questioned—regardless of who covers the costs of sequencing (Hawkins and Emanuel, Reference Hawkins and Emanuel2008). This issue also applies to groups: even if group consent is secured, the resulting data sharing may still be unfair. For these reasons, there is a need to pursue a more just form of informed consent—one rooted in the principles of interest balancing.

6.2. Enhancing TOMs within a data lifecycle management framework

This subsection examines the practical implementation of informed consent based on interest balancing, focusing on how TOMs can fulfil risk mitigation objectives. TOMs do not hold independent value; their purpose is derived from the foundational mechanism they support—in this case, the interest-balanced informed consent framework. For TOMs specifically, a key principle applies: they must be tailored to the unique characteristics of human genomic data (the subject matter they protect) and aligned with the principle of interest balancing (the core protection goals). In practice, this means effective technical measures should be cost-efficient, with adjustments made to keep pace with evolving technologies and societal needs. To operationalise informed consent based on interest balancing, comprehensive TOMs are required. This mechanism is built around three core components: a data lifecycle management system, human genomic data sharing platforms or spaces with registered access models, and effective protection tools. Together, these elements help mitigate diverse risks and ensure that derived benefits are distributed fairly.

Effective governance of genomic data necessitates a robust lifecycle management system, underscoring the imperative for secure, transparent, and inclusive practices across all stages of data processing. Modelled on product lifecycle frameworks in management science (Stark, Reference Stark and Stark2022), genomic data lifecycle management entails a structured methodology encompassing data acquisition, secure storage, ethical sharing, responsible utilisation, and timely erasure in accordance with data subjects’ preferences. The primary objective of this system is to mitigate diverse risks and safeguard the interests of all stakeholders in the genomic data lifecycle. In China, regulatory provisions complement informed consent to govern this lifecycle. Pursuant to Article 56 of China’s Biosecurity Law, collecting, storing, sharing, or utilising genomic resources above a certain quantity requires approval from the health department of the State Council—with exceptions for routine activities like clinical diagnosis and treatment. Here, “genomic resources” include both biospecimens that generate genomic data and the genomic data itself. Notably, however, China’s current legislation does not stipulate requirements for data erasure—a critical gap in the lifecycle management process.

The emphasis of data acquisition is warranted by the observation that when individuals consent to WGS—whether for medical, research, or commercial purposes—organisations often employ measures like anonymisation or granular consent to acquire data, yet subsequent stages of storage, sharing, and utilisation remain inadequately regulated. The significance of data acquisition has been recognised, yet the protection measures remain ineffective. As highlighted earlier, the current individual consent framework lacks provisions for group consent or consultation—and even where such input is needed, it is difficult to enforce. To address this, group consent should be established as a prerequisite for data acquisition in relevant cases. Consequently, human genomic data acquisition and subsequent activities must obtain separate informed consent from individuals and from associated groups when necessary. For comprehensive protection of human genomic data, the definition of “data acquisition” should encompass biospecimen acquisition. Additionally, regulations should specify the conditions under which individuals can collect and submit their own or others’ biospecimens for WGS and provide informed consent for the acquisition and subsequent sharing of genomic data.

Data storage is managed by data providers, and it represents their most significant contribution to genomic data governance, forming the foundation for how their interests are allocated. Storage begins once data acquisition ends, when providers take actual control of the human genomic data. Regulation (EU) 2025/327 (2025) includes provisions on data storage that offer valuable lessons. Article 72 mandates that trusted health data holders store data in a secure processing environment and comply with all requirements of the Regulation. One key compliance obligation, outlined in Article 77, is that reused health data must be publicly available via standardised machine-readable dataset catalogues. Additionally, Article 62 allows these holders to charge fees for making electronic health data available for secondary use. These rules provide a robust model for governing human genomic data storage. Providers should store human genomic data in a unified format (e.g. a standardised metadata structure) to facilitate seamless access, sharing, and usage (Byrd et al., Reference Byrd, Greene, Prasad, Jiang and Greene2020). They must also implement effective technical security measures to prevent unauthorised access and data leaks—such as the breaches that affected 23andMe. Given the costs of maintaining secure, standardised storage, data providers are entitled to fair compensation.

Data sharing is central to risk prevention and interest distribution. Currently, two prominent models dominate research data sharing: the controlled-access model and the registered-access model—both designed to mitigate specific risks (Byrd et al., Reference Byrd, Greene, Prasad, Jiang and Greene2020; Dyke, Reference Dyke, Jiang and Tang2020). The controlled-access model restricts data sharing to approved researchers for specific purposes (Ramos et al., Reference Ramos, Din-Lovinescu, Bookman, McNeil, Baker, Godynskiy, Harris, Lehner, McKeon, Moss, Starks, Sherry, Manolio and Rodriguez2013), exemplified by databases such as the US Genotypes and Phenotypes database (Mailman et al., Reference Mailman, Feolo, Jin, Kimura, Tryka, Bagoutdinov, Hao, Kiang, Paschall, Phan, Popova, Pretel, Ziyabari, Lee, Shao, Wang, Sirotkin, Ward, Kholodov, Zbicz, Beck, Kimelman, Shevelev, Preuss, Yaschenko, Graeff, Ostell and Sherry2007) and the EU’s European Genome–Phenome Archive (Lappalainen et al., Reference Lappalainen, Almeida-King, Kumanduri, Senf, Spalding, ur-Rehman, Saunders, Kandasamy, Caccamo, Leinonen, Vaughan, Laurent, Rowland, Marin-Garcia, Barker, Jokinen, Torres, de Argila, Llobet, Medina, Puy, Alberich, de la Torre, Navarro, Paschall and Flicek2015). Regulation (EU) 2025/327 (2025)—which establishes the EHDS—also adopts a controlled-access model for health data reuse. A key feature of this model is the requirement for rigorous review by dedicated data access committees; however, this process can lead to delays (Dyke, Reference Dyke, Jiang and Tang2020). In the EHDS, health data access bodies perform a similar role to these committees, meaning they face the same dilemma of balancing thoroughness with efficiency.

For ethical and efficient data sharing, the registered-access model—proposed by GA4GH—offers a viable alternative. This model builds on the well-established role-based access control framework used in information technology (IT) security. Unlike controlled-access models (which typically require approval for specific research projects), registered access grants users online access based on their role and a prior risk analysis (Dyke et al., Reference Dyke, Linden, Lappalainen, De Argila, Carey, Lloyd, Spalding, Cabili, Kerry, Foreman, Cutts, Shabani, Rodriguez, Haeussler, Walsh, Jiang, Wang, Perrett, Boughtwood, Matern, Brookes, Cupak, Fiume, Pandya, Tulchinsky, Scollen, Törnroos, Das, Evans, Malin, Beck, Brenner, Nyrönen, Blomberg, Firth, Hurles, Philippakis, Rätsch, Brudno, Boycott, Rehm, Baudis, Sherry, Kato, Knoppers, Baker and Flicek2018). In theory, this model could enable access to all shared human genomic data via a unified general registration process, eliminating the need for individualised data access committee reviews. To operate, it would require funding (either public or from contributions by data providers or users) and mandate user registration, identity verification, and declaration of intended data use.

Beyond risk mitigation, fair benefit allocation must also be considered in human genomic data sharing. The EHDS falls short here, as it requires electronic health data to be anonymised before secondary reuse, effectively excluding data subjects from compensation. Article 62 of Regulation (EU) 2025/327 (2025) allows health data access bodies to charge fees for providing electronic health data for secondary use, but these data are pseudonymised or anonymised. This anonymisation is ineffective, yet it bars data subjects (and related groups) from receiving financial benefits from their data. And even if data anonymisation can be realised, fair benefit allocation is still needed. In addition, the EHDS does include one benefit allocation for data subjects: it states that “[a] healthcare provider or a third party shall not directly or indirectly charge data subjects a fee or costs, or require compensation, for sharing or accessing data.” This is reasonable given individuals’ need for primary access to their own health data. However, when applied to human genomic data—which carries unique value and risks—this approach requires further assessment through a rigorous benefit–cost analysis to ensure equity.

Data utilisation is the core objective of human genomic data sharing, encompassing two key forms: use by data subjects themselves (primary use) and reuse by third parties (secondary use). The EU’s EHDS offers valuable insights here, as it is designed to facilitate access to electronic health data for both primary and secondary uses. For primary use, the EHDS outlines a set of rights in Chapter II of Regulation (EU) 2025/327 (2025) to support individuals and their representatives in accessing electronic health data. However, a critical gap remains: Regulation (EU) 2025/327 (2025) does not explicitly clarify whether individuals who undergo commercial genomic testing can require testing companies to share their genomic data with healthcare providers. This matters because integrating such data into electronic health records (EHRs) could enable both primary use (for personal healthcare) and secondary use (for research). While genomic data integration into EHRs is still in early stages, it holds significant potential—including improving personal health outcomes, enabling effective clinical application of genomic data, and advancing genomic research (Walton et al., Reference Walton, Johnson, Person and Chamala2019).

Third-party secondary reuse of health data under the EHDS is tightly constrained: it is limited to specific purposes and explicitly excludes harmful activities. Article 53 of Regulation (EU) 2025/327 (2025) permits reuse for scientific research in health or care, policymaking and regulatory work, education and teaching, and activities to improve public or occupational health. In contrast, Article 54 prohibits reuse for three key purposes: making detrimental or discriminatory decisions about individuals or groups, conducting advertising or marketing, and developing products or services that could harm individuals, public health, or society. These detailed rules on permitted and prohibited third-party reuse create strict safeguards, helping to control risks linked to data sharing. Genomic data sharing can draw direct lessons from this framework: imposing clear limits on third-party secondary use would similarly mitigate risks while preserving the value of genomic data for beneficial purposes.

Data erasure is essential for enabling individuals whose genomic data have been collected to regain control over their data. To achieve comprehensive control, genomic data erasure must include the deletion of entire genomic and genetic datasets, along with associated information and biospecimens. The permanent nature of genomic data and the lack of legal mandates for its erasure in many jurisdictions underscore the need to enshrine a legally enforceable right to data deletion as a cornerstone of data subject autonomy (Gassner, Reference Gassner2021). By embedding erasure as a default mechanism, this framework ensures that individuals retain control over the duration of their data’s existence and mitigates long-term privacy risks associated with indefinite data retention. Regulation (EU) 2025/327 (2025) grants individuals a range of rights that overlap in function with data erasure—such as the right to data portability, the right to restrict access, and the right to opt out of primary data use. However, these rights are not equivalent to a full right to deletion. Notably, the right to opt out only applies to primary use and excludes secondary reuse of health data. This gap stems from the Regulation’s classification of secondary use data as anonymised (and thus non-personal), which also explains why the text does not grant individuals an explicit right to delete their data. A right to delete is nonetheless essential: data subjects can only truly retain control if they can monitor data use, request erasure, and withdraw consent effectively. Without this right, these other protections remain incomplete. Ultimately, incorporating a right to delete into human genomic data sharing systems would balance progress in genomic research with the protection of stakeholder interests.

In summary, interest balancing and data lifecycle management function as enhancement mechanisms for genomic data protection, grounded in the principle of genomic contextualism. Their underlying logic differs from the individual-centric informed consent mechanism, with the proposals in this article requiring sui generis protection or specific provisions within general or health data protection laws.

7. Conclusion and future work

This article highlights the need for enhanced governance of human genomic data sharing. It reviews the concept of genomic data and the historical development of HGPs and firstly emphasises that genomic data constitute collective personal data. Drawing on this uniqueness, the article argues that the concept of “genomic contextualism” should be applied to govern these distinct data. It also outlines a tripartite taxonomy of risks in genomic data sharing. To improve governance and mitigate associated risks, the article compares and analyses data protection frameworks in the EU and China, highlighting that current systems may be insufficient to address all the risks posed by genomic data sharing.

The article further puts forward governance reform recommendations to promote responsible data sharing practices. It stresses that group consent is required where genomic data sharing and related activities impact group interests. Additionally, it proposes that data protection systems should be built on the principle of interest balancing, moving beyond over-reliance on informed consent alone. To implement this principle and comprehensively mitigate the risks of genomic data sharing, the article suggests establishing a data lifecycle management framework supported by effective TOMs. Collectively, these recommendations aim to foster responsible genomic data sharing, reduce diverse risks, and ensure equitable benefits for all involved stakeholders.

Beyond the analytical content of this article, this work seeks to drive both practical and critical research on the global landscape of genomic data governance. In particular, cross-border genomic data sharing poses unique challenges, as it involves balancing transnational interests and even intersects with national competition. We must also address the rapid development of AI, ML, synthetic biology, and other related technologies. These fields are converging with genomics and possess enormous potential, capable of delivering profound benefits to humanity or posing severe risks to its well-being.

Data availability statement

No publicly available datasets were utilised or generated in this research.

Acknowledgments

I am grateful to everyone who helped improve the quality of the article at different stages of publication, especially the reviewers and the editor for their comments/suggestions.

Author contribution

G.W.: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing—original draft, Writing—review and editing. The author alone carried out the research and was responsible for drafting and editing the manuscript.

Competing interests

The authors declare none.

References

23andMe (2023) Addressing Data Security Concerns—Action Plan. Available at https://blog.23andme.com/articles/addressing-data-security-concerns (accessed 5 December 2024).Google Scholar

23andMe (2025) Terms of Service. Available at https://www.23andme.com/legal/terms-of-service/ (accessed 14 March 2025).Google Scholar

Aartsma-Rus, A, Dooms, M and Le Cam, Y (2021) Orphan medicine incentives: How to address the unmet needs of rare disease patients by optimizing the European orphan medicinal product landscape guiding principles and policy proposals by the European expert Group for Orphan Drug Incentives (OD expert group). Frontiers in Pharmacology 12. https://doi.org/10.3389/fphar.2021.744532.CrossRef Google Scholar

Ahmed, E and Shabani, M (2019) DNA data marketplace: An analysis of the ethical concerns regarding the participation of the individuals. Frontiers in Genetics 10. https://doi.org/10.3389/fgene.2019.01107.CrossRef Google Scholar PubMed

Allyse, M (2013) 23 and me, we, and you: Direct-to-consumer genetics, intellectual property, and informed consent. Trends in Biotechnology 31(2), 68–69. https://doi.org/10.1016/j.tibtech.2012.11.007.CrossRef Google Scholar PubMed

Altman, RB, Clayton, EW, Kohane, IS, Malin, BA and Roden, DM (2013) Data re-identification: Societal safeguards. Science 339(6123), 1032–1033. https://doi.org/10.1126/science.339.6123.1032-c.CrossRef Google Scholar PubMed

Auton, A, Abecasis, GR, Altshuler, DM, Durbin, RM, Abecasis, GR, Bentley, DR, Chakravarti, A, Clark, AG, Donnelly, P, Eichler, EE, Flicek, P, Gabriel, SB, Gibbs, RA, Green, ED, Hurles, ME, Knoppers, BM, Korbel, JO, Lander, ES, Lee, C and National Eye Institute, NIH (2015) A global reference for human genetic variation. Nature 526(7571), 68–74. https://doi.org/10.1038/nature15393.Google Scholar PubMed

Bagger, FO, Borgwardt, L, Jespersen, AS, Hansen, AR, Bertelsen, B, Kodama, M and Nielsen, FC (2024) Whole genome sequencing in clinical practice. BMC Medical Genomics 17(1), 39. https://doi.org/10.1186/s12920-024-01795-w.CrossRef Google Scholar PubMed

Beauchamp, TL (2011) Informed consent: Its history, meaning, and present challenges. Cambridge Quarterly of Healthcare Ethics 20(4), 515–523. https://doi.org/10.1017/S0963180111000259.CrossRef Google Scholar PubMed

Belkadi, A, Bolze, A, Itan, Y, Cobat, A, Vincent, QB, Antipenko, A, Shang, L, Boisson, B, Casanova, J-L and Abel, L (2015) Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants. Proceedings of the National Academy of Sciences 112(17), 5473–5478. https://doi.org/10.1073/pnas.1418631112.CrossRef Google Scholar PubMed

Bentley, DR, Balasubramanian, S, Swerdlow, HP, Smith, GP, Milton, J, Brown, CG, Hall, KP, Evers, DJ, Barnes, CL, Bignell, HR, Boutell, JM, Bryant, J, Carter, RJ, Keira Cheetham, R, Cox, AJ, Ellis, DJ, Flatbush, MR, Gormley, NA, Humphray, SJ, Irving, LJ, Karbelashvili, MS, Kirk, SM, Li, H, Liu, X, Maisinger, KS, Murray, LJ, Obradovic, B, Ost, T, Parkinson, ML, Pratt, MR, Rasolonjatovo, IMJ, Reed, MT, Rigatti, R, Rodighiero, C, Ross, MT, Sabot, A, Sankar, SV, Scally, A, Schroth, GP, Smith, ME, Smith, VP, Spiridou, A, Torrance, PE, Tzonev, SS, Vermaas, EH, Walter, K, Wu, X, Zhang, L, Alam, MD, Anastasi, C, Aniebo, IC, Bailey, DMD, Bancarz, IR, Banerjee, S, Barbour, SG, Baybayan, PA, Benoit, VA, Benson, KF, Bevis, C, Black, PJ, Boodhun, A, Brennan, JS, Bridgham, JA, Brown, RC, Brown, AA, Buermann, DH, Bundu, AA, Burrows, JC, Carter, NP, Castillo, N, Chiara, E, Catenazzi, M, Chang, S, Neil Cooley, R, Crake, NR, Dada, OO, Diakoumakos, KD, Dominguez-Fernandez, B, Earnshaw, DJ, Egbujor, UC, Elmore, DW, Etchin, SS, Ewan, MR, Fedurco, M, Fraser, LJ, Fuentes Fajardo, KV, Scott Furey, W, George, D, Gietzen, KJ, Goddard, CP, Golda, GS, Granieri, PA, Green, DE, Gustafson, DL, Hansen, NF, Harnish, K, Haudenschild, CD, Heyer, NI, Hims, MM, Ho, JT, Horgan, AM, Hoschler, K, Hurwitz, S, Ivanov, DV, Johnson, MQ, James, T, Huw Jones, TA, Kang, G-D, Kerelska, TH, Kersey, AD, Khrebtukova, I, Kindwall, AP, Kingsbury, Z, Kokko-Gonzales, PI, Kumar, A, Laurent, MA, Lawley, CT, Lee, SE, Lee, X, Liao, AK, Loch, JA, Lok, M, Luo, S, Mammen, RM, Martin, JW, McCauley, PG, McNitt, P, Mehta, P, Moon, KW, Mullens, JW, Newington, T, Ning, Z, Ling Ng, B, Novo, SM, O’Neill, MJ, Osborne, MA, Osnowski, A, Ostadan, O., Paraschos, LL, Pickering, L, Pike, AC, Pike, AC, Chris Pinkard, D, Pliskin, DP, Podhasky, J, Quijano, VJ, Raczy, C, Rae, VH, Rawlings, SR, Chiva Rodriguez, A, Roe, PM, Rogers, J, Rogert Bacigalupo, MC, Romanov, N, Romieu, A, Roth, RK, Rourke, NJ, Ruediger, ST, Rusman, E, Sanches-Kuiper, RM, Schenker, MR, Seoane, JM, Shaw, RJ, Shiver, MK, Short, SW, Sizto, NL, Sluis, JP, Smith, MA, Ernest Sohna Sohna, J, Spence, EJ, Stevens, K, Sutton, N, Szajkowski, L, Tregidgo, CL, Turcatti, G, vandeVondele, S, Verhovsky, Y, Virk, SM, Wakelin, S, Walcott, GC, Wang, J, Worsley, GJ, Yan, J, Yau, L, Zuerlein, M, Rogers, J, Mullikin, JC, Hurles, ME, McCooke, NJ, West, JS, Oaks, FL, Lundberg, PL, Klenerman, D, Durbin, R, and Smith, AJ. (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456(7218), 53–59. https://doi.org/10.1038/nature07517.CrossRef Google Scholar PubMed

Berndt Rasmussen, K (2019) Harm and discrimination. Ethical Theory and Moral Practice 22(4), 873–891. https://doi.org/10.1007/s10677-018-9908-4.CrossRef Google Scholar

Bick, AG, Metcalf, GA, Mayo, KR, Lichtenstein, L, Rura, S, Carroll, RJ, Musick, A, Linder, JE, Jordan, IK, Nagar, SD, Sharma, S, Meller, R, Basford, M, Boerwinkle, E, Cicek, MS, Doheny, KF, Eichler, EE, Gabriel, S, Gibbs, RA and NIH All of Us Research Program Staff (2024) Genomic data in the all of us research program. Nature 627(8003), 340–346. https://doi.org/10.1038/s41586-023-06957-x.Google Scholar

Bietti, E (2019) Consent as a free pass: Platform power and the limits of the informational turn. Pace Law Review 40(1), 310–398.10.58948/2331-3528.2013CrossRef Google Scholar

Bilkey, GA, Burns, BL, Coles, EP, Bowman, FL, Beilby, JP, Pachter, NS, Baynam, G, JS Dawkins, H, Nowak, KJ and Weeramanthri, TS (2019) Genomic testing for human health and disease across the life cycle: Applications and ethical, legal, and social challenges. Frontiers in Public Health 7, 40. https://doi.org/10.3389/fpubh.2019.00040.CrossRef Google Scholar PubMed

Biosecurity Law of the People’s Republic of China (2024 Amendment) (2024) Order No. 24 of the President of the People’s Republic of China (in Chinese). Available at https://flk.npc.gov.cn/detail?id=ff8081818d6a424b01905e5e6b702e66&fileId=&type=&title=%E4%B8%AD%E5%8D%8E%E4%BA%BA%E6%B0%91%E5%85%B1%E5%92%8C%E5%9B%BD%E7%94%9F%E7%89%A9%E5%AE%89%E5%85%A8%E6%B3%95 (accessed 5 March 2025).Google Scholar

Bonomi, L, Huang, Y and Ohno-Machado, L (2020) Privacy challenges and research opportunities for genomic data sharing. Nature Genetics 52(7), 646–654. https://doi.org/10.1038/s41588-020-0651-0.CrossRef Google Scholar PubMed

Bradford, A (2020) The Brussels Effect: How the European Union Rules the World. New York, New York, United States: Oxford University Press.10.1093/oso/9780190088583.001.0001CrossRef Google Scholar

Brent, R, McKelvey, TG and Matheny, J (2024) The new bioweapons: How synthetic biology could destabilize the world. Foreign Affairs 103, 148–159.Google Scholar

Brockmann, K, Bauer, S and Boulanin, V (2019) Bio Plus X: Arms Control and the Convergence of Biology and Emerging Technologies. Stockholm International Peace Research Institute. https://www.sipri.org/publications/2019/policy-reports/bio-plus-x-arms-control-and-convergence-biology-and-emerging-technologies.Google Scholar

Bunnik, EM, de Jong, A, Nijsingh, N and de Wert, GMWR (2013) The new genetics and informed consent: Differentiating choice to preserve autonomy. Bioethics 27(6), 348–355. https://doi.org/10.1111/bioe.12030.CrossRef Google Scholar PubMed

Byrd, JB, Greene, AC, Prasad, DV, Jiang, X and Greene, CS (2020) Responsible, practical genomic data sharing that accelerates research. Nature Reviews Genetics 21(10), 615–629. https://doi.org/10.1038/s41576-020-0257-5.CrossRef Google Scholar PubMed

Calo, R (2011) The boundaries of privacy harm. Indiana Law Journal 86(3), 1131–1162.Google Scholar

Casolari, F, Buttaboni, C and Floridi, L (2023) The EU data act in context: A legal assessment. International Journal of Law and Information Technology 31(4), 399–412. https://doi.org/10.1093/ijlit/eaae005.CrossRef Google Scholar

Chapman, CR, Quinn, GP, Natri, HM, Berrios, C, Dwyer, P, Owens, K, Heraty, S and Caplan, AL (2023) Consideration and disclosure of group risks in genomics and other data-centric research: Does the common rule need revision? The American Journal of Bioethics, 1–14. https://doi.org/10.1080/15265161.2023.2276161.Google Scholar PubMed

Charter of Fundamental Rights of the European Union (2007) OJ C 303. Available at http://data.europa.eu/eli/treaty/char_2007/oj/eng (accessed 5 March 2025).Google Scholar

Cho, H, Wu, DJ and Berger, B (2018) Secure genome-wide association analysis using multiparty computation. Nature Biotechnology 36(6). https://doi.org/10.1038/nbt.4108.CrossRef Google Scholar PubMed

Christofides, E and O’Doherty, K (2016) Company disclosure and consumer perceptions of the privacy implications of direct-to-consumer genetic testing. New Genetics and Society 35(2), 101–123. https://doi.org/10.1080/14636778.2016.1162092.CrossRef Google Scholar

Citron, DK and Solove, DJ (2022) Privacy harms. Boston University Law Review 102(3), 793–864.Google Scholar

Constitution of the People’s Republic of China (2018 Amendment) (2018) Announcement No. 1 of the National People’s Congress of the People’s Republic of China (in Chinese). Available at http://www.npc.gov.cn/npc/c191/c505/201905/t20190521_263492.html (accessed 5 March 2025).Google Scholar

Costello, RÁ (2022) Genetic data and the right to privacy: Towards a relational theory of privacy? Human Rights Law Review 22(1), ngab031. https://doi.org/10.1093/hrlr/ngab031.CrossRef Google Scholar

d’Almeida, LD (2017) Arguing a fortiori. Modern Law Review 80(2), 202–237.10.1111/1468-2230.12252CrossRef Google Scholar

Detailed Rules for the Implementation of the Regulation on the Administration of Human Genetic Resources (2023) Order No. 21 of the Ministry of Science and Technology (in Chinese). Available at https://www.most.gov.cn/xxgk/xinxifenlei/fdzdgknr/fgzc/bmgz/202306/t20230601_186416.html (accessed 5 March 2025).Google Scholar

Deuber, D, Egger, C, Fech, K, Malavolta, G, Schröder, D, Thyagarajan, SAK, Battke, F and Durand, C (2019) My genome belongs to me: Controlling third party computation on genomic data. Proceedings on Privacy Enhancing Technologies 2019(1), 108–132. https://doi.org/10.2478/popets-2019-0007.CrossRef Google Scholar

Devuyst, O (2015) The 1000 genomes project: Welcome to a New World. Peritoneal Dialysis International: Journal of the International Society for Peritoneal Dialysis 35(7), 676–677. https://doi.org/10.3747/pdi.2015.00261.CrossRef Google Scholar PubMed

Dieuliis, D (2018) Biodata risks and synthetic biology: A critical juncture. Journal of Bioterrorism & Biodefense 09. https://doi.org/10.4172/2157-2526.1000159.CrossRef Google Scholar

Ding, X (2024) China’s approach to legal protection on publicly available personal information. Law Science 3(2), 287–314.Google Scholar

Du, L and Wang, M (2020) Genetic privacy and data protection: A review of Chinese direct-to-consumer genetic test services. Frontiers in Genetics 11. https://doi.org/10.3389/fgene.2020.00416.CrossRef Google Scholar PubMed

Dyke, SOM (2020) Chapter 2—Genomic data access policy models. In Jiang, X and Tang, H (eds), Responsible Genomic Data Sharing. San Diego, California, United States: Academic Press, pp. 19–32. https://doi.org/10.1016/B978-0-12-816197-5.00002-4.CrossRef Google Scholar

Dyke, SOM, Linden, M, Lappalainen, I, De Argila, JR, Carey, K, Lloyd, D, Spalding, JD, Cabili, MN, Kerry, G, Foreman, J, Cutts, T, Shabani, M, Rodriguez, LL, Haeussler, M, Walsh, B, Jiang, X, Wang, S, Perrett, D, Boughtwood, T, Matern, A, Brookes, AJ, Cupak, M, Fiume, M, Pandya, R, Tulchinsky, I, Scollen, S, Törnroos, J, Das, S, Evans, AC, Malin, BA, Beck, S, Brenner, SE, Nyrönen, T, Blomberg, N, Firth, HV, Hurles, M, Philippakis, AA, Rätsch, G, Brudno, M, Boycott, KM, Rehm, HL, Baudis, M, Sherry, ST, Kato, K, Knoppers, BM, Baker, D, and Flicek, P. (2018) Registered access: Authorizing data access. European Journal of Human Genetics 26(12), 1721–1731. https://doi.org/10.1038/s41431-018-0219-y.CrossRef Google Scholar PubMed

El Emam, K (2011) Methods for the de-identification of electronic health records for genomic research. Genome Medicine 3(4), 25. https://doi.org/10.1186/gm239.CrossRef Google Scholar PubMed

Elliot, M, O’Hara, K, Raab, C, O’Keefe, CM, Mackey, E, Dibben, C, Gowans, H, Purdam, K and McCullagh, K (2018) Functional anonymisation: Personal data and the data environment. Computer Law & Security Review 34(2), 204–221. https://doi.org/10.1016/j.clsr.2018.02.001.CrossRef Google Scholar

Entrikin, JL (2019) Family secrets and relational privacy: Protecting not-so-personal, sensitive information from public disclosure. University of Miami Law Review 74(3), 781–897.Google Scholar

Erie, MS and Streinz, T (2021) The Beijing effect: China’s digital silk road as transnational data governance. New York University Journal of International Law and Politics 54(1), 1–92.Google Scholar

Erlich, Y, Shor, T, Pe’er, I and Carmi, S (2018) Identity inference of genomic data using long-range familial searches. Science 362(6415), 690–694. https://doi.org/10.1126/science.aau4832.CrossRef Google Scholar PubMed

Erlich, Y, Williams, JB, Glazer, D, Yocum, K, Farahany, N, Olson, M, Narayanan, A, Stein, LD, Witkowski, JA and Kain, RC (2014) Redefining genomic privacy: Trust and empowerment. PLoS Biology 12(11), e1001983. https://doi.org/10.1371/journal.pbio.1001983.CrossRef Google Scholar PubMed

Evans, JP, Burke, W and Khoury, M (2010) The rules remain the same for genomic medicine: The case against “reverse genetic exceptionalism”. Genetics in Medicine 12(6), 342–343. https://doi.org/10.1097/GIM.0b013e3181deb308.CrossRef Google Scholar

Fatumo, S, Yakubu, A, Oyedele, O, Popoola, J, Attipoe, DA, Eze-Echesi, G, Modibbo, FZ, Ado-Wanka, N, Salako, O, Nashiru, O, Salako, BL, O’Dushlaine, C and Ene-Obong, A (2022) Promoting the genomic revolution in Africa through the Nigerian 100K genome project. Nature Genetics 54(5), 531–536. https://doi.org/10.1038/s41588-022-01071-6.CrossRef Google Scholar PubMed

Fullerton, SM (2011) The input-output problem: Whose DNA do we study, and why does it matter. In Achieving Justice in Genomic Translation: Re-Thinking the Pathway to Benefit (pp. 40–53). USA: Oxford University Press.Google Scholar

Fuster, GG (2014) The Emergence of Personal Data Protection as a Fundamental Right of the EU. Cham, Switzerland: Springer International Publishing.10.1007/978-3-319-05023-2CrossRef Google Scholar

Gallagher, MD and Chen-Plotkin, AS (2018) The post-GWAS era: From association to function. The American Journal of Human Genetics 102(5), 717–730. https://doi.org/10.1016/j.ajhg.2018.04.002.CrossRef Google Scholar PubMed

Garner, SA and Kim, J (2018) The privacy risks of direct-to-consumer genetic testing: A case study of 23andMe and ancestry. Washington University Law Review 96(6), 1219–1266.Google Scholar

Garrison, NA (2013) Genomic justice for native Americans: Impact of the Havasupai case on genetic research. Science, Technology, & Human Values 38(2), 201–223. https://doi.org/10.1177/0162243912470009.CrossRef Google Scholar PubMed

Garrison, NA, Brothers, KB, Goldenberg, AJ and Lynch, JA (2019a) Genomic Contextualism: Shifting the rhetoric of genetic exceptionalism. The American Journal of Bioethics 19(1), 51–63. https://doi.org/10.1080/15265161.2018.1544304.CrossRef Google Scholar

Garrison, NA, Hudson, M, Ballantyne, LL, Garba, I, Martinez, A, Taualii, M, Arbour, L, Caron, NR and Rainie, SC (2019b) Genomic research through an indigenous lens: Understanding the expectations. Annual Review of Genomics and Human Genetics 20(1), 495–517. https://doi.org/10.1146/annurev-genom-083118-015434.CrossRef Google Scholar

Gassner, AS (2021) The right to delete: Protecting consumer autonomy in direct-to-consumer genetic testing notes. UC Irvine Law Review 12(1), 267–314.Google Scholar

Gerke, S, Jacoby, MB and Cohen, IG (2025) 23andMe’s bankruptcy raises concerns about privacy in the era of big data. BMJ 389, r1071. https://doi.org/10.1136/bmj.r1071.CrossRef Google Scholar PubMed

Gil, JC and Guerreiro, J (2024) The consumer genome: Willingness to share and accept genetic data in marketing. Electronic Markets 35(1), 1. https://doi.org/10.1007/s12525-024-00744-w.CrossRef Google Scholar

Gillette, CP and Hopkins, TD (1987) Federal User Fees: A legal and economic analysis. Boston University Law Review 67(5), 795–876.Google Scholar

Gitschier, J (2009) Inferential genotyping of Y chromosomes in latter-day saints founders and comparison to Utah samples in the HapMap project. The American Journal of Human Genetics 84(2), 251–258. https://doi.org/10.1016/j.ajhg.2009.01.018.CrossRef Google Scholar PubMed

Green, MJ and Botkin, JR (2003) ‘Genetic exceptionalism’ in medicine: Clarifying the differences between genetic and nongenetic tests. Annals of Internal Medicine 138(7), 571–575.10.7326/0003-4819-138-7-200304010-00013CrossRef Google Scholar

Grishin, D, Obbad, K, Estep, P, Quinn, K, Zaranek, SW, Zaranek, AW, Vandewege, W, Clegg, T, César, N, Cifric, M and Church, G (2018) Accelerating genomic data generation and facilitating genomic data access using decentralization, privacy-preserving technologies and equitable compensation. Blockchain in Healthcare Today 1. https://doi.org/10.30953/bhty.v1.34.CrossRef Google Scholar

Guo, S-W (2008) Variation in genetic identity among relatives. Human Heredity 46(2), 61–70. https://doi.org/10.1159/000154328.CrossRef Google Scholar

Gürsoy, G (2020) Chapter 1—Criticality of data sharing in genomic research and public views of genomic data sharing. In Jiang, X and Tang, H (eds), Responsible Genomic Data Sharing. San Diego, California, United States: Academic Press, pp. 3–18. https://doi.org/10.1016/B978-0-12-816197-5.00001-2.CrossRef Google Scholar

Gürsoy, G (2022) Genome privacy and trust. Annual Review of Biomedical Data Science 5(1), 163–181. https://doi.org/10.1146/annurev-biodatasci-122120-021311.CrossRef Google Scholar PubMed

Gusareva, ES, Ghosh, AG, Kharkov, VN, Khor, S-S, Zarubin, A, Moshkov, N, Kalsi, N, Ratan, A, Heinle, CE, Cooke, N, Bravi, CM, Smolnikova, MV, Tereshchenko, SYu, Kasparov, EW, Khitrinskaya, I, Marusin, A, Razhabov, MO, Golubenko, MV, Swarovskaya, M, Kolesnikov, NA, Vagaitseva, KV, Eremina, ER, Sukhomyasova, A, Shtygasheva, O, Panicker, D, Ang, PN, Lee, CF, Koh, Y, Leong, ST, Park, C, Lohar, SR, Yap, ZH, Ng, SG, Dacanay, J, Drautz-Moses, DI, Ramli, NAB, Tokunaga, K, McGonigle, I, Danjoh, I, Moreno-Estrada, A, Tajima, A, Tanabe, H, Nakamura, Y, Nakagome, S, Tatarinova, TV, Stepanov, VA, Schuster, SC, and Kim, HL. (2025) From North Asia to South America: Tracing the longest human migration through genomic sequencing. Science 388(6748), eadk5081. https://doi.org/10.1126/science.adk5081.CrossRef Google Scholar PubMed

Haag, M (2019, February 4) FamilyTreeDNA Admits to Sharing Genetic Data With F.B.I. The New York Times. Available at https://www.nytimes.com/2019/02/04/business/family-tree-dna-fbi.html.Google Scholar

Haeusermann, T, Fadda, M, Blasimme, A, Tzovaras, BG and Vayena, E (2018) Genes wide open: Data sharing and the social gradient of genomic privacy. AJOB Empirical Bioethics 9(4), 207–221. https://doi.org/10.1080/23294515.2018.1550123.CrossRef Google Scholar PubMed

Harbord, K (2019) Genetic data privacy solutions in the GDPR comment. Texas A&M Law Review 7(1), 269–298.10.37419/LR.V7.I1.6CrossRef Google Scholar

Hartl, DL and Cochrane, BJ (2017) Genetics—Analysis of Genes and Genomes, 9th Edn. Burlington, Massachusetts, United States: Jones & Bartlett Learning.Google Scholar

Hawkins, JS and Emanuel, EJ (2008) Exploitation and Developing Countries: The Ethics of Clinical Research. Princeton, New Jersey, United States: Princeton University Press.10.1515/9781400837328CrossRef Google Scholar

Hendrycks, D, Mazeika, M and Woodside, T (2023) An overview of catastrophic AI risks. Preprint, arXiv:2306.12001. https://doi.org/10.48550/arXiv.2306.12001CrossRef Google Scholar

Holthouse, R, Owens, S and Bhunia, S (2025) The 23andMe data breach: Analyzing credential stuffing attacks, security vulnerabilities, and mitigation strategies. Preprint, arXiv:2502.04303. https://doi.org/10.48550/arXiv.2502.04303CrossRef Google Scholar

Howley, C, Haas, MA, Muftah, WAA, Annan, RB, Green, ED, Lundgren, B, Scott, RH, Stark, Z, Tan, P, North, KN and Boughtwood, T (2025) The expanding global genomics landscape: Converging priorities from national genomics programs. The American Journal of Human Genetics 112(4), 751–763. https://doi.org/10.1016/j.ajhg.2025.02.008.CrossRef Google Scholar PubMed

Hoxhaj, I, Stojanovic, J, Sassano, M, Acampora, A and Boccia, S (2020) A review of the legislation of direct-to-consumer genetic testing in EU member states. European Journal of Medical Genetics 63(4), 103841. https://doi.org/10.1016/j.ejmg.2020.103841.CrossRef Google Scholar PubMed

Hudson, M, Garrison, NA, Sterling, R, Caron, NR, Fox, K, Yracheta, J, Anderson, J, Wilcox, P, Arbour, L, Brown, A, Taualii, M, Kukutai, T, Haring, R, Te Aika, B, Baynam, GS, Dearden, PK, Chagné, D, Malhi, RS, Garba, I, Tiffin, N, Bolnick, D, Stott, M, Rolleston, AK, Ballantyne, LL, Lovett, R, David-Chavez, D, Martinez, A, Sporle, A, Walter, M, Reading, J, and Carroll, SR. (2020) Rights, interests and expectations: Indigenous perspectives on unrestricted access to genomic data. Nature Reviews Genetics 21(6). https://doi.org/10.1038/s41576-020-0228-x.CrossRef Google Scholar PubMed

International Human Genome Sequencing Consortium (2004) Finishing the euchromatic sequence of the human genome. Nature 431(7011), 931–945. https://doi.org/10.1038/nature03001.CrossRef Google Scholar

Jasanoff, S (2005) Designs on Nature: Science and Democracy in Europe and the United States. Princeton, New Jersey, United States: Princeton University Press. https://doi.org/10.1515/9781400837311.CrossRef Google Scholar

Joly, Y, Dupras, C, Pinkesz, M, Tovino, SA and Rothstein, MA (2020) Looking beyond GINA: Policy approaches to address genetic discrimination. Annual Review of Genomics and Human Genetics 21(1), 491–507. https://doi.org/10.1146/annurev-genom-111119-011436.CrossRef Google Scholar PubMed

Jonsson, H, Magnusdottir, E, Eggertsson, HP, Stefansson, OA, Arnadottir, GA, Eiriksson, O, Zink, F, Helgason, EA, Jonsdottir, I, Gylfason, A, Jonasdottir, A, Jonasdottir, A, Beyter, D, Steingrimsdottir, T, Norddahl, GL, Magnusson, OT, Masson, G, Halldorsson, BV, Thorsteinsdottir, U, Helgason, A, Sulem, P, Gudbjartsson, DF, and Stefansson, K. (2021) Differences between germline genomes of monozygotic twins. Nature Genetics 53(1). https://doi.org/10.1038/s41588-020-00755-1.CrossRef Google Scholar PubMed

Kaiser, B, Uberoi, D, Raven-Adams, MC, Cheung, K, Bruns, A, Chandrasekharan, S, Otlowski, M, Prince, AER, Tiller, J, Ahmed, A, Bombard, Y, Dupras, C, Moreno, PG, Ryan, R, Valderrama-Aguirre, A and Joly, Y (2024) A proposal for an inclusive working definition of genetic discrimination to promote a more coherent debate. Nature Genetics 56(7), 1339–1345. https://doi.org/10.1038/s41588-024-01786-8.CrossRef Google Scholar PubMed

Kaiser, J (2021) 200,000 whole genomes made available for biomedical studies. Science 374(6571), 1036–1036. https://doi.org/10.1126/science.acx9689.CrossRef Google Scholar PubMed

Kaye, J (2012) The tension between data sharing and the protection of privacy in genomics research. Annual Review of Genomics and Human Genetics 13, 415–431. https://doi.org/10.1146/annurev-genom-082410-101454.CrossRef Google Scholar PubMed

Kim, H, Ho, CWL, Ho, C-H, Athira, PS, Kato, K, De Castro, L, Kang, H, Huxtable, R, Zwart, H, Ives, J, Lee, I, Joly, Y and Kim, SY (2021) Genetic discrimination: Introducing the Asian perspective to the debate. NPJ Genomic Medicine 6(1), 54. https://doi.org/10.1038/s41525-021-00218-4.CrossRef Google Scholar PubMed

Koenig, BA (2014) Have we asked too much of consent? Hastings Center Report 44(4), 33–34. https://doi.org/10.1002/hast.329.CrossRef Google Scholar

Kudva, S and Aswani, A (2023) When would online platforms pay data dividends? In 2023 AMERICAN CONTROL CONFERENCE, ACC. San Diego, CA: IEEE, pp. 1692–1697. https://doi.org/10.23919/ACC55779.2023.10156068CrossRef Google Scholar

Kumuthini, J, Zass, L, Chaouch, M, Fadlelmola, FM, Mulder, N, Radouani, F, Ras, V, Samtal, C, Tchamga, MSS, Sathan, D, Ghoorah, A, Sangeda, RZ, Mwita, LA, Masamu, U, Kassim, SK, Gill, Z, Mungloo-Dilmohamud, Z and Wells, G (2023) 7--Genomics data sharing. In Mccormick, J and Pathak, J (eds), Genomic Data Sharing. San Diego, California, United States: Academic Press, pp. 111–135. https://doi.org/10.1016/B978-0-12-819803-2.00003-1.CrossRef Google Scholar

Laestadius, LI, Rich, JR and Auer, PL (2017) All your data (effectively) belong to us: Data practices among direct-to-consumer genetic testing firms. Genetics in Medicine 19(5), 513–520. https://doi.org/10.1038/gim.2016.136.CrossRef Google Scholar PubMed

Lappalainen, I, Almeida-King, J, Kumanduri, V, Senf, A, Spalding, JD, ur-Rehman, S, Saunders, G, Kandasamy, J, Caccamo, M, Leinonen, R, Vaughan, B, Laurent, T, Rowland, F, Marin-Garcia, P, Barker, J, Jokinen, P, Torres, AC, de Argila, JR, Llobet, OM, Medina, I, Puy, MS, Alberich, M, de la Torre, S, Navarro, A, Paschall, J, and Flicek, P. (2015) The European genome-phenome archive of human data consented for biomedical research. Nature Genetics 47(7), 692–695. https://doi.org/10.1038/ng.3312.CrossRef Google Scholar PubMed

Lee, H, Kim, W, Kwon, N, Kim, C, Kim, S and An, J-Y (2025) Lessons from national biobank projects utilizing whole-genome sequencing for population-scale genomics. Genomics & Informatics 23(1), 8. https://doi.org/10.1186/s44342-025-00040-9.CrossRef Google Scholar PubMed

Lemke, AA, Esplin, ED, Goldenberg, AJ, Gonzaga-Jauregui, C, Hanchard, NA, Harris-Wai, J, Ideozu, JE, Isasi, R, Landstrom, AP, Prince, AER, Turbitt, E, Sabatello, M, Vergano, SAS, Taylor, MRG, Yu, J-H, Brothers, KB and Garrison, NA (2022) Addressing underrepresentation in genomics research through community engagement. The American Journal of Human Genetics 109(9), 1563–1571. https://doi.org/10.1016/j.ajhg.2022.08.005.CrossRef Google Scholar PubMed

Lentzos, F (2020) How to protect the world from ultra-targeted biological weapons. Bulletin of the Atomic Scientists 76(6), 302–308. https://doi.org/10.1080/00963402.2020.1846412.CrossRef Google Scholar

Liu, H, Peng, C-G, Wu, Z-Q, Tian, Y-L and Tian, F (2021) A survey of the theories and methods of privacy preserving of genome data. Chinese Journal of Computers 44(7), 1430–1480. CNKI:SUN:JSJX.0.2021-07-009.Google Scholar

Liu, J, Hui, R-T and Song, L (2020) Precision cardiovascular medicine in China. Journal of Geriatric Cardiology 17(10), 638–641. https://doi.org/10.11909/j.issn.1671-5411.2020.10.005.Google Scholar PubMed

Lowe, AL, Urquhart, A, Foreman, LA and Evett, IW (2001) Inferring ethnic origin by means of an STR profile. Forensic Science International 119(1), 17–22. https://doi.org/10.1016/S0379-0738(00)00387-X.CrossRef Google Scholar PubMed

Lowrance, WW and Collins, FS (2007) Identifiability in genomic research. Science 317(5838), 600–602. https://doi.org/10.1126/science.1147699.CrossRef Google Scholar PubMed

Mailman, MD, Feolo, M, Jin, Y, Kimura, M, Tryka, K, Bagoutdinov, R, Hao, L, Kiang, A, Paschall, J, Phan, L, Popova, N, Pretel, S, Ziyabari, L, Lee, M, Shao, Y, Wang, ZY, Sirotkin, K, Ward, M, Kholodov, M, Zbicz, K, Beck, J, Kimelman, M, Shevelev, S, Preuss, D, Yaschenko, E, Graeff, A, Ostell, J, and Sherry, ST. (2007) The NCBI dbGaP database of genotypes and phenotypes. Nature Genetics 39(10), 1181–1186. https://doi.org/10.1038/ng1007-1181.CrossRef Google Scholar PubMed

Majumder, MA, Guerrini, CJ and McGuire, AL (2021) Direct-to-consumer genetic testing: Value and risk. Annual Review of Medicine 72(1), 151–166. https://doi.org/10.1146/annurev-med-070119-114727.CrossRef Google Scholar

Marelli, L, Lievevrouw, E and Van Hoyweghen, I (2020) Fit for purpose? The GDPR and the governance of European digital health. Policy Studies 41(5), 447–467. https://doi.org/10.1080/01442872.2020.1724929.CrossRef Google Scholar

Martins, MF, Murry, LT, Telford, L and Moriarty, F (2022) Direct-to-consumer genetic testing: An updated systematic review of healthcare professionals’ knowledge and views, and ethical and legal concerns. European Journal of Human Genetics 30(12), 12. https://doi.org/10.1038/s41431-022-01205-8.CrossRef Google Scholar PubMed

McGonigle, I (2019) Genomic data and the dividual self. Genetics Research 101, e12. https://doi.org/10.1017/S0016672319000107.CrossRef Google Scholar PubMed

McGonigle, IV (2016) The collective nature of personalized medicine. Genetics Research 98, e3. https://doi.org/10.1017/S0016672315000270.CrossRef Google Scholar PubMed

McGuire, AL and Beskow, LM (2010) Informed consent in genomics and genetic research. Annual Review of Genomics and Human Genetics 11, 361–381. https://doi.org/10.1146/annurev-genom-082509-141711.CrossRef Google Scholar PubMed

McGuire, AL, Diaz, CM, Wang, T and Hilsenbeck, SG (2009) Social networkers’ attitudes toward direct-to-consumer personal genome testing. The American Journal of Bioethics 9(6–7), 3–10. https://doi.org/10.1080/15265160902928209.CrossRef Google Scholar PubMed

Mersha, TB (2024) From Mendel to multi-omics: Shifting paradigms. European Journal of Human Genetics 32(2), 139–142. https://doi.org/10.1038/s41431-023-01420-x.CrossRef Google Scholar PubMed

Migliorini, S (2023) Biometric harm. Law, Technology and Humans 5(2), 238–251. https://doi.org/10.5204/lthj.2830.CrossRef Google Scholar

Molteni, M (2016, December 15) Genos Will Sequence Your Genes—And Help You Sell Them to Science. Wired. Available at https://www.wired.com/2016/12/genos-will-sequence-genes-help-sell-science/.Google Scholar

Murray, TH (2019) Is genetic exceptionalism past its sell-by date? On genomic diaries, context, and content. The American Journal of Bioethics 19(1), 13–15. https://doi.org/10.1080/15265161.2018.1552038.CrossRef Google Scholar PubMed

Myers, CT, Kumar, RD, Pilgram, L, Bonomi, L, Thomas, M, Griffith, OL, Fullerton, SM and Gibbs, RA (2025) Genomic data and privacy. Clinical Chemistry 71(1), 10–17. https://doi.org/10.1093/clinchem/hvae184.CrossRef Google Scholar PubMed

National Indian Health Board (2020) Resolution 20–04: Resolution to call upon the National Institutes of Health to consult with Tribal Nations and establish policies and guidance for Tribal oversight of data on Tribal citizens enrolled in the All of Us Research Program. Available at https://www.nihb.org/wp-content/uploads/2025/01/20-04-NIHB-Resolution.pdf.Google Scholar

Neuwirth, RJ (2013) Essentially oxymoronic concepts. Global Journal of Comparative Law 2(2), 147–166.10.1163/2211906X-00202002CrossRef Google Scholar

Niemiec, E and Howard, HC (2016) Ethical issues in consumer genome sequencing: Use of consumers’ samples and data. Applied & Translational Genomics 8, 23–30. https://doi.org/10.1016/j.atg.2016.01.005.CrossRef Google Scholar

Nunn, JS, Tiller, J, Fransquet, P and Lacaze, P (2019) Public involvement in global genomics research: A scoping review. Frontiers in Public Health 7. https://doi.org/10.3389/fpubh.2019.00079.CrossRef Google Scholar PubMed

Nurk, S, Koren, S, Rhie, A, Rautiainen, M, Bzikadze, AV, Mikheenko, A, Vollger, MR, Altemose, N, Uralsky, L, Gershman, A, Aganezov, S, Hoyt, SJ, Diekhans, M, Logsdon, GA, Alonge, M, Antonarakis, SE, Borchers, M, Bouffard, GG, Brooks, SY, Caldas, GV, Chen, N-C, Cheng, H, Chin, C-S, Chow, W, de Lima, LG, Dishuck, PC, Durbin, R, Dvorkina, T, Fiddes, IT, Formenti, G, Fulton, RS, Fungtammasan, A, Garrison, E, PGS, Grady, Graves-Lindsay, TA, Hall, IM, Hansen, NF, Hartley, GA, Haukness, M, Howe, K, Hunkapiller, MW, Jain, C, Jain, M, Jarvis, ED, Kerpedjiev, P, Kirsche, M, Kolmogorov, M, Korlach, J, Kremitzki, M, Li, H, Maduro, VV, Marschall, T, McCartney, AM, McDaniel, J, Miller, DE, Mullikin, JC, Myers, EW, Olson, ND, Paten, B, Peluso, P, Pevzner, PA, Porubsky, D, Potapova, T, Rogaev, EI, Rosenfeld, JA, Salzberg, SL, Schneider, VA, Sedlazeck, FJ, Shafin, K, Shew, CJ, Shumate, A, Sims, Y, AFA, Smit, Soto, DC, Sović, I, Storer, JM, Streets, A, Sullivan, BA, Thibaud-Nissen, F, Torrance, J, Wagner, J, Walenz, BP, Wenger, A, JMD, Wood, Xiao, C, Yan, SM, Young, AC, Zarate, S, Surti, U, RC, McCoy, Dennis, MY, Alexandrov, IA, Gerton, JL, O’Neill, RJ, Timp, W, Zook, JM, Schatz, MC, Eichler, EE, Miga, KH, and Phillippy, AM. (2022) The complete sequence of a human genome. Science 376(6588), 44–53. https://doi.org/10.1126/science.abj6987.CrossRef Google Scholar PubMed

O’Doherty, KC, Shabani, M, Dove, ES, Bentzen, HB, Borry, P, Burgess, MM, Chalmers, D, De Vries, J, Eckstein, L, Fullerton, SM, Juengst, E, Kato, K, Kaye, J, Knoppers, BM, Koenig, BA, Manson, SM, McGrail, KM, McGuire, AL, Meslin, EM, Nicol, D, Prainsack, B, Terry, SF, Thorogood, A, and Burke, W. (2021) Toward better governance of human genomic data. Nature Genetics 53(1), 2–8. https://doi.org/10.1038/s41588-020-00742-6.CrossRef Google Scholar PubMed

Ohm, P (2009) Broken promises of privacy: Responding to the surprising failure of anonymization. UCLA Law Review 57(6), 1701–1778.Google Scholar

Oliva, A, Kaphle, A, Reguant, R, Sng, LMF, Twine, NA, Malakar, Y, Wickramarachchi, A, Keller, M, Ranbaduge, T, Chan, EKF, Breen, J, Buckberry, S, Guennewig, B, Haas, M, Brown, A, Cowley, MJ, Thorne, N, Jain, Y and Bauer, DC (2024) Future-proofing genomic data and consent management: A comprehensive review of technology innovations. GigaScience, 13, giae021. https://doi.org/10.1093/gigascience/giae021CrossRef Google Scholar PubMed

Ormond, KE, Stanclift, C, Reuter, CM, Carter, JN, Murphy, KE, Lindholm, ME and Wheeler, MT (2025) Researcher views on returning results from multi-omics data to research participants: Insights from the molecular transducers of physical activity consortium (MoTrPAC) study. BMC Medical Ethics 26(1), 22. https://doi.org/10.1186/s12910-025-01174-9.CrossRef Google Scholar PubMed

Painter, C and Bastian, ND (2021) Generating genetic engineering linked indicator datasets for machine learning classifier training in biosecurity. Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications III 11746, 517–522. https://doi.org/10.1117/12.2587844.Google Scholar

Pal, M, Tsegaye, M, Girzaw, F, Bedada, H, Godishala, V and Kandi, V (2017) An overview on biological weapons and bioterrorism. American Journal of Biomedical Research 5(2), 24–34. https://doi.org/10.12691/ajbr-5-2-2.CrossRef Google Scholar

Paltiel, M, Taylor, M and Newson, A (2023) Protection of genomic data and the Australian privacy act: When are genomic data ‘personal information’? International Data Privacy Law 13(1), 47–62. https://doi.org/10.1093/idpl/ipad002.CrossRef Google Scholar

Park, ST and Kim, J (2016) Trends in next-generation sequencing and a new era for whole genome sequencing. International Neurourology Journal 20(Suppl 2), S76–S83. https://doi.org/10.5213/inj.1632742.371.CrossRef Google Scholar

Peng, C, Shao, G and Zheng, W (2022) China’s emerging legal regime for privacy and personal information protection. Tsinghua China Law Review 15(2), 191–221.Google Scholar

Personal Information Protection Law of the People’s Republic of China (2021) Order No. 91 of the President of the People’s Republic of China. Available at http://www.npc.gov.cn/npc/c2/c30834/202108/t20210820_313088.html (accessed 5 March 2025).Google Scholar

Price, WN and Cohen, IG (2019) Privacy in the age of medical big data. Nature Medicine 25(1), 37–43. https://doi.org/10.1038/s41591-018-0272-7.CrossRef Google Scholar PubMed

Purtova, N (2018) The law of everything. Broad concept of personal data and future of EU data protection law. Law, Innovation and Technology 10(1), 40–81. https://doi.org/10.1080/17579961.2018.1452176.CrossRef Google Scholar

Puzis, R, Farbiash, D, Brodt, O, Elovici, Y and Greenbaum, D (2020) Increased cyber-biosecurity for DNA synthesis. Nature Biotechnology 38(12). https://doi.org/10.1038/s41587-020-00761-y.CrossRef Google Scholar PubMed

Qiu, Q (2010, January 14) Thalassemia gene carriers denied government jobs. China Daily. https://english.cctv.com/20100114/102525.shtml.Google Scholar

Quinn, P, Ellyne, E and Yao, C (2024) Will the GDPR restrain health data access bodies under the European health data space (EHDS)? Computer Law & Security Review 54, 105993. https://doi.org/10.1016/j.clsr.2024.105993.CrossRef Google Scholar

Quinn, P and Malgieri, G (2021) The difficulty of defining sensitive data—The concept of sensitive data in the EU data protection framework. German Law Journal 22(8), 1583–1612. https://doi.org/10.1017/glj.2021.79.CrossRef Google Scholar

Rahnasto, J (2023) Genetic data are not always personal—Disaggregating the identifiability and sensitivity of genetic data. Journal of Law and the Biosciences 10(2), lsad029. https://doi.org/10.1093/jlb/lsad029.CrossRef Google Scholar

Raisaro, JL, Choi, G, Pradervand, S, Colsenet, R, Jacquemont, N, Rosat, N, Mooser, V and Hubaux, J-P (2018) Protecting privacy and security of genomic data in i2b2 with homomorphic encryption and differential privacy. IEEE/ACM Transactions on Computational Biology and Bioinformatics 15(5), 1413–1426. https://doi.org/10.1109/TCBB.2018.2854782.CrossRef Google Scholar PubMed

Ram, N, Guerrini, CJ and McGuire, AL (2018) Genealogy databases and the future of criminal investigation. Science 360(6393), 1078–1079. https://doi.org/10.1126/science.aau1083.CrossRef Google Scholar PubMed

Ramos, EM, Din-Lovinescu, C, Bookman, EB, McNeil, LJ, Baker, CC, Godynskiy, G, Harris, EL, Lehner, T, McKeon, C, Moss, J, Starks, VL, Sherry, ST, Manolio, TA and Rodriguez, LL (2013) A mechanism for controlled access to GWAS data: Experience of the GAIN data access committee. The American Journal of Human Genetics 92(4), 479–488. https://doi.org/10.1016/j.ajhg.2012.08.034.CrossRef Google Scholar PubMed

Raz, AE, Niemiec, E, Howard, HC, Sterckx, S, Cockbain, J and Prainsack, B (2020) Transparency, consent and trust in the use of customers’ data by an online genetic testing company: An exploratory survey among 23andMe users. New Genetics and Society 39(4), 459–482. https://doi.org/10.1080/14636778.2020.1755636.CrossRef Google Scholar

Rego, S, Grove, ME, Cho, MK and Ormond, KE (2020) Informed consent in the genomics era. Cold Spring Harbor Perspectives in Medicine 10(8), a036582. https://doi.org/10.1101/cshperspect.a036582.CrossRef Google Scholar PubMed

Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the Protection of Natural Persons with Regard to the Processing of Personal Data and on the Free Movement of Such Data, and Repealing Directive 95/46/EC (General Data Protection Regulation) (2016) OJ L 119. Available at https://eur-lex.europa.eu/eli/reg/2016/679/oj (accessed 5 March 2025).Google Scholar

Regulation (EU) 2022/868 of the European Parliament and of the Council of 30 May 2022 on European Data Governance and Amending Regulation (EU) 2018/1724 (Data Governance Act) (Text with EEA Relevance) (2022) OJ L 152. Available at http://data.europa.eu/eli/reg/2022/868/oj/eng (accessed 5 March 2025).Google Scholar

Regulation (EU) 2023/2854 of the European Parliament and of the Council of 13 December 2023 on Harmonised Rules on Fair Access to and Use of Data and Amending Regulation (EU) 2017/2394 and Directive (EU) 2020/1828 (Data Act) (2023) OJ L, 2023/2854. Available at http://data.europa.eu/eli/reg/2023/2854/oj/eng (accessed 5 March 2025).Google Scholar

Regulation (EU) 2025/327 of the European Parliament and of the Council of 11 February 2025 on the European Health Data Space and Amending Directive 2011/24/EU and Regulation (EU) 2024/2847 (Text with EEA Relevance) (2025) OJ L, 2025/327. Available at http://data.europa.eu/eli/reg/2025/327/oj/eng (accessed 5 March 2025).Google Scholar

Rehm, HL, AJH, Page, Smith, L, Adams, JB, Alterovitz, G, Babb, LJ, Barkley, MP, Baudis, M, MJS, Beauvais, Beck, T, Beckmann, JS, Beltran, S, Bernick, D, Bernier, A, Bonfield, JK, Boughtwood, TF, Bourque, G, Bowers, SR, Brookes, AJ, Brudno, M, Brush, MH, Bujold, D, Burdett, T, Buske, OJ, Cabili, MN, Cameron, DL, Carroll, RJ, Casas-Silva, E, Chakravarty, D, Chaudhari, BP, Chen, SH, Cherry, JM, Chung, J, Cline, M, Clissold, HL, Cook-Deegan, RM, Courtot, M, Cunningham, F, Cupak, M, Davies, RM, Denisko, D, Doerr, MJ, Dolman, LI, Dove, ES, Dursi, LJ, SOM, Dyke, Eddy, JA, Eilbeck, K, Ellrott, KP, Fairley, S, Fakhro, KA, Firth, HV, Fitzsimons, MS, Fiume, M, Flicek, P, Fore, IM, Freeberg, MA, Freimuth, RR, Fromont, LA, Fuerth, J, Gaff, CL, Gan, W, Ghanaim, EM, Glazer, D, Green, RC, Griffith, M, Griffith, OL, Grossman, RL, Groza, T, Auvil, JMG, Guigó, R, Gupta, D, Haendel, MA, Hamosh, A, Hansen, DP, Hart, RK, Hartley, DM, Haussler, D, Hendricks-Sturrup, RM, Ho, CWL, Hobb, AE, Hoffman, MM, Hofmann, OM, Holub, P, Hsu, JS, Hubaux, J-P, Hunt, SE, Husami, A, Jacobsen, JO, Jamuar, SS, Janes, EL, Jeanson, F, Jené, A, Johns, AL, Joly, Y, SJM, Jones, Kanitz, A, Kato, K, Keane, TM, Kekesi-Lafrance, K, Kelleher, J, Kerry, G, Khor, S-S, Knoppers, BM, Konopko, MA, Kosaki, K, Kuba, M, Lawson, J, Leinonen, R, Li, S, Lin, MF, Linden, M, Liu, X, Liyanage, IU, Lopez, J, Lucassen, AM, Lukowski, M, Mann, AL, Marshall, J, Mattioni, M., Metke-Jimenez, A, Middleton, A, Milne, RJ, Molnár-Gábor, F, Mulder, N, Munoz-Torres, MC, Nag, R, Nakagawa, H, Nasir, J, Navarro, A, Nelson, TH, Niewielska, A, Nisselle, A, Niu, J, Nyrönen, TH, O’Connor, BD, Oesterle, S, Ogishima, S, Wang, VO, Paglione, LAD, Palumbo, E, Parkinson, HE, Philippakis, AA, Pizarro, AD, Prlic, A, Rambla, J, Rendon, A, Rider, RA, Robinson, PN, Rodarmer, KW, Rodriguez, LL, Rubin, AF, Rueda, M, Rushton, GA, Ryan, RS, Saunders, GI, Schuilenburg, H, Schwede, T, Scollen, S, Senf, A, Sheffield, NC, Skantharajah, N, Smith, AV, Sofia, HJ, Spalding, D, Spurdle, AB, Stark, Z, Stein, LD, Suematsu, M, Tan, P, Tedds, JA, Thomson, AA, Thorogood, A, Tickle, TL, Tokunaga, K, Törnroos, J, Torrents, D, Upchurch, S, Valencia, A, Guimera, RV, Vamathevan, J, Varma, S, Vears, DF, Viner, C, Voisin, C, Wagner, AH, Wallace, SE, Walsh, BP, Williams, MS, Winkler, EC, Wold, BJ, Wood, GM, Woolley, JP, Yamasaki, C, Yates, AD, Yung, CK, Zass, LJ, Zaytseva, K, Zhang, J, Goodhand, P, North, K, and Birney, E. (2021) GA4GH: International policies and standards for data sharing across genomic research and healthcare. Cell Genomics 1(2). https://doi.org/10.1016/j.xgen.2021.100029.CrossRef Google Scholar PubMed

Ristanovic, E (2009) Bioterrorism — Risk and threat: The misuse of science. In Dishovsky, C and Pivovarov, A (eds), Counteraction to Chemical and Biological Terrorism in East European Countries. Dordrecht, Netherlands: Springer, pp. 121–125. https://doi.org/10.1007/978-90-481-2342-1_16.CrossRef Google Scholar

Ritchie, MD, Holzinger, ER, Li, R, Pendergrass, SA and Kim, D (2015) Methods of integrating data to uncover genotype–phenotype interactions. Nature Reviews Genetics 16(2), 85–97. https://doi.org/10.1038/nrg3868.CrossRef Google Scholar PubMed

Rocher, L, Hendrickx, JM and de Montjoye, Y-A (2019) Estimating the success of re-identifications in incomplete datasets using generative models. Nature Communications 10(1). https://doi.org/10.1038/s41467-019-10933-3.CrossRef Google Scholar PubMed

S. and Marper v. the United Kingdom (2008) No. 30562/04, 30566/04 (ECtHR [GC] 4 December 2008). Available at https://hudoc.echr.coe.int/fre?i=001-90051 (accessed 5 March 2025).Google Scholar

Sanger, F, Nicklen, S and Coulson, AR (1977) DNA sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences 74(12), 5463–5467. https://doi.org/10.1073/pnas.74.12.5463.CrossRef Google Scholar PubMed

Satam, H, Joshi, K, Mangrolia, U, Waghoo, S, Zaidi, G, Rawool, S, Thakare, RP, Banday, S, Mishra, AK, Das, G and Malonia, SK (2023) Next-generation sequencing technology: Current trends and advancements. Biology 12(7), 997. https://doi.org/10.3390/biology12070997.CrossRef Google Scholar PubMed

Saunders, G, Baudis, M, Becker, R, Beltran, S, Béroud, C, Birney, E, Brooksbank, C, Brunak, S, Van den Bulcke, M, Drysdale, R, Capella-Gutierrez, S, Flicek, P, Florindi, F, Goodhand, P, Gut, I, Heringa, J, Holub, P, Hooyberghs, J, Juty, N, Keane, TM, Korbel, JO, Lappalainen, I, Leskosek, B, Matthijs, G, Mayrhofer, MT, Metspalu, A, Navarro, A, Newhouse, S, Nyrönen, T, Page, A, Persson, B, Palotie, A, Parkinson, H, Rambla, J, Salgado, D, Steinfelder, E, Swertz, MA, Valencia, A, Varma, S, Blomberg, N, and Scollen, S. (2019) Leveraging European infrastructures to access 1 million human genomes by 2022. Nature Reviews Genetics 20(11), 693–701. https://doi.org/10.1038/s41576-019-0156-9.CrossRef Google Scholar PubMed

Shabani, M and Borry, P (2018) Rules for processing genetic data for research purposes in view of the new EU general data protection regulation. European Journal of Human Genetics 26(2). https://doi.org/10.1038/s41431-017-0045-7.CrossRef Google Scholar

Shriver, MD, Smith, MW, Jin, L, Marcini, A, Akey, JM, Deka, R and Ferrell, RE (1997) Ethnic-affiliation estimation by use of population-specific DNA markers. American Journal of Human Genetics 60(4), 957–964.Google Scholar PubMed

Sirgiovanni, E (2017) Criminal heredity: The influence of Cesare Lombroso’s concept of the “born criminal” on contemporary neurogenetics and its forensic applications. Medicina Nei Secoli: Journal of History of Medicine and Medical Humanities 29(1).Google Scholar

Solove, DJ (2013) Introduction: Privacy self-management and the consent dilemma symposium: Privacy and technology. Harvard Law Review 126(7), 1880–1903.Google Scholar

Solove, DJ (2024) Data is what data does: Regulating based on harm and risk instead of sensitive data. Northwestern University Law Review 118(4), 1081–1138.Google Scholar

Spielman, RS, Bastone, LA, Burdick, JT, Morley, M, Ewens, WJ and Cheung, VG (2007) Common genetic variants account for differences in gene expression among ethnic groups. Nature Genetics 39(2), 226–231. https://doi.org/10.1038/ng1955.CrossRef Google Scholar PubMed

Stark, J (2022) Product lifecycle management (PLM). In Stark, J (ed), Product Lifecycle Management (Volume 1): 21st Century Paradigm for Product Realisation. Cham, Switzerland: Springer International Publishing, pp. 1–32. https://doi.org/10.1007/978-3-030-98578-3_1.CrossRef Google Scholar

Staunton, C, Slokenberga, S and Mascalzoni, D (2019) The GDPR and the research exemption: Considerations on the necessary safeguards for research biobanks. European Journal of Human Genetics 27(8), 1159–1167. https://doi.org/10.1038/s41431-019-0386-5.CrossRef Google Scholar PubMed

The All of Us Research Program Investigators (2019) The “all of us” research program. New England Journal of Medicine 381(7), 668–676. https://doi.org/10.1056/NEJMsr1809937.CrossRef Google Scholar

Tigard, D (2019) Changing the mindset for precision medicine: From incentivized biobanking models to genomic data. Genetics Research 101, e10. https://doi.org/10.1017/S0016672319000077.CrossRef Google Scholar PubMed

Tramèr, F, Huang, Z, Hubaux, J-P and Ayday, E (2015) Differential privacy with bounded priors: Reconciling utility and privacy in genome-wide association studies. In Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security. New York, New York, United States: Association for Computing Machinery, pp. 1286–1297. https://doi.org/10.1145/2810103.2813610CrossRef Google Scholar

Tsosie, KS, Yracheta, JM and Dickenson, D (2019) Overvaluing individual consent ignores risks to tribal participants. Nature Reviews Genetics 20(9), 497–498. https://doi.org/10.1038/s41576-019-0161-z.CrossRef Google Scholar PubMed

Unger Avila, P, Padvitski, T, Leote, AC, Chen, H, Saez-Rodriguez, J, Kann, M and Beyer, A (2024) Gene regulatory networks in disease and ageing. Nature Reviews Nephrology 20(9), 616–633. https://doi.org/10.1038/s41581-024-00849-7.CrossRef Google Scholar PubMed

United Nations Declaration on the Rights of Indigenous Peoples (2007) Available at https://www.un.org/esa/socdev/unpfii/documents/DRIPS_en.pdf (accessed 5 March 2025).Google Scholar

Urbina, F, Lentzos, F, Invernizzi, C and Ekins, S (2022) Dual use of artificial-intelligence-powered drug discovery. Nature Machine Intelligence 4(3), 189–191. https://doi.org/10.1038/s42256-022-00465-9.CrossRef Google Scholar PubMed

Wall, JD, Stawiski, EW, Ratan, A, Kim, HL, Kim, C, Gupta, R, Suryamohan, K, Gusareva, ES, Purbojati, RW, Bhangale, T, Stepanov, V, Kharkov, V, Schröder, MS, Ramprasad, V, Tom, J, Durinck, S, Bei, Q, Li, J, Guillory, J, Phalke, S, Basu, A, Stinson, J, Nair, S, Malaichamy, S, Biswas, NK, Chambers, JC, Cheng, KC, George, JT, Khor, SS, Kim, J-I, Cho, B, Menon, R, Sattibabu, T, Bassi, A, Deshmukh, M, Verma, A, Gopalan, V, Shin, J-Y, Pratapneni, M, Santhosh, S, Tokunaga, K, Md-Zain, BM, Chan, KG, Parani, M, Natarajan, P, Hauser, M, Allingham, RR, Santiago-Turla, C, Ghosh, A, Gadde, SGK, Fuchsberger, C, Forer, L, Schoenherr, S, Sudoyo, H, Lansing, JS, Friedlaender, J, Koki, G, Cox, MP, Hammer, M, Karafet, T, Ang, KC, Mehdi, SQ, Radha, V, Mohan, V, Majumder, PP, Seshagiri, S, Seo, J-S, Schuster, SC, Peterson, AS, and GenomeAsia100K Consortium. (2019) The GenomeAsia 100K project enables genetic discoveries across Asia. Nature 576(7785), 106–111. https://doi.org/10.1038/s41586-019-1793-z.Google Scholar

Walton, NA, Johnson, DK, Person, TN and Chamala, S (2019) Genomic data in the electronic health record. Advances in Molecular Pathology 2(1), 21–33. https://doi.org/10.1016/j.yamp.2019.07.001.CrossRef Google Scholar

Wan, Z, Hazel, JW, Clayton, EW, Vorobeychik, Y, Kantarcioglu, M and Malin, BA (2022) Sociotechnical safeguards for genomic data privacy. Nature Reviews Genetics 23(7), 429–445. https://doi.org/10.1038/s41576-022-00455-y.CrossRef Google Scholar PubMed

Wang, G and Liu, Y (2025) Regulatory failures and improvement pathways for sharing personal genome data in China. Science and Technology Management Research 45(06), 217–225. https://doi.org/10.3969/j.issn.1000-7695.2025.6.023.Google Scholar

Wang, L (2013) Legal protection of personal information: Centered on the line between personal information and privacy. Modern Law Science 35(4), 62–72. https://doi.org/10.3969/j.issn.1001-2397.2013.04.08.Google Scholar

Wang, X, Mina, T, Sadhu, N, Jain, PR, Ng, HK, Low, DY, Tay, D, Tong, TYY, Choo, W-L, Kerk, SK, Low, GL, Team, THS, Lam, BCC, Dalan, R, Wanseicheong, G, Yew, YW, Leow, E-J, Brage, S, Michelotti, GA, Wong, KE, Sheridan, PA, Yan, LP, Xuan, YZ, Bertin, N, Bellis, C, Hebrard, M, Goy, P-A, Tsilidis, K, Sanikini, H, Li, GX, Han, LT, Lee, L, Best, JD, Tan, P, Elliott, P, Sing, LE, Lee, J, Ngeow, J, Riboli, E, Lam, M, Loh, M and Chambers, JC (2024a) The Health for Life in Singapore (HELIOS) study: Delivering precision medicine research for Asian populations. medRxiv. https://doi.org/10.1101/2024.05.14.24307259CrossRef Google Scholar

Wang, Z, Wang, M and Du, L (2024b) Public perceptions of international genetic information sharing for biomedical research in China: A case study of the social media debate on the article “a Pangenome reference of 36 Chinese populations” published in nature. Human Genomics 18(1), 86. https://doi.org/10.1186/s40246-024-00650-4.CrossRef Google Scholar

Wojcik, MH, Lemire, G, Berger, E, Zaki, MS, Wissmann, M, Win, W, White, SM, Weisburd, B, Wieczorek, D, Waddell, LB, Verboon, JM, VanNoy, GE, Töpf, A, Tan, TY, Syrbe, S, Strehlow, V, Straub, V, Stenton, SL, Snow, H, Singer-Berk, M, Silver, J, Shril, S, Seaby, EG, Schneider, R, Sankaran, VG, Sanchis-Juan, A, Russell, KA, Reinson, K, Ravenscroft, G, Radtke, M, Popp, D, Polster, T, Platzer, K, Pierce, EA, Place, EM, Pajusalu, S, Pais, L, Õunap, K, Osei-Owusu, I, Opperman, H, Okur, V, Oja, KT, O’Leary, M, O’Heir, E, Morel, CF, Merkenschlager, A, Marchant, RG, Mangilog, BE, Madden, JA, MacArthur, D, Lovgren, A, Lerner-Ellis, JP, Lin, J, Laing, N, Hildebrandt, F, Hentschel, J, Groopman, E, Goodrich, J, Gleeson, JG, Ghaoui, R, Genetti, CA, Gburek-Augustat, J, Gazda, HT, Ganesh, VS, Ganapathi, M, Gallacher, L, Fu, JM, Evangelista, E, England, E, Donkervoort, S, DiTroia, S, Cooper, ST, Chung, WK, Christodoulou, J, Chao, KR, Cato, LD, Bujakowska, KM, Bryen, SJ, Brand, H, Bönnemann, CG, Beggs, AH, Baxter, SM, Bartolomaeus, T, Agrawal, PB, Talkowski, M, Austin-Tse, C, Jamra, RA, Rehm, HL, and O’Donnell-Luria, A. (2024) Genome sequencing for diagnosing rare diseases. New England Journal of Medicine 390(21), 1985–1997. https://doi.org/10.1056/NEJMoa2314761.CrossRef Google Scholar PubMed

Zhang, X (2015) From privacy to personal information: Theoretical and institutional arrangements for rebalancing interests. China Legal Science 3, 38–59. https://doi.org/10.14111/j.cnki.zgfx.2015.03.003.Google Scholar

Submit a response

Comments

No Comments have been published for this article.

Article contents

Towards equitable governance of human genomic data sharing: guided by genomic contextualism

Abstract

Keywords

Information

Policy Significance Statement

1. Introduction

2. Defining genomic data and tracing the historical accumulation of genomic datasets

2.1. Defining genomic data

2.2. Tracing human genome projects and genomic data accumulation

3. The key features of genomic data and genomic contextualism

3.1. The key features of genomic data

3.2. The genomic contextualism

4. Tripartite risk taxonomy of genomic data sharing

4.1. Individual privacy violations

4.2. Group-level harms

4.3. Bioterrorism threats

5. Rules for genomic data sharing: a comparison of China and the EU

5.1. Overview of relevant personal data protection laws

5.2. Technical security mechanisms

5.3. Informed consent mechanisms

6. Proposals for governance reform of genomic data sharing

6.1. Supplementing informed consent with an interest-balancing principle

6.2. Enhancing TOMs within a data lifecycle management framework

7. Conclusion and future work

Data availability statement

Acknowledgments

Author contribution

Competing interests

References

Comments

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests