We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
The GABA type A receptor (GABAAR) belongs to the family of pentameric ligand-gated ion channels and plays a key role in inhibition in adult mammalian brains. Dysfunction of this macromolecule may lead to epilepsy, anxiety disorders, autism, depression, and schizophrenia. GABAAR is also a target for multiple physiologically and clinically relevant modulators, such as benzodiazepines (BDZs), general anesthetics, and neurosteroids. The first GABAAR structure appeared in 2014, but the past years have brought a particularly abundant surge in structural data for these receptors with various ligands and modulators. Although the open conformation remains elusive, this novel information has pushed the structure–function studies to an unprecedented level. Electrophysiology, mutagenesis, photolabeling, and in silico simulations, guided by novel structural information, shed new light on the molecular mechanisms of receptor functioning. The main goal of this review is to present the current knowledge of GABAAR functional and structural properties. The review begins with an outline of the functional and structural studies of GABAAR, accompanied by some methodological considerations, especially biophysical methods, enabling the reader to follow how major breakthroughs in characterizing GABAAR features have been achieved. The main section provides a comprehensive analysis of the functional significance of specific structural elements in GABAARs. We additionally summarize the current knowledge on the binding sites for major GABAAR modulators, referring to the molecular underpinnings of their action. The final chapter of the review moves beyond examining GABAAR as an isolated macromolecule and describes the interactions of the receptor with other proteins in a broader context of inhibitory plasticity. In the final section, we propose a general conclusion that agonist binding to the orthosteric binding sites appears to rely on local interactions, whereas conformational transitions of bound macromolecule (gating) and allosteric modulation seem to reflect more global phenomena involving vast portions of the macromolecule.
Toxoplasmosis is a significant public health concern with limited therapeutic options. The medicines for malaria venture (MMV) developed the pandemic response box (PRB) containing 400 drug-like molecules with broad pathogen activity. The aim of this work is to evaluate PRB compounds for their anti-Toxoplasma gondii activity and identify promising candidates for further evaluation. Screening identified 42 selective compounds with half effective concentration (EC50) ranging from 2.4 to 913.1 nm and half cytotoxic concentration (CC50) ranging from 6 μm to >50 μm. Selectivity index (SI) values (CC50/EC50) ranged from 11 to 17 708. Based on its in silico and in vitro profile and its commercial availability, RWJ-67657 was selected for further studies. Molecular docking analysis showed RWJ-67657 is predicted to bind to T. gondii p38 mitogen-activated protein kinase (TgMAPK). Oral administration of RWJ-67657 (20 mg kg day−1/10 days) significantly reduced parasite burden in chronically infected mice compared to mock-treated group (P < 0.01). These findings highlight the PRB as a promising source for anti-T. gondii compounds, with several showing favourable drug properties, including MMV1634492, MMV002731, MMV1634491, MMV1581551, MMV011565, MMV1581558, MMV1578577, MMV233495 and MMV1580482, firstly described here as anti-T. gondii agents. RWJ-67657 emerges as a valuable drug candidate for experimental chronic cerebral toxoplasmosis therapy.
Knowledge graphs have become a common approach for knowledge representation. Yet, the application of graph methodology is elusive due to the sheer number and complexity of knowledge sources. In addition, semantic incompatibilities hinder efforts to harmonize and integrate across these diverse sources. As part of The Biomedical Translator Consortium, we have developed a knowledge graph–based question-answering system designed to augment human reasoning and accelerate translational scientific discovery: the Translator system. We have applied the Translator system to answer biomedical questions in the context of a broad array of diseases and syndromes, including Fanconi anemia, primary ciliary dyskinesia, multiple sclerosis, and others. A variety of collaborative approaches have been used to research and develop the Translator system. One recent approach involved the establishment of a monthly “Question-of-the-Month (QotM) Challenge” series. Herein, we describe the structure of the QotM Challenge; the six challenges that have been conducted to date on drug-induced liver injury, cannabidiol toxicity, coronavirus infection, diabetes, psoriatic arthritis, and ATP1A3-related phenotypes; the scientific insights that have been gleaned during the challenges; and the technical issues that were identified over the course of the challenges and that can now be addressed to foster further development of the prototype Translator system. We close with a discussion on Large Language Models such as ChatGPT and highlight differences between those models and the Translator system.
In the years following FDA approval of direct-to-consumer, genetic-health-risk/DTCGHR testing, millions of people in the US have sent their DNA to companies to receive personal genome health risk information without physician or other learned medical professional involvement. In Personal Genome Medicine, Michael J. Malinowski examines the ethical, legal, and social implications of this development. Drawing from the past and present of medicine in the US, Malinowski applies law, policy, public and private sector practices, and governing norms to analyze the commercial personal genome sequencing and testing sectors and to assess their impact on the future of US medicine. Written in relatable and accessible language, the book also proposes regulatory reforms for government and medical professionals that will enable technological advancements while maintaining personal and public health standards.
We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes.
Methods:
We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression.
Results:
In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001).
Conclusions:
The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.
Genomic epidemiology is routinely used worldwide to interrogate infectious disease dynamics. Multiple computational tools exist that reconstruct transmission networks by coupling genomic data with epidemiological models. Resulting inferences can improve our understanding of pathogen transmission dynamics, and yet the performance of these tools has not been evaluated for tuberculosis (TB), a disease process with complex epidemiology including variable latency and within-host heterogeneity. Here, we performed a systematic comparison of six publicly available transmission reconstruction models, evaluating their accuracy when predicting transmission events in simulated and real-world Mycobacterium tuberculosis outbreaks. We observed variability in the number of transmission links that were predicted with high probability (P ≥ 0.5) and low accuracy of these predictions against known transmission in simulated outbreaks. We also found a low proportion of epidemiologically supported case–contact pairs were identified in our real-world TB clusters. The specificity of all models was high, and a relatively high proportion of the total transmission events predicted by some models were true links, notably with TransPhylo, Outbreaker2, and Phybreak. Our findings may inform the choice of tools in TB transmission analyses and underscore the need for caution when interpreting transmission networks produced using probabilistic approaches.
Precision Medicine is an emerging approach for disease treatment and prevention that takes into account individual variability in genes, environment, and lifestyle. Autoimmune diseases are those in which the body’s natural defense system loses discriminating power between its own cells and foreign cells, causing the body to mistakenly attack healthy tissues. These conditions are very heterogeneous in their presentation and therefore difficult to diagnose and treat. Achieving precision medicine in autoimmune diseases has been challenging due to the complex etiologies of these conditions, involving an interplay between genetic, epigenetic, and environmental factors. However, recent technological and computational advances in molecular profiling have helped identify patient subtypes and molecular pathways which can be used to improve diagnostics and therapeutics. This review discusses the current understanding of the disease mechanisms, heterogeneity, and pathogenic autoantigens in autoimmune diseases gained from genomic and transcriptomic studies and highlights how these findings can be applied to better understand disease heterogeneity in the context of disease diagnostics and therapeutics.
Bioinformatics is discussed in Chapter 11. The complex nature of the subject and its interaction with other disciplines are outlined, and the inter-dependence of bioinformatics, the development of computer hardware and the internet is stressed. The nature and range of biological databases are outlined, from the inception of nucleic acid databases in the 1970s to the present breadth of primary and secondary databases that are repositories for information on nucleic acid and protein sequences, interactions between cellular components, biochemical pathways, pharmacological targets and many other data sets derived from existing information. Genome sequence databases are used to illustrate the tools needed to assemble, collate, annotate and interrogate the data, and the impact of bioinformatics in enabling experiments and protocols to to be conducted in silico is discussed.
To introduce the subject, the history of genetics since Mendel’s work which was rediscovered in 1900 is outlined. The discovery of the structure of DNA in 1953 marked the start of the molecular genetics era. When restriction enzymes and DNA ligase were discovered, DNA fragments could be cut and joined, with the first recombinant DNA molecules generated in 1972. Rapid methods for sequencing DNA were developed in the late 1970s and eventually were improved to the level needed to enable the Human Genome Project to be undertaken. The completion of this in 2003 marked the start of the ‘post-genomic era’ that led to further development of the technology and a reduction in time and cost of genome sequencing. We are now firmly in the post-genomic era, where DNA technology is having a major impact in areas such as transgenic plants and animals, genome editing, diagnosis and treatment of disease, forensic analysis and personalised medicine.
Genomic studies have elucidated some molecular underpinnings for adaptations during the early history of snakes, but studies of dietary adaptations remain sparse. Snakes differ from most other squamates by tending towards diets of vertebrate prey (carnivory), whereas arthropods are common in diets of most other squamates (insectivory). To test whether a shift from insectivory to carnivory occurred early in snake history, I examined chitinase genes (CHIAs) in 19 squamates. Previous studies on mammals found that contraction in the number of CHIAs, which enzymatically digest arthropod chitinous exoskeletons, correlates with transitions from insectivory to carnivory or herbivory. I found evidence that CHIAs have a long history in Squamata, with at least seven paralogs inferred in their last common ancestor. Retention of these CHIAs seems to be commonplace for arthropod-eating squamates, but snakes likely lost six CHIAs between diverging from other toxicoferans and the origin of afrophidian snakes. This genomic signal corresponds with an inferred major shift towards carnivory during the origin and evolution of early snakes, which may have contributed to their successful radiation.
Tetraspanins are a superfamily of transmembrane proteins that in flatworms have structural roles in the development, maturation or stability of the tegument. Several tetraspanins are considered as potential candidates for vaccines or drugs against helminths. Monopisthocotylean monogeneans are ectoparasites of fish that are health hazards for farmed fish. The aim of this study was to identify in silico putative tetraspanins in the genomic datasets of four monopisthocotylean species. The analysis predicted and classified 40 tetraspanins in Rhabdosynochus viridisi, 39 in Scutogyrus longicornis, 22 in Gyrodactylus salaris and 13 in Neobenedenia melleni, belonging to 13 orthologous groups. The high divergence of tetraspanins made it difficult to annotate their function. However, a conserved group was identified in different metazoan taxa. According to this study, metazoan tetraspanins can be divided into 17 monophyletic groups. Of the 114 monogenean tetraspanins, only seven were phylogenetically close to tetraspanins from non-platyhelminth metazoans, which suggests that this group of proteins shows rapid sequence divergence. The similarity of the monopisthocotylean tetraspanins was highest with trematodes, followed by cestodes and then free-living platyhelminths. In total, 27 monopisthocotylean-specific and 34 flatworm-specific tetraspanins were identified. Four monogenean tetraspanins were orthologous to TSP-1, which is a candidate for the development of vaccines and a potential pharmacological target in trematodes and cestodes. Although studies of tetraspanins in parasitic flatworms are scarce, this is an interesting group of proteins for the development of new methods to control monogeneans.
Maize is one of the three staple foods in the world. The white variety represents 60% of the maize importation with a world consumption of 1125 million tons in 2019/2020. Currently, new technologies could contribute to the analysis of this seed, supporting quality control and improvement. This study aims to carry out the morphological and proteomic comparison between the hybrid MR2008 and its parental lines LUG03 and CML491 through mass spectrometry and bioinformatics analysis. Herein, we identified that 34.8% of the hybrid proteome differs from the parental proteome. Also, ontological and morphological analyses determined that the hybrid exhibits more characteristics related to CML491 than LUG03, for example, metabolic pathways and enzymes, such as anthocyanidin 3-O-glucosyltransferase (UniProt P16166). This analysis allowed the identification of dominant characters, metabolic pathways and confirms the utility of this methodology in agricultural practices, mainly in processes of selection and quality control of a crop.
Echinococcus granulosus sensu lato (s.l.) is a zoonotic parasite that causes cystic echinococcosis (CE) in humans. However, E. granulosus sensu stricto (s.s.) is considered the predominant species in CE infections worldwide. According to the population genetic diversity and structure of E. granulosus s.l., gene flow can explain the parasite drift among the neighbouring countries of Pakistan. The mitochondrial (mt) co1 (n = 47), nadh1 (n = 37) and cytb (n = 35) nucleotide sequences of E. granulosus s.l. isolates from Pakistan, Iran, China and India were retrieved from the National Centre for Biotechnology Information database to determine the genealogical relationships. The sequences were grouped as the mt-co1 (genotypes G1 and G3, G6-G7), mt-cytb (genotypes G1 and G3), and mt-nadh1(genotypes G1 and G3). The data were analysed using bioinformatic tools. A total of 19 polymorphic sites for the mt-co1 sequence (374 bp) were observed of which 31.6% (6/19) were parsimony-informative sites. Unique singleton haplotypes within the E. granulosus s.s. haplotype network based on the mt-co1 gene were highly prevalent (68.4%; 13/19) in Pakistani isolates followed by Chinese, Indian and Iranian isolates; four polymorphic sites were detected in the E. canadensis (G6/G7). In E. canadensis mt-co1 haplotype network, 75% (3/4) unique singleton haplotypes were from the Iranian isolates. Twelve polymorphic sites were found using the mt-cytb sequence (547 bp); 25% (3/12) were parsimony-informative and there were 66.7% (8/12) unique singleton haplotypes within the mt-cytb haplotype network in E. granulosus s.s. with the most reported from Pakistan followed by Iran and China. 20 polymorphic sites were detected in E. granulosus s.s. mt-nadh1 sequences (743 bp); 20% (4/20) were parsimony-informative. There were 66.7% (8/12) main single haplotypes within the mt-nadh1 haplotype network, with the most reported from Pakistan followed by that from India, Iran and China. The sequence analyses show low nucleotide diversity and high haplotype diversity in general.
Kinetoplastid parasites are responsible for both human and animal diseases across the globe where they have a great impact on health and economic well-being. Many species and life cycle stages are difficult to study due to limitations in isolation and culture, as well as to their existence as heterogeneous populations in hosts and vectors. Single-cell transcriptomics (scRNA-seq) has the capacity to overcome many of these difficulties, and can be leveraged to disentangle heterogeneous populations, highlight genes crucial for propagation through the life cycle, and enable detailed analysis of host–parasite interactions. Here, we provide a review of studies that have applied scRNA-seq to protozoan parasites so far. In addition, we provide an overview of sample preparation and technology choice considerations when planning scRNA-seq experiments, as well as challenges faced when analysing the large amounts of data generated. Finally, we highlight areas of kinetoplastid research that could benefit from scRNA-seq technologies.
The Expanded Program for Immunization Consortium – Human Immunology Project Consortium study aims to employ systems biology to identify and characterize vaccine-induced biomarkers that predict immunogenicity in newborns. Key to this effort is the establishment of the Data Management Core (DMC) to provide reliable data and bioinformatic infrastructure for centralized curation, storage, and analysis of multiple de-identified “omic” datasets. The DMC established a cloud-based architecture using Amazon Web Services to track, store, and share data according to National Institutes of Health standards. The DMC tracks biological samples during collection, shipping, and processing while capturing sample metadata and associated clinical data. Multi-omic datasets are stored in access-controlled Amazon Simple Storage Service (S3) for data security and file version control. All data undergo quality control processes at the generating site followed by DMC validation for quality assurance. The DMC maintains a controlled computing environment for data analysis and integration. Upon publication, the DMC deposits finalized datasets to public repositories. The DMC architecture provides resources and scientific expertise to accelerate translational discovery. Robust operations allow rapid sharing of results across the project team. Maintenance of data quality standards and public data deposition will further benefit the scientific community.
Africa plays a central importance role in the human origins, and disease susceptibility, agriculture and biodiversity conservation. Nigeria as the most populous and most diverse country in Africa, owing to its 250 ethnic groups and over 500 different native languages is imperative to any global genomic initiative. The newly inaugurated Nigerian Bioinformatics and Genomics Network (NBGN) becomes necessary to facilitate research collaborative activities and foster opportunities for skills’ development amongst Nigerian bioinformatics and genomics investigators. NBGN aims to advance and sustain the fields of genomics and bioinformatics in Nigeria by serving as a vehicle to foster collaboration, provision of new opportunities for interactions between various interdisciplinary subfields of genomics, computational biology and bioinformatics as this will provide opportunities for early career researchers. To provide the foundation for sustainable collaborations, the network organises conferences, workshops, trainings and create opportunities for collaborative research studies and internships, recognise excellence, openly share information and create opportunities for more Nigerians to develop the necessary skills to exceed in genomics and bioinformatics. NBGN currently has attracted more than 650 members around the world. Research collaborations between Nigeria, Africa and the West will grow and all stakeholders, including funding partners, African scientists, researchers across the globe, physicians and patients will be the eventual winners. The exponential membership growth and diversity of research interests of NBGN just within weeks of its establishment and the unanticipated attendance of its activities suggest the significant importance of the network to bioinformatics and genomics research in Nigeria.
In clinical and translational research, data science is often and fortuitously integrated with data collection. This contrasts to the typical position of data scientists in other settings, where they are isolated from data collectors. Because of this, effective use of data science techniques to resolve translational questions requires innovation in the organization and management of these data.
Methods:
We propose an operational framework that respects this important difference in how research teams are organized. To maximize the accuracy and speed of the clinical and translational data science enterprise under this framework, we define a set of eight best practices for data management.
Results:
In our own work at the University of Rochester, we have strived to utilize these practices in a customized version of the open source LabKey platform for integrated data management and collaboration. We have applied this platform to cohorts that longitudinally track multidomain data from over 3000 subjects.
Conclusions:
We argue that this has made analytical datasets more readily available and lowered the bar to interdisciplinary collaboration, enabling a team-based data science that is unique to the clinical and translational setting.
The storage root of alligatorweed [Alternanthera philoxeroides (Mart.) Griseb.] growing in terrestrial habitats is an important metamorphic organ for its propagation, overwintering, and spread. However, the regulatory mechanism adventitious root expansion to form storage roots is still unclear. To reveal the changes accompanying the root-swelling process, we quantified sugar, soluble protein, and phytohormone content in adventitious and storage roots. Results demonstrated that sucrose, fructose, and soluble protein increased in storage roots, whereas abscisic acid (ABA), indoleacetic acid (IAA), brassinosteroid (BR), gibberellin, jasmonic acid, and cytokinin (trans-zeatin [tZ] and isopentenyladenine [iP] and the corresponding ribosides tZR and iPR). tZ-type (tZR and tZ) content decreased, suggesting the involvement of sugars and hormones in the formation of storage roots. To further reveal the molecular basis of A. philoxeroides’s ability to form storage roots and provide candidate genes for molecular function analyses, we assembled a de novo transcriptome of A. philoxeroides based on four sets of RNA-sequencing data. According to functional annotation and expression profiling, 42 unigenes involved in sucrose synthesis and hydrolysis were identified, in addition to 70, 58, and 78 unigenes in ABA, BR, and IAA signal transduction, respectively. The quantitative reverse transcriptase polymerase chain reaction analysis revealed 21 unigenes involved in sugar metabolism and hormone signal transduction were differentially expressed during the formation of storage roots. These results revealed metabolic changes during the formation of storage roots and provide candidate genes involved in sugar and phytohormone metabolism in A. philoxeroides.
Two major outstanding questions in microbiome research ask what microbes are present in a community and how they interact with each other and their hosts. Recent, rapid improvements in nucleic acid (DNA and RNA) sequencing allow us to study the composition and function of microbiomes in unprecedented detail, leading to a step change in our understanding of host–microbe interactions. This chapter gives a broad overview of the basic toolkit available to modern microbiologists and microbial ecologists, exploring their application to key questions about microbiome structure and function. We cover tools based on nucleic acid sequencing (e.g. amplicon sequencing, metagenomics, metatranscriptomics) as well as approaches targeting larger molecules such as metabolomics and proteomics. We discuss the use of microbial culture as a means of measuring functional capacity of individual microbes, or building artificial communities to understand emergent properties of consortia. We emphasise the advantages of combining multiple techniques alongside robust experimental design to garner powerful quantitative estimates of microbiome structure, and how this relates to host–microbe interactions.
Through a long history of co-evolution, multicellular organisms form a complex of host cells plus many associated microorganism species. Consisting of algae, bacteria, archaea, fungi, protists and viruses, and collectively referred to as the microbiome, these microorganisms contribute to a range of important functions in their hosts, from nutrition, to behaviour and disease susceptibility. In this book, a diverse and international group of active researchers outline how multicellular organisms have become reliant on their microbiomes to function, and explore this vital interdependence across the breadth of soil, plant, animal and human hosts. They draw parallels and contrasts across hosts in different environments, and discuss how this invisible microbial ecosystem influences everything from the food we eat, to our health, to the correct functioning of ecosystems we depend on. This insightful read also pertinently encourages students and researchers in microbial ecology, ecology, and microbiology to consider how this interdependence may be key to mitigating environmental changes and developing microbial biotechnology to improve life on Earth.