Host gene expression in the Nasopharynx can discriminate microbiologically confirmed viral and bacterial lower respiratory tract infection

L. Gayani Tillekeratne; Nicholas O’Grady; Maria D. Iglesias-Ussel; Jack Anderson; Alana Brown; Armstrong Obale; Christina Nix; Champica K. Bodinayake; Ajith Nagahawatte; Robert Rolfe; E. Wilbur Woodhouse; Gaya B. Wijayaratne; Senali Weerasinghe; U.H.B.Y. Dilshan; Jayani Gamage; Ruvini Kurukulasooriya; Madureka Premamali; Himali S. Jayasinghearachchi; Bradly P. Nicholson; Emily R. Ko; Ephraim L. Tsalik; Micah T. McClain; Rachel A. Myers; Christopher W. Woods; Thomas W. Burke

doi:10.1017/cts.2025.10191

Host gene expression in the Nasopharynx can discriminate microbiologically confirmed viral and bacterial lower respiratory tract infection

Published online by Cambridge University Press: 29 October 2025

L. Gayani Tillekeratne

Nicholas O’Grady

Maria D. Iglesias-Ussel

Champica K. Bodinayake ,

Ajith Nagahawatte and

Robert Rolfe

...Show all authors

Show author details

L. Gayani Tillekeratne*: Affiliation:
Duke University School of Medicine, Durham, NC, USA Duke Global Health Institute, Durham, NC, USA Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Nicholas O’Grady: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Maria D. Iglesias-Ussel: Affiliation:
Duke University School of Medicine, Durham, NC, USA Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Jack Anderson: Affiliation:
Duke University School of Medicine, Durham, NC, USA Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Alana Brown: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Armstrong Obale: Affiliation:
Duke Global Health Institute, Durham, NC, USA Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Christina Nix: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Champica K. Bodinayake: Affiliation:
Duke Global Health Institute, Durham, NC, USA Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Ajith Nagahawatte: Affiliation:
Duke Global Health Institute, Durham, NC, USA Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Robert Rolfe: Affiliation:
Duke University School of Medicine, Durham, NC, USA Duke Global Health Institute, Durham, NC, USA
E. Wilbur Woodhouse: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Gaya B. Wijayaratne: Affiliation:
Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Senali Weerasinghe: Affiliation:
Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
U.H.B.Y. Dilshan: Affiliation:
Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Jayani Gamage: Affiliation:
Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Ruvini Kurukulasooriya: Affiliation:
Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Madureka Premamali: Affiliation:
Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Himali S. Jayasinghearachchi: Affiliation:
Faculty of Medicine, General Sir John Kotelawala Defence University, Ratmalana, Sri Lanka
Bradly P. Nicholson: Affiliation:
Institute for Medical Research, Durham Veterans Affairs Medical Center, Durham, NC, USA
Emily R. Ko: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Ephraim L. Tsalik: Affiliation:
Duke University School of Medicine, Durham, NC, USA Danaher Corporation, Washington, DC, USA
Micah T. McClain: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Rachel A. Myers: Affiliation:
Duke University School of Medicine, Durham, NC, USA
Christopher W. Woods: Affiliation:
Duke University School of Medicine, Durham, NC, USA Duke Global Health Institute, Durham, NC, USA Duke-Ruhuna Collaborative Research Centre, Faculty of Medicine, University of Ruhuna, Karapitiya, Galle, Sri Lanka
Thomas W. Burke: Affiliation:
Duke University School of Medicine, Durham, NC, USA
*: Corresponding author: L. G. Tillekeratne; Email: gayani.tillekeratne@duke.edu

Article contents

Abstract
Introduction:
Methods:
Results:
Conclusions:
Introduction
Methods
Results
Discussion
Supplementary material
Author contributions
Funding statement
Competing interests
Footnotes
References

Rights & Permissions

Abstract

Introduction:

Distinguishing viral versus bacterial lower respiratory tract infection (LRTI) is challenging. We previously developed a rapid, host response-based test (Biomeme HR-B/V assay) using peripheral blood samples to identify viral versus bacterial infection. We assessed the performance of this assay when using nasopharyngeal (NP) samples.

Methods:

Patients with LRTI were enrolled, and a NP swab sample was run using the HR-B/V assay (assessing 24 gene targets) on the FranklinTM platform. The performance of the prior classifier at identifying viral versus bacterial infection was assessed. A novel predictive model was generated for NP samples using the same 24 targets. Results were validated using external datasets with nasal/NP RNA sequence data.

Results:

Nineteen patients (median age 62 years, 52.1% male) were included. When using the prior HR-B/V classifier on NP samples of 19 patients with LRTI (12 viral, 7 bacterial), the area under the receiver operator curve (AUC) for viral versus bacterial infection was 0.786 (0.524–1), with accuracy 0.79 (95% CI 0.57–0.91), positive percent agreement (PPA) 0.43 (95% CI 0.16–0.75), and negative percent agreement (NPA) 1.00 (95% CI 0.76–1). The novel model had AUC 0.881 (95% CI 0.726–1), accuracy 0.84 (95% CI 0.62–0.94), PPA 0.86 (95% CI 0.49–0.97), and NPA 0.83 (95% CI 0.55–0.95) for bacterial infection. Validation in two external datasets showed AUC of 0.932 (95% CI 0.90–0.96) and 0.915 (95% CI 0.88–0.95).

Conclusions:

We show that host response in the nasopharynx can distinguish viral versus bacterial LRTI. These findings need to be replicated in larger cohorts with diverse LRTI etiologies.

Keywords

Lower respiratory tract infection host response antimicrobial stewardship nasopharynx rapid diagnostic

Information

Type: Research Article
Information: Journal of Clinical and Translational Science , Volume 9 , Issue 1 , 2025 , e257

DOI: https://doi.org/10.1017/cts.2025.10191 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Association for Clinical and Translational Science

Introduction

Identifying the etiology of lower respiratory tract infection (LRTI), which includes syndromes such as bronchitis, pneumonia, and infectious exacerbations of asthma and chronic obstructive pulmonary disease (COPD), remains challenging. Viral and bacterial LRTI present with similar clinical signs and symptoms, leading clinicians to prescribe antibacterials for fear of missing an otherwise fatal bacterial infection [Reference Ieven, Coenen and Loens1].

LRTI diagnostics that are currently used in clinical care, such as sputum or blood cultures or multiplex polymerase chain reaction (PCR) of nasopharyngeal (NP) or sputum samples, are generally focused on identifying a specific viral or bacterial pathogen. However, such pathogen-based diagnostics can have low sensitivity (in the case of culture), may detect only a select set of pathogens (in the case of PCR), or fail to distinguish colonization from infection (in both cases) [Reference Zaas, Garner, Tsalik, Burke, Woods and Ginsburg2,Reference Dekker, Verheij and van der Velden3]. Moreover, identifying an organism from a non-invasive, upper respiratory sample such as nasal or NP sample may not reflect the etiology of infection in the lower respiratory tract [Reference Robinson4]. Host response-based diagnostics, which assess the host’s response to infection and broadly classify infection as viral or bacterial, provides important adjunctive information to pathogen-based diagnostics, and can also help differentiate colonization from infection. However, traditional host response-based diagnostics, which include protein biomarkers such as C-reactive protein (CRP) and procalcitonin (PCT), have also been plagued by poor performance characteristics [Reference Ito and Ishida5]. Newer response-based tests that assess multiple protein biomarkers, such as the FebriDx (Lumos Diagnostics) and the MeMed BV (MeMed) tests, may have improved performance characteristics, but are not yet widely used in clinical practice [Reference Shapiro, Self and Rosen6,Reference Bachur, Kaplan and Arias7].

In recent years, measuring host gene expression has emerged as a novel strategy for assessing host response [Reference Jeffrey, Denny, Lipman and Conway Morris8]. Transcriptomics-based tests may have superior performance characteristics to traditional protein-based host response tests. One such transcriptomics test, the TriVerity by Inflammatix, was recently cleared by the US Food and Drug Administration (FDA) for distinguishing acute viral from bacterial infection [9].

We have previously developed a blood-based gene expression classifier (Biomeme HR-B/V test) using 22 gene targets and 2 normalizing genes that is run on a rapid, real-time quantitative polymerase chain reaction (RT-qPCR) platform, the Biomeme Franklin^TM [Reference Iglesias-Ussel, O’Grady and Anderson10]. The accuracy of the Biomeme HR-B/V test at distinguishing viral versus bacterial infection was 85%. For acute respiratory infections, assessing host response in the nasopharynx is an appealing strategy, as measuring localized host response may allow earlier detection of infection. In addition, a NP sample affords the possibility of developing an integrated diagnostic that can assess both pathogen and host response using a single, non-invasively collected sample [Reference Pandya, He, Sweeney, Hasin-Brumshtein and Khatri11]. Others have shown that gene expression classifiers using nasal or NP samples can differentiate viral respiratory infection from non-viral respiratory infection, healthy controls, or between different types of viral respiratory infection [Reference Pandya, He, Sweeney, Hasin-Brumshtein and Khatri11–Reference Barral-Arca, Gómez-Carballa, Cebey-López, Bello, Martinón-Torres and Salas14]. The performance of nasal or NP-based gene expression classifiers at distinguishing viral versus bacterial respiratory infection is just starting to be explored [Reference Andrew, Nooran and Max15].

In this study, we enrolled patients with LRTI and assessed the performance of our previously developed blood-based classifier at identifying viral versus bacterial infection when applied to NP samples.

Methods

Subject recruitment

Consecutive patients ≥ 1 year old admitted with acute LRTI to an 1800-bed, public tertiary care hospital in Southern Province, Sri Lanka were identified for enrollment within 48 hours of admission during the period of November 2019 to July 2020. Patients were eligible if they met an age-specific case definition for LRTI and had an acute illness, as described previously [Reference Medrano, Weerasinghe and Nagahawatte16]. Chest X-ray imaging within 48 hours of admission was required for eligibility for patients ≥ 5 years, but was not required in patients < 5 years of age. Patients were not eligible to participate in this study if they were outpatients, hospitalized within the past 28 days, or had known or suspected infections at other anatomic sites requiring antibacterial therapy. Written informed consent was obtained from all patients or their guardians for children < 18 years of age. Written assent was obtained from children 12–17 years of age.

Collection of clinical information and biological samples

At enrollment, a standardized questionnaire was administered by trained research assistants to collect demographic and clinical information. Laboratory tests results obtained during hospitalization as part of routine clinical care, such as white blood cell count and CRP level, were also recorded. Two NP swab samples were obtained, with one placed in universal transport media (UTM) and the other placed in RNAlater^® RNA stabilization solution (Thermo Fisher Scientific). A urine sample was also collected. All samples were stored at -80°C until used for testing. All patients also had blood and sputum samples collected for culture at enrollment, and these were processed immediately.

Etiological testing

The NP sample stored in UTM was tested for 3 bacterial and 18 viral pathogens using the Luminex NxTAG Respiratory Pathogen Panel (Luminex Corporation, Austin, TX, USA). Testing for SARS-Coronavirus-2 was conducted using the Centers for Disease Control and Prevention (CDC) SARS-CoV-2 assay on an AB7500 Fast DX (Applied Biosystems, Waltham, MA, USA). Urine antigen testing for Streptococcus pneumoniae was performed using the BinaxNOW (Abbott, Chicago, IL, USA). Sputum and blood cultures were processed manually using standard microbiological techniques according to the Clinical Laboratory Standards Institute [17,18].

Clinical adjudications

Clinical adjudication served as the comparator method to determine the etiology of illness. Adjudicators were physicians with experience in the diagnosis of infectious diseases, and used a combination of clinical history, results from laboratory and radiographic tests performed for clinical care, and results from etiological tests performed for research purposes to conduct adjudications. Two adjudicators independently determined the likelihood of bacterial and/or viral infection, non-infectious syndrome, or indeterminate diagnosis. Adjudicator discordance was resolved by a consensus panel of at least three experts, with simple majority determining final diagnosis [Reference Ko, Henao and Frankey19,Reference Tsalik, Henao and Montgomery20]. Among those who had a bacterial or viral infection identified as the primary cause, microbiological level of confidence was identified as being high confidence (positive microbiological data with supportive clinical history) versus low confidence (negative microbiological data but with supportive clinical history).

Selection of subjects into sub-analysis

Subjects were selected for inclusion into this sub-analysis if they had a bacterial or viral infection identified as the primary cause of infection, and if the microbiological level of confidence was identified as being high (positive microbiological data with supportive clinical history). A selection of 32 patients who met these criteria were initially selected at random for testing, based on sample and resource availability.

Platform and classifier for assessing host gene expression

The biomeme Franklin^TM molecular diagnostic platform can detect up to 27 targets per sample through RT-qPCR. The biomeme HR-B/V test on this platform includes 22 discriminating targets (BATF, CFAP45, CTBP1, DEFA3, DSC2, EXOG, FOLR3, GCAT, HLA-DRB1, IFI27, LAMP1, LAPTM4B, MCTP1, OAS3, PLAC8, RPS21, SIGLEC1, SIRPB1, SLC29A1, STAP1, TNFAIP2, USP18) along with two normalization controls (DECR1 and PPIB) and an RNA process control (RNA extraction and RT-PCR control utilizing MS2 bacteriophage) [Reference Iglesias-Ussel, O’Grady and Anderson10].

RNA extraction and RT-qPCR

Banked NP samples collected in RNAlater^® were thawed with 250–500µl of solution warmed at 37°C to return the precipitated reagent into solution. This was then centrifuged for 5 minutes at 3000 × G to pellet nucleic acid-containing material. The RNAlater^® supernatant was aspirated and the pellet was resuspended in 500µl of Biomeme Lysis Buffer (Biomeme, Philadelphia, USA). The whole volume was then added to an M1 RNA 2.0 Sample Prep Cartridge (Biomeme, Philadelphia, USA) for extraction and was eluted into 400µL TE buffer. Sample was pumped through the Biomeme M1 sample prep column, which contains silica membranes, a barbed tip, and Luer lock for attachment to a 1 mL syringe. The column’s barbed tip pierces the foil sealed cartridge chambers, which contain lysis buffer, protein, salt wash and drying buffers. For the final air-drying step, we transferred the column to a clean 20 mL syringe and dried it onto a clean low lint wipe with 5–10 pumps, and eluted the RNA with 400µL 10 mM Tris-HCl, 0.1 mM EDTA buffer. Purified RNA samples were added to lyophilized HR-B/V assay reagents, then run on the Franklin^TM three9 thermocycler (Biomeme, Philadelphia, USA). Primers/probes were multiplexed for triplex reactions.

Statistical analysis

Data processing

Raw relative fluorescent units (RFU) were exported from the Biomeme Franklin™ mobile RT-qPCR thermocyclers to a cloud database. Values were converted to cycle threshold (Ct) units and exported via XML worksheets. Samples that had greater than 33% of their target Ct values missing were removed from downstream analysis. After sample removal, gene targets with missing Ct values in 33% of all samples were removed for new model developments. However, all targets were considered when using the existing blood-derived models that are used with the Biomeme Franklin^TM. These existing models include separate models for bacterial versus nonbacterial and viral versus non-viral infections [Reference Iglesias-Ussel, O’Grady and Anderson10]. Missing or non-detected values were imputed to the maximum observed value per target plus one cycle threshold, i.e., max (observed Ct) + 1. RT-PCR values were normalized with the delta Ct method, which is the target Ct value minus the mean of the reference targets (DECR1 and PPIB) for that sample [Reference Vandesompele, De Preter and Pattyn21].

Exploratory and differential expression analysis

Principal component analysis (PCA) plots were generated for dimensionality reduction, separated by preservation type, and further stratified by their clinical bacterial and viral adjudications. Differential expression between bacterial and viral samples for each target was assessed using a two sample t-test. P-values were adjusted for multiple testing using the Benjamini-Hochberg procedure and targets with adjusted p-value ≤ 0.05 were considered significant [Reference Benjamini and Hochberg22].

Pathway enrichment analysis

Over-representation analysis was performed comparing these 22 genes from the Biomeme HR-B/V assay to a universe of all transcripts from the org. Hs.eg.db R object [23]. All genes were passed into clusterProfiler and run against the Gene Ontology biological processes (BP) database [Reference Xu, Hu and Cai24]. Enrichment results were limited to pathways that had three or more target genes in a pathway, and additionally restricted to a false discovery corrected p-value ≤ 0.05.

Predictive modeling using existing blood-based Biomeme HR-B/V Franklin^TM models

The HR-BV blood-derived models included two previously developed models for bacterial versus non-bacterial and viral versus non-viral infections. The bacterial model assesses bacterial infection versus non-bacterial infection, and the viral model assesses viral infection versus non-viral infection. Including two distinct models allows for the possibility of assessing bacterial and viral co-infection. Methods for deriving these models are published elsewhere [Reference Iglesias-Ussel, O’Grady and Anderson10]. These sparse logistic regression models were used in predictions on the normalized NP sample data. Box plots and area under the receiver operator curves (AUC) were built to assess performance.

Retraining a predictive model of viral versus bacterial infection

In addition to the imported Biomeme HR-B/V Franklin^TM models, a new model was built using linear sparse logistic regression on NP data. Specifically, bacterial versus viral elastic net regularization model favoring ridge regression (α = 0.1) was implemented in the glmnet R package [Reference Friedman, Hastie and Tibshirani25]. The optimal regularization parameter (λ) was obtained via leave-one-out cross-validation (LOOCV). Estimated performance metrics included AUC, accuracy, positive percent agreement (PPA), negative percent agreement (NPA), and box plots of predicted probabilities for bacterial versus viral infection. The threshold for bacterial classifiers for determining accuracy, PPA, and NPA was estimated via the Youden Index [Reference Youden26]. Confidence intervals were generated from confusion matrices using epiR, Wilson method [Reference Carstensen, Plummer, Laara and Hills27]. Gene-specific model weights were averaged over all iterations. All statistical analyses were completed using R Statistical Software version 4.4.1 [28].

Performance of standard biomarker - CRP

The performance characteristics of the commonly used biomarker CRP at identifying viral versus bacterial infection in our cohort were determined using CRP test results that were obtained during routine clinical care. AUC, PPA, NPA, positive predictive value (PPV), and negative predictive value (NPV) were determined for CRP. The performance of the novel viral versus bacterial model was compared with that of CRP using the DeLong test to compare AUCs, and a test of two proportions for PPA, NPA, PPV and NPV.

External validation

Series matrix gene counts and associated phenotypic data from series GSE163151 and GSE188678 were downloaded from the Gene Expression Omnibus (GEO) database. Data was converted into EdgeR objects for pre-processing quality control [Reference Robinson, McCarthy and Smyth29]. Lowly expressed genes were filtered using EdgeR’s filterByExpr function, and expression counts were normalized using the trimmed mean of M-values method (TMM) [Reference Robinson and Oshlack30]. Density plots across raw, filtered, and log2-counts per million (cpm) normalized data, and PCA on log2-cpm data were generated for study design considerations and removal of outliers. Voom weights were estimated to control for mean-variance heteroscedasticity, and incorporated in subsequent differential expression analysis [Reference Law, Chen, Shi and Smyth31]. Log2 normalized transcriptomic data was then filtered down to the 22 genes present in the Biomeme HR-B/V assay.

Elastic net logistic regression models for each GEO set were built using the 22 Biomeme HR-B/V genes, excluding housekeeping genes DECR1 and PPIB, after quality control filtering. The models were set up to predict viral versus non-viral samples and built using the process described above. We consider AUC and probabilities of viral infection summarized as box plots. Model coefficients (target weights) for all iterations were used to summarize (as boxplots and LOOCV usage) the targets used by the model.

Ethical considerations

This study was approved by the Ethical Review Committee of the Faculty of Medicine, University of Ruhuna, Sri Lanka (application number 15.02.2018.3.13) and the Duke University Institutional Review Board (Pro00092502) and conducted in accordance with the principles outlined in the Declaration of Helsinki. All participants provided written informed consent prior to participating in the study.

Results

Expression of gene targets

A subset of 32 subjects who had an etiology of viral (23) or bacterial (9) infection with high level of microbiological confidence based on clinical adjudications was initially identified. A total of 13 NP samples (11 viral and 2 bacterial, 41% of total samples) had > 33% of their targets missing by the HR-B/V test and were excluded from subsequent analyses (Supplementary Figure 1). The analysis cohort thus included 19 subjects (12 viral and 7 bacterial infections), with specimens collected only at initial enrollment. The sociodemographic and clinical characteristics of the 19 subjects are shown in Table 1.

Table 1. Sociodemographic and clinical characteristics of subjects with viral or bacterial etiology of lower respiratory tract infection based on clinical adjudications. The frequency (percentage) or median (interquartile range) is displayed

Abbreviations: COPD = chronic obstructive pulmonary disease; HTN = hypertension; CHF = congestive heart failure.

* One participant had an admission clinical diagnosis of dengue versus an unspecified viral fever (both diagnoses were listed).

When comparing gene expression in viral versus bacterial infection, five classifier genes (OAS3, IFI27, USP18, DSC2, RSP21) were significantly differentially expressed (Figure 1). The three most differentially expressed genes (OAS3, IFI27, and USP18) were viral targets and showed higher expression in viral samples in comparison to bacterial samples. The remaining classifier genes and two normalizing genes (DECR1 and PPIB) were not significantly expressed, all with adjusted p-values ≥ 0.25.

Figure 1. Normalized expression of genes in nasopharyngeal samples in subjects with lower respiratory tract infection, differentiated by viral (n = 12) versus bacterial (n = 7) infection. Expression values are qPCR cycle thresholds multiplied by negative one. The genes listed in red are the normalizing genes. Genes denoted with a single asterisk have a differentially expressed adjusted p-value of ≤ 0.05, while genes denoted with a double asterisk have an adjusted p-value ≤ 0.01.

Pathway analysis

Of the pre-selected 22 genes from the HR-B/V test, pathways associated with viral life cycle (6 genes) and viral process (6 genes) were the most common. Four genes were represented in pathways associated with cell killing, biological process involved in symbiotic interaction, defense response to virus, and defense response to symbiont. Three genes were represented in pathways such as type I interferon-mediated signaling pathway, regulation of cell killing, regulation of leukocyte-mediated cytotoxicity, and viral genome replication. Figure 2 displays the 20 pathways in which the classifier genes were represented with the highest statistical significance. Supplementary Table 1 shows the total of 43 pathways in which these genes were represented with p-value less than 0.05.

Figure 2. Pathways in which the 22 genes represented in the Biomeme HR-B/V classifier were found at a statistically significant level compared to other pathways. The 20 pathways with highest statistical significance are displayed here.

Prediction of viral versus bacterial infection using existing blood-based Biomeme Franklin^TM models

We first assessed discrimination of viral and bacterial infection using principal component analysis (PCA) (Figure 3).

Figure 3. Principal component analysis (PCA) of viral and bacterial infection among patients with lower respiratory tract infection.

We used the existing blood-based Biomeme HR-B/V Franklin^TM models to assess performance at identifying viral versus bacterial infection. When using NP swab samples, the bacterial model AUC was 0.786 (95% CI 0.524–1), with accuracy of 0.79 (95% CI 0.57–0.91), PPA of 0.43 (95% CI 0.16–0.75), and NPA of 1.00 (95% CI 0.76–1) compared to clinical adjudication (Figure 4). The viral model showed similar performance with an AUC of 0.821 (95% CI 0.564–1), accuracy of 0.84 (95% CI 0.62–0.94), PPA of 0.92 (95% CI 0.65–0.99), and NPA of 0.71 (95% CI 0.36–0.92).

Figure 4. (A) The area under the curves (AUC) and discrimination of viral and bacterial lower respiratory tract infection when using nasopharyngeal swab samples and the existing blood-based Biomeme HR-B/V Franklin^TM models. (B) Bacterial model. (C) Viral model. p stands for probability in the figures.

Prediction of viral versus bacterial infection using new model

All genes present in the Biomeme HR-B/V test were then used to build a novel predictive model based on NP-derived gene expression data. The AUC of this new model was 0.881 (95% CI 0.726–1). The model had accuracy of 0.84 (95% CI 0.62–0.94), PPA of 0.86 (95% CI 0.49–0.97), and NPA of 0.83 (95% CI 0.55–0.95) for bacterial infection. Figure 5 shows the AUC and the discrimination of viral and bacterial infection, and Supplementary Figure 2 displays the frequency of regression coefficients and the regression coefficient values.

Figure 5. (A) The area under the curve (AUC) and (B) discrimination of viral and bacterial lower respiratory tract infection when using nasopharyngeal samples and a newly derived model. p stands for probability in the figures.

Comparison to the biomarker CRP

We compared the performance of our newly developed model versus the standard biomarker, CRP (Table 2). Our model showed superior performance to CRP across all metrics, except in the case of PPA, when it displayed equal performance (85.7%). Accuracy of the new model was 77.8% compared to 61.1% for CRP. A test of two proportions confirmed that there were no statistically significant performance differences; this is likely due to the small sample size.

Table 2. Performance metrics of the newly developed nasopharyngeal B/V model and C-reactive protein

95% confidence intervals are displayed in parenthesis. Abbreviations: B/V = bacterial/ viral; PPA = positive percent agreement; NPA = negative percent agreement; PPV = positive predictive value; NPV = negative predictive value.

Validation in external datasets

To establish the generalizability of our results, we validated the Biomeme HR-BV classifier (gene targets) in NP samples using two external gene expression datasets (GSE163151 and GSE188678; Table 3). Few publicly available datasets with NP gene expression data in respiratory infection were found, and these particular datasets were selected because they included patients with both viral and non-viral acute respiratory illness, included RNA sequence data from nasal or NP specimens, and were thought to be most representative of the current dataset of adults with LRTI. GSE163151 included 340 NP samples (258 viral, 82 non-viral) from individuals with suspected respiratory infection. The cohort included 138 patients with COVID-19, 120 patients with other viral infections such as influenza A, influenza B, and rhinovirus, and 82 patients with no virus detected and presumed to be having non-viral respiratory illness. Mean age was 49 ± 20 years in patients with viral infection and 44 ± 16 years in patients who were viral negative. A total of 48% in the viral-positive group were male, compared with 26.5% in the viral-negative group. The majority of patients were ambulatory (70% in viral positive group and 80% in the viral negative group).

Table 3. External datasets of patients with viral versus non-viral respiratory illness and RNA sequence data from nasal/ nasopharyngeal samples. The biomeme HR-B/V classifier was validated in these external datasets. 95% confidence intervals (CI) for the area under the curve (AUC) are given in parentheses

Abbreviations: AUC = area under the receiver operating characteristic curve.

GSE188678 consisted of adults with acute respiratory illness, with 137 (43.1%) being male and age range consisting of 19–89 years. This cohort included 149 patients with viral infections (including 90 with COVID-19 and 59 with other viral respiratory infections consisting mostly of rhinovirus and influenza), and 169 with no virus detected and presumed to have non-viral respiratory illness.

In dataset GSE163151, 23 out of the 24 HR-B/V target genes were found, with HLA-DRB1 missing. Ten of the 23 HR-B/V target genes (SLC29A1, DEFA3, LAPTM4B, DSC2, GCAT, EXOG, FOLR3, BATF, SIRPB1, STAP1) were filtered out of the dataset due to low expression values. Eleven of the 13 remaining genes were found to be expressed at statistically significantly different levels between the viral and non-viral groups. When applying this new RNA-seq NP-derived model, the AUC was 0.932 (95% CI 0.901–0.964) in this dataset (Figure 6a). In dataset GSE188678, all 24 HR-B/V target genes were found. Two of the 24 HR-B/V target genes (DEFA3, FOLR3) were filtered out of the dataset due to low expression values. Eleven of the 22 genes were found to be expressed at statistically significantly different levels between the viral and non-viral groups. The AUC of this RNA-seq NP-derived model was 0.915 (95% CI 0.880–0.950) in this dataset (Figure 6b). Regression coefficient frequency and values are also summarized as bar and box plots in Supplemental Figure 3.

Figure 6. Area under the curve (AUC) (left) and discrimination of viral and non-viral lower respiratory tract infection (right) of the novel NP-derived classifier in two external datasets with nasal or nasopharyngeal RNA sequence data: GSE163151 (A) and GSE188678 (B).

Discussion

Characterizing the host response to infection is emerging as an important strategy by which viral infection can be distinguished from bacterial infection, and by which infection can be distinguished from colonization. Traditionally, host-based diagnostics have utilized peripheral blood to characterize host response. However, using our prior blood-based classifier, we show that host response in the nasopharynx can also distinguish viral versus bacterial infection, specifically in the lower respiratory tract. Performance was further enhanced by training a classification model on NP-derived gene expression data. Being able to identify viral versus bacterial LRTI using a non-invasively collected sample has important implications for identifying LRTI etiology, which remains unknown in the majority of cases. In addition, using a NP sample affords the possibility of developing an integrated pathogen and host-based diagnostic in the future which identifies infectious etiology using a single, non-invasively collected sample.

We showed that the median expression of certain genes (IFI27, HLA-DRB1, USP18, STAP1, and OAS3) in the Biomeme HR-B/V classifier was higher in viral versus bacterial infection. The pattern of expression for several of these genes is similar to what has been shown previously, and with what would be expected based on known biological function of these genes. For example, IFI27, which is induced by interferon, has been shown to be upregulated in both blood and NP samples in response to viral respiratory infection [Reference Rosenheim, Gupta and Thakker32,Reference Mick, Kamm and Pisco33]. USP18, STAP1, and OAS3 are involved in anti-viral response and have been shown to be upregulated in blood or lung tissue during viral infection [Reference Li, Banerjee and Wang34–Reference Saxena, Chaudhary and Bharadwaj37]. The median expression of some genes (CTBP1, LAPTM4B, CFAP45, DSC2, LAMP1, EXOG, RPS21, and SIRPB1) was higher in bacterial compared to viral infection. While genes present in bacterial pathways are less clearly defined, SIRPB1 has been shown to be involved in the promotion of phagocytosis in macrophages [Reference Hayashi, Ohnishi and Okazawa38]. Pathway analysis showed that the genes in the classifier were represented in pathways such as those associated with viral life cycle, cell killing, and regulation of leukocyte-mediated cytotoxicity.

The use of our existing blood-based Biomeme HR-B/V Franklin^TM bacterial model on NP samples showed moderate performance with AUC of 0.786 and accuracy of 0.79 at identifying viral versus bacterial infection. Training on NP-derived gene expression showed improved performance with AUC of 0.881 and accuracy of 0.84. These latter metrics are comparable to those observed in blood with the HR-B/V classifier [Reference Iglesias-Ussel, O’Grady and Anderson10]. Others have previously shown that NP-based classifiers can perform comparably to blood-based classifiers at identifying viral versus non-viral respiratory illness; however, none have directly compared the performance of a classifier on NP versus blood samples [Reference Yu, Peterson and Baran12,Reference Do, Pellet and van Doorn39]. From a biological perspective, it is plausible that gene expression changes in the HR-B/V classifier’s targets, many of which are related to immune function, would be similar in the blood versus NP spaces. The high performance of our HR-B/V signature (gene targets) in two external datasets with nasal/ NP samples lends further weight to the biological importance of these genes and to the generalizability of our results. It must be noted that of 32 initial NP samples, 13 (11 viral and 2 bacterial, 41% of total samples) had > 33% of their targets missing and were excluded from subsequent analyses. This level of missingness may be related to variations in sample quality due to collection methods, or may be related to underlying biological differences in gene expression in blood versus the nasopharynx. For example, the expression level of some genes may be too low in NP samples, resulting in the failure to detect them using PCR. Larger studies need to be conducted, and de novo classifiers for the NP space need to be explored. The level of missingness may also pose challenges in the future for developing a viable NP-based host response diagnostic that can be used clinically. It is possible that genes selected for such a classifier should be restricted to those with baseline high levels of expression.

The ability of the classifier to identify viral versus bacterial infection in the NP space is promising, as this provides an avenue for an integrated pathogen-host response diagnostic that utilizes a single, non-invasively collected patient sample. No such diagnostics currently exist in clinical care for the diagnosis of respiratory or other infectious syndromes. For respiratory infections, and LRTI in particular, identification of an organism from an upper respiratory sample does not necessarily indicate infection with that organism, thus such a diagnostic could transform current clinical practice. Improved diagnostics may help decrease antibacterial overuse for respiratory viral infections, which has been documented in both inpatient and outpatient settings globally [Reference Li, Song and Yang40–Reference Chanapal, Cheng, Lambert and Cong43]. Antibacterial overuse is associated with downstream antimicrobial resistance, which at current rates is estimated to result in 39 million deaths by 2050 [Reference Naghavi, Vollset and Ikuta44].

Some limitations must be noted. Our sample size was small. However, this pilot study is an initial proof-of-concept assessment and provides promising results that the HR-B/V assay may work well in NP samples. In addition, the replication of our findings using two external cohorts is a strength and suggests that our results are generalizable. However, our findings need to be further validated with additional internal cohorts as well as multi-site cohorts. The reference standard based on clinical adjudication may have resulted in misclassification. However, we attempted to minimize the chance for misclassification by using a rigorous adjudication system and by only utilizing cases in whom there was a high level of microbiological confirmation. Our results may thus not be applicable to other patients with less definite infection; however, we intend to study the performance of the HR-B/V classifier in the NP samples of patients with indeterminate etiology of infection in future work.

In conclusion, we show that our prior blood-based Biomeme HR-B/V classifier had high performance at identifying viral versus bacterial LRTI, particularly when trained on NP samples. Our findings need to be replicated in larger, multi-center cohorts with diverse etiologies of acute respiratory tract infection.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/cts.2025.10191.

Acknowledgements

The authors would like to thank the patients and research staff who were involved in this study.

Author contributions

L. Gayani Tillekeratne: Conceptualization, Data curation, Funding acquisition, Resources, Writing-original draft, Writing-review & editing; Nicholas O'Grady: Formal analysis, Investigation, Methodology, Validation, Writing-original draft, Writing-review & editing; Maria D. Iglesias Ussel: Investigation, Project administration, Writing-review & editing; Jack Anderson: Investigation, Writing-review & editing; Alana Brown: Investigation, Writing-review & editing; Armstrong Obale: Data curation, Writing-review & editing; Christina Nix: Data curation, Writing-review & editing; Champica Bodinayake: Project administration, Resources, Supervision, Writing-review & editing; Ajith Nagahawatte: Project administration, Resources, Supervision, Writing-review & editing; Robert Rolfe: Investigation, Writing-review & editing; E. Wilbur Woodhouse: Investigation, Writing-review & editing; Gaya Wijayaratne: Project administration, Writing-review & editing; Senali Weerasinghe: Project administration, Writing-review & editing; U.H.B.Y. Dilshan: Investigation, Writing review & editing; Jayani Gamage: Investigation, Writing-review & editing; Ruvini Kurukulasooriya: Project administration, Writing-review & editing; Madureka Premamali: Project administration, Writing-review & editing; Himali S. Jayasinghearachchi: Investigation, Writing-review & editing; Bradly P. Nicholson: Investigation, Writing-review & editing; Emily R. Ko: Investigation,Writing-review & editing; Ephraim L. Tsalik: Writing-review & editing; Micah T. McClain: Writing-review & editing; Rachel A. Myers: Formal analysis, Supervision, Writing-review & editing; Christopher W. Woods: Conceptualization, Resources, Writing-review & editing; Thomas W. Burke: Investigation, Supervision, Writing-review & editing.

Funding statement

This study was funded by a grant from the National Institute of Allergy and Infectious Diseases (R21 AI163548). Funding for the Duke-Ruhuna Collaborative Research Centre is provided by the Duke Global Health Institute and the Duke Hubert-Yeargan Center for Global Health.

Competing interests

C. W. W. and E. L. T. owned equity in Biomeme during the conduct of the study, and received personal fees from Biomeme. T. W. B. consulted for and owned equity in Biomeme during the conduct of the study. E. L. T., T. W. B., C. W. W., and M. T. M. are listed as inventors of a patent (WO 2017/004390 A1) for bacterial versus viral discrimination (licensed to Biomeme). C. W. W. serves as Biomeme’s Chief Medical Officer. E. L. T. is presently an employee of Danaher Corporation and owns equity. All other authors report no potential conflicts of interest.

Footnotes

Co-first authors.

Co-senior authors.

References

Ieven, M, Coenen, S, Loens, K, et al. Aetiology of lower respiratory tract infection in adults in primary care: a prospective study in 11 European countries. Clin Microbiol Infect. 2018:24:1158–1163. doi: 10.1016/j.cmi.2018.02.004.Google Scholar

Zaas, AK, Garner, BH, Tsalik, EL, Burke, T, Woods, CW, Ginsburg, GS. The current epidemiology and clinical decisions surrounding acute respiratory infections. Trends Mol Med. 2014:20:579–588. doi: 10.1016/j.molmed.2014.08.001.Google Scholar

Dekker, AR, Verheij, TJ, van der Velden, AW. Inappropriate antibiotic prescription for respiratory tract indications: most prominent in adult patients. Fam Pract. 2015:32: 401–407. doi: 10.1093/fampra/cmv019.Google Scholar

Robinson, J. Colonization and infection of the respiratory tract: What do we know? Paediatr Child Health 2004:9:21–24. doi: 10.1093/pch/9.1.21.Google Scholar

Ito, A, Ishida, T. Diagnostic markers for community-acquired pneumonia. Annals of Translational Medicine. 2020;8:609.Google Scholar

Shapiro, NI, Self, WH, Rosen, J, et al. A prospective, multi-centre US clinical trial to determine accuracy of FebriDx point-of-care testing for acute upper respiratory infections with and without a confirmed fever. Ann Med. 2018;50:420–429. doi: 10.1080/07853890.2018.1474002.Google Scholar

Bachur, RG, Kaplan, SL, Arias, CA, et al. A rapid host-protein test for differentiating bacterial from viral infection: Apollo diagnostic accuracy study. J Am Coll Emerg Physicians Open 2024:5:e13167. doi: 10.1002/emp2.13167.Google Scholar

Jeffrey, M, Denny, KJ, Lipman, J, Conway Morris, A. Differentiating infection, colonisation, and sterile inflammation in critical illness: the emerging role of host-response profiling. Intensive Care Med. 2023:49: 760–771. doi: 10.1007/s00134-023-07108-6.Google Scholar

Administration UFaD. 510(k) Substantial Equivalence Determination Decision Summary - TriVerity 2025.Google Scholar

Iglesias-Ussel, MD, O’Grady, N, Anderson, J, et al. A rapid host response blood test for bacterial/viral infection discrimination using a portable molecular diagnostic Platform. Open Forum Infect Dis. 2024, 12. doi: 10.1093/ofid/ofae729.Google Scholar

Pandya, R, He, YD, Sweeney, TE, Hasin-Brumshtein, Y, Khatri, P. A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples. Genome Med. 2023;15:64. doi: 10.1186/s13073-023-01216-0.Google Scholar

Yu, J, Peterson, DR, Baran, AM, et al. Host gene expression in nose and blood for the diagnosis of viral respiratory infection. J Infect Dis. 2019;219(7):1151–1161. doi: 10.1093/infdis/jiy608.Google Scholar

Landry, ML, Foxman, EF. Antiviral response in the nasopharynx identifies patients with respiratory virus infection. J Infect Dis. 2018:217: 897–905. doi: 10.1093/infdis/jix648.Google Scholar

Barral-Arca, R, Gómez-Carballa, A, Cebey-López, M, Bello, X, Martinón-Torres, F, Salas, A. A meta-analysis of multiple whole blood gene expression data unveils a diagnostic host-response transcript signature for respiratory syncytial virus. Int J Mol Sci. 2020;21(5):1831. doi: 10.3390/ijms21051831.Google Scholar

Andrew, CD, Nooran, AM, Max, H, et al. Metatranscriptomic profiling reveals pathogen and host response signatures of pediatric acute sinusitis and upper respiratory infection. Genome Med, 2025;17(1):22. doi: 10.1186/s13073-025-01447-3.Google Scholar

Medrano, PG, Weerasinghe, N, Nagahawatte, A, et al. Prevalence and predictors of antibiotic prescription among patients hospitalized with viral lower respiratory tract infections in Southern Province, Sri Lanka. PLoS One, 2024:19: e0304690. doi: 10.1371/journal.pone.0304690.Google Scholar

Institute CaLS, Principles and procedures for blood cultures, 2nd Edition. 2022.Google Scholar

Clinical and Laboratory Standards Institute. Performance standards for antimicrobial susceptibility testing. 30th edition ed. 2020.Google Scholar

Ko, ER, Henao, R, Frankey, K, et al. Prospective validation of a rapid host gene expression test to discriminate bacterial from viral respiratory infection. JAMA Netw Open. 2022:5:e227299. doi: 10.1001/jamanetworkopen.2022.7299.Google Scholar

Tsalik, EL, Henao, R, Montgomery, JL, et al. Discriminating bacterial and viral infection using a rapid host gene expression test. Crit Care Med. 2021:49:1651–1663. doi: 10.1097/ccm.0000000000005085.Google Scholar

Vandesompele, J, De Preter, K, Pattyn, F, et al. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002:3: research0034.1. doi: 10.1186/gb-2002-3-7-research0034.Google Scholar

Benjamini, Y, Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodol.) 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x.Google Scholar

BC T. Homo. Sapiens: annotation package for the Homo.sapiens object. R package version 1.3.1 2015.Google Scholar

Xu, S, Hu, E, Cai, Y, et al. Using clusterProfiler to characterize multiomics data. Nat Protoc. 2024; doi: 10.1038/s41596-024-01020-z.Google Scholar

Friedman, JH, Hastie, T, Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33: 1–22. doi: 10.18637/jss.v033.i01.Google Scholar

Youden, WJ. Index for rating diagnostic tests. Cancer 1950;3: 32–35. doi: 10.1002/1097-0142(1950)3:13.0.co;2-3.Google Scholar

Carstensen, B, Plummer, M, Laara, E, Hills, M. Epi: A package for statistical analysis in epidemiology 2022.Google Scholar

R Core Team (2021). R: A language and environment for statistical computing [program]. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/Google Scholar

Robinson, MD, McCarthy, DJ, Smyth, GK. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616.Google Scholar

Robinson, MD, Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11: R25. doi: 10.1186/gb-2010-11-3-r25.Google Scholar

Law, CW, Chen, Y, Shi, W, Smyth, GK. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15: R29. doi: 10.1186/gb-2014-15-2-r29.Google Scholar

Rosenheim, J, Gupta, RK, Thakker, C, et al. SARS-CoV-2 human challenge reveals biomarkers that discriminate early and late phases of respiratory viral infections. Nat Commun. 2024;15:10434. doi: 10.1038/s41467-024-54764-3.Google Scholar

Mick, E, Kamm, J, Pisco, AO, et al. Upper airway gene expression reveals suppressed immune responses to SARS-CoV-2 compared with other respiratory viruses. Nat Commun. 2020;11:5854. doi: 10.1038/s41467-020-19587-y.Google Scholar

Li, Y, Banerjee, S, Wang, Y, et al. Activation of RNase L is dependent on OAS3 expression during infection with diverse human viruses. Proc Natl Acad Sci U.S.A. 2016;113:2241–2246. doi: 10.1073/pnas.1519657113.Google Scholar

Tang, L, Liu, X, Wang, C, Shu, C. USP18 promotes innate immune responses and apoptosis in influenza A virus-infected A549 cells via cGAS-STING pathway. Virology 2023;585: 240–247. doi: 10.1016/j.virol.2023.06.012.Google Scholar

Ye, H, Duan, X, Yao, M, et al. USP18 mediates interferon resistance of dengue virus infection. Front Microbiol. 2021;12:682380. doi: 10.3389/fmicb.2021.682380.Google Scholar

Saxena, A, Chaudhary, A, Bharadwaj, A, et al. A lung transcriptomic analysis for exploring host response in COVID-19. J Pure Appl Microbio. 2020;14:1077–1081. doi: 10.22207/JPAM.Google Scholar

Hayashi, A, Ohnishi, H, Okazawa, H, et al. Positive regulation of phagocytosis by SIRPβ and its signaling mechanism in macrophages. J Biol Chem. 2004;279:29450–29460. doi: 10.1074/jbc.M400950200.Google Scholar

Do, LAH, Pellet, J, van Doorn, HR, et al. Host transcription profile in nasal epithelium and whole blood of hospitalized children Under 2 Years of age with respiratory syncytial virus infection. J Infect Dis. 2017;217:134–146. doi: 10.1093/infdis/jix519.Google Scholar

Li, J, Song, X, Yang, T, et al. A systematic review of antibiotic prescription associated with upper respiratory tract infections in China. Medicine. 2016;95:e3587. doi: 10.1097/MD.0000000000003587.Google Scholar

van Houten, CB, Cohen, A, Engelhard, D, et al. Antibiotic misuse in respiratory tract infections in children and adults-a prospective, multicentre study (TAILORED Treatment). Eur J Clin Microbiol Infect Dis. 2019;38:505–514. doi: 10.1007/s10096-018-03454-2.Google Scholar

Cheysson, F, Brun-Buisson, C, Opatowski, L, et al. Outpatient antibiotic use attributable to viral acute lower respiratory tract infections during the cold season in France, 2010–2017. Int J Antimicrob Agents. 2021;57: 106339. doi: 10.1016/j.ijantimicag.2021.106339.Google Scholar

Chanapal, A, Cheng, H-Y, Lambert, H, Cong, W. Antibiotic prescribing and bacterial infection in COVID-19 inpatients in Southeast Asia: a systematic review and meta-analysis. JAC-Antimicrobial Resistance. 2024;6. doi: 10.1093/jacamr/dlae093.Google Scholar

Naghavi, M, Vollset, SE, Ikuta, KS, et al. Global burden of bacterial antimicrobial resistance 1990–2021: a systematic analysis with forecasts to 2050. The Lancet, 2024. doi: 10.1016/S0140-6736(24)01867-1.Google Scholar

Figure 3. Principal component analysis (PCA) of viral and bacterial infection among patients with lower respiratory tract infection.

Figure 4. (A) The area under the curves (AUC) and discrimination of viral and bacterial lower respiratory tract infection when using nasopharyngeal swab samples and the existing blood-based Biomeme HR-B/V FranklinTM models. (B) Bacterial model. (C) Viral model. p stands for probability in the figures.

Table 2. Performance metrics of the newly developed nasopharyngeal B/V model and C-reactive protein

Tillekeratne et al. supplementary material

DOI: https://doi.org/10.1017/cts.2025.10191.sm001

File 1.8 MB

Article contents

Host gene expression in the Nasopharynx can discriminate microbiologically confirmed viral and bacterial lower respiratory tract infection

Abstract

Keywords

Information

Introduction

Methods

Subject recruitment

Collection of clinical information and biological samples

Etiological testing

Clinical adjudications

Selection of subjects into sub-analysis

Platform and classifier for assessing host gene expression

RNA extraction and RT-qPCR

Statistical analysis

Data processing

Exploratory and differential expression analysis

Pathway enrichment analysis

Predictive modeling using existing blood-based Biomeme HR-B/V FranklinTM models

Retraining a predictive model of viral versus bacterial infection

Performance of standard biomarker - CRP

External validation

Ethical considerations

Results

Expression of gene targets

Pathway analysis

Prediction of viral versus bacterial infection using existing blood-based Biomeme FranklinTM models

Prediction of viral versus bacterial infection using new model

Comparison to the biomarker CRP

Validation in external datasets

Discussion

Supplementary material

Acknowledgements

Author contributions

Funding statement

Competing interests

Footnotes

References

Tillekeratne et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests

Predictive modeling using existing blood-based Biomeme HR-B/V Franklin^TM models

Prediction of viral versus bacterial infection using existing blood-based Biomeme Franklin^TM models