Introduction
Biofilm constitutes a complex and intriguing community of microorganisms characterized by adhesion to surfaces and the secretion of a self-generated extracellular matrix, denoted as a biofilm matrix (Flemming and Wingender Reference Flemming and Wingender2010). This matrix serves as a protective barrier, endowing the biofilm with resilience against antibiotics, immune system responses and various environmental factors (Dufour et al. Reference Dufour, Leung and Lévesque2010). Naturally widespread, biofilms can colonize diverse surfaces (Flemming and Wuertz Reference Flemming and Wuertz2019), including medical implants, water distribution systems and food processing equipment (Galie et al. Reference Galie, García-Gutiérrez, Miguélez, Villar and Lombó2018), thereby presenting challenges such as infections, biofouling and corrosion, making them a prominent subject of scientific investigation (Wang et al. Reference Wang, Christiansen, Mehraeen and Cheng2020).
In recent years, there has been an increasing interest in employing machine learning (ML) techniques in biofilm research (Artini et al. Reference Artini, Papa, Sapienza, Božović, Vrenna, Tuccio Guarna Assanti, Sabatino, Garzoli, Fiscarelli and Ragno2022; Artini et al. Reference Artini, Patsilinakos, Papa, Božović, Sabatino, Garzoli, Vrenna, Tilotta, Pepi and Ragno2018; Papa et al. Reference Papa, Garzoli, Vrenna, Sabatino, Sapienza, Relucenti, Donfrancesco, Fiscarelli, Artini and Selan2020; Patsilinakos et al. Reference Patsilinakos, Artini, Papa, Sabatino, Božović, Garzoli, Vrenna, Buzzi, Manfredini and Selan2019). ML, a subset of artificial intelligence, empowers computer systems to learn and enhance performance through experience, without explicit programming (Lavallin and Downs Reference Lavallin and Downs2021). Through ML algorithms, the copious and intricate datasets derived from biofilm research, encompassing genomics, proteomics and metabolomics data, can be scrutinized to identify pivotal genes, proteins and metabolites associated with biofilm formation and function (Johnson et al. Reference Johnson, Ivanisevic and Siuzdak2016).
The primary aim of this research is to conduct an exhaustive bibliometric analysis of ML’s application in biofilm research, offering insights into the current landscape of the field (Li et al. Reference Li, Tong, Wang, Wang, Zhang, Qian, Liao, Diao, Zhou and Wu2023). Bibliometrics entails a quantitative examination of scientific publications, providing valuable insights into the structure, dynamics and trends of a specific research area (Anwar et al. Reference Anwar, Zhang, Asmi, Hussain, Plantinga, Zafar and Sinha2022; Talafidaryani et al. Reference Talafidaryani, Jalali and Moro2023). By employing bibliographic analysis, one can discern influential publications, authors and institutions in the ML-biofilm domain, along with prevalent research topics and trends (Zhang et al. Reference Zhang, Mao, Crittenden, Liu and Du2017).
Such analysis can also unveil how ML has contributed to advancing our comprehension of biofilm formation, growth and function. The amalgamation of ML and biofilm research has enabled the analysis of extensive and intricate datasets, identifying critical factors influencing biofilm aspects such as formation, growth and function, including environmental influences, genetic composition and metabolic processes. Moreover, ML techniques have potential applications in devising novel strategies for biofilm detection, prevention and control, thereby serving as a valuable tool in addressing challenges associated with biofilm-related issues.
This study will entail a meticulous search of pertinent databases for articles published with the keywords “machine learning biofilm.” Focusing on this relatively recent timeframe will provide insights into the most current research trends. Subsequently, the use of bibliographic analysis tools, such as VOSviewer, will facilitate the examination of identified articles, generating bibliographic networks to unveil crucial information about the field’s structure and dynamics. The analysis will encompass co-authorship analysis, keyword co-occurrence analysis and citation analysis, among other aspects, to pinpoint influential authors, institutions and publications, as well as common research topics and trends.
In summary, this study aims to offer a comprehensive overview of the current status of ML in biofilm research. By identifying key research themes, trends and influential entities in the field, it seeks to enhance understanding of challenges and opportunities linked to the integration of ML and biofilm research. Furthermore, the analysis is poised to guide future research endeavors, contributing to the advancement of knowledge regarding the intricate biological processes involved in biofilm formation, growth and function.
Materials and methods
To comprehensively assess the contemporary research landscape concerning the utilization of ML in the domain of biofilm, an exhaustive search was conducted on the Web of Science. The aim was to ensure the timeliness and relevance of our analysis. In January 2024, the search query “machine learning biofilm” identified a total of 126 articles that met our specified inclusion criteria (AlRyalat et al. Reference AlRyalat, Malkawi and Momani2019; Archambault et al. Reference Archambault, Campbell, Gingras and Larivière2009; Gorraiz and Schloegl Reference Gorraiz and Schloegl2008). For an in-depth exploration of the predominant themes and trends within this body of literature, VOSviewer (version 1.6.17 and 1.6.20), a robust software tool for constructing and visualizing bibliographic networks, was employed (Ji et al. Reference Ji, Zhao, Vymazal, Mander, Lust and Tang2021; Ramírez-Malule et al. Reference Ramírez-Malule, Quinones-Murillo and Manotas-Duque2020; Shah et al. Reference Shah, Lei, Ali, Doronin and Hussain2020). Various bibliographic analyses, encompassing co-occurrence analysis, country/region analysis and institution analysis, were executed (Zhu et al. Reference Zhu, Li, Reng, Wang, Zhang and Wang2020).
The co-authorship analysis focused on discerning the most impactful authors in the intersection of ML and biofilm (Wang et al. Reference Wang, Zhang, Chen, Cao, Zhou and Wu2023). This scrutiny revealed several highly productive authors who have significantly shaped the field. In the co-occurrence analysis, the prevalent research topics and themes within the literature were identified (Xue et al. Reference Xue, Reniers, Li, Yang, Wu and van Gelder2021). The analysis underscored key themes, such as biofilm formation, biofilm detection and the application of ML for identifying and predicting bacterial growth patterns. Furthermore, subtopics within these overarching themes, such as the influence of various environmental factors on biofilm formation and the formulation of ML algorithms for precise prediction of bacterial growth, were also pinpointed (Hashemi et al. Reference Hashemi, Bak, Khan, Hawboldt, Lefsrud and Wolodko2018).
The citation analysis shed light on the most influential publications and highly cited articles in the ML and biofilm research domain (Rickert et al. Reference Rickert, Hayta, Selle, Kouroudis, Harth, Gagliardi and Lieleg2021). Numerous articles garnered substantial citations, indicating their noteworthy impact on the field. Notably, a considerable portion of these highly cited articles centered around the use of ML for predicting biofilm formation and the development of novel algorithms to enhance the comprehension of bacterial growth patterns.
Our bibliographic analysis furnishes valuable insights into the current status of ML applications in the realm of biofilm research (Li et al. Reference Li, Tong, Wang, Wang, Zhang, Qian, Liao, Diao, Zhou and Wu2023). By identifying influential authors, institutions and publications, as well as unveiling prevalent research themes and trends, we enhance our understanding of the challenges and opportunities in this dynamic field of study (Qi et al. Reference Qi, Chen, Hu, Song and Cui2019; Zhang et al. Reference Zhang, Yin, Yang, Man, He, Wu and Lu2020). Moreover, these findings can guide future research endeavors, contributing to the advancement of our comprehension of the intricate biological processes involved in biofilm formation and growth (Colares et al. Reference Colares, Dell’Osbel, Wiesel, Oliveira, Lemos, da Silva, Lutterbeck, Kist and Machado2020; Moura et al. Reference Moura, Tapety, Mobim, Lago, de LobÃ, Leal, Santos and Monte2016).
Results
The results of our analysis reveal a significant and accelerating interest in the application of ML techniques to biofilm research over the past decade. Notably, the number of publications in this intersection has shown a rapid increase, indicating a growing recognition of the potential of ML in advancing biofilm-related studies.
The primary focus of research in this area revolves around biofilm formation, prediction and control, showcasing the diverse applications of ML techniques in addressing crucial aspects of biofilm-related processes. Among the various ML techniques employed, artificial neural networks (de Ramón-Fernández et al. Reference de Ramón-Fernández, Salar-García, Fernández, Greenman and Ieropoulos2020; Lahiri et al. Reference Lahiri, Nag, Sarkar, Dutta and Ray2021; Lesnik and Liu Reference Lesnik and Liu2017) and support vector machines (Li et al. Reference Li, Chen, Zhang, Yu, Chen and Zhang2022; Modak et al. Reference Modak, Lahorkar and Valadi2022; Shengxian et al. Reference Shengxian, Yanhui, Jing and Dayu2012) emerge as the most frequently used, underscoring their effectiveness in biofilm research.
Authorship patterns demonstrate a diverse and influential collaboration across academic and research institutions, as well as industrial and commercial organizations. Similarly, influential institutions contributing to this field encompass universities, research centers and hospitals. Noteworthy publications with high citation rates are centered on the development of ML algorithms for predicting biofilm formation and identifying key genes associated with biofilm formation.
The central portion of Figure 1 visually encapsulates the most significant words in the field, highlighting “machine learning” and “biofilm” as central themes. Additionally, critical terms related to biofilm formation, including “Staphylococcus aureus,” “expression,” “adaptation,” “motility,” “growth” and “virulence,” provide insights into key bacterial processes and study targets in biofilm research. Words associated with machine learning, such as “classification,” “identification,” “algorithm” and “system,” underscore the focus on predicting biofilm formation – a central goal in ML applications in this domain. In summary, Figure 1 offers a comprehensive overview of the essential words in the field, emphasizing their interconnectedness and providing valuable insights into the prominent themes in biofilm research and ML applications.
As illustrated in Figure 2, there exists a widespread and robust global interest in research endeavors related to the subject matter under examination. This heightened global interest has culminated in extensive collaborations among various countries and regions, fostering a dynamic and interconnected landscape of knowledge exchange. The visualization notably underscores the substantial involvement of the United States and China, both emerging as pivotal players with a pronounced level of engagement and substantial contributions to the expanding body of knowledge in this field.
While the United States and China take center stage, it is crucial to acknowledge the active participation of several other countries in shaping the research landscape. Countries such as Canada, the United Kingdom, Japan, India, Spain, Germany, Switzerland, Italy, Sweden, Denmark and South Korea have emerged as significant contributors to the field, each making noteworthy strides in advancing research frontiers. The multifaceted contributions from this diverse set of countries have enriched the field by introducing varied perspectives and methodological approaches, resulting in a more nuanced and comprehensive understanding of the subject matter.
Collaborative endeavors and cross-cultural exchange of ideas have proven instrumental in propelling advancements within this domain. The contributions of each participating country, with its unique research strengths and approaches, have collectively contributed to a richer and more intricate tapestry of knowledge. This synergy has not only accelerated progress but has also fostered a more inclusive and globally informed research community.
As evidenced by the current landscape, the importance of global collaborations in driving innovation and addressing complex challenges is unmistakable. The ongoing trend of international cooperation in research activities is anticipated to persist and even intensify in the future. This collaborative spirit promises to further amplify the impact of research endeavors, offering a collective and diverse approach to tackling the multifaceted dimensions of the subject under investigation. In essence, the interconnected and collaborative nature of global research efforts is poised to play a pivotal role in shaping the future trajectory of advancements in this field.
As delineated by the insights gleaned from Figure 3, a comprehensive analysis of the institutions at the forefront of research in the intersection of ML and biofilm reveals a diverse and influential array of academic and medical research entities, each playing a significant role in shaping and advancing the discourse within this burgeoning field.
The University of Mississippi stands as a standout institution, positioned as a keystone in the research landscape of ML on biofilm. Renowned for its academic prowess and research excellence, the University of Mississippi has consistently demonstrated a commitment to pushing the boundaries of knowledge within this specific domain. Its contributions underscore a dedication to fostering innovation and breakthroughs, establishing it as a pivotal player in the ongoing pursuit of advancements.
Another prominent player in this research arena is the University of Illinois, celebrated for its dedication to cutting-edge research initiatives at the confluence of ML and biofilm studies. With a reputation for rigorous inquiry and a proclivity for innovative approaches, the University of Illinois has solidified its standing as a catalyst for advancements, creating an environment conducive to scholarly endeavors within this field.
The University of California at Los Angeles (UCLA) emerges as a beacon of academic and research excellence, contributing significantly to the vibrancy of research activities within the dynamic field of ML on biofilm. Renowned for its research acumen and unwavering commitment to scientific exploration, UCLA plays a crucial role in furthering our understanding of the nuanced intersections between ML and biofilm studies.
In the medical research sphere, the Southwest Regional Wound Care Center, Bispebjerg Hospital, Rigshospitalet and Emory Children’s Cystic Fibrosis Research and Testing Lab collectively represent a convergence of institutions, each offering a unique and specialized perspective on the dynamic landscape of ML on biofilm. These medical institutions contribute to the holistic understanding of the subject, providing valuable insights and advancements in the field of medical research, particularly in the context of biofilm studies.
The global reach of research is further highlighted by the inclusion of international institutions such as the University of Copenhagen. These institutions add an international dimension to the collaborative efforts, contributing diverse perspectives and methodologies that enrich the global discourse and contribute to the comprehensive understanding of ML applications in biofilm research.
Collectively, this assembly of diverse and influential institutions showcased in Figure 3 underscores the collaborative and multidisciplinary nature of research endeavors at the intersection of ML and biofilm. The inclusion of institutions from various geographical locations and academic disciplines not only enriches the global research community but also underscores the interconnectedness of efforts aimed at addressing the multifaceted challenges inherent in this specific research area.
The institutions highlighted in Figure 3 collectively constitute a nexus of academic and medical excellence in the dynamic realm of ML on biofilm. Their combined endeavors, marked by scholarly rigor and a collaborative spirit, contribute significantly to the vibrancy and dynamism of research activities in this burgeoning field. This collaborative ecosystem emphasizes the importance of a global network of institutions working synergistically to advance our understanding of the subject matter and collectively contribute to the overarching goals of scientific exploration in ML applications for biofilm research.
Discussion
Revolutionizing biofilm management: the integration of machine learning techniques
When studying bacterial biofilms, previous research has primarily relied on experiments and models. These models are categorized into traditional predictive models and the ML models discussed in this paper. In the quest to develop effective strategies for managing bacterial biofilms, scientists have consistently explored the potential of ML models. Table 1 presents a comparison between ML models and traditional models.
Upon scrutinizing Table 2, which delineates recent and seminal papers in the field, it becomes apparent that ML techniques have been extensively employed in the study of biofilms. Notably, a significant study conducted by a group of researchers utilized an ML model to predict the presence of biofilm inhibitory molecules (Srivastava et al. Reference Srivastava, Malwe, Sharma, Shastri, Hibare and Sharma2020). This investigation incorporated a combination of descriptor, fingerprint and hybrid models, achieving impressive accuracy rates of 93%, 88% and 90%, respectively. Furthermore, the software resulting from this study, Molib, has evolved into a widely utilized tool for predicting small molecules with biofilm inhibitory properties. The implications of the success of Molib are particularly promising, offering an opportunity for therapeutic intervention against bacteria capable of forming biofilms.
One notable investigation involved the use of Pseudomonas aeruginosa, a commonly studied model organism, to scrutinize the chemical components of essential oils (EOs) and their potential impact on biofilm formation (Artini et al. Reference Artini, Papa, Sapienza, Božović, Vrenna, Tuccio Guarna Assanti, Sabatino, Garzoli, Fiscarelli and Ragno2022; Artini et al. Reference Artini, Patsilinakos, Papa, Božović, Sabatino, Garzoli, Vrenna, Tilotta, Pepi and Ragno2018). In this study, the researchers employed 11 different classification models (F1–F11) to analyze the data and assess the accuracy of the ML predictions. The results demonstrated that the models achieved prediction accuracies ranging from 69% to 98%, underscoring the efficacy of ML in identifying EO chemical components that may impact biofilm formation. Through their analysis, the authors pinpointed specific EO chemical components potentially influencing bacterial biofilm formation in both positive and negative ways. This insight is crucial for scientists working on developing strategies for the effective management and prevention of potentially harmful biofilms.
Another article discussed the challenges in treating biofilm-associated infections caused by Staphylococcus aureus and Staphylococcus epidermidis (Patsilinakos et al. Reference Patsilinakos, Artini, Papa, Sabatino, Božović, Garzoli, Vrenna, Buzzi, Manfredini and Selan2019). The study investigated the potential of EOs as a treatment option and examined the ability of 89 EOs to influence biofilm production in various bacterial strains. ML algorithms analyzed the chemical compositions of the EOs to evaluate their anti-biofilm potencies and pinpoint the components responsible for biofilm production, inhibition or stimulation.
In another study, EOs were investigated as natural alternatives to chemotherapeutic drugs for inhibiting biofilm in chronic S. aureus infections (Papa et al. Reference Papa, Garzoli, Vrenna, Sabatino, Sapienza, Relucenti, Donfrancesco, Fiscarelli, Artini and Selan2020). A total of 61 EOs were evaluated for biofilm modulation and antibacterial activity. Their chemical composition was analyzed using GC/MS, and ML algorithms were employed to correlate potency with active components. Certain EOs inhibited biofilm growth at a 1.00% concentration and were further characterized for their effects on biofilm organization through scanning electron microscope studies.
Another paper presented a novel computational methodology that combines meta-analysis and ML to identify important genes and pathways in biofilm-forming bacteria (Subramanian and Natarajan Reference Subramanian and Natarajan2021). This approach analyzed gene expression profiles in multiple S. aureus strains and identified 36 potential genes, including 11 newly reported ones. These genes are considered essential for biofilm development and represent a signature target list for designing anti-biofilm therapeutics. The study underscores the value of combining meta-analysis and ML techniques to enhance understanding of biofilm mechanisms and advance effective therapeutic strategies.
Another study developed a machine-learning-aided cocktail assay (Wang et al. Reference Wang, Jiang, Wei, Wang, Wang, Yang, Song and Yuan2022). This study utilized lanthanide nanoparticles with varied properties, integrated into cocktail kits. The physicochemical diversity of biofilms was translated into luminescence intensity, enabling identification of unknown biofilms with an overall accuracy rate surpassing 80% via the random forest algorithm. Antibiotic-loaded cocktail nano-probes effectively eradicated biofilms, demonstrating the technique’s promise as a reliable diagnostic tool for biofilm infections. Moreover, this approach offers a framework for developing assays to detect biochemical compounds beyond biofilms.
Interdisciplinary synergy: biofilm researchers and ML experts shaping the future
Our bibliometric analysis offers a comprehensive panorama of the current landscape of ML in biofilm research. The examination underscores the escalating interest in applying ML techniques to biofilm research and underscores the pivotal role of interdisciplinary collaboration between biofilm researchers and ML experts in propelling innovation within this domain (Hashemi et al. Reference Hashemi, Bak, Khan, Hawboldt, Lefsrud and Wolodko2018).
The predominant research themes in this realm revolved around biofilm formation, prediction and control, signifying the pressing demand for novel strategies to address biofilm-related challenges (Alaoui Mdarhri et al. Reference Alaoui Mdarhri, Benmessaoud, Yacoubi, Seffar, Guennouni Assimi, Hamam, Boussettine, Filali-Ansari, Lahlou and Diawara2022). Notably, artificial neural networks (de Ramón-Fernández et al. Reference de Ramón-Fernández, Salar-García, Fernández, Greenman and Ieropoulos2020; Lahiri et al. Reference Lahiri, Nag, Sarkar, Dutta and Ray2021; Lesnik and Liu Reference Lesnik and Liu2017) and support vector machines (Li et al. Reference Li, Chen, Zhang, Yu, Chen and Zhang2022; Modak et al. Reference Modak, Lahorkar and Valadi2022; Shengxian et al. Reference Shengxian, Yanhui, Jing and Dayu2012) emerged as the most frequently employed ML techniques in biofilm research, suggesting their aptness for deciphering intricate biofilm-related data.
An exploration of authorship and institutional affiliations revealed a diverse array of influential authors and institutions spanning various fields, underscoring the interdisciplinary essence inherent in biofilm research. This diversity assumes paramount importance in fostering innovation within the field.
Future recommendation of using ML in bacterial and biofilm studies
The utilization of big data and ML techniques has witnessed a growing prevalence across diverse fields, encompassing species distribution (Chen and Ding Reference Chen and Ding2022; Gobeyn et al. Reference Gobeyn, Mouton, Cord, Kaim, Volk and Goethals2019), education (Chen and Ding Reference Chen and Ding2023) and cancer prediction (Cammarota et al. Reference Cammarota, Ianiro, Ahern, Carbone, Temko, Claesson, Gasbarrini and Tortora2020). ML offers a platform through which policymakers can adjust policies to better serve the populace. However, despite the abundance of studies on bacteria and biofilm, the application of ML in this domain remains limited, as elucidated in the preceding paragraphs.
Bacteria and biofilm stand as focal points of extensive investigation within various environmental and industrial contexts, such as heavy metal pollutant removal (Ding et al. Reference Ding, Peng, Du, Ji and Cao2014) and microbial fuel cells (Zhao et al. Reference Zhao, Wu, Ding, Wang, Zhang, Kjelleberg, Loo, Cao and Zhang2015). Scientists have developed methods to genetically modify bacterial genes in efforts to either enhance or weaken biological processes, resulting in the formation of stronger or weaker biofilms. Despite these advancements, the underlying mechanisms governing these biological processes remain shrouded in uncertainty (Tribedi et al. Reference Tribedi, Gupta and Sil2015).
Given the diverse applications and significance of bacteria and biofilm research, it becomes imperative to explore the potential advantages of incorporating ML techniques into this realm (Sadeghi et al. Reference Sadeghi, Panahi, Mazlumi, Hejazi, Komi and Nami2022). By harnessing the voluminous data generated through research endeavors, ML stands poised to unravel the intricate interactions and mechanisms inherent in bacterial processes, offering novel insights and predictions (Long et al. Reference Long, Fan, Xu and Liu2022). Furthermore, the development of new ML algorithms specifically tailored to the unique challenges posed by bacterial data holds the promise of delivering more accurate and efficient results (Cordier et al. Reference Cordier, Lanzén, Apothéloz-Perret-Gentil, Stoeck and Pawlowski2019; Long et al. Reference Long, Wang, Cai, Lesnik and Liu2021).
In essence, the integration of ML into bacteria and biofilm research carries the potential to propel our comprehension of these vital biological processes, potentially leading to groundbreaking discoveries and applications across various fields.
Conclusions
Our comprehensive investigation into the integration of ML methodologies in biofilm research reveals a significant surge in interest over the last decade. The bibliographic analysis, encompassing 126 pertinent articles from the Web of Science database, sheds light on prevailing trends, particularly focusing on biofilm formation, prediction and control. Artificial neural networks and support vector machines emerged as the predominant ML techniques. The study underscores the vital role of collaborative efforts between biofilm researchers and ML experts, emphasizing interdisciplinary synergy. The findings provide valuable insights into the current landscape and future trajectories of ML in biofilm research, guiding further exploration and innovation in this dynamic and crucial field.
Data availability statement
Data availability does not apply to this article.
Author contribution
YD conceived and designed the study. YD conducted data gathering. SC performed software. SC performed visualization. YD wrote the article.
Financial support
This research received no specific grant from any funding agency, commercial or not-for-profit sectors.
Competing interests
The authors have no conflicts of interest to declare for this publication.
Ethics statement
Ethical approval and consent are not relevant to this article type.
Comments
No accompanying comment.