Data analysis and presentation methods in umbrella reviews/overviews of reviews in health care: A cross-sectional study

Cindy Stern; Jiaoli Li; Jennifer Stone; Hanan Khalil; Kim Sears; Romy Menghao Jia; Patraporn Bhatarasakoon; Edoardo Aromataris; Ritin Fernandez

doi:10.1017/rsm.2025.10040

Data analysis and presentation methods in umbrella reviews/overviews of reviews in health care: A cross-sectional study

Published online by Cambridge University Press: 14 October 2025

Jiaoli Li ,

Kim Sears ,

Patraporn Bhatarasakoon ,

Edoardo Aromataris

and

Ritin Fernandez

Show author details

Cindy Stern*: Affiliation:
JBI, Faculty of Health and Medical Sciences, The University of Adelaide , Adelaide, SA, Australia Health Evidence Synthesis, Recommendations and Impact (HESRI), School of Public Health, Faculty of Health and Medical Science, The University of Adelaide , Adelaide, SA, Australia
Jiaoli Li: Affiliation:
Center of Evidence-Based Education and Arts Therapies: A JBI Affiliated Group, Faculty of Education, Palacky University Olomouc , Olomouc, Czech Republic
Jennifer Stone: Affiliation:
JBI, Faculty of Health and Medical Sciences, The University of Adelaide , Adelaide, SA, Australia
Hanan Khalil: Affiliation:
Department of Public Health, School of Psychology and Public Health, La Trobe University, Melbourne, VIC, Australia
Kim Sears: Affiliation:
Queen’s Collaboration for Health Care Quality: A JBI Centre of Excellence, Queen’s University School of Nursing, Kingston, ON, Canada
Romy Menghao Jia: Affiliation:
JBI, Faculty of Health and Medical Sciences, The University of Adelaide , Adelaide, SA, Australia
Patraporn Bhatarasakoon: Affiliation:
Department of Psychiatric Nursing, Faculty of Nursing, Chiang Mai University , Chiang Mai, Thailand
Edoardo Aromataris: Affiliation:
JBI, Faculty of Health and Medical Sciences, The University of Adelaide , Adelaide, SA, Australia
Ritin Fernandez: Affiliation:
Centre for Transformative Nursing, Midwifery, and Health Research: A JBI Centre of Excellence, School of Nursing and Midwifery, University of Newcastle, Callaghan, NSW, Australia
*: Corresponding author: Cindy Stern; Email: cindy.stern@adelaide.edu.au

Article contents

Abstract
Highlights
Introduction
Research question
Method
Results
Discussion
Conclusion
Author contributions
Competing interest
Data availability statement
Funding statement
Footnotes
References

Rights & Permissions

Abstract

Umbrella reviews (URs) synthesize findings from multiple systematic reviews on a specific topic. Methodological approaches for analyzing and presenting UR results vary, and reviewers often adapt methods to align with research objectives. This study examined the characteristics of analysis and presentation methods used in healthcare-related URs. A systematic PubMed search identified URs published between 2023 and 2024. Inclusion criteria focused on healthcare URs using systematic reviews as the unit of analysis. A random sample of 100 eligible URs was included. A customized, piloted data extraction form was used to collect bibliographic, conduct, and reporting data independently. Descriptive analysis and narrative synthesis summarized findings. The most common terminology for eligible studies was “umbrella reviews” (65%) or “overviews” (30%). Question frameworks included PICO (43%) and PICOS (14%), with quantitative systematic reviews included in most URs (98%), and 68% including randomized controlled trials. The most frequent methodological guidance source was Cochrane (32%). Data analysis commonly used narrative synthesis and meta-analysis, with Stata, RevMan, and GRADEPro GDT employed for presentation. Information about study overlap and certainty assessment was rarely reported.Variation exists in how data are analyzed and presented in URs, with key elements often omitted. These findings highlight the need for clearer methodological guidance to enhance consistency and reporting in future URs.

Keywords

data analysis data presentation methodology overview umbrella review

Information

Type: Research Article
Information: Research Synthesis Methods , First View , pp. 1 - 15

DOI: https://doi.org/10.1017/rsm.2025.10040 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Open Practices: Open materials
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology

Highlights

What is already known?

• Umbrella reviews (URs) aim to integrate and analyze findings from multiple systematic reviews on a given topic.
• URs are a popular type of evidence synthesis.
• Limited understanding of the methodologies used to analyze and present findings of URs exist.

What is new?

• This cross-sectional study of a sample of 100 URs showed a strong emphasis on synthesizing quantitative data with limited examples of URs containing qualitative data located.
• Software used in analysis and presentation is often not reported.
• Many URs do not address primary study overlap nor assess certainty of the evidence.

Potential impact for Research Synthesis Methods readers

• Reviewers conducting URs should assess primary study overlap, assess certainty of the evidence, and report software used for analysis and presentation of data.

1 Introduction

An umbrella review (UR), also known as an “overview of reviews (ORs),” “systematic reviews,” or “meta-review,” aims to integrate and analyze the findings from multiple systematic reviews (SRs) on a given topic.Reference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom¹ This distinction between an UR and a SR lies in the unit of analysis, with SRs focused at the primary study level, while URs are focused at the SR or other secondary research synthesis level.

Umbrella reviews are becoming popular given the explosion of SRs being published annuallyReference Hoffmann, Allers and Rombey²^, Reference Smela, Toumi, Świerk, Gawlik, Clay and Boyer³ and the increasing need to synthesize their findings. URs provide a higher level summary of the body of knowledge on a particular topic and also identifies similarities and differences in the evidence base.Reference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom¹ They can have a much broader scope depending on the type of question/s posed and play a crucial role in informing clinical decision making, policy development, and evidence-based practice.

In terms of conduct, the steps for an UR are similar to that of a SR, however, there are some nuances that relate to their unit of analysis. Some of these pertain to how data are analyzed and presented. Several methods for analyzing and presenting the results of URs have been proposed. For example, JBI (formally the Joanne Briggs Institute) methodological guidance for quantitative URs does not currently recommend reanalysis of results of SRs using meta-analysis, instead encouraging summary tables with sufficient descriptive detailReference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴ and a narrative report. In contrast, Cochrane methodological guidanceReference Pollock, Fernandes, Becker, Pieper and Hartling⁵ recommends two methods of data analysis (1) summarizing data and (2) reanalyzing outcome data in a way that differs from the original analyses conducted in the SRs (e.g., to analyze specific populations or subgroups).

In contrast, attempts to establish methodological guidance for qualitative evidence at the UR level are in their infancy. Although methods to analyze qualitative evidence such as mega-aggregationReference Hendricks, Eshun-Wilson and Rohwer⁶ and mega-ethnographyReference Toye, Seers, Hannink and Barker⁷ are recommended for answering different review questions, guidance on presentation is more limited—with JBI again recommending tabular form of synthesized findings.Reference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴

Other important issues related to the analysis of data in URs include assessing primary study overlap within the included SRs,Reference Pieper, Antoine, Mathes, Neugebauer and Eikermann⁸ assessing the methodological quality of SRs, assessing the certainty/confidence of evidence of SRs, and reporting requirements with various methods and standards available.Reference Gates, Gates and Pieper⁹^– Reference Whiting, Savović and Higgins¹² Despite the increasing prevalence of URs in health research, there is limited understanding of the specific methodologies used for analysis and presentation and their impact on the interpretation of findings.

The JBI Umbrella Review Methodology Group is an international group of methodologists and researchers who report to the JBI Scientific CommitteeReference Aromataris, Stern and Lockwood¹³ and is tasked with providing up-to-date guidance and materials on JBI’s approach to URs.Reference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴ The group is currently undertaking a major update of its guidance with the first piece of work focused on data analysis and presentation methods. To inform methodological guidance, further investigation into the different methods and approaches used by UR authors is useful. This study aimed to examine the characteristics of analysis and presentation approaches used in a sample of URs related to health care. By identifying common trends and methodological choices, this research seeks to determine the transparency, consistency, and utility of URs in the healthcare domain.

2 Research question

What are the characteristics of analysis and presentation methods used in URs informing health care?

3 Method

The study was undertaken in accordance with a protocol registered in Open Science Framework (OSF) (https://osf.io/a9eu6). It followed the methodology of epidemiological studies, where a random sample of studies was selected to address the research question.Reference Khalil and Munn¹⁴ The steps undertaken included defining inclusion and exclusion criteria, developing a search strategy to identify eligible URs, screening and selection of search results, and data extraction and analysis.

3.1 Search strategy

The search for URs was conducted in PubMed (JL) with a time frame between March 5, 2023 and March 5, 2024. PubMed was selected as the sole database to answer the research question because it provides extensive coverage of biomedical literature relevant to this study, including the majority of key healthcare journals. This 1-year time period was chosen to ensure the feasibility and currency of the project. As there are no subject headings, subsets, or search filters for URs currently available in PubMed, the search strategy developed by Pieper and colleaguesReference Pieper, Antoine, Mathes, Neugebauer and Eikermann⁸ was adapted during consultation with a librarian (RJ) (see Appendix A).

3.2 Inclusion and exclusion criteria

Umbrella reviews with a focus on health were included. Reviews were restricted to the English language due to team capacity. For the purpose of this project, the focus was exclusively on URs that contained SRs or meta-analyses as their unit of analysis; reviews including scoping reviews, literature reviews, and primary studies in addition to SRs were excluded. Methodological reviews were also excluded.

3.3 Sample size

This cross-sectional study aimed to include 100 URs which were randomly selected from the search results using a random number generator.Reference Furey¹⁵ The sample size was agreed upon by the project team as it was considered sufficient to address the research question while remaining feasible with the available resources.

3.4 Screening and selection

Search results were imported into Excel and screened at title and abstract by two independent screeners (JL and JD) with any disagreements resolved by a third reviewer (CS). A random number generator was then used to select 100 records which were then imported into Covidence (an online platform used to conduct systematic reviews). It was anticipated that some records may not meet inclusion criteria at full-text review to get to the agreed sample size so an additional 20 records were randomly selected and set aside to use following full-text review if required. Full-text screening was undertaken by five screeners (CS, RF, JL, PB, and HK) and a pilot on five records was conducted to ensure consistency between screeners.

3.5 Data extraction and analysis

A data extraction form (Appendix B) was created and revised following group discussion. A pilot test was undertaken to assess the robustness of the data extraction tool, and relevant changes were subsequently made. Eight URs were included in the pilot phase, two more than originally specified in the review protocol. This modification was introduced to accommodate additional study team members, with each pair piloting two different papers. Data focusing on bibliographical information, descriptive characteristics of the URs (e.g., methodological and reporting guidance followed, critical appraisal tool used), and analysis and presentation methods (e.g., certainty of evidence method used and software used) were extracted. While the protocol specified that all extractions would be undertaken in duplicate, it was decided that extractions would be conducted by one author and a sample of extractions validated by another author (JL or CS) due to the type of data being extracted. Covidence was employed as a tool to manage the extraction process. In total, 20 extractions were verified representing 20% of included reviews. Any disagreements were addressed by discussion.

A descriptive analysis of the results, including tabular presentation and narrative, was conducted. Where appropriate, frequency distributions were presented, with percentages calculated and rounded to the nearest whole number. The mean, standard deviation, and median were calculated for the number or reviews included in the URs.

4 Results

4.1 Study inclusion

The PubMed search was conducted on April 30, 2024, resulting in 1183 records identified and title and abstracts screened for eligibility. Following screening, 728 records remained and using the random number generator,Reference Furey¹⁵ 100 records were selected and screened at full text. Of those, 12 were excluded (11 did not exclusively include SRs or meta-analyses as their unit of analysis and one was not an UR), so a further 12 records were selected from the additional set of 20 records (described previously). One hundred URs were included in the study (see Figure 1). A table of UR characteristics is presented in Supplementary Material 1, a summary table of the results is presented in Supplementary Material 2, and the citation details of the reviews is included in Supplementary Material 3.

Figure 1 Study identification and inclusion flow chart.

4.2 Review nomenclature

Seventy-one umbrella reviews were published in 2023 and 29 in 2024. Topics were diverse across the health spectrum. Most included reviews referred to themselves as an “umbrella review” (n = 65, 65%); this was followed by an “overview” (n = 30, 30%). Seven reviews (7%) used variations of the term “systematic review” (e.g., systematic review of systematic reviews, systematic review of reviews, systematic review of meta-analyses), and two reviews used the term “rapid review” (e.g., rapid review of reviews, rapid review of systematic reviews). Some reviews used multiple terms within the article.

4.3 Framework for question formulation

The majority of URs (n = 66, 66%) referred to either a protocol being published or publicly available or the UR being registered. Over half of the included URs utilized a PICO format (Population, Intervention, Comparator, Outcome) for their question development and/or eligibility criteria (n = 43 [43%] PICO, n = 14 [14%] PICOS [Study design or Setting], n = 1 [1%] PICOTS [Time]). Eight percent of URs (n = 8) used a version of the Population, Exposure, Comparator, Outcome (PECO) format (PECO [n = 1, 1%], PECOS [Study design] [n = 1, 1%], PEO [n = 6, 6%]), while four URs (4%) used PICo (Population, Interest, Context). Two URs (2%) used PCC (Population, Concept, Context) and one UR (1%) used SPIDER (Sample, Phenomenon of Interest, Design, Evaluation, Research Type). Population, Outcomes, and Comparator were used once (1%), as were Population and Phenomena of interest (1%). A quarter of URs did not explicitly specify the framework used (n = 25, 25%).

4.4 Review composition

The number of SRs included in 99 URs ranged from 4 to 175 with a mean of 24.3 (standard deviation [SD] = 24.5) and a median of 16. One UR did not report the number of SRs but instead provided the total number of RCTs included in all SRs in the UR.

The majority of the URs included quantitative SRs (n = 98, 98%), either exclusively or alongside qualitative or mixed method SRs. Seven percent of URs (n = 7) also included meta-analyses, while one (1%) also included a network meta-analysis.

Of the URs that contained qualitative evidence, most were from mixed methods SRs (n = 7, 7%) with only 2 URs located that exclusively included qualitative SRs (n = 2, 2%). Additionally, one UR (n = 1) included narrative reviews (n = 1, 1%).

Considering the designs of primary studies included in the SRs, the most frequently mentioned were randomized controlled trials (RCTs) in 68% of URs (n = 68). This was followed by cohort studies and case–control studies, included in 36 (36%) and 26 (26%) of URs, respectively. Additionally, quasi-experimental studies and cross-sectional studies were reported in 24% (n = 24) and 20% (n = 20) of the URs, respectively. Other designs included in the SRs were case series/reports (n = 12, 12%), prevalence studies (n = 3, 3%), and mixed methods studies (n = 4, 4%). Six URs (6%) included qualitative studies. Notably, 12 URs (12%) did not mention what study designs were included, representing a lack of methodological detail.

4.5 Methodological and reporting guidance

In 55 URs (55%), there was no mention of methodological guidance followed. Cochrane methodological guidanceReference Pollock, Fernandes, Becker, Pieper and Hartling⁵ was referred to in 32 URs (32%), while 14 used JBI guidanceReference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴ (14%). One UR cited both Cochrane and JBI methodological guidance (1%), while three reviews cited Cochrane guidance and other sources (3%).

In terms of reporting guidance, the majority of reviews stated they followed the Preferred Reporting Items for Systematic Reviews and Meta-AnalysesReference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow¹⁶ (PRISMA) reporting standards (n = 73, 73%), followed by the Preferred Reporting Items for Overviews of ReviewsReference Gates, Gates and Pieper⁹ (PRIOR) (n = 15, 15%) and Preferred Reporting Items for Overviews of Systematic Reviews including harm checklistReference Bougioukas, Liakos, Tsapas, Ntzani and Haidich¹⁷ (PRIO-harms) (n = 3, 3%). Other guidance mentioned included Enhancing Transparency in Reporting the Synthesis of Qualitative ResearchReference Tong, Flemming, McInnes, Oliver and Craig¹⁸ (ENTREQ), the PRISMA extension for diagnostic test accuracyReference McInnes, Moher and Thombs¹⁹ (PRISMA-DTA), Brief Review Checklist (BRC), and the PRISMA extensions for scoping reviewsReference Tricco, Lillie and Zarin²⁰ (PRISMA-ScR), with one review, respectively (1% each). Sixteen percent of reviews (n = 16) made no mention of reporting guidance.

4.6 Primary study overlap

Primary study overlap refers to two of more systematic reviews or meta-analyses included in a UR that share one or more of the same primary studies. Study overlap is a unique methodological challenge in URs and can impact validity and reliability. Primary study overlap was reported in 48 included URs (48%); 21 URs used corrected covered area calculationsReference Pieper, Antoine, Mathes, Neugebauer and Eikermann⁸ (21%), 16 provided a narrative description (16%), and 15 presented a tabular display (15%). The Graphical Representation of Overlap for OVErviewsReference Pérez-Bracchiglione, Meza and Bangdiwala²¹ (GROOVE) was mentioned in two URs and 14 URs (14%) included multiple formats. Of those URs where the overlap of the primary studies was considered and where multiple outcomes were reported, 13 reviews (13%) reported the overlap for individual outcomes.

4.7 Critical appraisal

Critical appraisal is the process of assessing the methodological quality of included systematic reviews or meta-analyses included in a UR and is considered an essential step in the review process. Almost all URs (n = 97, 97%) undertook critical appraisal of the included SRs. The most common instrument used was AMSTAR-2 (A MeaSurement Tool to Assess systematic Reviews)Reference Shea, Reeves and Wells¹¹ (n = 69, 69%), followed by the JBI toolReference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴ (n = 10, 10%), ROBISReference Whiting, Savović and Higgins¹² (n = 10, 10%), and the original AMSTARReference Shea, Grimshaw and Wells²² (n = 8, 8%). Other URs referred to various tools such as but not limited to: SANRA (Scale for the Assessment of Narrative Review Articles)Reference Baethge, Goldbeck-Wood and Mertens²³ criteria for narrative reviews, the Health Evidence checklist for systematic reviews,²⁴ and the Cochrane Risk of Bias toolReference Jørgensen, Paludan-Müller and Laursen²⁵ (n = 1, 1%, respectively).

Only four URs (4%) excluded SRs based on methodology quality. The threshold/criteria used for this decision varied, but related to excluding reviews that were considered low or very low methodological quality.

Of them, three used the JBI tool, while one used AMSTAR-2. For the three URs using the JBI tool, the following details were provided:

• One UR excluded low-quality reviews with low determined if the score was below 40%Reference Sufrate-Sorzano, Pérez and Juárez-Vela²⁶
• One UR excluded two papers whose quality level was lower than 40%Reference Sufrate-Sorzano, Pérez and Juárez-Vela²⁶
• One UR planned to exclude studies with a minimum score of 4Reference Reis, Gaspar, Paiva, Sousa and Machado²⁷

The UR that used AMSTAR-2 stated that “3 were rejected due to very low confidence according to AMSTAR 2.”Reference Babot-Pereña and Blanco-Blanco²⁸

4.8 Data analysis

All URs containing quantitative data analyzed data narratively. Of the 98 URs (98%) that analyzed quantitative data only, 30 (30%) included meta-analysis, 27 (27%) utilized basic descriptive analysis, and 3 (3%) used some form of data categorization.

Of the seven URs (7%) that included qualitative data (either from qualitative reviews or mixed methods reviews), seven URs (7%) analyzed data narratively, three included a method of qualitative synthesis (3%), and one each reported basic descriptive analysis (1%) and a process of categorization (1%).

Of the 100, 27 included URs (27%) referred to using software to analyze or present data. Tools used included Stata (n = 12), GRADEPro GDT (n = 5), MS Excel (n = 5), RevMan (n = 5), R (n = 3), Meta-umbrella (n = 2), and Comprehensive Meta-analysis (n = 2). Other software such as NVIVO, SPSS, Cytoscape, MetaXl, VOS Viewer, and STATISTICA were each mentioned once (n = 1).

4.9 Data presentation

While all URs including quantitative data provided narrative to describe their results, eight (8%) exclusively used text for presentation. To support the presentation of the results, the most popular method used was tabular display (n = 76, 78%), followed by forest plots (n = 29, 30%), and Summary of Findings tables (n = 18, 18%). Other approaches included figures (n = 5, 5%), the JBI stop light table (n = 3, 3%), bar charts (n = 3, 3%), and one each used line graphs, heat maps, evidence maps, funnel plots, and radar plots (1%). Over a third of URs used multiple presentation formats in addition to text (n = 41, 41%).

Of the eight URs containing qualitative data, five (63%) utilized tabular display to present their results, two used Summary of Findings (25%), and two used figures (25%).

The majority of URs presented between one and five tables or figures. A small number of URs (n = 2, 2%) reported no tables or figures, while 7% (n = 7) included only one. A larger portion of URs (n = 14, 14%) included two tables or figures, and 19% (n = 19) included three. Most URs used between four (n = 17) and five (n = 20) tables or figures (37% combined), with nine (9%) reporting six tables or figures. Including more than six tables or figures was rare, with only seven URs (7%) using seven or more, including one (1%) UR with 32 tables or figures.

4.10 Data presented in supplementary materials

Of the 50 (50%) URs that included Supplementary Materials, 23 (46%) provided data directly related to the analysis of results. Among these, 16 (32%) were associated with the main analyses, encompassing meta-analyses of primary data (n = 3, 6%), unspecified main analyses (n = 1, 2%), robust variance estimation (RVE) techniques (n = 1, 2%), statistical test results and reported associations (n = 2, 4%), subgroup analyses (n = 2, 4%), and narrative synthesis (n = 2, 4%). Presentation of results was also common, with additional forest plots (n = 7, 14%), funnel plots (n = 1, 2%), and other visual outputs such as figures, charts, or detailed tables (n = 5, 10%). Additionally, seven (14%) URs included assessments of overlap within their Supplementary Materials, while five (10%) provided data related to supplementary analyses, such as sensitivity analyses (n = 2, 4%) and other forms of additional analysis not featured in the main text (n = 3, 6%).

4.11 Certainty assessment

Certainty assessment (confidence assessment is the term often used for qualitative research) refers to determining how trustworthy and applicable a body of evidence is. Just over half of the URs did not assess the certainty of the evidence (n = 52, 52%). Of those that did assess certainty (48%), most used the Grading of Recommendations Assessment, Development, and EvaluationReference Schünemann, Brożek, Guyatt and Oxman²⁹ (GRADE) approach (n = 39, 39%). Other approaches included Confidence in the Evidence from Reviews of Qualitative researchReference Lewin, Booth and Glenton³⁰ (CerQual) (n = 4, 4%) and NutriGradeReference Schwingshackl, Knüppel and Schwedhelm³¹ (n = 2, 2%). Eight URs (8%) referred to using other individual methods with some referencing other documents/criteria (e.g., the SIGN Scale), while others appeared to create their own criteria for assessment (e.g., The overall confidence [high, moderate, low, and critically low] was graded as follows—high: no or one noncritical weakness; moderate: more than one noncritical weakness; low: one critical flaw with or without noncritical weaknesses; and critically low: more than one critical flaw with or without noncritical weaknesses).

5 Discussion

The application and use of umbrella/overview methods by review authors are diverse with many details related to key UR elements often not reported. The nomenclature used to describe the review methodology, “umbrella review” was by far the most common term used. This term reflects the growing recognition of URs as a distinct and important methodology within the evidence synthesis family.Reference Aromataris, Stern and Lockwood¹³

Our dataset predominantly consisted of URs that included quantitative evidence, which was evident when looking at the designs of the primary studies (RCTs, cohort studies and case–control studies being the most popular), and the use of a PICO framework for formulating research questions. For URs containing quantitative data, some authors undertook meta-analyses; Cochrane does advocate for further analysis if reanalyzing data in a different way, however, it was not in the scope of this project to investigate the appropriateness of the analysis and further research is required. Few URs included qualitative (or mixed methods) evidence, suggesting a trend in statistical synthesis for evidence-based decision making. As URs evolve, integrating qualitative research to capture contextual and experiential insights will enhance the applicability of findings.

The use of narrative synthesis, meta-analysis, and basic descriptive analysis in URs highlights the importance of clear and structured methods for synthesizing quantitative evidence. Most URs analyzed data narratively and presented it in tabular form, which aligns with JBI guidanceReference Aromataris, Fernandez, Godfrey, Holly, Khalil and Tungpunkom⁴ and Cochrane guidance.Reference Pollock, Fernandes, Becker, Pieper and Hartling⁵ The prevalence of tabular displays, forest plots, and summary of findings tables that align with GRADE methodologyReference Schünemann, Higgins, Vist, Glasziou, Akl and Skoetz³² emphasizes the need for effective presentation of complex data in ways that enhance translation. There was, however, a small portion presenting results as text only. For qualitative and mixed methods data, the predominant use of narrative synthesis and categorization reflects standard practices in qualitative SRs. Importantly, less than a third of reviews mentioned the software used for analysis and presentation of results, and Stata was the most commonly cited tool. While software tools are widely used, there is still a lack of consistent reporting on their application in URs.

More than half of the reviews did not address primary study overlap, and those that did used different approaches, such as the corrected covered area or citation matrices. Primary study overlap is a distinct challenge in URs, as it can lead to the overrepresentation of certain studies, thereby distorting effect estimates and reducing the reliability of synthesized findings.Reference Fernandez, Sharifnia and Khalil³³ Ways to manage study overlap have been established (including frameworks such as Methods for Overviews of Reviews [MOoR],Reference Lunny, Brennan, McDonald and McKenzie³⁴ which details the stages of a UR where overlap can be managed and outlines methods to do so), however, methods are still developing.Reference Bracchiglione, Meza and Pérez-Carrasco³⁵^, Reference Kirvalidze, Abbadi, Dahlberg, Sacco, Calderón-Larrañaga and Morin³⁶ More than half of the included URs in this study made no mention of study overlap highlighting a critical gap in methodological transparency. For those that did consider overlap, most did not report it for individual outcomes indicating that further standardization in how overlap is reported may be needed. Future methodological advancements should focus on developing standardized reporting frameworks and best-practice guidelines to ensure the reliability of data synthesis in URs.

Over half of the URs did not assess the certainty of the evidence, despite this being a key component of SR methodology as outlined in both the PRIOR statementReference Gates, Gates and Pieper⁹ and the PRISMA statement.Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow¹⁶ For those that did assess certainty, the GRADE framework was most commonly used, however, it is important to note that at present there is no formal guidance from GRADE on how to assess certainty in URs.Reference Stern, Munn and Barker³⁷ This finding indicates a gap in methodological guidance that could benefit from further development to ensure that URs consistently incorporate robust certainty assessments.

Over half of the included URs did not explicitly state which methodological conduct guidance they followed, but more than three quarters reported adhering to reporting standards, such as PRISMA.Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow¹⁶ The lack of explicit guidance adherence for methodological conduct suggests ongoing variability in research practices, evident in this study, potentially impacting the reliability and comparability of findings. This variability underscores the need for clearer implementation of methodological standards to enhance consistency and credibility in UR conduct and reporting.Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow¹⁶

The substantial variability in methodologies, reporting practices, and analytical approaches highlight the need for clear, standardized guidance for researchers conducting URs. Establishing comprehensive methodological frameworks will enhance consistency, transparency, and the overall quality of evidence synthesis. The findings of this study provide a crucial foundation for identifying key gaps and areas for improvement, guiding the development of standardized protocols and best practices. By addressing these inconsistencies, future research can ensure more reliable and comparable results, ultimately strengthening the impact of URs in evidence-based decision making.

5.1 Study limitations

This study only included URs in English and there may be URs available in languages other than English that meet our inclusion criteria and thus were missed. The study was limited exclusively to URs related to health care with only one database searched which means there may be characteristics used across other disciplines that were not identified. We restricted inclusion to URs that exclusively contained SRs or meta-analyses as their unit of analysis, and it is important to note that the sample size may not represent the diversity of URs across different disciplines in health. Not all extractions were undertaken in duplicate which could have introduced errors in data extraction. Finally, the study did not explore the influence of potential conflicts of interest or funding sources, which could affect the design and reporting of URs.

6 Conclusion

This is the first in a series of methodological projects being undertaken to inform the update of the JBI UR methodological guidance. This study provides valuable insights into the methodologies, methods, and characteristics of URs within health care. The findings reveal a strong emphasis on synthesizing quantitative data, with a growing interest in integrating qualitative and mixed methods evidence. Adherence to established methodological frameworks, such as PRISMA for reporting and Cochrane and JBI for conduct, is generally high, though variability in reporting practices remains. The study also highlights many elements of the methodology that are not being included in URs with key areas for improvement, including standardized reporting of study overlap and the need for more consistency in certainty assessments. These results underscore the evolving nature of URs and the need for further refinement in their methodological practices.

Supplementary material

To view supplementary material for this article, please visit http://doi.org/10.1017/rsm.2025.10040.

Acknowledgments

The authors acknowledge Jian Du (JD) for his work in screening.

Author contributions

CS: conceptualization; methodology; data curation; project administration; writing—original draft preparation. JL and EA: conceptualization; methodology; writing—review and editing. JS: data curation; writing—review and editing. HK, KS, RJ, PB, and RF: methodology; writing—review and editing.

Competing interest

The authors declare that no competing interests exist.

Data availability statement

The authors confirm that the data supporting the findings of this study are available within the article. The raw data are provided as Supplementary Material.

Funding statement

The authors declare that no specific funding has been received for this article.

Appendix A: Search strategy

Adapted from Pieper et al. (2014) to fit the PubMed search method

A.1 PubMed

(((((((((((((((((((umbrella review[Title]) OR (meta review[Title])) OR (metareview[Title])) OR ((overview[Title]) AND ((reviews[Title/Abstract]) NOT (reviews[Title])))) OR ((overview[Title]) AND (reviews[Title]))) OR ((overview[Title]) AND ((meta analyses[Title/Abstract]) NOT (meta analyses[Title])))) OR ((overview[Title]) AND ((metaanalyses[Title/Abstract]) NOT (metaanalyses[Title])))) OR ((overview[Title]) AND (meta analyses[Title]))) OR ((overview[Title]) AND (metaanalyses[Title]))) OR (((meta analyses[Title/Abstract]) NOT (meta analyses[Title])) AND (review[Title]))) OR (((metaanalyses[Title/Abstract]) NOT (metaanalyses[Title])) AND (review[Title]))) OR (((meta analyses[Title])) AND (review[Title]))) OR ((metaanalyses[Title]) AND (review[Title]))) OR ((review[Title]) AND (reviews[Title]))) OR ((metaanalyses[Title]) AND (synthesis[Title]))) OR ((meta analyses[Title]) AND (synthesis[Title]))) OR ((reviews[Title]) AND (synthesis[Title]))) OR ((metaanalyses[Title]) AND (summary[Title]))) OR ((meta analyses[Title]) AND (summary[Title]))) OR ((reviews[Title]) AND (summary[Title])) Filters: from 2023/3/5–2024/3/5.

Appendix B: Data extraction form

Footnotes

This article was awarded Open Materials badge for transparent practices. See the Data availability statement for details.

References

Aromataris, E, Fernandez, R, Godfrey, CM, Holly, C, Khalil, H, Tungpunkom, P. Summarizing systematic reviews: Methodological development, conduct and reporting of an umbrella review approach. JBI Evid Implement. 2015;13(3):132–140.Google Scholar PubMed

Hoffmann, F, Allers, K, Rombey, T, et al. Nearly 80 systematic reviews were published each day: Observational study on trends in epidemiology and reporting over the years 2000-2019. J Clin Epidemiol. 2021;138:1–11.10.1016/j.jclinepi.2021.05.022CrossRef Google Scholar PubMed

Smela, B, Toumi, M, Świerk, K, Gawlik, K, Clay, E, Boyer, L. Systematic literature reviews over the years. J Mark Access Health Policy 2023;11(1):2244305.10.1080/20016689.2023.2244305CrossRef Google Scholar PubMed

Aromataris, E, Fernandez, R, Godfrey, C, Holly, C, Khalil, H, Tungpunkom, P: Umbrella Reviews. 2020. JBI Manual for Evidence Synthesis [Internet]. JBI. Available from: https://synthesismanual.jbi.global.10.46658/JBIRM-17-08CrossRef Google Scholar

Pollock, M, Fernandes, RM, Becker, LA, Pieper, D, Hartling, LV: Overviews of Reviews. 2020. Cochrane Handbook for Systematic Reviews of Interventions [Internet]. Cochrane.Google Scholar

Hendricks, L, Eshun-Wilson, I, Rohwer, A. A mega-aggregation framework synthesis of the barriers and facilitators to linkage, adherence to ART and retention in care among people living with HIV. Syst Rev. 2021;10(1):54.10.1186/s13643-021-01582-zCrossRef Google Scholar PubMed

Toye, F, Seers, K, Hannink, E, Barker, K. A mega-ethnography of eleven qualitative evidence syntheses exploring the experience of living with chronic non-malignant pain. BMC Med Res Methodol. 2017;17(1):116.10.1186/s12874-017-0392-7CrossRef Google Scholar PubMed

Pieper, D, Antoine, SL, Mathes, T, Neugebauer, EA, Eikermann, M. Systematic review finds overlapping reviews were not mentioned in every other overview. J Clin Epidemiol. 2014;67(4):368–375.10.1016/j.jclinepi.2013.11.007CrossRef Google Scholar

Gates, M, Gates, A, Pieper, D, et al. Reporting guideline for overviews of reviews of healthcare interventions: Development of the PRIOR statement. BMJ. 2022;378:e070849.10.1136/bmj-2022-070849CrossRef Google Scholar PubMed

Grammatopoulos, T, Hunter, JWS, Munn, Z, Stone, JC, Barker, TH. Reporting quality and risk of bias in JBI systematic reviews evaluating the effectiveness of interventions: A methodological review protocol. JBI Evid Synth. 2023;21(3):584–591.10.11124/JBIES-22-00317CrossRef Google Scholar

Shea, BJ, Reeves, BC, Wells, G, et al. AMSTAR 2: A critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358:j4008.10.1136/bmj.j4008CrossRef Google Scholar PubMed

Whiting, P, Savović, J, Higgins, JP, et al. ROBIS: A new tool to assess risk of bias in systematic reviews was developed. J Clin Epidemiol. 2016;69:225–234.10.1016/j.jclinepi.2015.06.005CrossRef Google Scholar PubMed

Aromataris, E, Stern, C, Lockwood, C, et al. JBI series paper 2: Tailored evidence synthesis approaches are required to answer diverse questions: a pragmatic evidence synthesis toolkit from JBI. J Clin Epidemiol. 2022;150:196–202.10.1016/j.jclinepi.2022.04.006CrossRef Google Scholar PubMed

Khalil, H, Munn, Z. Guidance on conducting methodological studies – An overview. Curr Opin Epidemiol Public Health. 2023;2(1):2–6.10.1097/PXH.0000000000000013CrossRef Google Scholar

Furey, E. Random Number Generator at https://www.calculatorsoup.com/calculators/statistics/random-number-generator.php from CalculatorSoup, https://www.calculatorsoup.com. Online Calculators. 2023.Google Scholar

Page, MJ, McKenzie, JE, Bossuyt, PM, Boutron, I, Hoffmann, TC, Mulrow, CD, et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021;372:n71.10.1136/bmj.n71CrossRef Google Scholar PubMed

Bougioukas, KI, Liakos, A, Tsapas, A, Ntzani, E, Haidich, AB. Preferred reporting items for overviews of systematic reviews including harms checklist: A pilot tool to be used for balanced reporting of benefits and harms. J Clin Epidemiol. 2018;93:9–24.10.1016/j.jclinepi.2017.10.002CrossRef Google Scholar PubMed

Tong, A, Flemming, K, McInnes, E, Oliver, S, Craig, J. Enhancing transparency in reporting the synthesis of qualitative research: ENTREQ. BMC Med Res Methodol. 2012;12(1):181.10.1186/1471-2288-12-181CrossRef Google Scholar PubMed

McInnes, MDF, Moher, D, Thombs, BD, et al. Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA. 2018;319(4):388–396.10.1001/jama.2017.19163CrossRef Google Scholar PubMed

Tricco, AC, Lillie, E, Zarin, W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. 2018;169(7):467–473.10.7326/M18-0850CrossRef Google Scholar PubMed

Pérez-Bracchiglione, J, Meza, N, Bangdiwala, SI, et al. Graphical representation of overlap for OVErviews: GROOVE tool. Res Synth Methods. 2022;13(3):381–388.10.1002/jrsm.1557CrossRef Google Scholar PubMed

Shea, BJ, Grimshaw, JM, Wells, GA, et al. Development of AMSTAR: A measurement tool to assess the methodological quality of systematic reviews. BMC Med Res Methodol. 2007;7(1):10.10.1186/1471-2288-7-10CrossRef Google Scholar PubMed

Baethge, C, Goldbeck-Wood, S, Mertens, S. SANRA—A scale for the quality assessment of narrative review articles. Res Integr Peer Rev. 2019;4(1):5.10.1186/s41073-019-0064-8CrossRef Google Scholar PubMed

McMaster University. Health Evidence™ Quality Assessment Tool. Hamiliton, Ontario: McMaster University; 2025. https://www.healthevidence.org/our-appraisal-tools.aspx.Google Scholar

Jørgensen, L, Paludan-Müller, AS, Laursen, DRT, et al. Evaluation of the Cochrane tool for assessing risk of bias in randomized clinical trials: overview of published comments and analysis of user practice in Cochrane and non-Cochrane reviews. Syst Rev. 2016;5(1):80.10.1186/s13643-016-0259-8CrossRef Google Scholar PubMed

Sufrate-Sorzano, T, Pérez, J, Juárez-Vela, R, et al. Umbrella review of nursing interventions NIC for the treatment and prevention of suicidal behavior. Int J Nurs Knowl. 2023;34(3):204–215.10.1111/2047-3095.12392CrossRef Google Scholar PubMed

Reis, N, Gaspar, L, Paiva, A, Sousa, P, Machado, N. Effectiveness of nonpharmacological interventions in the field of ventilation: An umbrella review. Int J Environ Res Public Health. 2023;20(7):5239.10.3390/ijerph20075239CrossRef Google Scholar PubMed

Babot-Pereña, N, Blanco-Blanco, J. Healing techniques for split-thickness skin grafts donor sites. Umbrella review. Enferm Clin (Engl Ed). 2023;33(6):432–437.10.1016/j.enfcli.2023.10.001CrossRef Google Scholar PubMed

Schünemann, H, Brożek, J, Guyatt, G, Oxman, A, editors. GRADE Handbook. GRADE Working Group;2013.Google Scholar

Lewin, S, Booth, A, Glenton, C, et al. Applying GRADE-CERQual to qualitative evidence synthesis findings: introduction to the series. Implement Sci. 2018;13(1):2.10.1186/s13012-017-0688-3CrossRef Google Scholar PubMed

Schwingshackl, L, Knüppel, S, Schwedhelm, C, et al. Perspective: NutriGrade: A scoring system to assess and judge the meta-evidence of randomized controlled trials and cohort studies in nutrition research. Adv Nutr. 2016;7(6):994–1004.10.3945/an.116.013052CrossRef Google Scholar PubMed

Schünemann, HJ, Higgins, JP, Vist, GE, Glasziou, P, Akl, EA, Skoetz, N, et al. Completing ‘Summary of findings’ tables and grading the certainty of the evidence. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA, eds. Cochrane Handbook for Systematic Reviews of Interventions. 2nd ed. Chichester (UK): Wiley-Blackwell. 2019:375–402.10.1002/9781119536604.ch14CrossRef Google Scholar

Fernandez, RS, Sharifnia, AM, Khalil, H. Umbrella reviews: A methodological guide. Eur J Cardiovasc Nursing. 2025;24(6):996–1002.10.1093/eurjcn/zvaf012CrossRef Google Scholar PubMed

Lunny, C, Brennan, SE, McDonald, S, McKenzie, JE. Toward a comprehensive evidence map of overview of systematic review methods: Paper 2—Risk of bias assessment; synthesis, presentation and summary of the findings; and assessment of the certainty of the evidence. Syst Rev. 2018;7(1):159.10.1186/s13643-018-0784-8CrossRef Google Scholar

Bracchiglione, J, Meza, N, Pérez-Carrasco, I, et al. A methodological review finds mismatch between overall and pairwise overlap analysis in a sample of overviews. J Clin Epidemiol. 2023;159:31–39.10.1016/j.jclinepi.2023.05.006CrossRef Google Scholar

Kirvalidze, M, Abbadi, A, Dahlberg, L, Sacco, LB, Calderón-Larrañaga, A, Morin, L. Estimating pairwise overlap in umbrella reviews: Considerations for using the corrected covered area (CCA) index methodology. Res Synth Methods. 2023;14(5):764–767.10.1002/jrsm.1658CrossRef Google Scholar PubMed

Stern, C, Munn, Z, Barker, TH, et al. Implementing GRADE in systematic reviews that adhere to JBI methodological conduct. JBI Evid Synth. 2024;22(3):351–358.10.11124/JBIES-23-00543CrossRef Google Scholar PubMed

Figure 1 Study identification and inclusion flow chart.

•

Stern et al. supplementary material 1

Stern et al. supplementary material

File 94.7 KB

Stern et al. supplementary material 2

Stern et al. supplementary material

File 74 KB

Article contents

Data analysis and presentation methods in umbrella reviews/overviews of reviews in health care: A cross-sectional study

Abstract

Keywords

Information

Highlights

What is already known?

What is new?

Potential impact for Research Synthesis Methods readers

1 Introduction

2 Research question

3 Method

3.1 Search strategy

3.2 Inclusion and exclusion criteria

3.3 Sample size

3.4 Screening and selection

3.5 Data extraction and analysis

4 Results

4.1 Study inclusion

4.2 Review nomenclature

4.3 Framework for question formulation

4.4 Review composition

4.5 Methodological and reporting guidance

4.6 Primary study overlap

4.7 Critical appraisal

4.8 Data analysis

4.9 Data presentation

4.10 Data presented in supplementary materials

4.11 Certainty assessment

5 Discussion

5.1 Study limitations

6 Conclusion

Supplementary material

Acknowledgments

Author contributions

Competing interest

Data availability statement

Funding statement

Appendix A: Search strategy

Adapted from Pieper et al. (2014) to fit the PubMed search method

A.1 PubMed

Appendix B: Data extraction form

Footnotes

References

Stern et al. supplementary material 1

Stern et al. supplementary material 2

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests