Hostname: page-component-7857688df4-vhqbq Total loading time: 0 Render date: 2025-11-13T07:29:33.885Z Has data issue: false hasContentIssue false

Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents

Published online by Cambridge University Press:  27 October 2025

Enoch Kordjo Azasu*
Affiliation:
Social Work, University at Buffalo, Buffalo, NY, USA
Rights & Permissions [Opens in a new window]

Abstract

Background. Despite the growing recognition of adolescent suicide as a pressing concern, traditional methods for identifying suicide risk often fail to capture the complex interplay of socio-ecological and psychological factors. The advent of machine learning (ML) offers a transformative opportunity to improve suicide risk prediction and intervention strategies. Objective. This study aims to utilize ML techniques to analyze socio-ecological and psychological risk factors to predict suicide ideation, plans and attempts among a nationally representative sample of Ghanaian adolescents. Methods. A cross-sectional survey was conducted with 1,703 adolescents aged 12–18 years across Ghana measuring psychological factors (depression symptoms, anxiety symptoms etc) and socio-ecological factors (bullying, parental support etc) using validated measures. Descriptive statistics were conducted and random forest and logistic regression models were employed for suicide risk prediction, i.e., ‘ideation, plans and attempts’. Model performance was evaluated using accuracy, sensitivity, specificity and feature importance analysis. Results. Psychological factors such as depression symptoms (r = .42, p < .01), anxiety (r = .38, p < .01) and perceived stress (r = .35, p < .01) were the strongest predictors of suicide ideation, plans and attempts, while parental support emerged as a significant protective factor (r = −.34, p < .01). The random forest model demonstrated good predictive performance (accuracy = 78.3%, AUC = 0.81). Gender differences were observed. Conclusions. This study is the first to apply ML techniques to a nationally representative dataset of Ghanaian adolescents for suicide risk prediction, i.e., ‘ideation, plans and attempts’. The findings highlight the potential of ML to provide precise tools for early identification of at-risk individuals.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Impact statement

This article sheds light on the urgent issue of adolescent suicide in low- and middle-income countries (LMICs) like Ghana, where mental health support is often limited. By harnessing machine learning (ML), our research offers a groundbreaking approach to identifying young people at risk of suicide, moving beyond traditional methods that may miss critical warning signs. ML tools can help schools, community leaders and health workers quickly identify at-risk youth, even in areas with few trained professionals, enabling timely interventions that save lives. Moreover, this study advocates for the integration of technology with human expertise, ensuring that ML supports rather than replaces compassionate care. Our findings also emphasize the need for culturally tailored strategies, ensuring that solutions resonate with the unique social and emotional realities of Ghanaian adolescents. Ultimately, our work aims to reduce the tragic loss of adolescent lives to suicide, fostering hope and resilience in communities where such support is most needed.

Introduction

Suicide among adolescents is a pressing global public health issue, contributing significantly to mortality and morbidity during a critical developmental stage (Wasserman et al., Reference Wasserman, Cheng and Jiang2005). Adolescents are particularly vulnerable to suicidal ideation due to the interplay of psychosocial stressors, rapid developmental changes and limited access to mental health resources (Azasu et al., Reference Azasu, Quarshie, Messias, Larnyoh, Ali and Joe2024). Globally, suicide ranks as the second leading cause of death among individuals aged 10–24 years, underscoring the urgent need for evidence-based prevention strategies (WHO, 2020).

In Ghana, the burden of adolescent suicide is increasingly being recognized, with one out of five adolescents experiencing suicide ideation in the last 12 months (Azasu et al., Reference Azasu, Quarshie, Messias, Larnyoh, Ali and Joe2024). Ghana is faced with a high prevalence of suicidal behaviors among high school students, with 18.2%, 22.5% and 22.2% for suicidal ideation, suicidal plans and suicidal attempts, respectively (Oppong Asante et al., Reference Oppong Asante, Kugbey, Osafo, Quarshie and Sarfo2017; Azasu and Joe, Reference Azasu and Joe2023). Studies have found psychological and socio-ecological risk factors for suicide among these adolescents (Oppong Asante et al., Reference Oppong Asante, Kugbey, Osafo, Quarshie and Sarfo2017; Azasu et al., Reference Azasu, Quarshie, Messias, Larnyoh, Ali and Joe2024). These studies suggest that adolescents in Ghana face a complex interplay of psychological and socio-ecological challenges, including depression, anxiety, bullying and limited family support, which contribute to suicide ideation, plans and attempts. Despite the growing recognition of adolescent suicide as a critical issue, mental health resources in Ghana remain scarce, and traditional approaches to identifying suicide risk often fall short in addressing the multifaceted nature of this problem (Roberts et al., Reference Roberts, Mogan and Asare2014). These conventional methods, such as linear statistical models and manual clinical assessments, are not only limited in their ability to capture the complex interactions between risk factors but are also impractical for large-scale implementation in resource-constrained settings.

The advent of machine learning (ML) presents a transformative opportunity to address these limitations. Unlike traditional methods, ML techniques excel at analyzing large, complex datasets, identifying hidden patterns and modeling nonlinear relationships between variables (Rubinger et al., Reference Rubinger, Gazendam, Ekhtiari and Bhandari2023). These capabilities make ML particularly well suited for predicting suicide risk in Ghana, where multiple psychological and socio-ecological factors interact in dynamic and often nonlinear ways.

This study leverages a nationally representative dataset of nearly 2,000 Ghanaian adolescents to apply ML techniques for suicide risk prediction. By analyzing both psychological factors (e.g., depression symptoms, anxiety symptoms) and socio-ecological factors (e.g., bullying victimization, family support), this research aims to develop culturally relevant, data-driven insights to inform targeted and efficient interventions. This study is innovative in its application of ML to a nationally representative dataset of Ghanaian adolescents for suicide risk prediction, marking the first such effort in this context. By combining advanced computational methods with culturally specific data, the research addresses a critical gap in adolescent mental health assessment in LMICs.

Theoretical framework

This study is guided by the Socio-Ecological Model (SEM), which emphasizes the dynamic interplay between individual, relational, community and societal factors in shaping health behaviors and outcomes (Krug et al., Reference Krug, Mercy, Dahlberg and Zwi2002). The SEM provides a comprehensive framework for understanding adolescent suicidal ideation by considering how multiple layers of influence interact to increase or mitigate risk.

At the individual level, factors such as personal experiences of bullying victimization, food insecurity and emotional distress play a critical role in shaping mental health outcomes (Acquah et al., Reference Acquah, Wilson and Doku2014). These individual risk factors are compounded or alleviated by influences at the relational level, including parental support, peer interactions and family socio-economic status.

At the community level, the school environment, regional differences in access to resources and community attitudes toward mental health can either contribute to or protect against suicide risk. Finally, at the societal level, cultural norms, economic disparities and systemic inequities create a broader context within which individual and relational factors operate.

By applying this framework, the study seeks to identify not only the direct influences on suicidal ideation but also the interactions between various socio-ecological factors. This approach offers a holistic perspective, providing actionable insights for designing interventions that address risk at multiple levels (see Figure 1).

Figure 1. The socio-ecological model (Zollner et al., Reference Zollner, Fuchs and Fegert2014).

Aims of the study

The primary aim of this study is to utilize ML techniques to predict suicide risk among Ghanaian adolescents by analyzing a nationally representative dataset. The specific aims are as follows:

  1. 1. To identify key psychological and socio-ecological factors associated with suicide risk

  2. 2. To develop and evaluate ML models for suicide risk prediction

  3. 3. To determine the relative importance of psychological and socio-ecological risk factors

Methods

Study design and context

This study employed a cross-sectional survey design to examine the mental health and well-being of adolescents across Ghana. Data collection took place over a three-week period in October 2024, providing a comprehensive snapshot of adolescents’ emotional and psychological states. The study was conducted in junior high schools located in rural, peri-urban and urban areas and represented four major geographical regions of Ghana: the Northern, Eastern, Southern (Greater Accra and Eastern regions) and Middle (Ashanti region) belts. This regional sampling ensured that a broad spectrum of student experiences and educational contexts was captured.

Inclusion and exclusion criteria

The inclusion criteria for participation in this study required participants to be aged between 12 and 18 years, enrolled in a junior high school in Ghana, able to understand and provide informed assent or consent in English and available to participate during the designated data collection period in October 2024. Conversely, exclusion criteria encompassed students who were absent during data collection and those unable to provide informed assent or consent due to language barriers or cognitive limitations.

Participants and recruitment

A total of 1,850 students expressed interest in participating in the study. After screening for eligibility, 1,703 students completed the survey, yielding a participation rate of 92%. Students were recruited using stratified random sampling, ensuring that each region, as well as various socio-demographic groups (e.g., gender, socio-economic status), was adequately represented. Recruitment occurred through collaboration with school administrators and teachers, who helped identify eligible students from class rosters. Flyers and verbal presentations were used to provide information about the study’s goals and procedures. To prevent potential coercion by school authorities, participation was explicitly voluntary, and students were assured that their decision to participate or decline would not affect their academic standing or relationship with the school.

Informed consent and ethical approval

Prior to participation, written informed consent was obtained from parents or legal guardians, while students provided verbal or written assent. All participants were informed about the voluntary nature of participation, the right to withdraw at any time and the confidentiality of their responses. The study received approval from the Institutional Review Board (IRB) at the University at Buffalo. Ethical standards were maintained throughout the study to protect participants’ privacy, ensure data security and provide access to mental health resources in case of distress.

Incentives

No incentives, financial or otherwise, were provided to students before or during the consent process to ensure that participation was entirely voluntary and free from any potential financial coercion. Upon completion of measures, students were given notepads and pens.

Survey administration and instrumentation

Surveys were administered by trained research assistants during school hours in a controlled and quiet classroom setting. The average time to complete the survey was 30 min. The survey was presented in English, the primary language of instruction in Ghana’s junior high schools. The measures used in this study were tested and validated in a prior study conducted by the same research team in 2022 with 800 students from the Greater Accra region. The use of these pre-tested instruments ensured that the measures were culturally relevant, clear and reliable. To mitigate potential language barriers, research assistants were available to provide verbal explanations or clarifications in local languages as needed.

The survey included a range of measures assessing:

Demographics: Age, gender, living situation, educational level and ethnic group.

The World Health Organization Composite International Diagnostic Interview (WHO-CIDI) (Kessler and Üstün, Reference Kessler and Üstün2004) was used to assess suicidal behaviors. The scoring was binary (yes or no). The independent variables included psychological factors such as depression symptoms and anxiety symptoms, and socio-ecological factors such as perceived stress, social media addiction, trauma exposure and financial hardship (see Supplementary Appendix 1 for measurement details, scoring levels and citations).

Safety protocols

Given the sensitive nature of some of the survey questions, a comprehensive safety plan was in place. Research assistants were trained to recognize signs of emotional distress and provide immediate assistance. Students expressing significant psychological distress during or after the survey were referred to school counselors or mental health professionals. Additionally, participants were provided with contact information for national mental health hotlines and local support services. All participants were reassured about the voluntary nature of their participation and their right to withdraw at any time.

Data analysis

First, we performed descriptive statistics to summarize the characteristics of the study population. Chi-square tests were used to examine associations between categorical variables, while independent t-tests were used to compare means between groups.

For the ML analysis, we first cleaned the data and handled missing values. After evaluating the patterns of missingness, we opted to use listwise deletion to handle missing data, excluding cases with any missing values on the variables of interest from the analyses.

The data were then split into training (80%) and testing (20%) sets. Random forest captured complex, nonlinear relationships and predictor importance, while logistic regression provided interpretable odds ratios for clear associations. For logistic regression, suicide risk was coded as a binary outcome using the WHO-CIDI Suicidal Behaviors module (Kessler and Üstün, Reference Kessler and Üstün2004): ‘at risk’ (coded 1) for any ‘yes’ response to ideation, plans or attempts in the past 12 months and ‘not at risk’ (coded 0) for ‘no’ responses to all items. In the analysis, Likert scale measures were treated as continuous variables unless transformed into categorical or binary variables as specified, while the suicidal behavior variable from the WHO-CIDI was modeled as binary, with individual items (ideation, plans and attempts) combined into a single outcome where a ‘yes’ response was coded if any item was endorsed.

We applied both random forest and logistic regression models to identify factors associated with suicide risk. The relative importance of each risk factor was determined through feature importance analysis. Model performance was assessed using accuracy, sensitivity and specificity measures. All analyses were performed using Python’s scikit-learn library, with statistical significance set at p < .05.

Results

Descriptive statistics

A total of 1,736 adolescents participated in the study. The mean age of participants was 14.3 years (SD = 1.42, range = 12–18), with 58.7% identifying as female. Among the participants, 24.3% reported having suicide ideation, 18.2% reported making suicide plans and 12.5% reported suicide attempts in their lifetime.

Psychological and socio-ecological predictors of suicide risk

Correlations between key variables

Table 1 presents the correlations between independent variables and suicide risk indicators. Depression symptoms showed the strongest positive correlation with suicide risk (r = .42, p < .01), followed by anxiety (r = .38, p < .01) and perceived stress (r = .35, p < .01). Parental support demonstrated a significant negative correlation with suicide risk (r = −.34, p < .01).

Table 1. Correlations between independent variables and suicide risk (N = 1,703)

Note. Correlation is significant at the 0.01 level (2-tailed).

Figure 2 shows a heatmap of these correlations, illustrating the strength and direction of relationships among the variables.

Figure 2. Heatmap of correlations between independent variables and suicide risk.

Gender differences in risk factors

Gender differences were pronounced in psychological factors, with female participants consistently reporting significantly higher levels of depression symptoms (t = 4.89, p < .001), anxiety (t = 6.32, p < .001) and perceived stress (t = 4.15, p < .001) compared to male participants, suggesting a greater psychological burden among females. For socio-ecological factors, female participants also reported significantly higher social media addiction (t = 4.78, p < .001), potentially indicating greater exposure to online stressors, while male participants reported slightly higher parental support (t = −2.45, p = .014), which may suggest a protective factor. These findings highlight the nuanced interplay of individual and environmental influences on suicide risk across genders. Table 2. summarizes the differences between females and males.

Table 2. Gender differences in key risk factors (N = 1,703)

Note. *p < .05, **p < .01._.

Figure 3 visualizes the mean scores of key risk factors by gender.

Figure 3. Mean scores of key risk factors by gender.

Predictive model performance

A random forest classification model was developed to identify the most important predictors of suicide risk. The model demonstrated good predictive performance. Due to the superior predictive performance and ability to handle complex interactions, random forest results are presented in the main text, while the complete logistic regression outputs are available in the Supplementary Appendix for reference (see Table 3).

Table 3. Random forest model performance metrics

Feature importance analysis

The relative importance of predictors in the random forest model revealed that psychological factors were the strongest predictors of suicide risk. Figure 4 displays the feature importance scores (see Table 4).

Figure 4. Feature importance scores from the random forest model.

Table 4. Feature importance in predicting suicide risk

Note. Importance scores are normalized and sum to 1.

Receiver operating characteristic (ROC) curve

The ROC curve illustrates the model’s ability to discriminate between those with and without suicide risk. The area under the ROC curve (AUC) was 0.81 (95% CI [0.78, 0.84]), indicating good discrimination. Figure 5 presents the ROC curve.

Figure 5. ROC curve for the suicide risk prediction model.

Discussion

The results of this ML analysis provide crucial insights into suicide risk factors among Ghanaian adolescents, offering several important implications for intervention and policy development.

Summary of findings

The findings indicate that the random forest model effectively identified key predictors and demonstrated good performance in classifying individuals at risk. Psychological factors such as depression symptoms, anxiety symptoms and perceived stress are strong predictors of suicide risk among Ghanaian adolescents. Parental support serves as a protective factor, with higher levels of support associated with lower suicide risk. Female adolescents reported higher levels of psychological distress and suicide risk indicators compared to male participants.

Predictive model performance

The ROC curve demonstrates robust predictive capability, indicating that the model performs well in distinguishing between adolescents at high and low risk of suicide. This finding suggests that the model has practical utility and could be effectively deployed in Ghanaian school settings as part of a broader mental health strategy. In schools with limited access to trained mental health professionals, such a predictive tool could serve as a first-line screening mechanism, enabling educators and school counselors to identify at-risk adolescents more efficiently and allocate resources where they are most needed.

This is especially critical in the Ghanaian context, where mental health resources are scarce and stigma often prevents adolescents from seeking help (Azasu, Reference Azasu2024). A data-driven approach to early risk detection could overcome these barriers by providing an objective, non-invasive method for identifying adolescents in need of support. Furthermore, the scalability of ML-based tools makes them particularly suited for resource-constrained environments, where traditional one-on-one assessments may not be feasible due to time and personnel limitations (Atmakuru et al., Reference Atmakuru, Shahini, Chakraborty, Seoni, Salvi, Hafeez-Baig, Rashid, Tan, Barua, Molinari and Acharya2025).

In comparing model performance, random forest outperformed logistic regression in predictive accuracy, highlighting its strength in capturing complex relationships among suicide risk factors. However, logistic regression provided interpretable odds ratios, revealing key predictors like depression symptoms and the protective effect of parental support (OR = 0.88, p = .032). Together, these models offer complementary insights into risk assessment (see Supplementary Appendix A for details).

By integrating such predictive models into school-based mental health programs, there is significant potential for early intervention, reducing the likelihood of suicide attempts and improving overall mental health outcomes. However, further research is needed to ensure the model’s cultural relevance, ethical implementation and long-term efficacy in such settings.

Psychological and socio-ecological factors

The feature importance scores reveal that psychological factors, particularly depression, anxiety and perceived stress, are the strongest predictors of suicide risk among Ghanaian adolescents. This aligns with previous findings on suicide risk factors among adolescents in Ghana but provides more precise quantification of their relative importance (Oppong Asante et al., Reference Oppong Asante, Kugbey, Osafo, Quarshie and Sarfo2017; Azasu et al., Reference Azasu, Quarshie, Messias, Larnyoh, Ali and Joe2024). The significant weight of these psychological factors suggests that mental health screening programs in Ghanaian schools should prioritize depression and anxiety assessment.

The analysis reveals the substantial impact of socio-ecological factors like parental support and social media addiction, reflecting the socio-ecological framework outlined in the theoretical background. This finding is particularly relevant in the Ghanaian context, where traditional family structures are evolving due to urbanization (Yankson and Bertrand, Reference Yankson and Bertrand2012). Social media use is also rapidly increasing among adolescents in Ghana, while access to mental health resources remains scarce to address its potential adverse effects. Studies have shown the adverse effects of social media use on mental health and well-being among Ghanaian youth (Offei Ofosu, Reference Offei Ofosu2024).

Gender differences and cultural implications

The gender-based analysis reveals notable differences in risk factors between male and female adolescents. This finding is crucial given Ghana’s cultural context, where gender roles and expectations can significantly influence mental health outcomes and the literature supports significant gender differences in suicide risk across sub-Saharan Africa (Joe et al., Reference Joe, Stein, Seedat, Herman and Williams2008; Keugoung et al., Reference Keugoung, Tabah and Criel2013). While girls tend to experience suicidal thoughts more frequently than boys, their engagement in suicidal behaviors is comparable to that of boys (Azasu et al., Reference Azasu, Quarshie, Messias, Larnyoh, Ali and Joe2024). This raises critical questions about the impact of gender norms that place undue pressure on adolescent girls, heightening stress, exposure to abuse, gender-based violence and perpetuating a culture of silence. There is therefore the need for gender-sensitive intervention strategies, cultural competency in mental health services and targeted support systems that account for gender-specific risk factors.

Practical implications for Ghana’s educational and healthcare systems

The findings of this study offer valuable insights with significant practical implications for Ghana’s educational and healthcare systems. First, they highlight the need for strategic resource allocation in schools. By identifying key risk factors for adolescent suicide, stakeholders can prioritize limited mental health resources to address the most pressing needs. Schools can focus on providing targeted support for adolescents who display psychological distress, such as depression and anxiety, ensuring that interventions are directed where they will have the greatest impact.

Second, the predictive model developed in this study presents an opportunity to implement cost-effective screening programs in Ghanaian schools. Given the scarcity of mental health professionals and resources, particularly in rural and underserved areas, this model can serve as a scalable tool for early identification of at-risk adolescents. By integrating this screening tool into existing school systems, educators and counselors can proactively address mental health concerns before they escalate.

Finally, the findings underscore the importance of designing interventions that adopt a multi-level approach. Effective programs should address individual psychological factors, such as depression and anxiety, while also strengthening family support systems and fostering school-based mechanisms for mental health care. Additionally, raising community awareness and promoting engagement around adolescent mental health can help reduce stigma and create a supportive environment for young people. Together, these efforts can contribute to a more comprehensive and sustainable approach to improving mental health outcomes in Ghanaian schools.

Limitations

While this study provides valuable insights into adolescent suicide risk and the potential of predictive models in Ghanaian school settings, several limitations must be acknowledged. First, the cross-sectional design of the study limits the ability to establish causal relationships between identified risk factors and suicide risk. Longitudinal studies would be necessary to better understand the temporal dynamics of these relationships and to assess the long-term predictive accuracy of the model.

Second, the data were self-reported, which introduces the possibility of response bias. Adolescents may have underreported or overreported their psychological distress or experiences due to stigma, social desirability or misunderstanding of survey items. This could affect the reliability of the data and, consequently, the performance of the predictive model.

Third, while the sample was nationally representative, certain contextual factors unique to specific regions or subgroups may not have been fully captured. For instance, cultural differences, access to resources and variations in gender norms across rural, peri-urban and urban areas might influence risk factors differently.

Excluding absent students, those with language barriers or those with cognitive limitations may introduce selection bias and limit the generalizability of our findings to diverse and vulnerable populations.

Lastly, the predictive model, while demonstrating strong performance metrics, requires further validation in real-world settings. Implementation in schools may face challenges such as limited infrastructure, lack of trained personnel and ethical concerns regarding data privacy and stigmatization of at-risk adolescents. Future research should address these limitations by incorporating longitudinal designs, refining data collection methods and piloting the model in diverse school environments to ensure its feasibility and effectiveness.

Conclusion

This study highlights the critical need for proactive measures to address adolescent suicide risk in Ghanaian schools. By identifying key risk factors and demonstrating the potential of a predictive model for early detection, it provides a foundation for scalable, cost-effective mental health interventions in resource-limited settings.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/gmh.2025.10083.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/gmh.2025.10083.

Data availability statement

The data for this study cannot be shared as part of the approval policy by the University at Buffalo Human Research Ethics Committee.

Financial support

This research was funded by the State University of New York at Buffalo.

Competing interests

The authors have no financial interests or potential conflicts of interest to disclose as far as this article is concerned. This research did not receive any specific grant from funding agencies in the public, commercial or not-for-profit sectors. The sponsors were not involved in the design or completion of the study or manuscript.

Ethics approval

The questionnaire and methodology for this study were approved by the University at Buffalo Human Research Ethics Committee.

Consent to participate

Informed consent was obtained from all individual participants included in the study, including legal guardians.

Consent to publish

No identifying information was published in this article as this was just a cross-sectional study.

References

Acquah, EO, Wilson, ML and Doku, DT (2014) Patterns and correlates for bullying among young adolescents in Ghana. The Social Sciences 3, 114.Google Scholar
Atmakuru, A, Shahini, A, Chakraborty, S, Seoni, S, Salvi, M, Hafeez-Baig, A, Rashid, S, Tan, RS, Barua, PD, Molinari, F and Acharya, UR (2025) Artificial intelligence-based suicide prevention and prediction: A systematic review (2019–2023). Information Fusion 114, 102673. https://doi.org/10.1016/j.inffus.2024.102673.Google Scholar
Azasu, EK (2024) Exploring suicide stigma among adolescents in the greater Accra region of Ghana: The role of age, mental health stigma, and traditional beliefs. Mental Health & Prevention 33, 200314. https://doi.org/10.1016/j.mhp.2023.200314.Google Scholar
Azasu, EK and Joe, S (2023) Correlates of suicide among middle and high school students in Ghana. The Journal of Adolescent Health 72(5S), S59S63. https://doi.org/10.1016/j.jadohealth.2022.09.036.Google Scholar
Azasu, EK, Quarshie, EN, Messias, E, Larnyoh, M, Ali, E and Joe, S (2024) Psychological and socio-ecological correlates of 12-month suicide behavior among junior high school students in the greater Accra region of Ghana. Social Psychiatry and Psychiatric Epidemiology. https://doi.org/10.1007/s00127-024-02613-5.Google Scholar
Joe, S, Stein, DJ, Seedat, S, Herman, A and Williams, DR (2008) Non-fatal suicidal behavior among south Africans: Results from the South Africa stress and health study. Social Psychiatry and Psychiatric Epidemiology: The International Journal for Research in Social and Genetic Epidemiology and Mental Health Services 43(6), 454461. https://doi.org/10.1007/s00127-008-0348-7.Google Scholar
Kessler, RC and Üstün, TB (2004) The world mental health (WMH) survey initiative version of the World Health Organization (WHO) composite international diagnostic interview (CIDI). International Journal of Methods in Psychiatric Research 13(2), 93121. https://doi.org/10.1002/mpr.168.Google Scholar
Keugoung, B., Tabah, E., & Criel, B. (2013). Suicides in a rural health district of Cameroon: The iceberg of poor mental health care policies and services: A mixed method study [conference abstract]. Journal of Mental Health Policy and Economics, 16, S17S18. https://www.embase.com/search/results?subaction=viewrecord&id=L71383733&from=exportGoogle Scholar
Krug, EG, Mercy, JA, Dahlberg, LL and Zwi, AB (2002) The world report on violence and health. The Lancet 360(9339), 10831088. https://doi.org/10.1016/s0140-6736(02)11133-0.Google Scholar
Offei Ofosu, F (2024) Impact of Social Media Use on Adolescent Mental Health and Well-Being at Selected Private Schools in the Greater Accra Region Ghana. Ensign Global College, Eastern Region: Ghana, West Africa.Google Scholar
Oppong Asante, K, Kugbey, N, Osafo, J, Quarshie, EN-B and Sarfo, JO (2017) The prevalence and correlates of suicidal behaviours (ideation, plan and attempt) among adolescents in senior high schools in Ghana. SSM - Population Health 3, 427434. https://doi.org/10.1016/j.ssmph.2017.05.005.Google Scholar
Roberts, M, Mogan, C and Asare, JB (2014) An overview of Ghana’s mental health system: Results from an assessment using the World Health Organization’s assessment instrument for mental health systems (WHO-AIMS). International Journal of Mental Health Systems 8(1), 16. https://doi.org/10.1186/1752-4458-8-16.Google Scholar
Rubinger, L, Gazendam, A, Ekhtiari, S and Bhandari, M (2023) Machine learning and artificial intelligence in research and healthcare. Injury 54, S69S73. https://doi.org/10.1016/j.injury.2022.01.046.Google Scholar
Wasserman, D., Cheng, Q., & Jiang, G.-X. (2005). Global suicide rates among young people aged 15-19. World Psychiatry: Official Journal of the World Psychiatric Association (WPA), 4(2), 114120. https://pubmed.ncbi.nlm.nih.gov/16633527.Google Scholar
WHO. (2020). Global Health Estimates 2019: deaths by cause, age, sex, by country and by region, 2000–2019. https://www.who.int/publications/i/item/9789240026643.Google Scholar
Yankson, PW and Bertrand, M (2012) Challenges of urbanization in Ghana. The Mobile City of Accra, 125. https://horizon.documentation.ird.fr/exl-doc/pleins_textes/divers13-04/010056998.pdfGoogle Scholar
Zollner, H, Fuchs, K and Fegert, J (2014) Prevention of sexual abuse: Improved information is crucial. Child and Adolescent Psychiatry and Mental Health 8, 5. https://doi.org/10.1186/1753-2000-8-5.Google Scholar
Figure 0

Figure 1. The socio-ecological model (Zollner et al., 2014).

Figure 1

Table 1. Correlations between independent variables and suicide risk (N = 1,703)

Figure 2

Figure 2. Heatmap of correlations between independent variables and suicide risk.

Figure 3

Table 2. Gender differences in key risk factors (N = 1,703)

Figure 4

Figure 3. Mean scores of key risk factors by gender.

Figure 5

Table 3. Random forest model performance metrics

Figure 6

Figure 4. Feature importance scores from the random forest model.

Figure 7

Table 4. Feature importance in predicting suicide risk

Figure 8

Figure 5. ROC curve for the suicide risk prediction model.

Supplementary material: File

Azasu supplementary material 1

Azasu supplementary material
Download Azasu supplementary material 1(File)
File 35.6 KB
Supplementary material: File

Azasu supplementary material 2

Azasu supplementary material
Download Azasu supplementary material 2(File)
File 19.9 KB

Author comment: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR1

Comments

Dear Dr. Galea and Guest Editors,

I am pleased to submit my manuscript titled ‘A Machine Learning Analysis of Socio-Ecological and Psychological Risk Factors for Suicide Among a Nationally Representative Sample of Ghanaian Junior High School Students’ for consideration in the special issue Self-harm and Suicide: A Global Priority of the Cambridge Journal. My work aligns closely with the theme “Changing the Narrative” for World Suicide Prevention Day 2024 and contributes to the growing body of research aimed at addressing the global burden of suicide, particularly in low- and middle-income countries (LMICs).

The manuscript addresses key areas of focus outlined in your call for papers, including “the development of culturally appropriate suicide screening tools,” “interventions for adolescents at elevated risk,” or “the role of stigma reduction in suicide prevention efforts”. Drawing on data and insights from a nationally representative study of adolescents in Ghana, this work provides evidence-based recommendations for scalable and cost-effective detection and prediction of suicide risk tailored to resource-limited settings.

In line with the goals of the United Nations Sustainable Development Goal (UN SDG) 3.4.2 to reduce suicide mortality by one-third by 2030, this research offers innovative approaches to early identification of at-risk populations and highlights practical strategies for implementation in LMICs. It also underscores the importance of addressing the intersectionality of social, cultural, and psychological determinants of suicide, as well as the role of community engagement in reducing stigma and improving mental health outcomes.

I believe that this manuscript will make a valuable contribution to the special issue, offering both theoretical and practical insights to inform policy and practice. I have adhered to the journal’s submission guidelines and confirm that this manuscript has not been published elsewhere and is not under consideration by any other journal.

Thank you for considering my submission. I look forward to the possibility of contributing to this important special issue and am happy to provide any additional information or clarification if needed.

Sincerely,

Enoch Azasu

Review: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Dear Dr. Azasu,

Thank you for sharing your work—it’s clearly an important contribution, and I appreciate the effort you’ve put into it. I do have a few remarks that I hope will help strengthen your study:

Incentives and Coercion:

Were there any incentives provided to students before the consent process? If not, it would be helpful to explicitly mention this to rule out potential financial coercion.

Additionally, how were students protected from potential coercion by school authorities to participate? Clarifying this would add to the ethical rigor of the study.

Language Considerations:

Did you consider translating the questionnaire into the local language for students who might not be proficient in English? This could ensure greater inclusivity and accuracy in responses.

Education as a Predictor:

Based on previous literature, is education (Y/N) a predictor for the outcomes of interest? If so, focusing your study on students might affect its validity, as the goal is to assess mental health among all adolescents in the country. This point might be worth addressing.

Support for Suicidal Ideation:

For the 18% of students who screened positive for suicidal ideation, was there any immediate support provided? If not, are there any plans underway to address this critical need?

Once again, thank you for your meaningful work. I look forward to hearing your thoughts!

Best regards,

Isaac.

Review: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR3

Conflict of interest statement

I have no conflicts of interests to declare.

Comments

Objective of the Paper

This study investigates the use of machine learning (ML) techniques to analyze psychological factors (e.g., depression, anxiety) and socio-ecological factors (e.g., bullying, parental support) associated with suicide risk among a representative sample of Ghanaian school-going students.

Key Points

• The study contributes valuable knowledge to the limited literature on suicide in low- and middle-income countries (LMICs).

• The work is highly significant and fills a critical gap, particularly in the emerging research on adolescent suicide in sub-Saharan Africa. It provides important insights into a vulnerable population using novel/emerging methods.

Major Issues and General Comments

• No major concerns.

• The manuscript contains many bullet points. It is recommended to convert these into paragraph format for better readability.

• The manuscript frequently references AI/ML techniques. Since AI methodologies are not used, it is advisable to only refer to using ML techniques.

• The term “suicide” is used broadly throughout the abstract and manuscript. Suicide encompasses various aspects, including suicidal ideation, planning, attempts, self-harm, and death by suicide. It is crucial to specify which aspect is being addressed or use more precise terminology (e.g., suicidal thoughts and behaviors).

Minor Issues

Abstract Clarity

• Clearly specify the aspects of suicide being predicted (e.g., suicidal thoughts and behaviors, self-harm, death by suicide).

• As previously mentioned, avoid implying the use of both AI and ML techniques.

• When discussing “depression, anxiety, and perceived stress,” specify that these are symptoms rather than diagnosed disorders.

Introduction

• Provide citations for the statement: “Adolescents are particularly vulnerable to suicidal ideation due to the interplay of psychosocial stressors, rapid developmental changes, and limited access to mental health resources.”

• Include statistics on suicidal planning and attempts among adolescents in Ghana or similar West African or sub-Saharan African settings, if available.

• Support the statement: “Despite the growing recognition of adolescent suicide as a critical issue, mental health resources in Ghana remain scarce, and traditional approaches to identifying suicide risk often fall short in addressing the multifaceted nature of this problem.”

• Provide a citation for the statement: “At the individual level, factors such as personal experiences of bullying victimization, food insecurity, and emotional distress play a critical role in shaping mental health outcomes.”

• Consider incorporating a visual adaptation of the SEM model to illustrate how it fits into the study’s framework.

Methods

• Convert bullet points into paragraphs for consistency.

• The abstract states that the survey included 1,703 students, while 1,702 students are mentioned on line 54. Ensure consistency in reporting.

• If the study referenced in “The measures used in this study were tested and validated in a prior study conducted by the same research team in 2022 with 800 students from the Greater Accra region” is published, provide a citation. If unpublished, describe its status.

• Clearly list all measurement scales used to assess outcomes (e.g., specify which scale was used to assess anxiety). Provide a brief summary of what each measure entailed within the categories of demographic, mental health, psychosocial factors, and social media/internet use along with response options. For example:

o Depression was assessed using the PHQ-4, an x-item measure where responses range from 0 (not at all) to 3 (nearly every day), with scores ≥X indicating moderate to severe depressive symptoms.

o Consider creating an appendix to present the full questions and response options.

• If suicidal ideation, plans, and attempts were assessed separately, clearly define how each was assessed, coded, and prepared for each type of analysis. Do the same if a global WHO CIDI score was used as well. This is important to clarify particularly for logistic regression models which typically involve categorical outcomes.

• Ensure consistent terminology from the introduction onward. Psychological and socio-ecological factors should remain distinct throughout. When reporting findings, avoid lumping them together to prevent confusion. For instance, even in the methods, when discussing measures, consider separating psychological and socio-ecological factors to remain consistent.

• In lines 45-47, expand on the data cleaning process, including the frequency, range, and handling of missing values (e.g., multiple imputation, listwise deletion).

• In lines 49-50, provide a rationale for applying both Random Forest and Logistic Regression models to identify suicide risk factors. Since logistic regression involves categorical outcomes, clearly define response options and how data were coded.

Results

• Convert bullet points into paragraphs.

• Maintain consistency in terminology for psychological and socio-ecological factors. When discussing gender differences, clarify both psychological and socio-ecological findings. Example, you have:

o “Significant gender differences were observed across multiple risk factors. Female students consistently reported higher levels of psychological distress compared to male students.” But what about the socio-ecological findings?

o In Table 2, separate psychological and socio-ecological factors for clarity while keeping them in the same table. Add a row to distinguish between the two categories and explicitly discuss conclusions related to each.

• Although logistic regression is mentioned in the methods, results from these models are not presented. If logistic regression models were conducted, consider including them in an appendix or if they were not used, remove mentions of logistic regression models. If included, compare their performance with Random Forest models.

Discussion

• Convert bullet points into paragraphs.

• Ensure consistent terminology when discussing psychological and socio-ecological factors.

• The limitations of cross-sectional data in causal inference are acknowledged in the discussion; consider mentioning this in the methods section as well, particularly in relation to machine learning analysis and the prediction of suicide risk. Right now, the language is a little ambiguous.

• Discuss the comparative performance of Random Forest and Logistic Regression models (if they were used).

• Discuss the limitations of excluding absent students, those unable to consent due to language barriers, or those with cognitive limitations.

Review: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR4

Conflict of interest statement

Reviewer declares none.

Comments

The researchers sought to demonstrate that the use of artificial intelligence and machine learning can prove more effective than traditional methods in identifying adolescents at higher risk of suicide using a cohort of teens from a junior high school in Ghana. There is a hefty amount of statistical jargon that is not explained sufficiently and difficult to interpret. Additionally, to someone unfamiliar with AI/ML, it is not clear what the artificial intelligence and machine learning aspects are by definition or what they entail in this context as well as how exactly they are used on the data in a way that adds to the findings already gathered by the surveys that the researchers administered. I would like to thank the authors for their effort and share a series of comments and recommendations for potential improvement of the manuscript.

INTRODUCTION

- There is repeated reference to “psychological and socioecological” factors that feels excessive, without much explanation as to what those factors are until further into the introduction.

- Would suggest explaining in this section what exactly artificial intelligence and machine learning mean

- The theoretical framework with the SEM was well-described

METHODS

- The recruitment and logistics of survey administration were overall well-described

- Unclear if the comprehensive safety plan was just for research assistants to recognize distress and refer to other mental health professionals or if there was more individualized planning involved

- What is Python programming language?

- What does “cleaned the data” mean?

- What is the difference between the “training” and “testing” sets?

- What is Random Forest? If this is the machine learning used, I would recommend describing it in much greater detail than what is given here.

RESULTS

- How is “suicide risk” defined? Is it based on the percentages of SI, suicide plans, attempts section? If so would explicitly state this and how the ultimate value of the risk is determined if it is being correlated with the other variables

- Consider combining the two titles below the descriptive statistics section: “correlations between key psychological and socioecological predictors of suicide risk.”

- The formatting within table one for the parental support line appears confusing as the negative symbol is in a line above the values.

- The note at the bottom of table one is not specifically labeled to refer to the double asterisks

- It is unclear what the values in table 2 represent. For example, what does a mean score of 8.2 for depression signify here?

DISCUSSION

- What is the machine learning/predictive models adding to the understanding of suicide risk past what the survey that was administered is already asking the students to identify themselves?

- Is this stating that the results of said surveys should be analyzed by AI rather than by humans and have the system then use the data to identify at-risk youth? Would make this explicitly clear if so and also who would be examining the findings of the AI programming in what is proposed as resource-scarce areas.

- The “practical implications” section repeats much of the assertions already made in the discussion and feels more like a concluding section. Would consider either condensing this or making this the “conclusion” section of the paper.

- Other limitations could include the lack of more detailed demographics including any existing other psychiatric comorbities, LGBTQ+ status, and substance use, which could all affect suicide risk especially in adolescents.

Recommendation: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR5

Comments

No accompanying comment.

Decision: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R0/PR6

Comments

No accompanying comment.

Author comment: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R1/PR7

Comments

Subject: Revision Submission for GMH-2025-0003 – Response to Editor

Dear Dr. Galea,

Thank you for the opportunity to revise my manuscript entitled “A Machine Learning Analysis of Socio-Ecological and Psychological Risk Factors for Suicide Among a Nationally Representative Sample of Ghanaian Junior High School Students” (GMH-2025-0003) for Cambridge Prisms: Global Mental Health. I appreciate the thoughtful feedback provided by the reviewers and am pleased to submit my revised manuscript, addressing each comment as detailed in the accompanying response document.

I have completed the revisions, incorporating the suggested major changes, and have included both clean and tracked changes versions of the manuscript for your review. Additionally, I have added the requested impact statement to highlight the significance of my findings. However, regarding the graphical abstract, I regret to inform you that I am not able to include a graphical abstract at this time. I would greatly appreciate any guidance or support the journal can provide in this regard.

Thank you once again for considering my work. I look forward to your feedback and am eager to ensure my submission meets the journal’s standards.

Sincerely,

Dr. Enoch Azasu

Review: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

Comments for the Author

Objective of the Paper

This study investigates the application of machine learning (ML) techniques to examine psychological (e.g., depression, anxiety) and socio-ecological (e.g., bullying, parental support) factors associated with suicide risk among a representative sample of school-going students in Ghana.

Key Points

• This study contributes valuable insights to the limited literature on suicide in low- and middle-income countries (LMICs).

• The work addresses a critical gap, particularly in the emerging research on adolescent suicide in sub-Saharan Africa, and leverages novel analytical methods to explore this issue in a vulnerable population.

Major Issues and General Comments

• No major concerns. This is a timely and important paper that makes a meaningful contribution.

Minor Issues and Specific Suggestions

Abstract Clarity

• As previously suggested, please clearly specify what aspect(s) of suicide were assessed (e.g., suicide risk, suicidal thoughts, behaviors, etc). If space allows, briefly mention how suicide risk was measured. Otherwise, the abstract reads well.

Introduction

• Thank you for incorporating the SEM framework. If the figure used was adapted or taken from another source, please cite both the original developer of the model—Urie Bronfenbrenner—and the source of the visual.

Methods

• Consider reducing the use of bullet points, especially in the inclusion/exclusion section, by integrating the content into narrative paragraphs for better readability.

• In lines 19–27: Thank you for the revisions—this section is much clearer. One area that still needs clarification is how variables were scored and coded. Many of the measures listed in the appendix are based on Likert scales; please specify whether these were analyzed as continuous variables or transformed into categorical or binary variables. For the suicidal behavior variable based on the WHOCIDI, it is described as “nominal (Yes or No),” but this appears to be binary. “Binary” would be a more precise term. Additionally, please clarify whether individual WHOCIDI items (e.g., ideation, plan, attempt) were modeled separately or combined into a single outcome. If combined, indicate the criteria used to code a “Yes” response. You touch on this in the data analysis section, but consider moving it up and expanding a bit for clarity.

• In the sentence, “The independent variables were depression symptoms, anxiety symptoms, perceived stress, social media addiction, trauma exposure, and financial hardship,” consider labeling which variables are psychological versus socio-ecological to align with your conceptual framework.

• In lines 22–27: Thank you for explaining your handling of missing data. To enhance transparency and allow readers to assess potential bias or information loss, please report the proportion of missingness (e.g., <5%, 10%, etc.) before listwise deletion was applied.

Results

• Although logistic regression is mentioned in the methods, the corresponding results are not presented. If you prefer not to include them in the main text, consider briefly justifying why the random forest results are prioritized and referring readers to the appendix for the logistic regression outputs. As of right now, a reader would expect to see both models in the results section.

Discussion

• This section is well written and clearly interprets the results. No changes needed here—thank you!

Overall

Excellent work incorporating previous feedback! This is a strong and impactful paper, and I look forward to seeing the final version!

Recommendation: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R1/PR9

Comments

No accompanying comment.

Decision: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R1/PR10

Comments

No accompanying comment.

Author comment: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R2/PR11

Comments

No accompanying comment.

Review: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R2/PR12

Conflict of interest statement

Reviewer declares none.

Comments

Thank you for addressing all of my previous comments—this manuscript is much improved and ready to move forward. I have just a few very minor edits to consider:

1. In the abstract (line 45), please add parentheses around “i.e. ideation, plan and attempts.”

2. In the second page of the abstract (in the conclusion section, line 10), similarly enclose “i.e. ideation, plan and attempts.” in parentheses.

3. Under “Survey Administration” (page 10) (lines 40–47), I suggest rephrasing the list of independent variables for clarity. For example:

“The independent variables included psychological factors such as depression symptoms and anxiety symptoms, and socio‐ecological factors such as perceived stress, social media addiction, trauma exposure, and financial hardship (see Appendix 1 for measurement details, scoring levels, and citations).”

4. In the same section (lines 38–40), change “The scoring was nominal (Yes or No)” to “The scoring was binary (Yes or No).”

Thank you again for your careful revisions. I’m happy to see this paper progressing and have no further concerns.

Recommendation: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R2/PR13

Comments

No accompanying comment.

Decision: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R2/PR14

Comments

No accompanying comment.

Author comment: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R3/PR15

Comments

No accompanying comment.

Recommendation: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R3/PR16

Comments

Thank you for addressing the reviewers' concerns. This is important work.

Decision: Using a machine learning analysis of socio-ecological and psychological factors to predict suicide risk among a nationally representative sample of Ghanaian adolescents — R3/PR17

Comments

No accompanying comment.