Factors Influencing Sexual Experiences in Adolescents Using a Random Forest Model: Secondary Data Analysis of the 2019~2021 Korea Youth Risk Behavior Web-based Survey Data
- Author:
Yoonseok YANG
1
;
Ju Won KWON
;
Youngran YANG
Author Information
- Publication Type:RESEARCH PAPER
- From:Journal of Korean Academy of Nursing 2024;54(2):193-210
- CountryRepublic of Korea
- Language:EN
-
Abstract:
Purpose:The objective of this study was to develop a predictive model for the sexual experiences of adolescents using the random forest method and to identify the “variable importance.”
Methods:The study utilized data from the 2019 to 2021 Korea Youth Risk Behavior Web-based Survey, which included 86,595 man and 80,504 woman participants. The number of independent variables stood at 44. SPSS was used to conduct Rao-Scott χ2 tests and complex sample t-tests. Modeling was performed using the random forest algorithm in Python. Performance evaluation of each model included assessments of precision, recall, F1-score, receiver operating characteristics curve, and area under the curve calculations derived from the confusion matrix.
Results:The prevalence of sexual experiences initially decreased during the COVID-19 pandemic, but later increased. “Variable importance” for predicting sexual experiences, ranked in the top six, included week and weekday sedentary time and internet usage time, followed by ease of cigarette purchase, age at first alcohol consumption, smoking initiation, breakfast consumption, and difficulty purchasing alcohol.
Conclusion:Education and support programs for promoting adolescent sexual health, based on the top-ranking important variables, should be integrated with health behavior intervention programs addressing internet usage, smoking, and alcohol consumption. We recommend active utilization of the random forest analysis method to develop high-performance predictive models for effective disease prevention, treatment, and nursing care.