1.Association of Age, Sex and Education With Access to the Intravenous Thrombolysis for Acute Ischemic Stroke
Yoona KO ; Beom Joon KIM ; Youngran KIM ; Jong-Moo PARK ; Kyusik KANG ; Jae Guk KIM ; Jae-Kwan CHA ; Tai Hwan PARK ; Kyungbok LEE ; Jun LEE ; Keun-Sik HONG ; Byung-Chul LEE ; Kyung-Ho YU ; Dong-Eog KIM ; Joon-Tae KIM ; Jay Chol CHOI ; Jee Hyun KWON ; Wook-Joo KIM ; Kyu Sun YUM ; Sung-Il SOHN ; Hyungjong PARK ; Sang-Hwa LEE ; Kwang-Yeol PARK ; Chi Kyung KIM ; Sung Hyuk HEO ; Moon-Ku HAN ; Anjail Z. SHARRIEF ; Sunil A. SHETH ; Hee-Joon BAE ;
Journal of Korean Medical Science 2025;40(13):e49-
		                        		
		                        			 Background:
		                        			Barriers to treatment with intravenous thrombolysis (IVT) for patients with acute ischemic stroke (AIS) in South Korea remain incompletely characterized. We analyze a nationwide prospective cohort to determine patient-level features associated with delayed presentation and non-treatment of potential IVT-eligible patients. 
		                        		
		                        			Methods:
		                        			We identified consecutive patients with AIS from 01/2011 to 08/2023 from a multicenter and prospective acute stroke registry in Korea. Patients were defined as IVT candidates if they presented within 4.5 hours from the last known well, had no lab evidence of coagulopathy, and had National Institute of Health Stroke Scale (NIHSS) ≥ 4. Multivariable generalized linear mixed regression models were used to investigate the associations between their characteristics and the IVT candidates or the use of IVT among the candidates. 
		                        		
		                        			Results:
		                        			Among 84,103 AIS patients, 41.0% were female, with a mean age of 69 ± 13 years and presentation NIHSS of 4 [interquartile range, 1–8]. Out of these patients, 13,757 (16.4%) were eligible for IVT, of whom 8,179 (59.5%) received IVT. Female sex (adjusted risk ratio [RR], 0.90; 95% confidence interval [CI], 0.86–0.94) and lower years of education (adjusted RR, 0.90; 95% CI, 0.84–0.97 for 0–3 years, compared to ≥ 13 years) were associated with a decreased likelihood of presenting as eligible for IVT after AIS; meanwhile, young age (adjusted RR, 1.12; 95% CI, 1.01–1.24 for ≤ 44 years, compared to 75–84 years) was associated with an increased likelihood of being an IVT candidate. Among those who were eligible for IVT, only age was significantly associated with the use of IVT (adjusted RR, 1.09; 95% CI, 1.03–1.16 for age 65–74 and adjusted RR, 0.83; 95% CI, 0.76–0.90 for ≥ 85 years, respectively). 
		                        		
		                        			Conclusion
		                        			Most patients with AIS present outside IVT eligibility in South Korea, and only 60% of eligible patients were ultimately treated. We identified increased age, female sex and lower education as key features on which to focus interventions for improving IVT utilization. 
		                        		
		                        		
		                        		
		                        	
2.Long-Term Incidence of Gastrointestinal Bleeding Following Ischemic Stroke
Jun Yup KIM ; Beom Joon KIM ; Jihoon KANG ; Do Yeon KIM ; Moon-Ku HAN ; Seong-Eun KIM ; Heeyoung LEE ; Jong-Moo PARK ; Kyusik KANG ; Soo Joo LEE ; Jae Guk KIM ; Jae-Kwan CHA ; Dae-Hyun KIM ; Tai Hwan PARK ; Kyungbok LEE ; Hong-Kyun PARK ; Yong-Jin CHO ; Keun-Sik HONG ; Kang-Ho CHOI ; Joon-Tae KIM ; Dong-Eog KIM ; Jay Chol CHOI ; Mi-Sun OH ; Kyung-Ho YU ; Byung-Chul LEE ; Kwang-Yeol PARK ; Ji Sung LEE ; Sujung JANG ; Jae Eun CHAE ; Juneyoung LEE ; Min-Surk KYE ; Philip B. GORELICK ; Hee-Joon BAE ;
Journal of Stroke 2025;27(1):102-112
		                        		
		                        			 Background:
		                        			and Purpose Previous research on patients with acute ischemic stroke (AIS) has shown a 0.5% incidence of major gastrointestinal bleeding (GIB) requiring blood transfusion during hospitalization. The existing literature has insufficiently explored the long-term incidence in this population despite the decremental impact of GIB on stroke outcomes. 
		                        		
		                        			Methods:
		                        			We analyzed the data from a cohort of patients with AIS admitted to 14 hospitals as part of a nationwide multicenter prospective stroke registry between 2011 and 2013. These patients were followed up for up to 6 years. The occurrence of major GIB events, defined as GIB necessitating at least two units of blood transfusion, was tracked using the National Health Insurance Service claims data. 
		                        		
		                        			Results:
		                        			Among 10,818 patients with AIS (male, 59%; mean age, 68±13 years), 947 (8.8%) experienced 1,224 episodes of major GIB over a median follow-up duration of 3.1 years. Remarkably, 20% of 947 patients experienced multiple episodes of major GIB. The incidence peaked in the first month after AIS, reaching 19.2 per 100 person-years, and gradually decreased to approximately one-sixth of this rate by the 2nd year with subsequent stabilization. Multivariable analysis identified the following predictors of major GIB: anemia, estimated glomerular filtration rate <60 mL/min/1.73 m2 , and a 3-month modified Rankin Scale score of ≥4. 
		                        		
		                        			Conclusion
		                        			Patients with AIS are susceptible to major GIB, particularly in the first month after the onset of AIS, with the risk decreasing thereafter. Implementing preventive strategies may be important, especially for patients with anemia and impaired renal function at stroke onset and those with a disabling stroke. 
		                        		
		                        		
		                        		
		                        	
3.Association of Age, Sex and Education With Access to the Intravenous Thrombolysis for Acute Ischemic Stroke
Yoona KO ; Beom Joon KIM ; Youngran KIM ; Jong-Moo PARK ; Kyusik KANG ; Jae Guk KIM ; Jae-Kwan CHA ; Tai Hwan PARK ; Kyungbok LEE ; Jun LEE ; Keun-Sik HONG ; Byung-Chul LEE ; Kyung-Ho YU ; Dong-Eog KIM ; Joon-Tae KIM ; Jay Chol CHOI ; Jee Hyun KWON ; Wook-Joo KIM ; Kyu Sun YUM ; Sung-Il SOHN ; Hyungjong PARK ; Sang-Hwa LEE ; Kwang-Yeol PARK ; Chi Kyung KIM ; Sung Hyuk HEO ; Moon-Ku HAN ; Anjail Z. SHARRIEF ; Sunil A. SHETH ; Hee-Joon BAE ;
Journal of Korean Medical Science 2025;40(13):e49-
		                        		
		                        			 Background:
		                        			Barriers to treatment with intravenous thrombolysis (IVT) for patients with acute ischemic stroke (AIS) in South Korea remain incompletely characterized. We analyze a nationwide prospective cohort to determine patient-level features associated with delayed presentation and non-treatment of potential IVT-eligible patients. 
		                        		
		                        			Methods:
		                        			We identified consecutive patients with AIS from 01/2011 to 08/2023 from a multicenter and prospective acute stroke registry in Korea. Patients were defined as IVT candidates if they presented within 4.5 hours from the last known well, had no lab evidence of coagulopathy, and had National Institute of Health Stroke Scale (NIHSS) ≥ 4. Multivariable generalized linear mixed regression models were used to investigate the associations between their characteristics and the IVT candidates or the use of IVT among the candidates. 
		                        		
		                        			Results:
		                        			Among 84,103 AIS patients, 41.0% were female, with a mean age of 69 ± 13 years and presentation NIHSS of 4 [interquartile range, 1–8]. Out of these patients, 13,757 (16.4%) were eligible for IVT, of whom 8,179 (59.5%) received IVT. Female sex (adjusted risk ratio [RR], 0.90; 95% confidence interval [CI], 0.86–0.94) and lower years of education (adjusted RR, 0.90; 95% CI, 0.84–0.97 for 0–3 years, compared to ≥ 13 years) were associated with a decreased likelihood of presenting as eligible for IVT after AIS; meanwhile, young age (adjusted RR, 1.12; 95% CI, 1.01–1.24 for ≤ 44 years, compared to 75–84 years) was associated with an increased likelihood of being an IVT candidate. Among those who were eligible for IVT, only age was significantly associated with the use of IVT (adjusted RR, 1.09; 95% CI, 1.03–1.16 for age 65–74 and adjusted RR, 0.83; 95% CI, 0.76–0.90 for ≥ 85 years, respectively). 
		                        		
		                        			Conclusion
		                        			Most patients with AIS present outside IVT eligibility in South Korea, and only 60% of eligible patients were ultimately treated. We identified increased age, female sex and lower education as key features on which to focus interventions for improving IVT utilization. 
		                        		
		                        		
		                        		
		                        	
4.Association of Age, Sex and Education With Access to the Intravenous Thrombolysis for Acute Ischemic Stroke
Yoona KO ; Beom Joon KIM ; Youngran KIM ; Jong-Moo PARK ; Kyusik KANG ; Jae Guk KIM ; Jae-Kwan CHA ; Tai Hwan PARK ; Kyungbok LEE ; Jun LEE ; Keun-Sik HONG ; Byung-Chul LEE ; Kyung-Ho YU ; Dong-Eog KIM ; Joon-Tae KIM ; Jay Chol CHOI ; Jee Hyun KWON ; Wook-Joo KIM ; Kyu Sun YUM ; Sung-Il SOHN ; Hyungjong PARK ; Sang-Hwa LEE ; Kwang-Yeol PARK ; Chi Kyung KIM ; Sung Hyuk HEO ; Moon-Ku HAN ; Anjail Z. SHARRIEF ; Sunil A. SHETH ; Hee-Joon BAE ;
Journal of Korean Medical Science 2025;40(13):e49-
		                        		
		                        			 Background:
		                        			Barriers to treatment with intravenous thrombolysis (IVT) for patients with acute ischemic stroke (AIS) in South Korea remain incompletely characterized. We analyze a nationwide prospective cohort to determine patient-level features associated with delayed presentation and non-treatment of potential IVT-eligible patients. 
		                        		
		                        			Methods:
		                        			We identified consecutive patients with AIS from 01/2011 to 08/2023 from a multicenter and prospective acute stroke registry in Korea. Patients were defined as IVT candidates if they presented within 4.5 hours from the last known well, had no lab evidence of coagulopathy, and had National Institute of Health Stroke Scale (NIHSS) ≥ 4. Multivariable generalized linear mixed regression models were used to investigate the associations between their characteristics and the IVT candidates or the use of IVT among the candidates. 
		                        		
		                        			Results:
		                        			Among 84,103 AIS patients, 41.0% were female, with a mean age of 69 ± 13 years and presentation NIHSS of 4 [interquartile range, 1–8]. Out of these patients, 13,757 (16.4%) were eligible for IVT, of whom 8,179 (59.5%) received IVT. Female sex (adjusted risk ratio [RR], 0.90; 95% confidence interval [CI], 0.86–0.94) and lower years of education (adjusted RR, 0.90; 95% CI, 0.84–0.97 for 0–3 years, compared to ≥ 13 years) were associated with a decreased likelihood of presenting as eligible for IVT after AIS; meanwhile, young age (adjusted RR, 1.12; 95% CI, 1.01–1.24 for ≤ 44 years, compared to 75–84 years) was associated with an increased likelihood of being an IVT candidate. Among those who were eligible for IVT, only age was significantly associated with the use of IVT (adjusted RR, 1.09; 95% CI, 1.03–1.16 for age 65–74 and adjusted RR, 0.83; 95% CI, 0.76–0.90 for ≥ 85 years, respectively). 
		                        		
		                        			Conclusion
		                        			Most patients with AIS present outside IVT eligibility in South Korea, and only 60% of eligible patients were ultimately treated. We identified increased age, female sex and lower education as key features on which to focus interventions for improving IVT utilization. 
		                        		
		                        		
		                        		
		                        	
5.Long-Term Incidence of Gastrointestinal Bleeding Following Ischemic Stroke
Jun Yup KIM ; Beom Joon KIM ; Jihoon KANG ; Do Yeon KIM ; Moon-Ku HAN ; Seong-Eun KIM ; Heeyoung LEE ; Jong-Moo PARK ; Kyusik KANG ; Soo Joo LEE ; Jae Guk KIM ; Jae-Kwan CHA ; Dae-Hyun KIM ; Tai Hwan PARK ; Kyungbok LEE ; Hong-Kyun PARK ; Yong-Jin CHO ; Keun-Sik HONG ; Kang-Ho CHOI ; Joon-Tae KIM ; Dong-Eog KIM ; Jay Chol CHOI ; Mi-Sun OH ; Kyung-Ho YU ; Byung-Chul LEE ; Kwang-Yeol PARK ; Ji Sung LEE ; Sujung JANG ; Jae Eun CHAE ; Juneyoung LEE ; Min-Surk KYE ; Philip B. GORELICK ; Hee-Joon BAE ;
Journal of Stroke 2025;27(1):102-112
		                        		
		                        			 Background:
		                        			and Purpose Previous research on patients with acute ischemic stroke (AIS) has shown a 0.5% incidence of major gastrointestinal bleeding (GIB) requiring blood transfusion during hospitalization. The existing literature has insufficiently explored the long-term incidence in this population despite the decremental impact of GIB on stroke outcomes. 
		                        		
		                        			Methods:
		                        			We analyzed the data from a cohort of patients with AIS admitted to 14 hospitals as part of a nationwide multicenter prospective stroke registry between 2011 and 2013. These patients were followed up for up to 6 years. The occurrence of major GIB events, defined as GIB necessitating at least two units of blood transfusion, was tracked using the National Health Insurance Service claims data. 
		                        		
		                        			Results:
		                        			Among 10,818 patients with AIS (male, 59%; mean age, 68±13 years), 947 (8.8%) experienced 1,224 episodes of major GIB over a median follow-up duration of 3.1 years. Remarkably, 20% of 947 patients experienced multiple episodes of major GIB. The incidence peaked in the first month after AIS, reaching 19.2 per 100 person-years, and gradually decreased to approximately one-sixth of this rate by the 2nd year with subsequent stabilization. Multivariable analysis identified the following predictors of major GIB: anemia, estimated glomerular filtration rate <60 mL/min/1.73 m2 , and a 3-month modified Rankin Scale score of ≥4. 
		                        		
		                        			Conclusion
		                        			Patients with AIS are susceptible to major GIB, particularly in the first month after the onset of AIS, with the risk decreasing thereafter. Implementing preventive strategies may be important, especially for patients with anemia and impaired renal function at stroke onset and those with a disabling stroke. 
		                        		
		                        		
		                        		
		                        	
6.Association of Age, Sex and Education With Access to the Intravenous Thrombolysis for Acute Ischemic Stroke
Yoona KO ; Beom Joon KIM ; Youngran KIM ; Jong-Moo PARK ; Kyusik KANG ; Jae Guk KIM ; Jae-Kwan CHA ; Tai Hwan PARK ; Kyungbok LEE ; Jun LEE ; Keun-Sik HONG ; Byung-Chul LEE ; Kyung-Ho YU ; Dong-Eog KIM ; Joon-Tae KIM ; Jay Chol CHOI ; Jee Hyun KWON ; Wook-Joo KIM ; Kyu Sun YUM ; Sung-Il SOHN ; Hyungjong PARK ; Sang-Hwa LEE ; Kwang-Yeol PARK ; Chi Kyung KIM ; Sung Hyuk HEO ; Moon-Ku HAN ; Anjail Z. SHARRIEF ; Sunil A. SHETH ; Hee-Joon BAE ;
Journal of Korean Medical Science 2025;40(13):e49-
		                        		
		                        			 Background:
		                        			Barriers to treatment with intravenous thrombolysis (IVT) for patients with acute ischemic stroke (AIS) in South Korea remain incompletely characterized. We analyze a nationwide prospective cohort to determine patient-level features associated with delayed presentation and non-treatment of potential IVT-eligible patients. 
		                        		
		                        			Methods:
		                        			We identified consecutive patients with AIS from 01/2011 to 08/2023 from a multicenter and prospective acute stroke registry in Korea. Patients were defined as IVT candidates if they presented within 4.5 hours from the last known well, had no lab evidence of coagulopathy, and had National Institute of Health Stroke Scale (NIHSS) ≥ 4. Multivariable generalized linear mixed regression models were used to investigate the associations between their characteristics and the IVT candidates or the use of IVT among the candidates. 
		                        		
		                        			Results:
		                        			Among 84,103 AIS patients, 41.0% were female, with a mean age of 69 ± 13 years and presentation NIHSS of 4 [interquartile range, 1–8]. Out of these patients, 13,757 (16.4%) were eligible for IVT, of whom 8,179 (59.5%) received IVT. Female sex (adjusted risk ratio [RR], 0.90; 95% confidence interval [CI], 0.86–0.94) and lower years of education (adjusted RR, 0.90; 95% CI, 0.84–0.97 for 0–3 years, compared to ≥ 13 years) were associated with a decreased likelihood of presenting as eligible for IVT after AIS; meanwhile, young age (adjusted RR, 1.12; 95% CI, 1.01–1.24 for ≤ 44 years, compared to 75–84 years) was associated with an increased likelihood of being an IVT candidate. Among those who were eligible for IVT, only age was significantly associated with the use of IVT (adjusted RR, 1.09; 95% CI, 1.03–1.16 for age 65–74 and adjusted RR, 0.83; 95% CI, 0.76–0.90 for ≥ 85 years, respectively). 
		                        		
		                        			Conclusion
		                        			Most patients with AIS present outside IVT eligibility in South Korea, and only 60% of eligible patients were ultimately treated. We identified increased age, female sex and lower education as key features on which to focus interventions for improving IVT utilization. 
		                        		
		                        		
		                        		
		                        	
7.Long-Term Incidence of Gastrointestinal Bleeding Following Ischemic Stroke
Jun Yup KIM ; Beom Joon KIM ; Jihoon KANG ; Do Yeon KIM ; Moon-Ku HAN ; Seong-Eun KIM ; Heeyoung LEE ; Jong-Moo PARK ; Kyusik KANG ; Soo Joo LEE ; Jae Guk KIM ; Jae-Kwan CHA ; Dae-Hyun KIM ; Tai Hwan PARK ; Kyungbok LEE ; Hong-Kyun PARK ; Yong-Jin CHO ; Keun-Sik HONG ; Kang-Ho CHOI ; Joon-Tae KIM ; Dong-Eog KIM ; Jay Chol CHOI ; Mi-Sun OH ; Kyung-Ho YU ; Byung-Chul LEE ; Kwang-Yeol PARK ; Ji Sung LEE ; Sujung JANG ; Jae Eun CHAE ; Juneyoung LEE ; Min-Surk KYE ; Philip B. GORELICK ; Hee-Joon BAE ;
Journal of Stroke 2025;27(1):102-112
		                        		
		                        			 Background:
		                        			and Purpose Previous research on patients with acute ischemic stroke (AIS) has shown a 0.5% incidence of major gastrointestinal bleeding (GIB) requiring blood transfusion during hospitalization. The existing literature has insufficiently explored the long-term incidence in this population despite the decremental impact of GIB on stroke outcomes. 
		                        		
		                        			Methods:
		                        			We analyzed the data from a cohort of patients with AIS admitted to 14 hospitals as part of a nationwide multicenter prospective stroke registry between 2011 and 2013. These patients were followed up for up to 6 years. The occurrence of major GIB events, defined as GIB necessitating at least two units of blood transfusion, was tracked using the National Health Insurance Service claims data. 
		                        		
		                        			Results:
		                        			Among 10,818 patients with AIS (male, 59%; mean age, 68±13 years), 947 (8.8%) experienced 1,224 episodes of major GIB over a median follow-up duration of 3.1 years. Remarkably, 20% of 947 patients experienced multiple episodes of major GIB. The incidence peaked in the first month after AIS, reaching 19.2 per 100 person-years, and gradually decreased to approximately one-sixth of this rate by the 2nd year with subsequent stabilization. Multivariable analysis identified the following predictors of major GIB: anemia, estimated glomerular filtration rate <60 mL/min/1.73 m2 , and a 3-month modified Rankin Scale score of ≥4. 
		                        		
		                        			Conclusion
		                        			Patients with AIS are susceptible to major GIB, particularly in the first month after the onset of AIS, with the risk decreasing thereafter. Implementing preventive strategies may be important, especially for patients with anemia and impaired renal function at stroke onset and those with a disabling stroke. 
		                        		
		                        		
		                        		
		                        	
8.Occupation classification model based on DistilKoBERT: using the 5th and 6th Korean Working Condition Surveys
Tae-Yeon KIM ; Seong-Uk BAEK ; Myeong-Hun LIM ; Byungyoon YUN ; Domyung PAEK ; Kyung Ehi ZOH ; Kanwoo YOUN ; Yun Keun LEE ; Yangho KIM ; Jungwon KIM ; Eunsuk CHOI ; Mo-Yeol KANG ; YoonHo CHO ; Kyung-Eun LEE ; Juho SIM ; Juyeon OH ; Heejoo PARK ; Jian LEE ; Jong-Uk WON ; Yu-Min LEE ; Jin-Ha YOON
Annals of Occupational and Environmental Medicine 2024;36(1):e19-
		                        		
		                        			
		                        			  Accurate occupation classification is essential in various fields, including policy development and epidemiological studies. This study aims to develop an occupation classification model based on DistilKoBERT. This study used data from the 5th and 6th Korean Working Conditions Surveys conducted in 2017 and 2020, respectively. A total of 99,665 survey participants, who were nationally representative of Korean workers, were included. We used natural language responses regarding their job responsibilities and occupational codes based on the Korean Standard Classification of Occupations (7th version, 3-digit codes). The dataset was randomly split into training and test datasets in a ratio of 7:3. The occupation classification model based on DistilKoBERT was fine-tuned using the training dataset, and the model was evaluated using the test dataset. The accuracy, precision, recall, and F1 score were calculated as evaluation metrics. The final model, which classified 28,996 survey participants in the test dataset into 142 occupational codes, exhibited an accuracy of 84.44%. For the evaluation metrics, the precision, recall, and F1 score of the model, calculated by weighting based on the sample size, were 0.83, 0.84, and 0.83, respectively. The model demonstrated high precision in the classification of service and sales workers yet exhibited low precision in the classification of managers. In addition, it displayed high precision in classifying occupations prominently represented in the training dataset. This study developed an occupation classification system based on DistilKoBERT, which demonstrated reasonable performance. Despite further efforts to enhance the classification accuracy, this automated occupation classification model holds promise for advancing epidemiological studies in the fields of occupational safety and health.
		                        		
		                        	
9.Occupation classification model based on DistilKoBERT: using the 5th and 6th Korean Working Condition Surveys
Tae-Yeon KIM ; Seong-Uk BAEK ; Myeong-Hun LIM ; Byungyoon YUN ; Domyung PAEK ; Kyung Ehi ZOH ; Kanwoo YOUN ; Yun Keun LEE ; Yangho KIM ; Jungwon KIM ; Eunsuk CHOI ; Mo-Yeol KANG ; YoonHo CHO ; Kyung-Eun LEE ; Juho SIM ; Juyeon OH ; Heejoo PARK ; Jian LEE ; Jong-Uk WON ; Yu-Min LEE ; Jin-Ha YOON
Annals of Occupational and Environmental Medicine 2024;36(1):e19-
		                        		
		                        			
		                        			  Accurate occupation classification is essential in various fields, including policy development and epidemiological studies. This study aims to develop an occupation classification model based on DistilKoBERT. This study used data from the 5th and 6th Korean Working Conditions Surveys conducted in 2017 and 2020, respectively. A total of 99,665 survey participants, who were nationally representative of Korean workers, were included. We used natural language responses regarding their job responsibilities and occupational codes based on the Korean Standard Classification of Occupations (7th version, 3-digit codes). The dataset was randomly split into training and test datasets in a ratio of 7:3. The occupation classification model based on DistilKoBERT was fine-tuned using the training dataset, and the model was evaluated using the test dataset. The accuracy, precision, recall, and F1 score were calculated as evaluation metrics. The final model, which classified 28,996 survey participants in the test dataset into 142 occupational codes, exhibited an accuracy of 84.44%. For the evaluation metrics, the precision, recall, and F1 score of the model, calculated by weighting based on the sample size, were 0.83, 0.84, and 0.83, respectively. The model demonstrated high precision in the classification of service and sales workers yet exhibited low precision in the classification of managers. In addition, it displayed high precision in classifying occupations prominently represented in the training dataset. This study developed an occupation classification system based on DistilKoBERT, which demonstrated reasonable performance. Despite further efforts to enhance the classification accuracy, this automated occupation classification model holds promise for advancing epidemiological studies in the fields of occupational safety and health.
		                        		
		                        	
10.Occupation classification model based on DistilKoBERT: using the 5th and 6th Korean Working Condition Surveys
Tae-Yeon KIM ; Seong-Uk BAEK ; Myeong-Hun LIM ; Byungyoon YUN ; Domyung PAEK ; Kyung Ehi ZOH ; Kanwoo YOUN ; Yun Keun LEE ; Yangho KIM ; Jungwon KIM ; Eunsuk CHOI ; Mo-Yeol KANG ; YoonHo CHO ; Kyung-Eun LEE ; Juho SIM ; Juyeon OH ; Heejoo PARK ; Jian LEE ; Jong-Uk WON ; Yu-Min LEE ; Jin-Ha YOON
Annals of Occupational and Environmental Medicine 2024;36(1):e19-
		                        		
		                        			
		                        			  Accurate occupation classification is essential in various fields, including policy development and epidemiological studies. This study aims to develop an occupation classification model based on DistilKoBERT. This study used data from the 5th and 6th Korean Working Conditions Surveys conducted in 2017 and 2020, respectively. A total of 99,665 survey participants, who were nationally representative of Korean workers, were included. We used natural language responses regarding their job responsibilities and occupational codes based on the Korean Standard Classification of Occupations (7th version, 3-digit codes). The dataset was randomly split into training and test datasets in a ratio of 7:3. The occupation classification model based on DistilKoBERT was fine-tuned using the training dataset, and the model was evaluated using the test dataset. The accuracy, precision, recall, and F1 score were calculated as evaluation metrics. The final model, which classified 28,996 survey participants in the test dataset into 142 occupational codes, exhibited an accuracy of 84.44%. For the evaluation metrics, the precision, recall, and F1 score of the model, calculated by weighting based on the sample size, were 0.83, 0.84, and 0.83, respectively. The model demonstrated high precision in the classification of service and sales workers yet exhibited low precision in the classification of managers. In addition, it displayed high precision in classifying occupations prominently represented in the training dataset. This study developed an occupation classification system based on DistilKoBERT, which demonstrated reasonable performance. Despite further efforts to enhance the classification accuracy, this automated occupation classification model holds promise for advancing epidemiological studies in the fields of occupational safety and health.
		                        		
		                        	
            
Result Analysis
Print
Save
E-mail