Cancer-related Keywords in 2023: Insights from Text Mining of a Major Consumer Portal
	    		
		   		
		   			
		   		
	    	
    	 
    	10.4258/hir.2024.30.4.398
   		
        
        	
        	
        	
        		- Author:
	        		
		        		
		        		
			        		Wonjeong JEONG
			        		
			        		
			        		
			        			1
			        			
			        		
			        		
			        		
			        		
			        		;
		        		
		        		
		        		
			        		Eunkyoung SONG
			        		
			        		;
		        		
		        		
		        		
			        		Eunzi JEONG
			        		
			        		;
		        		
		        		
		        		
			        		Kyoung Hee OH
			        		
			        		;
		        		
		        		
		        		
			        		Hye-Sun LEE
			        		
			        		;
		        		
		        		
		        		
			        		Jae Kwan JUN
			        		
			        		
		        		
		        		
		        		
		        		
		        			
			        		
			        		Author Information
			        		
		        		
		        		
			        		
			        		
			        			1. Cancer Knowledge & Information Center, National Cancer Control Institute, National Cancer Center, Goyang, Korea
			        		
		        		
	        		
        		 
        	
        	
        	
        		- Publication Type:Original Article
 
        	
        	
            
            
            	- From:Healthcare Informatics Research
	            		
	            		 2024;30(4):398-408
	            	
            	
 
            
            
            	- CountryRepublic of Korea
 
            
            
            	- Language:English
 
            
            
            	- 
		        	Abstract:
			       	
			       		
				        
				        	 Objectives:With the growing importance of monitoring cancer patients’ internet usage, there is an increasing need for technology that expands access to relevant information through text mining. This study analyzed internet articles from portal sites in 2023 to identify trends in the information available to cancer patients and to derive meaningful insights. 
				        	
				        
				        	Methods:This study analyzed 19,578 news articles published on Naver, a major Korean portal site, from January 1, 2023, to December 31, 2023. Natural language processing, text mining, network analysis, and word cloud analysis were employed. The search term “am” (Korean for “cancer”) was used to identify keywords related to cancer. 
				        	
				        
				        	Results:In 2023, an average of 1,631 cancer-related articles were published monthly, with a peak of 1,946 in September and a low of 1,371 in February. A total of 132,456 keywords were extracted, with “cure” (2,218 occurrences), “lung cancer” (1,652), and “breast cancer” (1,235) being the most frequent. Term frequency-inverse document frequency analysis ranked “struggle” (1064.172) as the most significant keyword, followed by “lung cancer” (839.988) and “breast cancer” (744.840). Network analysis revealed four distinct clusters focusing on treatment, celebrity-related issues, major cancer types, and cancer-causing factors. 
				        	
				        
				        	Conclusions:The analysis of cancer-related keywords in 2023 indicates that news articles often prioritize gossip over essential information. These findings provide foundational data for future policy directions and strategies to address misinformation. This study underscores the importance of understanding the nature of cancer-related information consumed by the public and offers insights to guide official policies and healthcare practices.