1.NLUS-VQA: construction and evaluation of a visual question answering model for neonatal lung ultrasound diagnosis
Xuming TONG ; Jiangang CHEN ; Yiran WANG ; Xiqing ZHAO ; Yanhong YUAN ; Zishuo WANG ; Peng JIANG ; Qingyao XIONG ; Renxing LI ; Xueli WANG ; Jing LIU
Chinese Journal of Perinatal Medicine 2025;28(11):917-928
Objective:To develop and evaluate a medical visual question answering (VQA) model for neonatal lung ultrasound (LUS) images to enhance intelligent auxiliary diagnosis of neonatal pulmonary diseases.Methods:Using data from neonates admitted to Beijing Obstetrics and Gynecology Hospital, Capital Medical University (January 2023 to December 2024), an image-question-answer dataset comprising 251 LUS images was constructed [43 pneumonia (17.1%), 42 neonatal respiratory distress syndrome (16.7%), 83 transient tachypnea (33.1%), and 83 normal (33.1%) images] with a four-tier medical question-answer framework. Building upon the Qwen2.5-VL-7B base model and integrating LoRA fine-tuning with chain-of-thought prompting, we developed the NLUS-VQA model to enhance visual-language semantic alignment and enable stepwise clinical reasoning, achieving efficient small-sample adaptation. Model performance was comprehensively assessed through natural language generation metrics (BLEU-4, ROUGE-1/2/L), qualitative evaluation of characteristic recognition, and clinical consistency analysis.Results:(1) Quantitative evaluation demonstrated that NLUS-VQA achieved scores of 22.38 (BLEU-4), 48.26 (ROUGE-1), 22.40 (ROUGE-2), and 37.20 (ROUGE-L), representing significant improvements over baseline models. (2) Qualitatively, the model exhibited strong performance in identifying lung consolidation, coalescent B-lines, and snowflake signs, with its chain-of-thought strategy enhancing clinical interpretability and answer accuracy. (3) Clinically, NLUS-VQA achieved a Cohen's Kappa coefficient of 0.78 and diagnostic accuracy of 80.8% (21/26), indicating substantial agreement with clinical experts.Conclusion:The NLUS-VQA model demonstrates robust interpretability in recognizing key sonographic patterns (e.g. lung consolidation, confluent B-lines, and snowflake signs), providing a scalable framework for small-sample medical image analysis, though diagnostic performance on complex conditions remains limited by dataset scale and minority class representation.
2.Analysis of the application of VR and AR technologies in medical education
Jiaxian YUE ; Qingyao SHANG ; Jiaxiang LIU ; Xiyu KANG ; Xin WANG
China Medical Equipment 2025;22(7):172-176
VR technology can generate virtual,immersive,and interactive environments,allowing users to immerse themselves in these environments and interact with objects within them.AR technology,on the other hand,can accurately overlay virtual information onto real-world scenes,achieving a seamless integration of the virtual and the real.These two emerging technologies each possess unique advantages and exhibit broad development prospects.They have already begun to be applied in various aspects of medical education,such as basic theoretical teaching and skills training,with promising results.They can compensate for the shortcomings of traditional medical education,enhance students'learning enthusiasm and safety,and improve teaching effectiveness.However,limitations remain,such as the need for improved hardware infrastructure and a scarcity of teaching resources.Based on this,this paper systematically introduces the concepts of AR and VR technologies,reviews their application prospects,current status,advantages,and limitations in medical education,aiming to provide evidence-based support and feasible approaches for medical schools to develop digital teaching plans,promote educational reform,and drive research innovation.
3.A chest CT report conclusion generation system based on mT5 large language model for residency training
Yanfei HU ; Ai WANG ; Yaping ZHANG ; Keke ZHAO ; Zhijie PAN ; Qingyao LI ; Min XU ; Xifu WANG ; Xueqian XIE
Chinese Journal of Medical Education Research 2025;24(8):1016-1021
Objective:To fine-tune the mT5 (massively multilingual pre-trained text-to-text transformer) large language model, automatically generate report conclusions for teaching purposes from chest CT image descriptions, and assess the quality of automatically generated conclusions.Methods:The training set included 3 000 high-quality physical examination chest CT reports from one hospital, and the external validation set consisted of 600 physical examination chest CT reports from two other hospitals. Experienced radiology teaching physicians assessed the consistency between the generated conclusions and the original physician-written conclusions in the external validation set using a 5-point Likert scale across five linguistic indicators (correctness of examination information, correctness of lesion detection, standardization of terminology, applicability of the conclusions, and simplicity of conclusions). Using the original report conclusions as the reference, the accuracy of the conclusions generated based on the external validation set in describing four major thoracic conditions (pulmonary nodules, pneumonia, emphysema, pleural effusion) was evaluated. Perform chi square test using SPSS 25.0.Results:In the external validation set, the mean consistency score between the generated conclusions and the original conclusions given by the radiology teaching physicians was >4 points, indicating agreement with the original conclusions. In the generated conclusions, the description of the four major thoracic conditions demonstrated 0.95-1.00 (95% CI=0.91-1.00) accuracy, 0.76-1.00 (95% CI=0.59-1.00) sensitivity, and 0.97-1.00 (95% CI=0.91-1.00) specificity. Conclusions:The chest CT report conclusion generation system based on the mT5 large language model demonstrated high accuracy and is expected to provide immediate and efficient automated guidance for standardized residency training.
4.Swin2SR network for reconstructing chest super-resolution CT images
Qingyao LI ; Min XU ; Yaping ZHANG ; Lu ZHANG ; Lingyun WANG ; Zhijie PAN ; Xueqian XIE
Chinese Journal of Medical Imaging Technology 2025;41(5):739-743
Objective To observe the value of Swin2SR network based on Transformer architecture for reconstructing chest super-resolution CT images.Methods Chest CT data of 218 patients were retrospectively collected.Swin2SR model based on Transformer architecture was adopted to enhance standard 512 matrix(512 × 512)CT images(standard-512 group)into 1 024(SR-1 024 group)and 2 048(SR-2 048 group)matrix SR CT images,respectively.Subjective and objective evaluation of image quality were performed,and the results were compared among groups.Results The subjective scores of overall imaging quality and lesion clarity in SR-1 024 and SR-2 048 groups were both higher than those in standard-512 group(all P<0.05),while no significant difference was found between the former two(P>0.05).Meanwhile,no significant difference of objective indexes of imaging quality was observed among 3 groups(all P>0.05).Conclusion Swin2SR model could reconstruct chest SR CT images without increasing noise and improve imaging quality.
5.A chest CT report conclusion generation system based on mT5 large language model for residency training
Yanfei HU ; Ai WANG ; Yaping ZHANG ; Keke ZHAO ; Zhijie PAN ; Qingyao LI ; Min XU ; Xifu WANG ; Xueqian XIE
Chinese Journal of Medical Education Research 2025;24(8):1016-1021
Objective:To fine-tune the mT5 (massively multilingual pre-trained text-to-text transformer) large language model, automatically generate report conclusions for teaching purposes from chest CT image descriptions, and assess the quality of automatically generated conclusions.Methods:The training set included 3 000 high-quality physical examination chest CT reports from one hospital, and the external validation set consisted of 600 physical examination chest CT reports from two other hospitals. Experienced radiology teaching physicians assessed the consistency between the generated conclusions and the original physician-written conclusions in the external validation set using a 5-point Likert scale across five linguistic indicators (correctness of examination information, correctness of lesion detection, standardization of terminology, applicability of the conclusions, and simplicity of conclusions). Using the original report conclusions as the reference, the accuracy of the conclusions generated based on the external validation set in describing four major thoracic conditions (pulmonary nodules, pneumonia, emphysema, pleural effusion) was evaluated. Perform chi square test using SPSS 25.0.Results:In the external validation set, the mean consistency score between the generated conclusions and the original conclusions given by the radiology teaching physicians was >4 points, indicating agreement with the original conclusions. In the generated conclusions, the description of the four major thoracic conditions demonstrated 0.95-1.00 (95% CI=0.91-1.00) accuracy, 0.76-1.00 (95% CI=0.59-1.00) sensitivity, and 0.97-1.00 (95% CI=0.91-1.00) specificity. Conclusions:The chest CT report conclusion generation system based on the mT5 large language model demonstrated high accuracy and is expected to provide immediate and efficient automated guidance for standardized residency training.
6.Swin2SR network for reconstructing chest super-resolution CT images
Qingyao LI ; Min XU ; Yaping ZHANG ; Lu ZHANG ; Lingyun WANG ; Zhijie PAN ; Xueqian XIE
Chinese Journal of Medical Imaging Technology 2025;41(5):739-743
Objective To observe the value of Swin2SR network based on Transformer architecture for reconstructing chest super-resolution CT images.Methods Chest CT data of 218 patients were retrospectively collected.Swin2SR model based on Transformer architecture was adopted to enhance standard 512 matrix(512 × 512)CT images(standard-512 group)into 1 024(SR-1 024 group)and 2 048(SR-2 048 group)matrix SR CT images,respectively.Subjective and objective evaluation of image quality were performed,and the results were compared among groups.Results The subjective scores of overall imaging quality and lesion clarity in SR-1 024 and SR-2 048 groups were both higher than those in standard-512 group(all P<0.05),while no significant difference was found between the former two(P>0.05).Meanwhile,no significant difference of objective indexes of imaging quality was observed among 3 groups(all P>0.05).Conclusion Swin2SR model could reconstruct chest SR CT images without increasing noise and improve imaging quality.
7.Analysis of the application of VR and AR technologies in medical education
Jiaxian YUE ; Qingyao SHANG ; Jiaxiang LIU ; Xiyu KANG ; Xin WANG
China Medical Equipment 2025;22(7):172-176
VR technology can generate virtual,immersive,and interactive environments,allowing users to immerse themselves in these environments and interact with objects within them.AR technology,on the other hand,can accurately overlay virtual information onto real-world scenes,achieving a seamless integration of the virtual and the real.These two emerging technologies each possess unique advantages and exhibit broad development prospects.They have already begun to be applied in various aspects of medical education,such as basic theoretical teaching and skills training,with promising results.They can compensate for the shortcomings of traditional medical education,enhance students'learning enthusiasm and safety,and improve teaching effectiveness.However,limitations remain,such as the need for improved hardware infrastructure and a scarcity of teaching resources.Based on this,this paper systematically introduces the concepts of AR and VR technologies,reviews their application prospects,current status,advantages,and limitations in medical education,aiming to provide evidence-based support and feasible approaches for medical schools to develop digital teaching plans,promote educational reform,and drive research innovation.
8.NLUS-VQA: construction and evaluation of a visual question answering model for neonatal lung ultrasound diagnosis
Xuming TONG ; Jiangang CHEN ; Yiran WANG ; Xiqing ZHAO ; Yanhong YUAN ; Zishuo WANG ; Peng JIANG ; Qingyao XIONG ; Renxing LI ; Xueli WANG ; Jing LIU
Chinese Journal of Perinatal Medicine 2025;28(11):917-928
Objective:To develop and evaluate a medical visual question answering (VQA) model for neonatal lung ultrasound (LUS) images to enhance intelligent auxiliary diagnosis of neonatal pulmonary diseases.Methods:Using data from neonates admitted to Beijing Obstetrics and Gynecology Hospital, Capital Medical University (January 2023 to December 2024), an image-question-answer dataset comprising 251 LUS images was constructed [43 pneumonia (17.1%), 42 neonatal respiratory distress syndrome (16.7%), 83 transient tachypnea (33.1%), and 83 normal (33.1%) images] with a four-tier medical question-answer framework. Building upon the Qwen2.5-VL-7B base model and integrating LoRA fine-tuning with chain-of-thought prompting, we developed the NLUS-VQA model to enhance visual-language semantic alignment and enable stepwise clinical reasoning, achieving efficient small-sample adaptation. Model performance was comprehensively assessed through natural language generation metrics (BLEU-4, ROUGE-1/2/L), qualitative evaluation of characteristic recognition, and clinical consistency analysis.Results:(1) Quantitative evaluation demonstrated that NLUS-VQA achieved scores of 22.38 (BLEU-4), 48.26 (ROUGE-1), 22.40 (ROUGE-2), and 37.20 (ROUGE-L), representing significant improvements over baseline models. (2) Qualitatively, the model exhibited strong performance in identifying lung consolidation, coalescent B-lines, and snowflake signs, with its chain-of-thought strategy enhancing clinical interpretability and answer accuracy. (3) Clinically, NLUS-VQA achieved a Cohen's Kappa coefficient of 0.78 and diagnostic accuracy of 80.8% (21/26), indicating substantial agreement with clinical experts.Conclusion:The NLUS-VQA model demonstrates robust interpretability in recognizing key sonographic patterns (e.g. lung consolidation, confluent B-lines, and snowflake signs), providing a scalable framework for small-sample medical image analysis, though diagnostic performance on complex conditions remains limited by dataset scale and minority class representation.
9.A Case Report of Primary Hypertrophic Osteoarthropathy
Zongxuan ZHAO ; Liying SUN ; Jia CHEN ; Yanyuan WANG ; Dan CHEN ; Qingyao ZUO ; Wei DENG ; Wen TIAN
JOURNAL OF RARE DISEASES 2024;3(2):241-245
Primary hypertrophic osteoarthropathy(PHO)is a rare disease also known as pachydermo-periostosis.We reported a painless case whose diagnosis was confirmed by genetic test.A 24-year-old male presented a series of symptoms that first began at 14.He suffered from progressive clubbed-fingers accompa-nied by swelling of the wrist and ankle joints.Facial skin concentric thickening and alar nose broadening ap-peared simultaneously and increased progressively.He was also prone to acne and hyperhidrosis.X-rays showed thickening of the metacarpal and phalangeal bones,as well as symmetrical periosteal ossification of both the tibia and fibula.Clinical diagnosis of PHO is difficult because of the variable features.With acromeg-aly excluded,the diagnosis was confirmed by a genetic test.Whole exome sequencing revealed a heterozygous SLCO2A1 c.611C>T(p.Ser204Lue)and SLCO2A1 c.1602C>A(p.Asn534Lys)mutation from each par-ent.It suggests that primary hypertrophic osteoarthropathy should be considered for young limb hypertrophic patients especially when periosteal thickening signs were showed in X-ray.A confirmatory diagnosis can be made through the genetic test.
10.A meta-analysis of dose-response relationship between executive function and single exercise in children and adolescents
Qingyao SONG ; Ying YU ; Xiaofei FAN ; Ping SUN ; Wunian WANG
Chinese Mental Health Journal 2024;38(2):122-130
Objective:To examine the dose-response relationship between acute exercise and executive func-tion in children and adolescents.Methods:The experimental studies on the effect of acute exercise on the executive function of children and adolescents in CNKI,Weipu,PubMed,Scopus,Web of Science and EBSCO databases were searched,and meta-analysis was performed by using Review Manager 5.4 software.Results:A total of 14 articles containing 691 participants were included.Single exercise had a significant effect on improving the response of in-hibitory function[SMD=-0.78(-1.35,-0.25),P<0.01]and accuracy[SMD=0.91(0.27,1.55),P<0.01],and also had a significant effect on improving the refresh function response[SMD=-1.04(-2.01,-0.07),P<0.05]and the accuracy[SMD=1.16(0.39,1.93),P<0.01].The effect of static exercise,30 min and moderate intensity on improving the response of inhibition function in children and adolescents(-5.86,-1.41,-0.76),the effect of inhibition function accuracy(2.98,5.64,1.62)and the effect of refresh function accuracy(6.27,7.39,2.57)was the largest(Ps<0.05).Conclusion:Single exercise could improve inhibition and refresh function in the executive function in children and adolescents.

Result Analysis
Print
Save
E-mail