1.Development and validation of PhenoRAG: A visualization tool for automated human phenotype ontology term annotation based on large language models and retrieval-augmented generation technology.
Wei ZHONG ; Yousheng YAN ; Kai YANG ; Yan LIU ; Xinyu FU ; Zhengyang YAO ; Chenghong YIN
Chinese Journal of Medical Genetics 2026;43(1):36-43
OBJECTIVE:
To develop a user-friendly visualization application for the automatic annotation of Human Phenotype Ontology (HPO) terms based on large language models and retrieval-augmented generation (RAG) technology, and to validate its performance in an authoritative case dataset.
METHODS:
By integrating the domestic open-source large language model DeepSeek-V3 with RAG technology, an interactive web application was deployed on the Streamlit cloud platform. Using only the latest official HPO dataset as the data source, the lightweight sentence-embedding model BAAI/bge-small-en-v1.5 was employed to construct a FAISS vector index. During the online phase, a four-step closed-loop process is automatically completed: multilingual translation, phenotype phrase extraction, RAG candidate retrieval, term mapping, and official database validation. 121 English case reports publicly released by BMJ Case Reports and Oxford Medical Case Reports (with a gold-standard HPO set of 1 794 terms) were selected for application validation. Precision, recall, and F1 score were calculated and compared horizontally with traditional dictionary tools, standalone large language models, and the similar application "RAG-HPO". Finally, replace the model with the more advanced ChatGPT-5 and evaluate its performance on the newly extracted dataset.
RESULTS:
An HPO term automatic annotation visualization application named PhenoRAG, based on large language models and RAG technology, was successfully developed. Users can access it directly via a web link. Across the 112 cases, a total of 2 150 HPO terms were generated; 2,064 (96.0%) were fully validated by the official database, with a hallucination rate of 1.3% and an HPO ID-name mismatch rate of 2.7%. After deduplication, 1,906 terms remained for testing. The overall precision was 63.65%, recall was 67.34%, and F1 was 65.44%, significantly outperforming traditional annotation tools (F1: 0.45-0.49, P < 0.001). Although PhenoRAG's F1 was lower than that of RAG-HPO (F1 = 0.78, P < 0.001), which relies on a manually constructed synonym database of 54 000 entries plus the HPO dataset, it requires no additional dictionary maintenance and can be used without any background in computer programming. Moreover, after switching to the GPT-5 model, PhenoRAG exhibited no hallucination rate on the new dataset, and its F1 score significantly increased (P = 0.038).
CONCLUSION
Without constructing a synonym database, the PhenoRAG achieved high-accuracy automatic mapping from clinical text to standard HPO terms. It features a low usage threshold, free access, and a Chinese-language interface, and can directly serve rare disease diagnosis, genetic counseling, and research scenarios in China and worldwide, warranting further clinical promotion and multicenter validation.
Humans
;
Phenotype
;
Biological Ontologies
;
Language
;
Software
;
Large Language Models
2.Research on the screening efficiency of Thalassemia based on an automated evaluation software.
Jun HU ; Huan LIANG ; Limei DUAN ; Jianqiang GAO
Chinese Journal of Medical Genetics 2026;43(4):281-287
OBJECTIVE:
To explore the efficacy of a Thalassemia risk assessment software for the screening of thalassemia mutation carriers and distribution of thalassemia genotypes detected by screening.
METHODS:
A total of 6 040 individuals were evaluated at Leshan Maternal and Child Health Care Hospital between 2022 and 2024 using the commonly used clinical thalassemia risk assessment method and the thalassemia screening software, respectively, and the performance indicators of the two methods were compared and analyzed against the result of thalassemia gene testing. This study was approved by the Ethics Committee of our hospital (Ethics No.: LfyLL[2022]005).
RESULTS:
The high-risk rate by the thalassemia screening software was 11.19%, with a sensitivity of 95.12%, specificity of 93.28%, positive predictive value of 43.20%, negative predictive value of 99.72%, and the area under the ROC curve (AUC) was 0.942. The thalassemia gene detection rate of the high-risk samples screened was 4.83%. The high-risk screening rate of the conventional method was 2.50%, with a sensitivity of 51.22%, specificity of 93.28%, positive predictive value of 80.79%, negative predictive value of 97.40%, and the AUC was 0.754. The thalassemia gene detection rate of the high-risk samples was 2.02%.
CONCLUSION
The software can effectively detect thalassemia carriers and significantly reduce the missed detection compared with conventional method, thereby significantly improve the efficacy of screening.
Humans
;
Thalassemia/diagnosis*
;
Software
;
Female
;
Genetic Testing/methods*
;
Male
;
Mutation
;
Adult
;
Genotype
;
ROC Curve
;
Risk Assessment
3.Building an artificial intelligence and digital ecosystem: a smart hospital's data-driven path to healthcare excellence.
Weien CHOW ; Narayan VENKATARAMAN ; Hong Choon OH ; Sandhiya RAMANATHAN ; Srinath SRIDHARAN ; Sulaiman Mohamed ARISH ; Kok Cheong WONG ; Karen Kai Xin HAY ; Jong Fong HOO ; Wan Har Lydia TAN ; Charlene Jin Yee LIEW
Singapore medical journal 2025;66(Suppl 1):S75-S83
Hospitals worldwide recognise the importance of data and digital transformation in healthcare. We traced a smart hospital's data-driven journey to build an artificial intelligence and digital ecosystem (AIDE) to achieve healthcare excellence. We measured the impact of data and digital transformation on patient care and hospital operations, identifying key success factors, challenges, and opportunities. The use of data analytics and data science, robotic process automation, AI, cloud computing, Medical Internet of Things and robotics were stand-out areas for a hospital's data-driven journey. In the future, the adoption of a robust AI governance framework, enterprise risk management system, AI assurance and AI literacy are critical for success. Hospitals must adopt a digital-ready, digital-first strategy to build a thriving healthcare system and innovate care for tomorrow.
Artificial Intelligence
;
Humans
;
Delivery of Health Care
;
Hospitals
;
Cloud Computing
;
Robotics
;
Internet of Things
;
Data Science
4.Automatic brain segmentation in cognitive impairment: Validation of AI-based AQUA software in the Southeast Asian BIOCIS cohort.
Ashwati VIPIN ; Rasyiqah BINTE SHAIK MOHAMED SALIM ; Regina Ey KIM ; Minho LEE ; Hye Weon KIM ; ZunHyan RIEU ; Nagaendran KANDIAH
Annals of the Academy of Medicine, Singapore 2025;54(8):467-475
INTRODUCTION:
Interpretation and analysis of magnetic resonance imaging (MRI) scans in clinical settings comprise time-consuming visual ratings and complex neuroimage processing that require trained professionals. To combat these challenges, artificial intelligence (AI) techniques can aid clinicians in interpreting brain MRI for accurate diagnosis of neurodegenerative diseases but they require extensive validation. Thus, the aim of this study was to validate the use of AI-based AQUA (Neurophet Inc., Seoul, Republic of Korea) segmentation software in a Southeast Asian community-based cohort with normal cognition, mild cognitive impairment (MCI) and dementia.
METHOD:
Study participants belonged to the community-based Biomarker and Cognition Study in Singapore. Participants aged between 30 and 95 years, having cognitive concerns, with no diagnosis of major psychiatric, neurological or systemic disorders who were recruited consecutively between April 2022 and July 2023 were included. Participants underwent neuropsychological assessments and structural MRI, and were classified as cognitively normal, with MCI or with dementia. MRI pre-processing using automated pipelines, along with human-based visual ratings, were compared against AI-based automated AQUA output. Default mode network grey matter (GM) volumes were compared between cognitively normal, MCI and dementia groups.
RESULTS:
A total of 90 participants (mean age at visit was 63.32±10.96 years) were included in the study (30 cognitively normal, 40 MCI and 20 dementia). Non-parametric Spearman correlation analysis indicated that AQUA-based and human-based visual ratings were correlated with total (ρ=0.66; P<0.0001), periventricular (ρ=0.50; P<0.0001) and deep (ρ=0.57; P<0.0001) white matter hyperintensities (WMH). Additionally, volumetric WMH obtained from AQUA and automated pipelines was also strongly correlated (ρ=0.84; P<0.0001) and these correlations remained after controlling for age at visit, sex and diagnosis. Linear regression analyses illustrated significantly different AQUA-derived default mode network GM volumes between cognitively normal, MCI and dementia groups. Dementia participants had significant atrophy in the posterior cingulate cortex compared to cognitively normal participants (P=0.021; 95% confidence interval [CI] -1.25 to -0.08) and in the hippocampus compared to cognitively normal (P=0.0049; 95% CI -1.05 to -0.16) and MCI participants (P=0.0036; 95% CI -1.02 to -0.17).
CONCLUSION
Our findings demonstrate high concordance between human-based visual ratings and AQUA-based ratings of WMH. Additionally, the AQUA GM segmentation pipeline showed good differentiation in key regions between cognitively normal, MCI and dementia participants. Based on these findings, the automated AQUA software could aid clinicians in examining MRI scans of patients with cognitive impairment.
Humans
;
Cognitive Dysfunction/pathology*
;
Magnetic Resonance Imaging/methods*
;
Male
;
Middle Aged
;
Female
;
Aged
;
Artificial Intelligence
;
Software
;
Dementia/diagnostic imaging*
;
Aged, 80 and over
;
Adult
;
Singapore
;
Neuropsychological Tests
;
Brain/pathology*
;
Cohort Studies
;
Gray Matter/pathology*
;
Southeast Asian People
5.Research on software development and smart manufacturing platform incorporating near-infrared spectroscopy for measuring traditional Chinese medicine manufacturing process.
Yan-Fei WU ; Hui XU ; Kai-Yi WANG ; Hui-Min FENG ; Xiao-Yi LIU ; Nan LI ; Zhi-Jian ZHONG ; Ze-Xiu ZHANG ; Zhi-Sheng WU
China Journal of Chinese Materia Medica 2025;50(9):2324-2333
Process analytical technology(PAT) is a key means for digital transformation and upgrading of the traditional Chinese medicine(TCM) manufacturing process, serving as an important guarantee for consistent and controllable TCM product quality. Near-infrared(NIR) spectroscopy has become the core technology for measuring the TCM manufacturing process. By incorporating NIR spectroscopy into PAT and starting from the construction of a smart platform for the TCM manufacturing process, this paper systematically described the development history and innovative application of the combination of NIR spectroscopy with chemometrics in measuring the TCM manufacturing process by the research team over the past two decades. Additionally, it explored the application of a validation method based on accuracy profile(AP) in the practice of NIR spectroscopy. Furthermore, the software development progress driven by NIR spectroscopy supported by modeling technology was analyzed, and the prospect of integrating NIR spectroscopy in smart factory control platforms was exemplified with the construction practices of related platforms. By integrating with the smart platform, NIR spectroscopy could improve production efficiency and guarantee product quality. Finally, the prospect of the smart platform application in measuring the TCM manufacturing process was projected. It is believed that the software development for NIR spectroscopy and the smart manufacturing platform will provide strong technical support for TCM digitalization and industrialization.
Spectroscopy, Near-Infrared/methods*
;
Drugs, Chinese Herbal/analysis*
;
Software
;
Medicine, Chinese Traditional
;
Quality Control
6.Research on motor imagery recognition based on feature fusion and transfer adaptive boosting.
Yuxin ZHANG ; Chenrui ZHANG ; Shihao SUN ; Guizhi XU
Journal of Biomedical Engineering 2025;42(1):9-16
This paper proposes a motor imagery recognition algorithm based on feature fusion and transfer adaptive boosting (TrAdaboost) to address the issue of low accuracy in motor imagery (MI) recognition across subjects, thereby increasing the reliability of MI-based brain-computer interfaces (BCI) for cross-individual use. Using the autoregressive model, power spectral density and discrete wavelet transform, time-frequency domain features of MI can be obtained, while the filter bank common spatial pattern is used to extract spatial domain features, and multi-scale dispersion entropy is employed to extract nonlinear features. The IV-2a dataset from the 4 th International BCI Competition was used for the binary classification task, with the pattern recognition model constructed by combining the improved TrAdaboost integrated learning algorithm with support vector machine (SVM), k nearest neighbor (KNN), and mind evolutionary algorithm-based back propagation (MEA-BP) neural network. The results show that the SVM-based TrAdaboost integrated learning algorithm has the best performance when 30% of the target domain instance data is migrated, with an average classification accuracy of 86.17%, a Kappa value of 0.723 3, and an AUC value of 0.849 8. These results suggest that the algorithm can be used to recognize MI signals across individuals, providing a new way to improve the generalization capability of BCI recognition models.
Brain-Computer Interfaces
;
Humans
;
Support Vector Machine
;
Algorithms
;
Neural Networks, Computer
;
Imagination/physiology*
;
Pattern Recognition, Automated/methods*
;
Electroencephalography
;
Wavelet Analysis
7.Research on intelligent fetal heart monitoring model based on deep active learning.
Bin QUAN ; Yajing HUANG ; Yanfang LI ; Qinqun CHEN ; Honglai ZHANG ; Li LI ; Guiqing LIU ; Hang WEI
Journal of Biomedical Engineering 2025;42(1):57-64
Cardiotocography (CTG) is a non-invasive and important tool for diagnosing fetal distress during pregnancy. To meet the needs of intelligent fetal heart monitoring based on deep learning, this paper proposes a TWD-MOAL deep active learning algorithm based on the three-way decision (TWD) theory and multi-objective optimization Active Learning (MOAL). During the training process of a convolutional neural network (CNN) classification model, the algorithm incorporates the TWD theory to select high-confidence samples as pseudo-labeled samples in a fine-grained batch processing mode, meanwhile low-confidence samples annotated by obstetrics experts were also considered. The TWD-MOAL algorithm proposed in this paper was validated on a dataset of 16 355 prenatal CTG records collected by our group. Experimental results showed that the algorithm proposed in this paper achieved an accuracy of 80.63% using only 40% of the labeled samples, and in terms of various indicators, it performed better than the existing active learning algorithms under other frameworks. The study has shown that the intelligent fetal heart monitoring model based on TWD-MOAL proposed in this paper is reasonable and feasible. The algorithm significantly reduces the time and cost of labeling by obstetric experts and effectively solves the problem of data imbalance in CTG signal data in clinic, which is of great significance for assisting obstetrician in interpretations CTG signals and realizing intelligence fetal monitoring.
Humans
;
Pregnancy
;
Female
;
Cardiotocography/methods*
;
Deep Learning
;
Neural Networks, Computer
;
Algorithms
;
Fetal Monitoring/methods*
;
Heart Rate, Fetal
;
Fetal Distress/diagnosis*
;
Fetal Heart/physiology*
8.A portable steady-state visual evoked potential brain-computer interface system for smart healthcare.
Yisen ZHU ; Zhouyu JI ; Shuran LI ; Haicheng WANG ; Yunfa FU ; Hongtao WANG
Journal of Biomedical Engineering 2025;42(3):455-463
This paper realized a portable brain-computer interface (BCI) system tailored for smart healthcare. Through the decoding of steady-state visual evoked potential (SSVEP), this system can rapidly and accurately identify the intentions of subjects, thereby meeting the practical demands of daily medical scenarios. Firstly, an SSVEP stimulation interface and an electroencephalogram (EEG) signal acquisition software were designed, which enable the system to execute multi-target and multi-task operations while also incorporating data visualization functionality. Secondly, the EEG signals recorded from the occipital region were decomposed into eight sub-frequency bands using filter bank canonical correlation analysis (FBCCA). Subsequently, the similarity between each sub-band signal and the reference signals was computed to achieve efficient SSVEP decoding. Finally, 15 subjects were recruited to participate in the online evaluation of the system. The experimental results indicated that in real-world scenarios, the system achieved an average accuracy of 85.19% in identifying the intentions of the subjects, and an information transfer rate (ITR) of 37.52 bit/min. This system was awarded third prize in the Visual BCI Innovation Application Development competition at the 2024 World Robot Contest, validating its effectiveness. In conclusion, this study has developed a portable, multifunctional SSVEP online decoding system, providing an effective approach for human-computer interaction in smart healthcare.
Brain-Computer Interfaces
;
Humans
;
Evoked Potentials, Visual/physiology*
;
Electroencephalography
;
Signal Processing, Computer-Assisted
;
Software
;
Adult
;
Male
9.Evaluation method and system for aging effects of autonomic nervous system based on cross-wavelet transform cardiopulmonary coupling.
Juntong LYU ; Yining WANG ; Wenbin SHI ; Pengyan TAO ; Jianhong YE
Journal of Biomedical Engineering 2025;42(4):748-756
Heart rate variability time and frequency indices are widely used in functional assessment for autonomic nervous system (ANS). However, this method merely analyzes the effect of cardiac dynamics, overlooking the effect of cardio-pulmonary interplays. Given this, the present study proposes a novel cardiopulmonary coupling (CPC) algorithm based on cross-wavelet transform to quantify cardio-pulmonary interactions, and establish an assessment system for ANS aging effects using wearable electrocardiogram (ECG) and respiratory monitoring devices. To validate the superiority of the proposed method under nonstationary and low signal-to-noise ratio conditions, simulations were first conducted to demonstrate the performance strength of the proposed method to the traditional one. Next, the proposed CPC algorithm was applied to analyze cardiac and respiratory data from both elderly and young populations, revealing that young populations exhibited significantly stronger couplings in the high-frequency band compared with their elderly counterparts. Finally, a CPC assessment system was constructed by integrating wearable devices, and additional recordings from both elderly and young populations were collected by using the system, completing the validation and application of the aging effect assessment algorithm and the wearable system. In conclusion, this study may offers methodological and system support for assessing the aging effects on the ANS.
Humans
;
Autonomic Nervous System/physiology*
;
Algorithms
;
Aging/physiology*
;
Electrocardiography/methods*
;
Heart Rate/physiology*
;
Wavelet Analysis
;
Aged
;
Signal Processing, Computer-Assisted
;
Wearable Electronic Devices
10.Clinical application standard of dynamic navigation technology in implant surgery.
Chinese Journal of Stomatology 2025;60(2):105-108
Dynamic navigation technology can "real-time guide" the implantologist to place the implant in the alveolar bone of the missing tooth area according to the preoperative design of the optimal site and path, making the whole implant surgery process more safe and precise. In order to further promote the standardized application of oral implant dynamic navigation technology, China Association of Gerontology and Geriatrics has convened distinguished experts to engage in deliberations and develop the standard. This standard covers the basic requirements, indications and contraindications, operation procedures, common complications and treatment measures, and accuracy verification. This standard can be used as a reference for the use of dynamic navigation technology in implant surgery.
Humans
;
Dental Implantation, Endosseous/standards*
;
Surgery, Computer-Assisted/standards*
;
Dental Implants
;
Surgical Navigation Systems/standards*

Result Analysis
Print
Save
E-mail