1.Development and validation of PhenoRAG: A visualization tool for automated human phenotype ontology term annotation based on large language models and retrieval-augmented generation technology.
Wei ZHONG ; Yousheng YAN ; Kai YANG ; Yan LIU ; Xinyu FU ; Zhengyang YAO ; Chenghong YIN
Chinese Journal of Medical Genetics 2026;43(1):36-43
OBJECTIVE:
To develop a user-friendly visualization application for the automatic annotation of Human Phenotype Ontology (HPO) terms based on large language models and retrieval-augmented generation (RAG) technology, and to validate its performance in an authoritative case dataset.
METHODS:
By integrating the domestic open-source large language model DeepSeek-V3 with RAG technology, an interactive web application was deployed on the Streamlit cloud platform. Using only the latest official HPO dataset as the data source, the lightweight sentence-embedding model BAAI/bge-small-en-v1.5 was employed to construct a FAISS vector index. During the online phase, a four-step closed-loop process is automatically completed: multilingual translation, phenotype phrase extraction, RAG candidate retrieval, term mapping, and official database validation. 121 English case reports publicly released by BMJ Case Reports and Oxford Medical Case Reports (with a gold-standard HPO set of 1 794 terms) were selected for application validation. Precision, recall, and F1 score were calculated and compared horizontally with traditional dictionary tools, standalone large language models, and the similar application "RAG-HPO". Finally, replace the model with the more advanced ChatGPT-5 and evaluate its performance on the newly extracted dataset.
RESULTS:
An HPO term automatic annotation visualization application named PhenoRAG, based on large language models and RAG technology, was successfully developed. Users can access it directly via a web link. Across the 112 cases, a total of 2 150 HPO terms were generated; 2,064 (96.0%) were fully validated by the official database, with a hallucination rate of 1.3% and an HPO ID-name mismatch rate of 2.7%. After deduplication, 1,906 terms remained for testing. The overall precision was 63.65%, recall was 67.34%, and F1 was 65.44%, significantly outperforming traditional annotation tools (F1: 0.45-0.49, P < 0.001). Although PhenoRAG's F1 was lower than that of RAG-HPO (F1 = 0.78, P < 0.001), which relies on a manually constructed synonym database of 54 000 entries plus the HPO dataset, it requires no additional dictionary maintenance and can be used without any background in computer programming. Moreover, after switching to the GPT-5 model, PhenoRAG exhibited no hallucination rate on the new dataset, and its F1 score significantly increased (P = 0.038).
CONCLUSION
Without constructing a synonym database, the PhenoRAG achieved high-accuracy automatic mapping from clinical text to standard HPO terms. It features a low usage threshold, free access, and a Chinese-language interface, and can directly serve rare disease diagnosis, genetic counseling, and research scenarios in China and worldwide, warranting further clinical promotion and multicenter validation.
Humans
;
Phenotype
;
Biological Ontologies
;
Language
;
Software
;
Large Language Models
2.Research on the screening efficiency of Thalassemia based on an automated evaluation software.
Jun HU ; Huan LIANG ; Limei DUAN ; Jianqiang GAO
Chinese Journal of Medical Genetics 2026;43(4):281-287
OBJECTIVE:
To explore the efficacy of a Thalassemia risk assessment software for the screening of thalassemia mutation carriers and distribution of thalassemia genotypes detected by screening.
METHODS:
A total of 6 040 individuals were evaluated at Leshan Maternal and Child Health Care Hospital between 2022 and 2024 using the commonly used clinical thalassemia risk assessment method and the thalassemia screening software, respectively, and the performance indicators of the two methods were compared and analyzed against the result of thalassemia gene testing. This study was approved by the Ethics Committee of our hospital (Ethics No.: LfyLL[2022]005).
RESULTS:
The high-risk rate by the thalassemia screening software was 11.19%, with a sensitivity of 95.12%, specificity of 93.28%, positive predictive value of 43.20%, negative predictive value of 99.72%, and the area under the ROC curve (AUC) was 0.942. The thalassemia gene detection rate of the high-risk samples screened was 4.83%. The high-risk screening rate of the conventional method was 2.50%, with a sensitivity of 51.22%, specificity of 93.28%, positive predictive value of 80.79%, negative predictive value of 97.40%, and the AUC was 0.754. The thalassemia gene detection rate of the high-risk samples was 2.02%.
CONCLUSION
The software can effectively detect thalassemia carriers and significantly reduce the missed detection compared with conventional method, thereby significantly improve the efficacy of screening.
Humans
;
Thalassemia/diagnosis*
;
Software
;
Female
;
Genetic Testing/methods*
;
Male
;
Mutation
;
Adult
;
Genotype
;
ROC Curve
;
Risk Assessment
3.A bibliometric analysis of research productivity on Kawasaki disease in Southeast Asia: Trend and socioeconomic drivers.
Maria Llaine J. Callanta ; Karol Ann T. Baldo
Acta Medica Philippina 2026;60(2):33-40
OBJECTIVES
The increasing prevalence of Kawasaki disease in Southeast Asia (SEA) and its potential relation with Coronavirus Disease 2019 (COVID-19) infection resulted in heightened interest in KD in the region, thus, this paper aimed to determine the trend and the socioeconomic facilitators of scientific productivity of KD research within the region. Specifically, this article determined the number of publication and citations related to KD per country, institution, and journal. We also explored the networks of countries within the region to the rest of the world and the keywords mostly associated with KD research in the region. Lastly, correlation of these bibliometric indices with socioeconomic factors in the region was analyzed.
METHODSA literature search of KD papers in SEA was performed using Scopus database. We obtained bibliographic data from the available literature and visualized network of existing collaborations and keywords using VOSviewer software.
RESULTSA total of 196 papers were included in the study. Bibliometric analysis showed a rising trend in publication within the region, most of which were from institutions in Singapore and Thailand. The most common topics on KD studies included clinical features, complications, treatment, and comorbidities.
Country characteristics such as gross domestic product (GDP) per capita, research and development (R&D) expenditure (% GDP), and number of physician and R&D researchers were positively correlated with bibliometric indices of KD research in SEA. Moreover, number of international linkages was significantly associated with KD research productivity in the region.
CONCLUSIONIn summary, we showed an increasing trend of KD research in SEA. Funding allocation and capacity building are necessary to strengthen research productivity within the region.
Asia ; Asia, Southeastern ; Bibliometrics ; Capacity Building ; Coronavirus ; Covid-19 ; Database ; Disease ; Efficiency ; Gross Domestic Product ; Guanosine Diphosphate ; Infection ; Infections ; Literature ; Mucocutaneous Lymph Node Syndrome ; Paper ; Physicians ; Prevalence ; Publications ; Research ; Research Personnel ; Rest ; Singapore ; Socioeconomic Factors ; Software ; Thailand ; Therapeutics
4.Perspectives of University of Santo Tomas (UST) administrators toward the use of artificial intelligence (AI) in higher education: A study protocol.
Jose Ma. Rafael RAMOS ; Reinaluz MANALO ; Les CADUYAC ; Enya LUANSING ; Jazztine JORGE ; Fiona PEREZ ; Breanna SANTOS
Philippine Journal of Allied Health Sciences 2026;9(2):34-39
OBJECTIVES
This study aims to create a study protocol that will explore UST administrators’ perceptions of the benefits and risks of AI use in higher education learning environments.
METHODSA qualitative descriptive design will be employed, using semi-structured interviews with at least fifteen administrators selected through purposive sampling. Audio-recorded interviews will be transcribed verbatim and subjected to thematic analysis using NVivo software
RESULTSAdministrators from different college-level fields perceive and engage with AI across various academic contexts. Exploring these perceptions will allow guidance in the development of coherent, contextually grounded institutional policies that promote responsible GenAI use and support digital leadership in Philippine higher education.
Human ; Artificial Intelligence ; Universities ; Software ; Administrative Personnel ; Intelligence ; Risk ; Policy
5.Association between acupuncture and live birth rates after fresh embryo transfer: A cohort study based on different propensity score methods.
Xiao-Yan ZHENG ; Zi-Yi JIANG ; Yi-Ting LI ; Chao-Liang LI ; Hao ZHU ; Zheng YU ; Si-Yi YU ; Li-Li YANG ; Song-Yuan TANG ; Xing-Yu LÜ ; Fan-Rong LIANG ; Jie YANG
Journal of Integrative Medicine 2025;23(5):528-536
OBJECTIVE:
To explore the association between acupuncture during controlled ovarian hyperstimulation (COH) and the live birth rate (LBR) using different propensity score methods.
METHODS:
In this retrospective cohort study, eligible women who underwent a COH were divided into acupuncture and non-acupuncture groups. The primary outcome was LBR, as determined by propensity score matching (PSM). LBR was defined as the delivery of one or more living infants that reached a gestational age over 28 weeks after embryo transfer. The propensity score model encompassed 16 confounding variables. To validate the results, sensitivity analyses were conducted using three additional propensity score methods: propensity score adjustment, inverse probability weighting (IPW), and IPW with a "doubly robust" estimator.
RESULTS:
The primary cohort encompassed 9751 patients (1830 [18.76%] in the acupuncture group and 7921 [81.23%] in the non-acupuncture group). Following 1:1 PSM, a higher LBR was found in the acupuncture cohort (41.4% [755/1824] vs 36.4% [664/1824], with an odds ratio of 1.23 [95% confidence interval, 1.08-1.41]). Three additional propensity score methods produced essentially similar results. The risk of serious adverse events did not significantly differ between the two groups.
CONCLUSION
This retrospective study revealed an association between acupuncture and an increased LBR among patients undergoing COH, and that acupuncture is a safe and valuable treatment option. Please cite this article as: Zheng XY, Jiang ZY, Li YT, Li CL, Zhu H, Yu Z, Yu SY, Yang LL, Tang SY, Lü XY, Liang FR, Yang J. Association between acupuncture and live birth rates after fresh embryo transfer: A cohort study based on different propensity score methods. J Integr Med. 2025; 23(5):528-536.
Humans
;
Female
;
Propensity Score
;
Embryo Transfer
;
Adult
;
Acupuncture Therapy
;
Retrospective Studies
;
Pregnancy
;
Live Birth
;
Birth Rate
;
Cohort Studies
6.Impact of Endometrial Polyps on Pregnancy Outcomes in Patients with Endometriosis and Infertility: A Systematic Review and Meta-analysis.
Liang ZHANG ; Qian HAN ; Mei Ru BAO ; Ying WU
Biomedical and Environmental Sciences 2025;38(3):341-350
OBJECTIVE:
To evaluate the impact of endometrial polyps (EP) on postoperative pregnancy outcomes in infertile women with endometriosis (EMs).
METHODS:
PubMed, Embase, The Cochrane Library, CNKI, VIP, SinoMed, and WanFang Data databases were searched to include clinical studies on the effect of EP on pregnancy outcomes in patients with EMs, published before August 31, 2020. A meta-analysis was performed using Rev Man 5.3 software after two investigators independently screened the literature, extracted information, and evaluated the risk of bias of the included studies.
RESULTS:
The meta-analysis included ten studies (651 and 1,040 in the combined EP and uncomplicated EP groups, respectively). The spontaneous pregnancy rate, clinical pregnancy rate, and live birth rate were significantly lower in the group with combined EPs than in the group without combined EPs [Odd's ratio ( OR) = 0.63, 95% confidence interval ( CI): 0.50-0.80, P = 0.0001; OR = 0.63, 95% CI: 0.48-0.84, P = 0.001; OR = 0.63, 95% CI: 0.42-0.96, P = 0.03], and the rate of embryonic abortion was significantly higher than that in the uncomplicated EP group [ OR = 3.10, 95% CI: 1.52-6.32, P = 0.002].
CONCLUSION
EP may adversely affect pregnancy outcomes in patients with infertility and EMs. Even after surgical treatment, EP can still reduce natural pregnancy, clinical pregnancy, and live birth rates in infertile women with EMs and increase the risk of embryo arrest in these women.
Humans
;
Female
;
Pregnancy
;
Endometriosis/complications*
;
Pregnancy Outcome/epidemiology*
;
Polyps/complications*
;
Infertility, Female/etiology*
;
Pregnancy Rate
;
Uterine Diseases/complications*
7.Artificial intelligence-enhanced physics-based computational modeling technologies for proteins.
Baoyan LIU ; Shuai LI ; Hao SU ; Xiang SHENG
Chinese Journal of Biotechnology 2025;41(3):917-933
Computational modeling is an invaluable tool for mechanism analysis, directed engineering, and rational design of biological parts, metabolic networks, and even cellular systems. It can provide new technological solutions to address biological challenges at different levels and has become a central focus of research in biomanufacturing. In the computational modeling of proteins, which are the key parts in biological systems, the traditional physics-based methods (computer software and mathematical model) have been widely used to study the physical and chemical processes in the functioning of proteins, and have thus been recognized as a powerful tool for understanding complex biological systems and guiding experimental designs. As the scale of computational modeling continues to expand, traditional modeling techniques face difficulties in balancing computational accuracy and speed. In recent years, the explosive growth of biological data has made it possible to construct high-performance artificial intelligence (AI) models, which brings new opportunities to the computational modeling of proteins, and the AI-enhanced physics-based computational modeling technologies have emerged. This combined strategy not only incorporates the chemical knowledge and established physical principles but also is powerful in data processing and pattern recognition, which greatly improves the computational efficiency and prediction accuracy, as well as possesses stronger interpretation ability, transferability, and robustness. The AI-enhanced physics-based computational modeling technologies have already shown great potential and value in biocatalysis, paving a new way for the future development of biomanufacturing.
Artificial Intelligence
;
Proteins/chemistry*
;
Computer Simulation
;
Software
;
Computational Biology/methods*
8.Exogenous triggering with hCG/GnRHa improves outcomes of natural cycle IVF/ICSI in patients with diminished ovarian reserve: a propensity score matching and logistic regression analysis.
Xinyue CHANG ; Ningning YAO ; Yan ZHAO ; Yinfeng WANG ; Ancong WANG ; Huihui ZHANG ; Jing ZHANG
Journal of Southern Medical University 2025;45(7):1519-1526
OBJECTIVES:
To explore the effects of exogenous trigger (hCG/GnRHa) versus endogenous LH surge in natural cycle IVF/ICSI (NC-IVF/ICSI) for patients with diminished ovarian reserve (DOR).
METHODS:
A retrospective analysis was conducted on 1,118 NC-IVF/ICSI cycles from two reproductive centers between 2013 and 2024. Propensity score matching (PSM) and multivariate logistic regression were used to adjust for confounding factors. The trigger-day hormone threshold was determined using receiver operating characteristic (ROC) curve analysis. Outcome measures included oocyte retrieval rate, 2PN fertilization rate, clinical available embryo rate, high-quality embryo rate, fresh cycle clinical pregnancy rate (CPR), and live birth rate (LBR).
RESULTS:
After adjusting for confounders via PSM and logistic regression, the exogenous trigger group demonstrated significantly better outcomes across all the evaluated parameters (oocyte retrieval rate, 2PN fertilization rate, transferable embryo rate, high-quality embryo rate, fresh cycle CPR, and LBR) than the endogenous LH surge group (P<0.05). Age-stratified analysis revealed that for the entire cohort, exogenous triggering significantly increased the number of transferable embryos and high-quality embryos (P<0.001). In the 35-39 years old subgroup, exogenous triggering showed significant advantages in oocyte yield, high-quality embryo rate, CPR, and LBR (P<0.05) and resulted in the most pronounced improvement in LBR (OR=6.25, 95% CI: 1.34-29.23). ROC analysis established a decision-day LH threshold of 19.055 mIU/mL (AUC=0.945, specificity=93.3%) for precise stratification of the clinical pathways.
CONCLUSIONS
For DOR patients undergoing NC-IVF/ICSI, exogenous triggering comprehensively improves the treatment outcomes, particularly providing significant live birth benefits for women aged 35-40 years. An individualized protocol incorporating the LH threshold (19.055 mIU/mL) effectively enhances embryonic developmental potential and live birth rates.
Humans
;
Female
;
Ovarian Reserve
;
Pregnancy
;
Propensity Score
;
Retrospective Studies
;
Fertilization in Vitro
;
Sperm Injections, Intracytoplasmic
;
Chorionic Gonadotropin
;
Pregnancy Rate
;
Logistic Models
;
Ovulation Induction/methods*
;
Gonadotropin-Releasing Hormone
;
Adult
;
Oocyte Retrieval
9.Racial differences in treatment and prognosis of gastric signet ring cell carcinoma: analysis based on SEER and TCGA databases.
Shangping FANG ; Jiameng LIU ; Xingchen YUE ; Huan LI ; Wanning LI ; Xiaoyu TANG ; Pengju BAO
Journal of Southern Medical University 2025;45(8):1706-1717
OBJECTIVES:
To analyze the differences in the prognosis of gastric signet ring cell carcinoma (SRCC) among different races using the US Surveillance Epidemiology and End Results (SEER) database and The Cancer Genome Atlas (TCGA) database.
METHODS:
We analyzed the data of patients with gastric SRCC from the SEER database from 2000 to 2020, and divided the patients into cohorts of whites, blacks, Asians or Pacific Islanders, American Indians/Alaska Natives according to their race. The prognosis and treatment of the cohorts were evaluated using baseline demographic analysis, Kamplan-Meier survival curve, and nomogram analysis.
RESULTS:
We analyzed the data of a total of 2058 patients, including 8.6% blacks, 72.4% whites, 16.6% Asians or Pacific Islanders, 1.0% American Indians/Alaska Natives, and 1.4% other races. The tumor grade varied among different races, and the prevalence and survival rates of patients differed significantly across races. The differences in the white cohort were the most prominent, and all the differences were statistically significant (P<0.05). Racial differences were also noted in patient management and prognosis.
CONCLUSIONS
There are racial differences in tumor grades and prognosis of gastric SRCC, and these differences provide evidence for optimizing clinical diagnosis and treatment strategies for this malignancy.
Aged
;
Female
;
Humans
;
Male
;
Middle Aged
;
Carcinoma, Signet Ring Cell/therapy*
;
Databases, Factual
;
Prognosis
;
Racial Groups
;
SEER Program
;
Stomach Neoplasms/therapy*
;
Survival Rate
;
United States/epidemiology*
;
White
;
Asian American Native Hawaiian and Pacific Islander
;
American Indian or Alaska Native
;
Black or African American
10.AQMFB-DWT: A Preprocessing Technique for Removing Blink Artifacts Before Extracting Pain-evoked Potential EEG.
Wenjia GAO ; Dan LIU ; Qisong WANG ; Yongping ZHAO ; Jinwei SUN
Neuroscience Bulletin 2025;41(12):2285-2295
The pain-evoked potential electroencephalogram (EEG) is an effective electrophysiological indicator for pain assessment, yet its extraction is challenging due to interference from background activity and involuntary blinks. Although existing blink artifact-removal methods show efficacy, they face limitations such as the need for reference signals, neglect of individual differences, and reliance on user input, hindering their practical application in clinical pain assessments. In this paper, we propose a novel framework applying adaptive quadrature mirror filter banks (AQMFB) with discrete wavelet transform (DWT) to remove blink artifacts in pain EEG. Unlike traditional DWT methods that apply fixed wavelets across subjects, our method adapts wavelet construction based on the characteristics of EEG. Experimental results demonstrate that AQMFB-DWT outperforms four leading methods in removing blink artifacts with minimal distortion of pain information, all within an acceptable processing time. This technique is a valuable preprocessing step for enhancing the extraction of pain-evoked potentials.
Humans
;
Artifacts
;
Blinking/physiology*
;
Electroencephalography/methods*
;
Pain/diagnosis*
;
Male
;
Wavelet Analysis
;
Adult
;
Female
;
Evoked Potentials/physiology*
;
Young Adult
;
Brain/physiopathology*
;
Pain Measurement/methods*
;
Signal Processing, Computer-Assisted


Result Analysis
Print
Save
E-mail