Search Results

1.Development and validation of PhenoRAG: A visualization tool for automated human phenotype ontology term annotation based on large language models and retrieval-augmented generation technology.

Wei ZHONG ; Yousheng YAN ; Kai YANG ; Yan LIU ; Xinyu FU ; Zhengyang YAO ; Chenghong YIN

Chinese Journal of Medical Genetics 2026;43(1):36-43

OBJECTIVE: To develop a user-friendly visualization application for the automatic annotation of Human Phenotype Ontology (HPO) terms based on large language models and retrieval-augmented generation (RAG) technology, and to validate its performance in an authoritative case dataset. METHODS: By integrating the domestic open-source large language model DeepSeek-V3 with RAG technology, an interactive web application was deployed on the Streamlit cloud platform. Using only the latest official HPO dataset as the data source, the lightweight sentence-embedding model BAAI/bge-small-en-v1.5 was employed to construct a FAISS vector index. During the online phase, a four-step closed-loop process is automatically completed: multilingual translation, phenotype phrase extraction, RAG candidate retrieval, term mapping, and official database validation. 121 English case reports publicly released by BMJ Case Reports and Oxford Medical Case Reports (with a gold-standard HPO set of 1 794 terms) were selected for application validation. Precision, recall, and F1 score were calculated and compared horizontally with traditional dictionary tools, standalone large language models, and the similar application "RAG-HPO". Finally, replace the model with the more advanced ChatGPT-5 and evaluate its performance on the newly extracted dataset. RESULTS: An HPO term automatic annotation visualization application named PhenoRAG, based on large language models and RAG technology, was successfully developed. Users can access it directly via a web link. Across the 112 cases, a total of 2 150 HPO terms were generated; 2,064 (96.0%) were fully validated by the official database, with a hallucination rate of 1.3% and an HPO ID-name mismatch rate of 2.7%. After deduplication, 1,906 terms remained for testing. The overall precision was 63.65%, recall was 67.34%, and F1 was 65.44%, significantly outperforming traditional annotation tools (F1: 0.45-0.49, P < 0.001). Although PhenoRAG's F1 was lower than that of RAG-HPO (F1 = 0.78, P < 0.001), which relies on a manually constructed synonym database of 54 000 entries plus the HPO dataset, it requires no additional dictionary maintenance and can be used without any background in computer programming. Moreover, after switching to the GPT-5 model, PhenoRAG exhibited no hallucination rate on the new dataset, and its F1 score significantly increased (P = 0.038). CONCLUSION Without constructing a synonym database, the PhenoRAG achieved high-accuracy automatic mapping from clinical text to standard HPO terms. It features a low usage threshold, free access, and a Chinese-language interface, and can directly serve rare disease diagnosis, genetic counseling, and research scenarios in China and worldwide, warranting further clinical promotion and multicenter validation.
Humans ; Phenotype ; Biological Ontologies ; Language ; Software ; Large Language Models

2.Influencing factors of resampling failure for non-invasive prenatal testing and its influence on pregnancy outcomes

Cheng LIU ; Yipeng WANG ; Liying ZOU ; Chenghong YIN

Chinese Journal of Obstetrics and Gynecology 2025;60(4):268-274

Objective:To evaluate the factors influencing test failure after resampling in non-invasive prenatal testing (NIPT) and to explore its impact on pregnancy outcomes.Methods:The information of pregnant women who failed to undergo NIPT for the first time and resampled for testing in Beijing Obstetrics and Gynecology Hospital, Capital Medical University from January 2018 to January 2022 were collected and retrospectively analyzed. According to the results of resampled NIPT, the pregnant women were divided into the failure group (170 cases) and the success group(485 cases), and the general clinical data and pregnancy outcomes of the two groups were compared.Results:(1) A total of 88 928 pregnant women underwent NIPT in Beijing Obstetrics and Gynecology Hospital during the study period, of which 1 299 (1.461%, 1 299/88 928) failed in the first NIPT. Among the 1 299 pregnant women who failed in the first NIPT, 720 were resampled for testing. Finally, 655 pregnant women who met the inclusion criteria and had complete clinical information and perinatal outcomes were collected. The success rate of resampling was 74.0% (485/655). Compared with the success group, the pregnant women in the failure group had a later gestational age at resampling, a higher pre-pregnancy body mass index (BMI) and a higher fetal fraction, and the differences were statistically significant (all P<0.001). (2) Among the 485 pregnant women in the success group, 130 cases (26.8%, 130/485) were detected with chromosome aneuploidy. Among the 170 pregnant women in the failure group, 8 cases had abnormal amniocentesis, 2 cases had abnormal maternal serum screening of aneuploidy in the second trimester, 3 cases had abnormal ultrasound anomaly removal, and 157 cases had no abnormality. (3) The incidence of fetal or neonatal malformation in the failure group was significantly higher than that in the success group [11.2% (19/170) vs 5.8% (28/485), P=0.019], but after adjusting for age and pre-pregnancy BMI, fetal or neonatal malformation was not associated with the success of resampling ( RR=0.675, 95% CI: 0.346-1.319; P=0.250). The incidences of gestational diabetes mellitus and hypertensive disorders in pregnancy in the failure group were significantly higher than those in the success group (all P<0.05), but after adjusting for age and pre-pregnancy BMI, only the incidence of gestational diabetes mellitus in the failure group was higher ( RR=0.630, 95% CI: 0.426-0.932; P=0.021). Conclusions:For pregnant women who failed the initial NIPT, the success of the resampling test is associated with pre-pregnancy BMI and the gestational week at the time of resampling. Those who failed the resampling test are more likely to develop gestational diabetes mellitus. When providing genetic counseling for pregnant women who failed the initial NIPT, it is important to consider the successful rate of resampling testing. The risk of chromosomal abnormalities should be comprehensively considered to develop further screening strategies.

3.Associations of systemic immune-inflammation index and systemic inflammation response index with maternal gestational diabetes mellitus: Evidence from a prospective birth cohort study.

Shuanghua XIE ; Enjie ZHANG ; Shen GAO ; Shaofei SU ; Jianhui LIU ; Yue ZHANG ; Yingyi LUAN ; Kaikun HUANG ; Minhui HU ; Xueran WANG ; Hao XING ; Ruixia LIU ; Wentao YUE ; Chenghong YIN

Chinese Medical Journal 2025;138(6):729-737

BACKGROUND: The role of inflammation in the development of gestational diabetes mellitus (GDM) has recently become a focus of research. The systemic immune-inflammation index (SII) and systemic inflammation response index (SIRI), novel indices, reflect the body's chronic immune-inflammatory state. This study aimed to investigate the associations between the SII or SIRI and GDM. METHODS: A prospective birth cohort study was conducted at Beijing Obstetrics and Gynecology Hospital from February 2018 to December 2020, recruiting participants in their first trimester of pregnancy. Baseline SII and SIRI values were derived from routine clinical blood results, calculated as follows: SII = neutrophil (Neut) count × platelet (PLT) count/lymphocyte (Lymph) count, SIRI = Neut count × monocyte (Mono) count/Lymph count, with participants being grouped by quartiles of their SII or SIRI values. Participants were followed up for GDM with a 75-g, 2-h oral glucose tolerance test (OGTT) at 24-28 weeks of gestation using the glucose thresholds of the International Association of Diabetes and Pregnancy Study Groups (IADPSG). Logistic regression was used to analyze the odds ratios (ORs) (95% confidence intervals [CIs]) for the the associations between SII, SIRI, and the risk of GDM. RESULTS: Among the 28,124 women included in the study, the average age was 31.8 ± 3.8 years, and 15.76% (4432/28,124) developed GDM. Higher SII and SIRI quartiles were correlated with increased GDM rates, with rates ranging from 12.26% (862/7031) in the lowest quartile to 20.10% (1413/7031) in the highest quartile for the SII ( Ptrend <0.001) and 11.92-19.31% for the SIRI ( Ptrend <0.001). The ORs (95% CIs) of the second, third, and fourth SII quartiles were 1.09 (0.98-1.21), 1.21 (1.09-1.34), and 1.39 (1.26-1.54), respectively. The SIRI findings paralleled the SII outcomes. For the second through fourth quartiles, the ORs (95% CIs) were 1.24 (1.12-1.38), 1.41 (1.27-1.57), and 1.64 (1.48-1.82), respectively. These associations were maintained in subgroup and sensitivity analyses. CONCLUSION The SII and SIRI are potential independent risk factors contributing to the onset of GDM.
Humans ; Female ; Pregnancy ; Diabetes, Gestational/immunology* ; Prospective Studies ; Adult ; Inflammation/immunology* ; Glucose Tolerance Test ; Birth Cohort

4.Influencing factors of resampling failure for non-invasive prenatal testing and its influence on pregnancy outcomes

Cheng LIU ; Yipeng WANG ; Liying ZOU ; Chenghong YIN

Chinese Journal of Obstetrics and Gynecology 2025;60(4):268-274

Objective:To evaluate the factors influencing test failure after resampling in non-invasive prenatal testing (NIPT) and to explore its impact on pregnancy outcomes.Methods:The information of pregnant women who failed to undergo NIPT for the first time and resampled for testing in Beijing Obstetrics and Gynecology Hospital, Capital Medical University from January 2018 to January 2022 were collected and retrospectively analyzed. According to the results of resampled NIPT, the pregnant women were divided into the failure group (170 cases) and the success group(485 cases), and the general clinical data and pregnancy outcomes of the two groups were compared.Results:(1) A total of 88 928 pregnant women underwent NIPT in Beijing Obstetrics and Gynecology Hospital during the study period, of which 1 299 (1.461%, 1 299/88 928) failed in the first NIPT. Among the 1 299 pregnant women who failed in the first NIPT, 720 were resampled for testing. Finally, 655 pregnant women who met the inclusion criteria and had complete clinical information and perinatal outcomes were collected. The success rate of resampling was 74.0% (485/655). Compared with the success group, the pregnant women in the failure group had a later gestational age at resampling, a higher pre-pregnancy body mass index (BMI) and a higher fetal fraction, and the differences were statistically significant (all P<0.001). (2) Among the 485 pregnant women in the success group, 130 cases (26.8%, 130/485) were detected with chromosome aneuploidy. Among the 170 pregnant women in the failure group, 8 cases had abnormal amniocentesis, 2 cases had abnormal maternal serum screening of aneuploidy in the second trimester, 3 cases had abnormal ultrasound anomaly removal, and 157 cases had no abnormality. (3) The incidence of fetal or neonatal malformation in the failure group was significantly higher than that in the success group [11.2% (19/170) vs 5.8% (28/485), P=0.019], but after adjusting for age and pre-pregnancy BMI, fetal or neonatal malformation was not associated with the success of resampling ( RR=0.675, 95% CI: 0.346-1.319; P=0.250). The incidences of gestational diabetes mellitus and hypertensive disorders in pregnancy in the failure group were significantly higher than those in the success group (all P<0.05), but after adjusting for age and pre-pregnancy BMI, only the incidence of gestational diabetes mellitus in the failure group was higher ( RR=0.630, 95% CI: 0.426-0.932; P=0.021). Conclusions:For pregnant women who failed the initial NIPT, the success of the resampling test is associated with pre-pregnancy BMI and the gestational week at the time of resampling. Those who failed the resampling test are more likely to develop gestational diabetes mellitus. When providing genetic counseling for pregnant women who failed the initial NIPT, it is important to consider the successful rate of resampling testing. The risk of chromosomal abnormalities should be comprehensively considered to develop further screening strategies.

5.Content and psychometric properties of functional measurements in patients with sarcopenia based on ICF

Hui LIU ; Hang YIN ; Chenghong HU ; Shaohui JIA ; Yewenzhe FENG ; Qingkui HU

Chinese Journal of Rehabilitation Theory and Practice 2024;30(3):273-280

Objective To explore the content and psychometric properties of sarcopenia measurements for older adults. Methods Using the International Classification of Functioning,Disability and Health(ICF)linking and coding rules,a content analysis were conducted on the seven types of measurement tools for geriatric sarcopenia,including Strength,Assistance with walking,Rising from a chair,Climbing stairs,and Falls(SARC-F);Ishii Test;Mini Sar-copenia Risk Assessment-7(MSRA-7);6-minute walk test(6MWT);Barthel Index(BI);Short Physical Perfor-mance Battery(SPPB);and World Health Organization Disability Assessment Schedule 2.0(WHODAS 2.0),and psychometric property was explored. Results The seven measurement tools can be categorized into body functions and structures,activities and participation,and comprehensive categories.ICF coding for SARC-F involved muscle power functions(b730),walking(d450),changing basic body position(d410),etc.Ishii Test involved muscle power functions(b730)and weight maintenance functions(b530).MSRA-7 involved walking(d450),involuntary movement reaction functions(b755),muscle endurance functions(b740),etc.ICF coding for 6MWT involved walking(d450).SPPB involved changing basic body position(d410),control voluntary movement functions(b760),and walking(d450).BI in-volved eating(d550),drinking(d560),washing oneself(d510),caring for body parts(d520),etc.WHODAS 2.0 involved attention functions(b140),acquiring skills(d155),communication-receiving-verbal messages(d310),community life(d910),etc.SARC-F was low to medium in sensitivity,and medium to excellent in specificity.Ishii Test was medium to good in sensitivity,average to good in specificity,and good in diagnostic performance.MSRA-7 was medium to good in sensitivity,average in specificity,and average to medium in diagnostic perfor-mance,while MSRA-5 was average to good in sensitivity,average in specificity,and medium in diagnostic per-formance.6MWT was average to medium in sensitivity,and average in specificity.SPPB was average in sensitiv-ity,average to medium in specificity,and average to excellent in diagnostic performance.BI was good in reliabili-ty,and WHODAS 2.0 was excellent in reliability. Conclusion The measurement tools for sarcopenia can be categorized into body functions and structures,activities and participation,and comprehensive assessment tools combining the above.The psychometric properties of the sev-en measurement tools are low to excellent in sensitivity and average to excellent in specificity.

6.Expert consensus on the clinical application strategy of NIPT2.0, a new-generation non-invasive prenatal screening technology

Chenming XU ; Chenghong YIN ; Aihua YIN ; Shanling LIU ; Yulin JIANG ; Qiong LUO ; Hua WANG ; Hefeng HUANG

Chinese Journal of Medical Genetics 2024;41(10):1155-1163

7.An evaluation of carrier detection for Spinal muscular atrophy using digital PCR assay

Yousheng YAN ; Chianru TAN ; Meng ZHANG ; Fang WANG ; Yipeng WANG ; Xinwen CHEN ; Chenghong YIN ; Yong GUO

Chinese Journal of Medical Genetics 2024;41(1):20-24

8.Discussion on the current situation and countermeasures of clinical research coordinator management mode in China

Rui JU ; Yinmei DAI ; Chenghong YIN

Chinese Journal of Medical Science Research Management 2024;37(4):344-350

9.Analysis of the current status of management in Investigator-Initiated Trials in China

Yue ZHANG ; Shuanghua XIE ; Ningning ZHANG ; Yingyuan ZHANG ; Chenghong YIN ; Wentao YUE

Chinese Journal of Medical Science Research Management 2024;37(4):356-360

10.Application of cluster analysis to evaluate the scientific research performance evaluation index system of departments in an obstetrics and gynecology hospital

Zhuo CHEN ; Wentao YUE ; Chenghong YIN

Chinese Journal of Medical Science Research Management 2023;36(2):99-104

Objective:The cluster analysis method was applied to evaluate the scientific research performance evaluation index system of departments in an obstetrics and gynecology hospital, and analyze the weaknesses of scientific research work of various departments, to provide a basis for the improvement of scientific research strength of departments.Methods:On the basis of the scientific research performance evaluation index system of departments in the obstetrics and gynecology hospital, the indicators and weights of the system were optimized through expert consultation, and the scientific research values from 2019 to 2021 were brought into the optimized scientific research performance evaluation index system to calculate the scientific research scores of each department, and then the cluster analysis method was applied to evaluate the index system.Results:Before and after the optimization of the scientific research performance evaluation index system of departments, the conformity with the discipline classification was 76.00% and 96.67% respectively ( P=0.039). In total of 30 departments were clustered into 4 categories: excellent (7), good (7), medium (4), and concerned (12). The average score of the total scientific research performance evaluation indicators of the 4 categories of departments was 15.022. The highest average score was for papers and monographs, and the lowest was for awards. Conclusions:This study applied the cluster analysis method to evaluate the scientific research performance evaluation index system of departments in the obstetrics and gynecology hospital and replaced quantitative indicators with quality indicators. It will optimize and improve the hospital hierarchical management methods, provide data support and classified guidance for the scientific research development of different categories of departments, and provide a reference basis for hospitals to formulate scientific research management policies such as achievement transformation, award, industry standard guidelines, etc.