1.Development and validation of PhenoRAG: A visualization tool for automated human phenotype ontology term annotation based on large language models and retrieval-augmented generation technology.
Wei ZHONG ; Yousheng YAN ; Kai YANG ; Yan LIU ; Xinyu FU ; Zhengyang YAO ; Chenghong YIN
Chinese Journal of Medical Genetics 2026;43(1):36-43
OBJECTIVE:
To develop a user-friendly visualization application for the automatic annotation of Human Phenotype Ontology (HPO) terms based on large language models and retrieval-augmented generation (RAG) technology, and to validate its performance in an authoritative case dataset.
METHODS:
By integrating the domestic open-source large language model DeepSeek-V3 with RAG technology, an interactive web application was deployed on the Streamlit cloud platform. Using only the latest official HPO dataset as the data source, the lightweight sentence-embedding model BAAI/bge-small-en-v1.5 was employed to construct a FAISS vector index. During the online phase, a four-step closed-loop process is automatically completed: multilingual translation, phenotype phrase extraction, RAG candidate retrieval, term mapping, and official database validation. 121 English case reports publicly released by BMJ Case Reports and Oxford Medical Case Reports (with a gold-standard HPO set of 1 794 terms) were selected for application validation. Precision, recall, and F1 score were calculated and compared horizontally with traditional dictionary tools, standalone large language models, and the similar application "RAG-HPO". Finally, replace the model with the more advanced ChatGPT-5 and evaluate its performance on the newly extracted dataset.
RESULTS:
An HPO term automatic annotation visualization application named PhenoRAG, based on large language models and RAG technology, was successfully developed. Users can access it directly via a web link. Across the 112 cases, a total of 2 150 HPO terms were generated; 2,064 (96.0%) were fully validated by the official database, with a hallucination rate of 1.3% and an HPO ID-name mismatch rate of 2.7%. After deduplication, 1,906 terms remained for testing. The overall precision was 63.65%, recall was 67.34%, and F1 was 65.44%, significantly outperforming traditional annotation tools (F1: 0.45-0.49, P < 0.001). Although PhenoRAG's F1 was lower than that of RAG-HPO (F1 = 0.78, P < 0.001), which relies on a manually constructed synonym database of 54 000 entries plus the HPO dataset, it requires no additional dictionary maintenance and can be used without any background in computer programming. Moreover, after switching to the GPT-5 model, PhenoRAG exhibited no hallucination rate on the new dataset, and its F1 score significantly increased (P = 0.038).
CONCLUSION
Without constructing a synonym database, the PhenoRAG achieved high-accuracy automatic mapping from clinical text to standard HPO terms. It features a low usage threshold, free access, and a Chinese-language interface, and can directly serve rare disease diagnosis, genetic counseling, and research scenarios in China and worldwide, warranting further clinical promotion and multicenter validation.
Humans
;
Phenotype
;
Biological Ontologies
;
Language
;
Software
;
Large Language Models
2.Genetic analysis and prenatal diagnosis of structural brain abnormalities associated with TUBB gene c.155A>G variant.
Yifan LIU ; Wei SONG ; Xinlian WANG ; Yan RUAN ; Meng ZHANG ; Yujiao CHEN ; Yan LIU ; Puqing ZHANG ; Li WANG ; Yousheng YAN
Chinese Journal of Medical Genetics 2026;43(2):136-142
OBJECTIVE:
To explore the genotype-phenotype correlation in a Chinese family with structural brain abnormalities due to variant of the TUBB gene.
METHODS:
A family undergoing prenatal diagnosis at Beijing Obstetrics and Gynecology Hospital in October 2024 was selected as the study subject. Clinical data were collected. Amniotic fluid sample was subjected to chromosomal copy number variation sequencing (CNV-seq). Trio whole-exome sequencing (Trio-WES) was carried out on the amniotic fluid and parental blood samples, and candidate variant was verified by Sanger sequencing. This study was approved by the Medical Ethics Committee of the hospital (Ethics No.: 2023-KY-076-01).
RESULTS:
Both prenatal ultrasound and fetal MRI showed deviation of brain midline, unilateral lateral ventriculomegaly, and bilateral gyral asymmetry. Trio-WES revealed that the fetus has harbored a maternally derived heterozygous missense variant of the TUBB gene [NM_178014.4: c.155A>G (p.N52S)]. Sanger sequencing confirmed that the woman and a previously terminated fetus both harbored the same variant. Both the proband and two fetuses exhibited similar neuroimaging abnormalities including midline deviation and asymmetrical gyri. Based on the guidelines from the American College of Medical Genetics and Genomics (ACMG), the variant was classified as likely pathogenic (PM2_Supporting+PS2_Moderate+PS3).
CONCLUSION
The heterozygous c.155A>G (p.N52S) variant was the TUBB gene probably underlay the pathogenesis of the structural brain abnormalities in this family. Above findings have expanded the phenotypic spectrum associated with the variant and facilitated the prenatal diagnosis for this family.
Humans
;
Female
;
Pregnancy
;
Prenatal Diagnosis
;
Tubulin/genetics*
;
Adult
;
Brain/diagnostic imaging*
;
Male
;
Pedigree
;
DNA Copy Number Variations/genetics*
;
Exome Sequencing
;
Genetic Association Studies
;
Magnetic Resonance Imaging
3.A consensus on prenatal diagnosis and genetic counseling for chromosomal mosaicism.
Shaobin LIN ; Weiqiang LIU ; Li GUO ; Jun ZHANG ; Jian LU ; Hanbiao CHEN ; Yousheng WANG ; Yangyi CHEN ; Juntao SHEN ; Xiaoming WEI ; Huihui ZHU ; Aihua YIN
Chinese Journal of Medical Genetics 2022;39(8):797-802
With the extensive application of highly sensitive genetic techniques in the field of prenatal diagnosis, prenatal chromosomal mosaicisms including true fetal mosaicisms and confined placental mosaicisms are frequently identified in clinical settings, and the diagnostic criteria and principle of genetic counseling and clinical management for such cases may vary significantly among healthcare centers across the country. This not only has brought challenges to laboratory technician, genetic counselor and fetal medicine doctor, but can also cause confusion and anxiety of the pregnant woman and their family members. In this regard, we have formulated a consensus over the prenatal diagnosis and genetic counseling for chromosomal mosaicisms with the aim to promote more accurate and rational evaluation for fetal chromosomal mosaicisms in prenatal clinics.
Consensus
;
Female
;
Genetic Counseling
;
Humans
;
Mosaicism
;
Placenta
;
Pregnancy
;
Prenatal Diagnosis/methods*
4.Expression of METTL14 in epithelial ovarian cancer and the effect on cell proliferation, invasion and migration of A2780 and SKOV3 cells
Yousheng WEI ; Desheng YAO ; Li LI ; Yan LU ; Xinmei YANG ; Wenge ZHANG
Chinese Journal of Obstetrics and Gynecology 2022;57(1):46-56
Objective:To study the expression of methyltransferase-like protein 14 (METTL14) in epithelial ovarian cancer and its clinical significance, and to explore the effect of METTL14 expression on the proliferation, invasion and migration of ovarian cancer cells.Methods:Immunohistochemistry (IHC) was used to detect METTL14 expression in tumor tissue samples, and analyze the relationships among METTL14 expression, clinicopathological factors, and prognosis in ovarian cancer. Lentiviral vectors and small interfering RNA (siRNA) were used to up-regulate and down-regulate the METTL14 expression in ovarian cancer cell lines A2780 and SKOV3, respectively. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) method was used to detect the N6-methyladenosine (m6A) content in ovarian cancer cells. Cell counting kit-8 (CCK-8), wound healing assay, and transwell assay were used to examine the function of METTL14 expression in the cells.Results:(1) The IHC score of METTL14 protein was 6.2±3.7 in 20 samples of ovarian cancer tissues and 3.3±2.5 in 15 samples of normal ovarian tissues, and the difference was statistically significant ( t=-2.64, P=0.012). Among the patients who suffered from ovarian cancer, there were 69 cases with high expression of METTL14 protein (IHC score≥6), accounting for 57.0% (69/121), and the cases with low expression of METTL14 protein (IHC score<6) accounting for 43.0% (52/121). Compared with the patients with low expression of METTL14, the patients with high expression of METTL14 had later stages, higher rates of lymph node metastasis, abdominal metastasis, and more ascite amount. The differences were statistically significant (all P<0.05). The overall survival rate was significantly lower in patients with high METTL14 expression than the low expression ( P=0.009). (2) LC-MS/MS data showed that the relative expression of m6A in A2780 and SKOV3 cells in the lentivirus (LV)-METTL14 group were 0.213±0.024 and 0.181±0.018, which were significantly higher than those in the LV-normal control (NC) group (0.109±0.022 and 0.128±0.020; all P<0.05). While the relative expression of m6A in A2780 and SKOV3 cells in the si-METTL14 group were 0.063±0.012 and 0.069±0.015, which were significantly lower than the expression in si-NC group of 0.108±0.014 and 0.121±0.014 (all P<0.05). CCK-8 assay showed that the absorbance values were significantly lower in the si-METTL14 group compared with the si-NC group at 36, 48, 60 hours (all P<0.05); while were significantly increased in the LV-METTL14 group compared with the LV-NC group at 48, 60 hours (all P<0.01). Scratch wound assays showed that the migration rate of the si-METTL14 group was lower than those of the si-NC group, while the LV-METTL14 group were higher than the LV-NC group by 24 hours, the differences were statistically significant (all P<0.01). Cell migration and invasion were detected by transwell migration and invasion assays. After cultivated for 24 hours, the invasion cell number and the migration cell number in the si-METTL14 group were less than those in the si-NC group. While the invasion cell number and the migration cell number in the LV-METTL14 group were more than those in the LV-NC group, respectively. The differences were statistically significant (all P<0.01). Conclusion:Patients with high METTL14 expression have a worse prognosis in ovarian cancer, which may increase the m6A modification of ovarian cancer cells and promote cells proliferation, invasion and migration.
5.CHCHD2 maintains mitochondrial contact site and cristae organizing system stability and protects against mitochondrial dysfunction in an experimental model of Parkinson’s disease
Lin LU ; Hengxu MAO ; Miaomiao ZHOU ; Yuwan LIN ; Wei DAI ; Jiewen QIU ; Yousheng XIAO ; Mingshu MO ; Xiaoqin ZHU ; Zhuohua WU ; Zhong PEI ; Wenyuan GUO ; Pingyi XU ; Xiang CHEN
Chinese Medical Journal 2022;135(13):1588-1596
Background::Parkinson’s disease (PD) is the second most common neurodegenerative disease after Alzheimer’s dementia. Mitochondrial dysfunction is involved in the pathology of PD. Coiled-coil-helix-coiled-coil-helix domain-containing 2 (CHCHD2) was identified as associated with autosomal dominant PD. However, the mechanism of CHCHD2 in PD remains unclear.Methods::Short hairpin RNA (ShRNA)-mediated CHCHD2 knockdown or lentivirus-mediated CHCHD2 overexpression was performed to investigate the impact of CHCHD2 on mitochondrial morphology and function in neuronal tumor cell lines represented with human neuroblastoma (SHSY5Y) and HeLa cells. Blue-native polyacrylamide gel electrophoresis (PAGE) and two-dimensional sodium dodecyl sulfate-PAGE analysis were used to illustrate the role of CHCHD2 in mitochondrial contact site and cristae organizing system (MICOS). Co-immunoprecipitation and immunoblotting were used to address the interaction between CHCHD2 and Mic10. Serotype injection of adeno-associated vector-mediated CHCHD2 and 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine (MPTP) administration were used to examine the influence of CHCHD2 in vivo.Results::We found that the overexpression of CHCHD2 can protect against methyl-4-phenylpyridinium (MPP+)-induced mitochondrial dysfunction and inhibit the loss of dopaminergic neurons in the MPTP-induced mouse model. Furthermore, we identified that CHCHD2 interacted with Mic10, and overexpression of CHCHD2 can protect against MPP +-induced MICOS impairment, while knockdown of CHCHD2 impaired the stability of MICOS. Conclusion::This study indicated that CHCHD2 could interact with Mic10 and maintain the stability of the MICOS complex, which contributes to protecting mitochondrial function in PD.
6.A consensus recommendation for the interpretation and reporting of exome sequencing in prenatal genetic diagnosis
Yan ZHANG ; Weiqiang LIU ; Jun ZHANG ; Shaobin LIN ; Hui HUANG ; Victor Wei ZHANG ; Zhilin REN ; Yousheng WANG ; Yaping YANG ; Aihua YIN
Chinese Journal of Medical Genetics 2020;37(11):1205-1212
With the rapid development and adaptation of high-throughput sequencing in clinical settings, application of exome sequencing (ES) has been gradually expanded from pediatric to prenatal diagnosis in recent years. There is an urgent need to establish criteria for clinical grade ES in order to facilitate such a complex testing. The standardization of pre- and post-test consultation, quality control for sample processing process and validation of bioinformatics data analysis, and more importantly data interpretation and reporting, as well as appropriate reporting scope, is of great importance for health care stakeholders. To achieve this, a committee composed of a wide range of healthcare professionals has proposed an ES standard for prenatal diagnosis. This has provided expert opinion on the genetic counseling and reporting standards of prenatal ES for the purpose of applying ES technology in prenatal setting.
7. The value of the planned neoadjuvant radiotherapy or chemoradiotherapy for the non-radical resection of esophageal squamous cell carcinoma
Wenjie NI ; Wei DENG ; Zefen XIAO ; Zongmei ZHOU ; Xin WANG ; Dongfu CHEN ; Qinfu FENG ; Jun LIANG ; Jima LYU ; Nan BI ; Lei DENG ; Tao ZHANG ; Wenqing WANG ; Qi XUE ; Shugeng GAO ; Juwei MU ; Yousheng MAO ; Dali WANG ; Jun ZHAO ; Yushun GAO ; Jinfeng HUANG ; Fengwei TAN ; Liang ZHAO ; Fang LYU ; Guochao ZHANG
Chinese Journal of Oncology 2019;41(4):295-302
Objective:
The role of planned neoadjuvant radiotherapy or chemoradiotherapy in the non-radical resection of esophageal squamous cell carcinoma was unclear. The study aimed to evaluate their therapeutic effect and analyze the prognostic factors.
Methods:
We retrospectively analyzed the clinical data of locally advanced esophageal squamous cell carcinoma who received neoadjuvant radio therapy (33 patients) and concurrent chemoradiotherapy (119 patients) from January 2004 to December 2016 in our single-institution database.The survival rates were calculated by Kaplan-Meier method. The prognostic factors were analyzed by using Log rank test and Cox proportional hazards model.
Results:
The median follow-up was 29.8 months. One hundred and one patients survived more than 3 years. The rates of overall survival (OS) and disease-free survival (DFS) at 3 years were 63.9% and 55.6%, respectively.The rates of complete, partial and minimal pathological response of the primary tumor were 50.3%, 38.4%, 11.3%, the corresponding 3-year OS were 75.5%, 57.4%, 27.3% (
9.The value of the planned neoadjuvant radiotherapy or chemoradiotherapy for the non?radical resection of esophageal squamous cell carcinoma
Wenjie NI ; Wei DENG ; Zefen XIAO ; Zongmei ZHOU ; Xin WANG ; Dongfu CHEN ; Qinfu FENG ; Jun LIANG ; Jima LYU ; Nan BI ; Lei DENG ; Tao ZHANG ; Wenqing WANG ; Qi XUE ; Shugeng GAO ; Juwei MU ; Yousheng MAO ; Dali WANG ; Jun ZHAO ; Yushun GAO ; Jinfeng HUANG ; Fengwei TAN ; Liang ZHAO ; Fang LYU ; Guochao ZHANG
Chinese Journal of Oncology 2019;41(4):295-302
Objective The role of planned neoadjuvant radiotherapy or chemoradiotherapy in the non?radical resection of esophageal squamous cell carcinoma was unclear. The study aimed to evaluate their therapeutic effect and analyze the prognostic factors. Methods We retrospectively analyzed the clinical data of locally advanced esophageal squamous cell carcinoma who received neoadjuvant radio therapy ( 33 patients) and concurrent chemoradiotherapy (119 patients) from January 2004 to December 2016 in our single?institution database.The survival rates were calculated by Kaplan?Meier method. The prognostic factors were analyzed by using Log rank test and Cox proportional hazards model. Results The median follow?up was 29.8 months. One hundred and one patients survived more than 3 years. The rates of overall survival (OS) and disease?free survival ( DFS) at 3 years were 63.9% and 55.6%, respectively.The rates of complete, partial and minimal pathological response of the primary tumor were 50.3%, 38.4%, 11.3%, the corresponding 3?year OS were 75.5%, 57.4%, 27.3%( P<0.001) and 3?year DFS were 72.0%, 44.7%, 17.6%(P<0.001), respectively.The postoperative lymph node metastasis rate was 27.0%. The 3?year OS and DFS of the lymph node positive group was 45.6% and 32.8%, significantly lower than 70.8% and 63.7%of the negative group (both P<0.001).The 3?year OS and DFS of pathologic stage Ⅰ,Ⅱ,ⅢA,ⅢB andⅥ A were 76.2%, 57.4%, 64.7%, 35.0%, 33.3%( P<0.001) and 70.1%, 49.3%, 41.2%, 22.1%, 33.3%(P<0.001), respectively.The operation?related mortality was 3.3%. Multivariate analysis showed that chest pain, postoperative respiratory failure, pathological differentiation, more than 15 lymph node dissection and ypTNM stage were the independent prognostic factors of OS ( P<0.05 for all). Conclusions The planned neoadjuvant radiotherapy or chemoradiotherapy for the non?radical resection of advanced esophageal squamous cell carcinoma could result in favorable survival. The chest pain, postoperative respiratory failure, pathological differentiation, the number of lymph node resection and ypTNM stage are the independent prognostic factors of the prognosis of these patients.
10.Publication output of the new integrated strategy for schistosomiasis japonica control in China: a PubMed-based bibliometric assessment
Yao DENG ; Wei WANG ; Yili QIAN ; Hongru ZHU ; Yousheng LIANG
Global Health Journal 2019;3(1):4-8
Background:Although sustainable control since 1950s has achieved great successes,schistosomiasis japonica remains a major public health problem in China.Since 2004,a new integrated strategy was developed aiming to control the transmission of Schistosoma japonicum through the implementation of a package of interventions.To date,no systematic review or meta-analysis assessing the effectiveness of this new integrated strategy for schistosomiasis control in China has been published.We performed a PubMed-based bibliometric assessment of publications on the new integrated strategy for schistosomiasis japonica control in China,to understand the global transmissibility and sharing of the new integrated strategy.Methods:An in-depth bibliometric analysis of all publications on the new integrated strategy for schistosomiasis japonica control in China was performed through a PubMed search using the terms "schistosomiasis" and "China,"from January 1,2004 to August 31,2018.All titles and abstracts were read carefully,and the publications reporting the effectiveness,experiences,lessons,or problems of the new integrated strategy were included in the bibliometric analysis.Results:Overall,2,361 titles were screened,and 70 eligible publications were accessed for analyses,including 23 studies in English,published in 15 international journals,and 47 studies in Chinese with abstracts in English,published in 3 national journals.Chinese Journal of Schistosomiasis Control (Zhongguo Xue Xi Chong Bing Fang Zhi Za Zhi) published 60% of the research output,Research articles (48.6%) and short reports (37.1%) were the dominant manuscript types.Furthermore,471 contributing authors from 277 affiliations across 9 countries produced these 70 publications.Conclusion:This is the first PubMed-based quantitative analysis of the research output of the new integrated strategy,and our data indicate a low global transmissibility of Chinese new integrated strategy.We therefore call for more research outputs of the new integrated strategy for schistosomiasis japonica control in China to be communicated through international platforms.

Result Analysis
Print
Save
E-mail