1.Development and validation of PhenoRAG: A visualization tool for automated human phenotype ontology term annotation based on large language models and retrieval-augmented generation technology.
Wei ZHONG ; Yousheng YAN ; Kai YANG ; Yan LIU ; Xinyu FU ; Zhengyang YAO ; Chenghong YIN
Chinese Journal of Medical Genetics 2026;43(1):36-43
OBJECTIVE:
To develop a user-friendly visualization application for the automatic annotation of Human Phenotype Ontology (HPO) terms based on large language models and retrieval-augmented generation (RAG) technology, and to validate its performance in an authoritative case dataset.
METHODS:
By integrating the domestic open-source large language model DeepSeek-V3 with RAG technology, an interactive web application was deployed on the Streamlit cloud platform. Using only the latest official HPO dataset as the data source, the lightweight sentence-embedding model BAAI/bge-small-en-v1.5 was employed to construct a FAISS vector index. During the online phase, a four-step closed-loop process is automatically completed: multilingual translation, phenotype phrase extraction, RAG candidate retrieval, term mapping, and official database validation. 121 English case reports publicly released by BMJ Case Reports and Oxford Medical Case Reports (with a gold-standard HPO set of 1 794 terms) were selected for application validation. Precision, recall, and F1 score were calculated and compared horizontally with traditional dictionary tools, standalone large language models, and the similar application "RAG-HPO". Finally, replace the model with the more advanced ChatGPT-5 and evaluate its performance on the newly extracted dataset.
RESULTS:
An HPO term automatic annotation visualization application named PhenoRAG, based on large language models and RAG technology, was successfully developed. Users can access it directly via a web link. Across the 112 cases, a total of 2 150 HPO terms were generated; 2,064 (96.0%) were fully validated by the official database, with a hallucination rate of 1.3% and an HPO ID-name mismatch rate of 2.7%. After deduplication, 1,906 terms remained for testing. The overall precision was 63.65%, recall was 67.34%, and F1 was 65.44%, significantly outperforming traditional annotation tools (F1: 0.45-0.49, P < 0.001). Although PhenoRAG's F1 was lower than that of RAG-HPO (F1 = 0.78, P < 0.001), which relies on a manually constructed synonym database of 54 000 entries plus the HPO dataset, it requires no additional dictionary maintenance and can be used without any background in computer programming. Moreover, after switching to the GPT-5 model, PhenoRAG exhibited no hallucination rate on the new dataset, and its F1 score significantly increased (P = 0.038).
CONCLUSION
Without constructing a synonym database, the PhenoRAG achieved high-accuracy automatic mapping from clinical text to standard HPO terms. It features a low usage threshold, free access, and a Chinese-language interface, and can directly serve rare disease diagnosis, genetic counseling, and research scenarios in China and worldwide, warranting further clinical promotion and multicenter validation.
Humans
;
Phenotype
;
Biological Ontologies
;
Language
;
Software
;
Large Language Models
2.Genetic analysis and prenatal diagnosis of structural brain abnormalities associated with TUBB gene c.155A>G variant.
Yifan LIU ; Wei SONG ; Xinlian WANG ; Yan RUAN ; Meng ZHANG ; Yujiao CHEN ; Yan LIU ; Puqing ZHANG ; Li WANG ; Yousheng YAN
Chinese Journal of Medical Genetics 2026;43(2):136-142
OBJECTIVE:
To explore the genotype-phenotype correlation in a Chinese family with structural brain abnormalities due to variant of the TUBB gene.
METHODS:
A family undergoing prenatal diagnosis at Beijing Obstetrics and Gynecology Hospital in October 2024 was selected as the study subject. Clinical data were collected. Amniotic fluid sample was subjected to chromosomal copy number variation sequencing (CNV-seq). Trio whole-exome sequencing (Trio-WES) was carried out on the amniotic fluid and parental blood samples, and candidate variant was verified by Sanger sequencing. This study was approved by the Medical Ethics Committee of the hospital (Ethics No.: 2023-KY-076-01).
RESULTS:
Both prenatal ultrasound and fetal MRI showed deviation of brain midline, unilateral lateral ventriculomegaly, and bilateral gyral asymmetry. Trio-WES revealed that the fetus has harbored a maternally derived heterozygous missense variant of the TUBB gene [NM_178014.4: c.155A>G (p.N52S)]. Sanger sequencing confirmed that the woman and a previously terminated fetus both harbored the same variant. Both the proband and two fetuses exhibited similar neuroimaging abnormalities including midline deviation and asymmetrical gyri. Based on the guidelines from the American College of Medical Genetics and Genomics (ACMG), the variant was classified as likely pathogenic (PM2_Supporting+PS2_Moderate+PS3).
CONCLUSION
The heterozygous c.155A>G (p.N52S) variant was the TUBB gene probably underlay the pathogenesis of the structural brain abnormalities in this family. Above findings have expanded the phenotypic spectrum associated with the variant and facilitated the prenatal diagnosis for this family.
Humans
;
Female
;
Pregnancy
;
Prenatal Diagnosis
;
Tubulin/genetics*
;
Adult
;
Brain/diagnostic imaging*
;
Male
;
Pedigree
;
DNA Copy Number Variations/genetics*
;
Exome Sequencing
;
Genetic Association Studies
;
Magnetic Resonance Imaging
3.Genetic analysis of a de novo EFTUD2 variant causing Mandibulofacial dysostosis with microcephaly in a fetus.
Jianyu REN ; Xiaojiao GUAN ; Shuang LIU ; Yousheng YAN ; Shufa YANG
Chinese Journal of Medical Genetics 2026;43(4):288-294
OBJECTIVE:
To investigate the genetic etiology of a fetus diagnosed with Mandibulofacial dysostosis with microcephaly (MFDM).
METHODS:
A fetus that underwent prenatal diagnosis at Beijing Obstetrics and Gynecology Hospital, Capital Medical University, on May 19, 2025 was selected for analysis. Results of fetal ultrasound findings, chromosomal karyotyping, copy number variation sequencing (CNV-seq), and whole-exome sequencing (WES) were collected. Sanger sequencing was performed for familial validation of the pathogenic variant. The Human Protein Atlas (HPA), STRING, and Simple ClinVar databases were queried to characterize the biological features of the candidate gene. Three-dimensional structures of the wild-type and variant proteins were modeled and analyzed, and the evolutionary conservation of the affected amino acid was assessed using UGENE. Prenatal phenotypes associated with EFTUD2 variants were summarized through a review of the literature. This study was approved by the Ethics Committee of Beijing Obstetrics and Gynecology Hospital, Capital Medical University (Ethics No.: 2025-KY-029-01).
RESULTS:
At 23+2 weeks of gestation, ultrasound examination revealed bilateral microtia with low-set ears, mild micrognathia with a reduced mandibular-facial angle, a single umbilical artery, a slightly narrow aortic diameter, and trivial mitral regurgitation. Amniotic fluid karyotyping and CNV-seq showed no abnormalities. WES identified a de novo, previously unreported EFTUD2 variant, c.698dupA (p.V235Gfs*27), in the fetus. This frameshift variant is predicted to alter the structural integrity of the EFTUD2 protein. Literature review indicated that micrognathia and microtia or low-set ears are the most common sonographic features in fetuses with EFTUD2 variants, while secondary findings may include abnormal stomach bubble, cleft palate, single umbilical artery, gastrointestinal atresia, polyhydramnios, and reduced aortic diameter.
CONCLUSION
The EFTUD2: c.698dupA (p.V235Gfs*27) variant is likely the genetic cause underlying MFDM in this fetus.
Humans
;
Mandibulofacial Dysostosis/diagnostic imaging*
;
Microcephaly/diagnostic imaging*
;
Female
;
Pregnancy
;
Ribonucleoprotein, U5 Small Nuclear/chemistry*
;
Peptide Elongation Factors/chemistry*
;
Fetus
;
DNA Copy Number Variations/genetics*
;
Adult
;
Ultrasonography, Prenatal
4.Research Progress on the Mechanism of Regulating Glycolysis of Hepatic Stellate Cells Against Liver Fibrosis and the Prevention and Treatment of Traditional Chinese Medicine
Mengmeng HAO ; Lu LIU ; Langping YI ; Shuangwei LI ; Xin CHEN ; Hongying YANG ; Minghuang GAO ; Yousheng MO ; Weirong LI ; Qi WANG
Traditional Chinese Drug Research & Clinical Pharmacology 2024;35(7):1101-1106
Hepatic stellate cell(HSC)activation is a key link in the development of liver fibrosis.The metabolic reprogramming of activated HSC has become a hot topic in current research,especially the change of glycolysis is an important factor in regulating HSC activation.Based on the metabolic reprogramming in the process of HSC activation,this paper expounds the mechanism of regulating HSC activation and liver fibrosis through glycolysis,and reviews the research progress of traditional Chinese medicine and its active ingredients in regulating HSC glycolysis to prevent and treat liver fibrosis.Liver fibrosis is a complex pathological process involving multiple factors and pathways.From the perspective of regulating the glycolysis of activated HSC,it can provide a new idea for the development of anti-liver fibrosis drugs.
5.Progress of the Impact of Terahertz Radiation on Ion Channel Kinetics in Neuronal Cells.
Yanjiang LIU ; Xi LIU ; Yousheng SHU ; Yuguo YU
Neuroscience Bulletin 2024;40(12):1960-1974
In neurons and myocytes, selective ion channels in the plasma membrane play a pivotal role in transducing chemical or sensory stimuli into electrical signals, underpinning neural and cardiac functionality. Recent advancements in biomedical research have increasingly spotlighted the interaction between ion channels and electromagnetic fields, especially terahertz (THz) radiation. This review synthesizes current findings on the impact of THz radiation, known for its deep penetration and non-ionizing properties, on ion channel kinetics and membrane fluid dynamics. It is organized into three parts: the biophysical effects of THz exposure on cells, the specific modulation of ion channels by THz radiation, and the potential pathophysiological consequences of THz exposure. Understanding the biophysical mechanisms underlying these effects could lead to new therapeutic strategies for diseases.
Neurons/metabolism*
;
Animals
;
Ion Channels/radiation effects*
;
Humans
;
Terahertz Radiation
;
Kinetics
;
Cell Membrane/radiation effects*
6.Chinese thoracic surgery experts consensus on postoperative follow-up plans for esophageal squamous cell carcinoma
Longqi CHEN ; Xiaofei LI ; Jianhua FU ; Song ZHAO ; Yin LI ; Yousheng MAO ; Shuoyan LIU ; Zhentao YU ; Lijie TAN ; Hui LI ; Yongtao HAN ; Chun CHEN ; Mingqiang KANG ; Jian HU ; Zhigang LI ; Hecheng LI ; Renquan ZHANG ; Shidong XU ; Linyou ZHANG ; Kaican CAI
Chinese Journal of Clinical Thoracic and Cardiovascular Surgery 2022;29(02):141-149
Resection is one of the most important treatments for esophageal squamous cell carcinoma, and routine postoperative follow-up is an effective method for early detection and treatment of recurrent metastases, which can improve patients' quality of life and prognosis. This consensus aims to provide a reference for colleagues responsible for postoperative follow-up of esophageal squamous cell carcinoma patients in China, and further improve the standardization of the diagnosis and treatment of esophageal squamous cell carcinoma.
7.A consensus on prenatal diagnosis and genetic counseling for chromosomal mosaicism.
Shaobin LIN ; Weiqiang LIU ; Li GUO ; Jun ZHANG ; Jian LU ; Hanbiao CHEN ; Yousheng WANG ; Yangyi CHEN ; Juntao SHEN ; Xiaoming WEI ; Huihui ZHU ; Aihua YIN
Chinese Journal of Medical Genetics 2022;39(8):797-802
With the extensive application of highly sensitive genetic techniques in the field of prenatal diagnosis, prenatal chromosomal mosaicisms including true fetal mosaicisms and confined placental mosaicisms are frequently identified in clinical settings, and the diagnostic criteria and principle of genetic counseling and clinical management for such cases may vary significantly among healthcare centers across the country. This not only has brought challenges to laboratory technician, genetic counselor and fetal medicine doctor, but can also cause confusion and anxiety of the pregnant woman and their family members. In this regard, we have formulated a consensus over the prenatal diagnosis and genetic counseling for chromosomal mosaicisms with the aim to promote more accurate and rational evaluation for fetal chromosomal mosaicisms in prenatal clinics.
Consensus
;
Female
;
Genetic Counseling
;
Humans
;
Mosaicism
;
Placenta
;
Pregnancy
;
Prenatal Diagnosis/methods*
8.Construction of HMGA2 knockout model of human papillary thyroid cancer cell line TPC-1
Hong YONG ; Xiaojuan LI ; Shan JIN ; Yousheng LIU ; Yuntu WU ; Yinbao BAI ; La TA
Chinese Journal of Endocrine Surgery 2022;16(4):421-425
Objective:To construct a TPC-1 cell model that stably knocks out the HMGA2 by using CRISPR/Cas9 gene editing technology. Methods:Recombinant pLV[2gRNA]-EGFP:T2A:Puro- U6> {hHMGA2 [gRNA# A1]*}- U6>{hHMGA2 [gRNA#A2]*} of lentiviral plasmid vector was constructed: targeting HMGA2 Dual-gRNA sequence was designed, the synthesized Dual-gRNA fragment into pLV [2gRNA]-EGFP was cloned: T2A:Puro-U6 vector, extract a single clone for sequencing verification. the constructed recombinant plasmid vector with lentivirus was packed, and TPC-1 cells were infected, puromycin was used to obtain HMGA2 knock-out single clone, PCR and sequencing verification were performed, and real-time fluorescent quantitative qPCR was used to detect HMGA2 mRNA in cells Knockout efficiency. Results:After sequencing verification, pLV [2gRNA]-EGFP targeting HMGA2: T2A: Puro-U6>{hHMGA2 [gRNA#A1]*}-U6>{hHMGA2 [gRNA #A2]*} plasmid was successfully constructed; A single clone was picked for PCR identification and gene sequencing, TPC-1 cells were successfully obtained with HMGA2 gene completely knocked out; TPC-1 cells with HMGA2 knocked out were detected by real-time fluorescent quantitative qPCR, and they did not express HMGA2 mRNA.Conclusion:CRISPR/Cas9 gene editing technology enables us to construct a human papillary thyroid cancer cell line TPC-1 cell model with stable knockout of HMGA2.
9.Introduction and application of deep integration strategy of extracurricular scientific research management and information technology for undergraduates in military medical universities
Feifei WU ; Xiaoxia LIU ; Gaixia LI ; Xiaoxu JIANG ; Kaifeng LI ; Xiacheng SUN ; Fei TIAN ; Yousheng WU ; Li WANG ; Nannan LIU ; Haifeng ZHANG ; Yayun WANG
Chinese Journal of Medical Education Research 2022;21(6):664-668
This study deeply analyzes the common problems of three military medical universities in the management of undergraduate extracurricular scientific research, such as lack of communication means, limited online resources, backward laboratory opening and low utilization rate of equipment. We have built a cloud platform management system for undergraduate extracurricular scientific research. This system firstly sets up a teaching resources storage module including videos, PPTs, documents, pictures, electronic materials, question bank, etc. Then four subsystems for different roles of students, mentors, experimental teaching staff and administrators are constructed. Finally, this system realizes independent experiments by students, real-time evaluation by mentors, instrument sharing and efficient management through the seamless connection with the user terminal equipment. And the study also makes evaluation on the present usage.
10.Consensus on technological standards for non-invasive prenatal screening of pathogenic copy number variations by high-throughput sequencing of maternal plasma cell-free DNA.
Weiqiang LIU ; Jiexia YANG ; Jun ZHANG ; Jian LU ; Yangyi CHEN ; Hongmin ZHU ; Jiale XIANG ; Yousheng WANG ; Min WANG ; Juan WANG ; Qixi WU ; Aihua YIN
Chinese Journal of Medical Genetics 2021;38(7):613-619
Genomic disorders caused by pathogenic copy number variation (pCNV) have proven to underlie a significant proportion of birth defects. With technological advance, improvement of bioinformatics analysis procedure, and accumulation of clinical data, non-invasive prenatal screening of pCNV (NIPS-pCNV) by high-throughput sequencing of maternal plasma cell-free DNA has been put to use in clinical settings. Specialized standards for clinical application of NIPS-pCNV are required. Based on the discussion, 10 pCNV-associated diseases with well-defined conditions and 5 common chromosomal aneuploidy syndromes are recommended as the target of screening in this consensus. Meanwhile, a standardized procedure for NIPS-pCNV is also provided, which may facilitate propagation of this technique in clinical settings.
Aneuploidy
;
Cell-Free Nucleic Acids/genetics*
;
Consensus
;
DNA Copy Number Variations
;
Female
;
High-Throughput Nucleotide Sequencing
;
Humans
;
Pregnancy
;
Prenatal Diagnosis

Result Analysis
Print
Save
E-mail