1.QingNangTCM: a parameter-efficient fine-tuning large language model for traditional Chinese medicine
Xuming TONG ; Liyan LIU ; Yanhong YUAN ; Xiaozheng DING ; Huiru JIA ; Xu YANG ; Sio Kei IM ; Mini Han WANG ; Zhang XIONH ; Yapeng WANG
Digital Chinese Medicine 2026;9(1):1-12
Objective:
To develop QingNangTCM, a specialized large language model (LLM) tailored for expert-level traditional Chinese medicine (TCM) question-answering and clinical reasoning, addressing the scarcity of domain-specific corpora and specialized alignment.
Methods:
We constructed QnTCM_Dataset, a corpus of 100 000 entries, by integrating data from ShenNong_TCM_Dataset and SymMap v2.0, and synthesizing additional samples via retrieval-augmented generation (RAG) and persona-driven generation. The dataset comprehensively covers diagnostic inquiries, prescriptions, and herbal knowledge. Utilizing P-Tuning v2, we fine-tuned the GLM-4-9B-Chat backbone to develop QingNangTCM. A multi-dimensional evaluation framework, assessing accuracy, coverage, consistency, safety, professionalism, and fluency, was established using metrics such as bilingual evaluation understudy (BLEU), recall-oriented understudy for gisting evaluation (ROUGE), metric for evaluation of translation with explicit ordering (METEOR), and LLM-as-a-Judge with expert review. Qualitative analysis was conducted across four simulated clinical scenarios: symptom analysis, disease treatment, herb inquiry, and failure cases. Baseline models included GLM-4-9B-Chat, DeepSeek-V2, HuatuoGPT-II (7B), and GLM-4-9B-Chat (freeze-tuning).
Results:
QingNangTCM achieved the highest scores in BLEU-1/2/3/4 (0.425/0.298/0.137/0.064), ROUGE-1/2 (0.368/0.157), and METEOR (0.218), demonstrating a balanced and superior normalized performance profile of 0.900 across the dimensions of accuracy, coverage, and consistency. Although its ROUGE-L score (0.299) was lower than that of HuatuoGPT-II (7B) (0.351), it significantly outperformed domain-specific models in expert-validated win rates for professionalism (86%) and safety (73%). Qualitative analysis confirmed that the model strictly adheres to the “symptom-syndrome-pathogenesis-treatment” reasoning chain, though occasional misclassifications and hallucinations persisted when dealing with rare medicinal materials and uncommon syndromes.
Conclusion
Combining domain-specific corpus construction with parameter-efficient prompt tuning enhances the reasoning behavior and domain adaptation of LLMs for TCM-related tasks. This work provides a technical framework for the digital organization and intelligent utilization of TCM knowledge, with potential value for supporting diagnostic reasoning and medical education.
2.Association between amino acids and primary malignant bone tumor: a Mendelian randomization study
LI Xiaoshan ; WANG Manyi ; ZHANG Huiru ; WANG Shuntao ; LIU Xinyue ; ZENG Guqing
Journal of Preventive Medicine 2025;37(12):1252-1256
Objective:
To investigate the causal association between amino acids and the primary malignant bone tumor and its underlying mechanism.
Methods:
Genome-wide association study (GWAS) data of glycine, serine, arginine, glutamine, methionine, and leucine was sourced from the IEU OpenGWAS database and the GWAS Catalog. GWAS data of primary malignant bone tumor were obtained from the FinnGen database. Using each of the six amino acids as the exposure and primary malignant bone tumor as the outcome, two-sample Mendelian randomization (MR) analysis was performed with the inverse-variance weighted method as the primary approach. Multivariable MR analysis was employed to control for collinearity among amino acids. Sensitivity analyses were conducted using Cochran's Q test, MR-Egger regression and the MR Steiger test. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis and protein-protein interaction network analysis were explored to explore potential mechanisms and identify key genes.
Results:
MR analysis results indicated a statistically significant causal association between glycine and primary malignant bone tumor (OR=1.719, 95%CI: 1.083-2.728). No significant causal associations were found for the other five amino acids (all P>0.05). Multivariable MR analysis revealed that, after adjusting for the other five amino acids, confirmed a positive causal association between glycine and primary malignant bone tumor (OR=1.512, 95%CI: 1.125-2.031). Sensitivity analyses revealed no significant heterogeneity, horizontal pleiotropy, or reverse causality (all P>0.05). Genes associated with both glycine metabolism and primary malignant bone tumor were enriched in the JAK-STAT signaling pathway, with serine hydroxymethyltransferase 2 (SHMT2) identified as a key gene.
Conclusion
Higher glycine levels may increase the risk of primary malignant bone tumor via the SHMT2-JAK-STAT pathway.
3.Construction and gene identification of CSF1R +/-mice
Yuanyuan Zhou ; Chong Liu ; Anqi Wang ; Huiru Zhang ; Jiaqi Qiu ; Mengjuan Zhu ; Jiajie Tu
Acta Universitatis Medicinalis Anhui 2025;60(5):884-889
Objective:
To constructCSF1R+/-mice and to analyze their genotypes, so as to provide animal model basis for disease pathological mechanism and drug target.
Methods :
A linearized targeting vector was designed according to Cre/Loxp system. A Loxp site was inserted upstream of the 5th exon of theCSF1Rgene, and a neomycin resistance box with Loxp sites on both sides was inserted downstream of the 5th exon. The linearized targeting vector was electroporated into embryonic stem cells. The correctly targeted embryonic stem cells were injected into the blastocysts of C57BL/6J mice to obtain chimeric mice, which were bred with Zp3-Cre mice. The newborn mice were numbered 9-14 days after birth and their tails were cut. The DNA of the mice was extracted, and the genotype of the mice was identified by polymerase chain reaction and agarose gel electrophoresis. The expression of CSF1R in mouse macrophages was detected by flow cytometry. The expression of CSF1R in mouse tissues was detected by Western blot.
Results:
The results of agarose gel electrophoresis showed that 453 bp bands were amplified in wild type mice, and 453 bp and 650 bp bands were amplified in heterozygous mice. The results of flow cytometry showed that the expression of CSF1R in peritoneal macrophages and bone marrow-derived macrophages of CSF1R heterozygous mice was lower than that of WT group(P<0.05). The results of Western blot showed that the expression of CSF1R in spleen, kidney and brain tissue of CSF1R heterozygous group was lower than that of WT group(P<0.05).
Conclusion
CSF1R+/-mice are successfully constructed, reproduced and identified, which provides an animal model basis for further revealing the potential mechanism of CSF1R in immune regulation.
4.Mendelian randomization and GEO database identification analysis based on potential therapeutic targets for chronic obstructive pulmonary disease
Xianwei JIANG ; Minghang WANG ; Huiru LI ; Xiaosheng DONG ; Yuanyuan LIU
Journal of Jilin University(Medicine Edition) 2025;51(4):1072-1083
Objective:To screen the key genetic,diagnostic and therapeutic targets of chronic obstructive pulmonary disease(COPD)patients by using microarray datasets and Mendelian randomization(MR)method,and to provide the evidence for clinical diagnosis and treatment of COPD.Methods:Four COPD gene expression profile datasets were obtained from the Gene Expression Omnibus(GEO)database.The data were processed and normalized using R software,and differentially expressed genes(DEGs)were screened.MR analysis was performed to explore the causal relationship between COPD and expression quantitative trait loci(eQTL),intersection with DEGs was taken to identify potential key targets.Gene Set Enrichment Analysis(GSEA),Gene Ontology(GO)functional enrichment analysis,and Kyoto Encyclopedia of Genes and Genomes(KEGG)signaling pathway enrichment analysis were conducted to investigate the functional roles and pathways of the key targets,external datasets were used to validate their expression.Results:A total of 1 571 DEGs were screened,including 820 upregulated genes and 751 downregulated genes.MR analysis identified 286 COPD-related genes,and intersection with DEGs revealed 3 upregulated genes:diacylglycerol kinase gamma(DGKG),neurofilament heavy polypeptide(NEFH),and Fc receptor like B(FCRLB);and 6 downregulated genes:STEAP4 metalloreductase(STEAP4),pleckstrin homology domain containing family F member 2(PLEKHF2),CD3d molecule(CD3D),transgelin 2(TAGLN2),tripartite motif containing 22(TRIM22),and ribosomal protein L9(RPL9).The biological function analysis results indicated that these genes were mainly involved in pathways such as iron ion transport into the cells,oxidoreductase activity,primary immunodeficiency,and Th1 and Th2 cell differentiation.The MR analysis results confirmed the causal relationship between these targets and COPD.The external validation results showed that compared with healthy controls,the expression level of FCRLB in COPD samples was significantly increased(P<0.01),while the expression levels of CD3D and RPL9 were significantly decreased(P<0.05 or P<0.01),which was consistent with the MR analysis results,highlighting the reliability of this study.Conclusion:DGKG,NEFH,FCRLB,STEAP4,PLEKHF2,CD3D,TAGLN2,TRIM22,and RPL9 may serve as important regulatory factors and clinical diagnostic/therapeutic targets in the pathogenesis of COPD,providing clues for early screening,diagnosis,and targeted treatment of COPD.
5.Construction, breeding, and gene identification of micro RNA - 22 - 3p knockout mice
Anqi Wang ; Huiru Zhang ; Yuanyuan Zhou ; Chong Liu ; Yizhao Chen ; Jiajie Tu
Acta Universitatis Medicinalis Anhui 2025;60(6):1052-1058
Objective:
To construct microRNA(miR)-22 gene knockout(miR-22-/-) mice using CRISPR/Cas 9 technology, to breed miR-22-/- mice and to identify their genotypes.
Methods :
In this experiment, CRISPR/Cas 9 technology was used to construct miR-22-/- genetically engineered mice. After gene identification, the F0 generation miR-22-/- mice were mated with wild-type mice in the same litter to obtain F1 generation miR-22-/- mice. The miR-22 knockout efficiency was analyzed at the RNA level by real-time fluorescence quantitative polymerase chain reaction(qPCR). Western blot was used to detect the interaction between miR-22 and target genes.
Results :
miR-22-/- mice were successfully constructed using CRISPR/Cas 9 technology, gene identification was performed on the bred mice, and three stable genotypes of miR-22+/+,miR-22+/-, and miR-22-/- were identified. The real-time fluorescence quantitative PCR detection results confirmed that miR-22-/- mice showed almost no expression of miR-22 in the heart, liver, lung, kidney, spleen, and thymus tissues compared to wild-type mice in the same litter. Western blot analysis showed that the relative expression level of NLRP3 protein in miR-22-/- mouse tissues was lower than that in wild-type mice.
Conclusion
A miR-22-/- mouse model is successfully constructed, and stable genetic homozygous miR-22-/- mice is obtained. This indicates that miR-22 has an inhibitory effect on the downstream target gene NLRP3.
6.Breeding and genotype identification of CCR2 knockout mice
Huiru Zhang ; Anqi Wang ; Chong Liu ; Yuanyuan Zhou ; Hui Xue ; Jiajie Tu
Acta Universitatis Medicinalis Anhui 2025;60(7):1167-1172
Objective:
To explore the breeding and genotyping of CCR2 knockout mice, and to verify the applicability of the polymerase chain reaction(PCR) method for genotype detection of CCR2 knockout mice.
Methods:
The introduced CCR2 pure male mice and wild-type female mice were mated and bred to produce the offspring generation, the obtained F1 generation heterozygous mice were continued to be mated. DNA was extracted by clipping the tail tissues of the mice at the age of 2 weeks, the target gene fragment was amplified by PCR, and the genotypic results were determined by agarose gel electrophoresis. The proportion of purebred progeny carrying the CCR2 knockout gene was increased by genetic crosses, the effect of CCR2 knockout in the progeny mice was verified by using Western blot against major immune cells and key organs, and flow cytometry was used to detect whether the knockout of the CCR2 gene had any effect on the function of the immune system by targeting the major immune cells.
Results:
CCR2 knockout mice were successfully bred and characterized, and three genotypes of F2 generation mice were obtained: CCR2+/+, CCR2+/-, and CCR2-/-. The offspring genotypes were identified by PCR, and Western blot showed extremely low CCR2 protein expression in CCR2 knockout mice. Flow analysis showed that CCR2 knockdown reduced the expression of CD4+T and Th1 cells in mouse spleen-derived T cells, but did not affect macrophage function.
Conclusion
Correct breeding and identification are important ways to get the pure CCR2 knockout mice, and PCR method for identifying mouse genotypes is simple, fast and reliable.
7.Construction and gene identification of myeloid-specific Spi1 knockout mice
Xuming WU ; Huihui WANG ; Xiangling ZHU ; Yuanyuan ZHOU ; Anqi WANG ; Huiru ZHANG ; Chong LIU ; Jiajie TU
Acta Universitatis Medicinalis Anhui 2024;59(3):413-417
Objective To construct myeloid-specific Spi1 gene knockout mice and analyze their genotypes,so as to provide animal model basis for the study of pathological mechanism of diseases and drug targets.Methods Ac-cording to the principle of CRISPR/Cas9 technology and Cre/LoxP system,sgRNA and Donor vectors were de-signed and constructed.The transcript of Exon 2(Exon 2)was used as the knockout region,and Loxp elements were placed on both sides of Exon 2.Cas9 protein,sgRNA and Donor vector were mixed and microinjected into the fertilized eggs of C57BL/6J mice,the fertilized eggs were transplanted into the uterus of C57BL/6J pregnant female mice,and F0 generation was obtained after 19~20 days.Positive F0 mice were mated with C57BL/6J mice to ob-tain stable F1 Spi1flox/+mice.Spi1flox/+mice of F1 generation were selfed to obtain Spi1flox/flox mice.Spi1flox/flox mated with Lyz2-Cre+mice to obtain Spi1flox/+/Lyz2-Cre+mice,and then mated with Spi1flox/flox,the Spi1flox/flox/Lyz2-Cre+mice were myeloid-specific Spi1 gene knockout(KO)mice.Spi1flox/flox/Lyz2-cre-mice were used as wild-type(WT)mice.DNA of WT and KO mice was extracted,and the genotypes were identified by agarose gel electro-phoresis after PCR amplification.Western blot was used to detect the expression of spleen focus forming virus provi-ral integration oncogene,Spi-1/purine rich box-1(PU.1)in immune cells of WT and KO mice.Results The results of PCR identification showed that the genotype of mice with only 220 bp amplified by flox primer was Spi1flox/flox homozygote,and the genotype of mice with 700 bp amplified by Lyz2-Cre primer was Lyz2-Cre+.Western blot showed that compared with WT group,the protein PU.1 was not expressed in bone marrow-derived macropha-ges(BMDMs)and peritoneal macrophages(PM)in KO group(P<0.01).There was no significant difference of statistics in the expression level of PU.1 in T cells between KO mice and WT mice.The results of PCR and West-ern blot showed that myeloid-specific Spi1 KO mice were successfully constructed.Conclusion The myeloid-spe-cific Spi1 gene KO mice are successfully constructed and identified,which provides animal model basis for further revealing the potential mechanism of PU.1 inimmune regulation.
8.Breeding and genotyping of T lymphocyte-conditional Spi1 knockout mice
Huihui WANG ; Xiangling ZHU ; Xuming WU ; Huiru ZHANG ; Yuanyuan ZHOU ; Anqi WANG ; Chong LIU ; Jiajie TU
Acta Universitatis Medicinalis Anhui 2024;59(4):595-599
Objective To breed and identify the T lymphocyte-conditional Spi1 knockout mice for the further in-vestgation of the specific role of Spi1-encoded protein PU.1.Methods The Lck-Cre mice were mated with Spi1flox/flox mice to obtain Lck-Cre×Spi1flox/flox mice(T lymphocyte-specific Spi1 knockout mice),and the genotype was determined by polymerase chain reaction(PCR)and agarose gel electrophoresis.Magnetic beads were used to sort out the splenic T lymphocytes,and the knockdown efficiency of PU.1 in T cells was detected by Western blot,quantitative real-time PCR(qPCR)and flow cytometry.Results The Lck-Cre×Spi1flox/flox mouse genotype was stably inherited.Compared with Spi1flox/flox mice,the expression level of PU.1 was significantly reduced in splenic T cells of Lck-Cre×Spi1flox/flox mice.Conclusion In this study,the T lymphocyte-specific Spi1 knockout mice was successfully constructed by applying Cre/LoxP system and CRISPR/Cas9 technology,which provided a reliable an-imal model for the subsequent experiments of the specific role of PU.1 in T cell-related diseases.
9.Construction and efficiency detection of Csf1r-CreERT2 R26REYFP reporter gene mouse based on Cre/Loxp system
Xiangling ZHU ; Xuming WU ; Huihui WANG ; Yuanyuan ZHOU ; Anqi WANG ; Huiru ZHANG ; Chong LIU ; Jiajie TU
Acta Universitatis Medicinalis Anhui 2024;59(7):1175-1180
Objective To construct Csf1r-CreERT2 R26REYFP reporter gene mice and assess the efficacy of Csf1r-CreERT2-mediated enhancement of CSF1R in CD45+cells labeled with yellow fluorescein protein EYFP.Methods Csf1r-CreERT2 mice were crossbred with R26REYFP homozygous mice,and Csf1r-CreERT2R26REYFP mice were identified through PCR and Western Blot analyses.Flow cytometry was employed to evaluate CSF1R tag-efficiency in CD45+cells across different mouse tissues following tamoxifen induction.Results Csf1r-CreERT2 R26REYFP reporter gene mice were acquired.In addition,it was found that Csf1r-CreERT2-mediated EYFP could effectively mark CSF1R in various tissues of mice and CD45+cells in different locations.Compared to the R26REYF P group,the highest labeling efficiency was observed in the brain tissue(P<0.001),the lowest in the thymus tissue(P<0.05),and no sig-nificant difference was observed in the spleen tissue.Conclusion Adult Csf1r-CreERT2 mice and R26REYFP mice are effective ways to obtain Csf1r-CreERT2 R26REYFP induced conditional fluorescence mice.Csf1r-CreERT2 can mediate EYFP to effectively trace CSF1R in CD45+cells in different parts of mice.
10.Expression of Nectin-4 in invasive bladder urothelial carcinoma and its clinical significance
Huiru SONG ; Dan LUO ; Junxiu WEN ; Lu NI ; Kexin ZHANG ; Qi WANG ; Liu YANG ; Xudong SONG ; Liru DONG
Journal of Modern Urology 2024;29(10):903-908
[Objective] To explore the expression of Nectin-4 in invasive bladder urothelial carcinoma (BUC) tissue and its clinical significance, so as to provide reference for clinical diagnosis and treatment of BUC. [Methods] Nectin-4 expression in 60 cases of invasive BUC and 40 cases of chronic inflammation of bladder mucosa was detected with immunohistochemical staining (IHC) and RNAscope.The results of the two methods were analyzed and compared, and the relationship between the two methods and the clinicopathological characteristics of invasive BUC was discussed.The correlation between the protein expression of Nectin-4 in BUC tissues, human epidermal growth factor receptor 2 (Her-2) and programmed death factor ligand 1 (PD-L1) was analyzed. [Results] The positive protein expression rates of Nectin-4 detected by IHC were 78.33%(47/60) and 17.50% (7/40) in the invasive BUC group and inflammatory group, respectively, while the positive mRNA expression rates of Nectin-4 detected by RNAscope were 83.33% (50/60) and 12.50% (5/40), respectively.The Kappa values of Nectin-4 in the invasive BUC group and inflammatory group were 0.732 and 0.610, respectively, with general consistency.The protein expression of Nectin-4 in invasive BUC was correlated with muscular invasion, histological grade, vascular thrombus, lymph node metastasis and clinical stage (P<0.05). The mRNA expression of Nectin-4 in invasive BUC was correlated with max tumor diameter, muscular invasion, histological grade, vascular thrombus, lymph node metastasis and clinical stage (P<0.05). The high expression of Nectin-4 in invasive BUC was positively correlated with the expression of Her-2 (P=0.002), but not with the expression of PD-L1 (P>0.05). [Conclusion] Nectin-4 is highly expressed in invasive BUC, and is usually associated with the pathological parameters of poor prognosis.Detection of Nectin-4 expression will help to guide clinical diagnosis and treatment.


Result Analysis
Print
Save
E-mail