1.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
2.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
3.Hearing loss prevalence and burden of disease in China: Findings from provincial-level analysis.
Yu WANG ; Yang XIE ; Minghao WANG ; Mengdan ZHAO ; Rui GONG ; Ying XIN ; Jia KE ; Ke ZHANG ; Shaoxing ZHANG ; Chen DU ; Qingchuan DUAN ; Fang WANG ; Tao PAN ; Furong MA ; Xiangyang HU
Chinese Medical Journal 2025;138(1):41-48
BACKGROUND:
Without timely and effective rehabilitation, hearing loss may profoundly affect human life quality. China has a large population of hearing-impaired individuals, which imposes a heavy health burden on society. Moreover, this population is projected to increase rapidly owing to China's aging society.
METHODS:
We used data from a population-representative epidemiological investigation of hearing loss and ear diseases in four Chinese provinces. We estimated the national prevalence using multiple linear regression of the age-group proportions and prevalence in 31 provinces with clustering analysis. We used years lived with disability (YLDs) to analyze the disease burden and forecasted the prevalence of hearing loss by 2060 in China.
RESULTS:
An estimated 115 million people had moderate-to-complete hearing loss in 2015 across the 31 provinces of China (8.4% of 1.37 billion people). Of these, 85.7% were older than age 50 years (99 million people) and 2.4% were younger than 20 years old (2.8 million people). Of all YLDs attributable to hearing loss, 68.9% were attributable to moderate-to-complete cases. By 2060, a projected 242 million people in China will have moderate-to-complete hearing loss, a 110.0% increase from 2015.
CONCLUSIONS
The hearing loss prevalence in China is high. Population aging and socioeconomic factors substantially affect the prevalence and severity of hearing loss and the disease burden. The prevalence and severity of hearing loss are unevenly distributed across different provinces. Future public health policies should take these trends and regional variations into account.
Humans
;
China/epidemiology*
;
Hearing Loss/epidemiology*
;
Prevalence
;
Middle Aged
;
Male
;
Female
;
Adult
;
Aged
;
Adolescent
;
Young Adult
;
Child
;
Child, Preschool
;
Infant
;
Aged, 80 and over
;
Cost of Illness
4.Efficacy and mechanism of Cistanches Herba extract in treating reproductive dysfunction in rats with kidney-Yang deficiency based on metabolomics.
Ze-Hui LI ; Pan-Yu XU ; Jia-Shan LI ; Li GUO ; Yuan LI ; Si-Qi LI ; Na LIN ; Ying XU
China Journal of Chinese Materia Medica 2025;50(7):1850-1860
This study investigates the reproductive protective effect and potential mechanism of Cistanches Herba extract(CHE) on a rat model of kidney-Yang deficiency induced by adenine. Rats were randomly divided into five groups: normal, model, low-dose CHE(0.6 g·kg~(-1)·d~(-1)), high-dose CHE(1.2 g·kg~(-1)·d~(-1)), and L-carnitine(100 mg·kg~(-1)·d~(-1)). The rats were administered adenine(200 mg·kg~(-1)·d~(-1)) by gavage for the first 14 days to induce kidney-Yang deficiency, while simultaneously receiving drug treatment. After 14 days, the modeling was discontinued, but drug treatment continued to 49 days. The content of components in CHE was analyzed by high-performance liquid chromatography. The adenine-induced kidney-Yang deficiency model was assessed through symptom characterization and measurement of testosterone(T) levels using an enzyme-linked immunosorbent assay kit. Pathological damage to the testis and epididymis was evaluated based on the wet weight and performing hematoxylin-eosin staining. Sperm density and motility were measured using computer-aided sperm analysis, and sperm viability was assessed using live/dead sperm staining kits, and sperm morphology was evaluated using eosin staining, thereby determining rat sperm quality. Metabolomics was used to analyze changes in serum metabolites, enrich related metabolic pathways, and explore the mechanism of CHE in improving reproductive function damage in rats with kidney-Yang deficiency syndrome. Compared to the normal group, the model group exhibited significant kidney-Yang deficiency symptoms, reduced T levels, decreased testicular and epididymal wet weights, and significant pathological damage to the testis and epididymis. The sperm density, motility, and viability decreased, with an increased rate of sperm abnormalities. In contrast, rats treated with CHE showed marked improvements in kidney-Yang deficiency symptoms, restored T levels, alleviated pathological damage to the testis and epididymis, and improved various sperm parameters. Metabolomics results revealed 286 differential metabolites between the normal and model groups(191 upregulated and 95 downregulated). Seventy-five differential metabolites were identified between the model and low-dose CHE groups(21 upregulated and 54 downregulated). A total of 24 common differential metabolites were identified across the three groups, with 22 of these metabolites exhibiting opposite regulation trends between the two comparison groups. These metabolites were primarily involved in linoleic acid metabolism, ether lipid metabolism, and pantothenic acid and coenzyme A biosynthesis, as well as metabolites including 13-hydroperoxylinoleic acid, lysophosphatidylcholine, and pantethine. CHE can improve kidney-Yang deficiency symptoms in rats, alleviate reproductive organ damage, and enhance sperm quality. The regulation of lipid metabolism may be a potential mechanism through which CHE improves reproductive function in rats with kidney-Yang deficiency. The potential bioactive compounds of CHE include echinacoside, verbascoside, salidroside, betaine, and cistanoside A.
Animals
;
Male
;
Rats
;
Yang Deficiency/physiopathology*
;
Metabolomics
;
Kidney/physiopathology*
;
Rats, Sprague-Dawley
;
Drugs, Chinese Herbal/administration & dosage*
;
Cistanche/chemistry*
;
Kidney Diseases/metabolism*
;
Testis/metabolism*
;
Humans
;
Reproduction/drug effects*
;
Testosterone/blood*
5.Mechanism related to bile acids metabolism of liver injury induced by long-term administration of emodin.
Jing-Zhuo TIAN ; Lian-Mei WANG ; Yan YI ; Zhong XIAN ; Nuo DENG ; Yong ZHAO ; Chun-Ying LI ; Yu-Shi ZHANG ; Su-Yan LIU ; Jia-Yin HAN ; Chen PAN ; Chen-Yue LIU ; Jing MENG ; Ai-Hua LIANG
China Journal of Chinese Materia Medica 2025;50(11):3079-3087
Emodin is a hydroxyanthraquinone compound that is widely distributed and has multiple pharmacological activities, including anti-diarrheal, anti-inflammatory, and liver-protective effects. Research indicates that emodin may be one of the main components responsible for inducing hepatotoxicity. However, studies on the mechanisms of liver injury are relatively limited, particularly those related to bile acids(BAs) metabolism. This study aims to systematically investigate the effects of different dosages of emodin on BAs metabolism, providing a basis for the safe clinical use of traditional Chinese medicine(TCM)containing emodin. First, this study evaluated the safety of repeated administration of different dosages of emodin over a 5-week period, with a particular focus on its impact on the liver. Next, the composition and content of BAs in serum and liver were analyzed. Subsequently, qRT-PCR was used to detect the mRNA expression of nuclear receptors and transporters related to BAs metabolism. The results showed that 1 g·kg~(-1) emodin induced hepatic damage, with bile duct hyperplasia as the primary pathological manifestation. It significantly increased the levels of various BAs in the serum and primary BAs(including taurine-conjugated and free BAs) in the liver. Additionally, it downregulated the mRNA expression of farnesoid X receptor(FXR), retinoid X receptor(RXR), and sodium taurocholate cotransporting polypeptide(NTCP), and upregulated the mRNA expression of cholesterol 7α-hydroxylase(CYP7A1) in the liver. Although 0.01 g·kg~(-1) and 0.03 g·kg~(-1) emodin did not induce obvious liver injury, they significantly increased the level of taurine-conjugated BAs in the liver, suggesting a potential interference with BAs homeostasis. In conclusion, 1 g·kg~(-1) emodin may promote the production of primary BAs in the liver by affecting the FXR-RXR-CYP7A1 pathway, inhibit NTCP expression, and reduce BA reabsorption in the liver, resulting in BA accumulation in the peripheral blood. This disruption of BA homeostasis leads to liver injury. Even doses of emodin close to the clinical dose can also have a certain effect on the homeostasis of BAs. Therefore, when using traditional Chinese medicine or formulas containing emodin in clinical practice, it is necessary to regularly monitor liver function indicators and closely monitor the risk of drug-induced liver injury.
Emodin/administration & dosage*
;
Bile Acids and Salts/metabolism*
;
Animals
;
Male
;
Liver/injuries*
;
Chemical and Drug Induced Liver Injury/genetics*
;
Drugs, Chinese Herbal/adverse effects*
;
Humans
;
Rats, Sprague-Dawley
;
Mice
;
Rats
6.Pharmaceutical process omics and its applications and prospects in drug quality control.
Yu-Ting LU ; Jia-Yu YANG ; Ying-Ting SHI ; Xin-Yuan XIE ; Jian-Yang PAN ; Hai-Bin QU
China Journal of Chinese Materia Medica 2025;50(13):3628-3634
The quality of drugs is a crucial premise for ensuring the safety and effectiveness of clinical medication, while quality control during the pharmaceutical process directly affects the quality and consistency of the final product formulation. However, there is a lack of a comprehensive and scientific system for assessing and optimizing the quality control level during the manufacturing process in the field of drug quality control. Therefore, this study proposed the concept of "pharmaceutical process omics", clarified its advantages in guiding drug production, and explored in depth the research approaches, diverse analytical techniques, and broad range of applications in drug quality control. In addition, this study anticipated the broad application prospects of pharmaceutical process omics in the field of drug quality control, aiming to provide a scientific basis for the development of pharmaceutical process quality control standards.
Quality Control
;
Humans
;
Drugs, Chinese Herbal/chemistry*
7.Quality evaluation of Xinjiang Rehmannia glutinosa and Rehmannia glutinosa based on fingerprint and multi-component quantification combined with chemical pattern recognition.
Pan-Ying REN ; Wei ZHANG ; Xue LIU ; Juan ZHANG ; Cheng-Fu SU ; Hai-Yan GONG ; Chun-Jing YANG ; Jing-Wei LEI ; Su-Qing ZHI ; Cai-Xia XIE
China Journal of Chinese Materia Medica 2025;50(16):4630-4640
The differences in chemical quality characteristics between Xinjiang Rehmannia glutinosa and R. glutinosa were analyzed to provide a theoretical basis for the introduction and quality control of R. glutinosa. In this study, the high performance liquid chromatography(HPLC) fingerprints of 6 batches of Xinjiang R. glutinosa and 10 batches of R. glutinosa samples were established. The content of iridoid glycosides, phenylethanoid glycosides, monosaccharides, oligosaccharides, and polysaccharides in Xinjiang R. glutinosa and R. glutinosa was determined by high performance liquid chromatography-diode array detection(HPLC-DAD), high performance liquid chromatography-evaporative light scattering detection(HPLC-ELSD), and ultraviolet-visible spectroscopy(UV-Vis). The determination results were analyzed with by chemical pattern recognition and entropy weight TOPSIS method. The results showed that there were 19 common peaks in the HPLC fingerprints of the 16 batches of R. glutinosa, and catalpol, aucubin, rehmannioside D, rehmannioside A, hydroxytyrosol, leonuride, salidroside, cistanoside A, and verbascoside were identified. Hierarchical cluster analysis(HCA) and principal component analysis(PCA) showed that Qinyang R. glutinosa, Mengzhou R. glutinosa, and Xinjiang R. glutinosa were grouped into three different categories, and eight common components causing the chemical quality difference between Xinjiang R. glutinosa and R. glutinosa in Mengzhou and Qinyang of Henan province were screened out by orthogonal partial least squares discriminant analysis(OPLS-DA). The results of content determination showed that there were glucose, sucrose, raffinose, stachyose, polysaccharides, and nine glycosides in Xinjiang R. glutinosa and R. glutinosa samples, and the content of catalpol, rehmannioside A, leonuride, cistanoside A, verbascoside, sucrose, and glucose was significantly different between Xinjiang R. glutinosa and R. glutinosa. The analysis with entropy weight TOPSIS method showed that the comprehensive quality of R. glutinosa in Mengzhou and Qinyang of Henan province was better than that of Xinjiang R. glutinosa. In conclusion, the types of main chemical components of R. glutinosa and Xinjiang R. glutinosa were the same, but their content was different. The chemical quality of R. glutinosa was better than Xinjiang R. glutinosa, and other components in R. glutinosa from two producing areas and their effects need further study.
Rehmannia/classification*
;
Drugs, Chinese Herbal/chemistry*
;
Chromatography, High Pressure Liquid/methods*
;
Quality Control
8.Lumbar temperature change after acupuncture or moxibustion at Weizhong (BL40) or Chize (LU5) in healthy adults: A randomized controlled trial.
Si-Yi ZHENG ; Xiao-Ying WANG ; Li-Nan LIN ; Shan LIU ; Xiao-Xiao HUANG ; Yi-Yue LIU ; Xiao-Shuai YU ; Wei PAN ; Jian-Qiao FANG ; Yi LIANG
Journal of Integrative Medicine 2025;23(2):145-151
BACKGROUND:
There is a gap in understanding the effects of different acupoints and treatment methods (acupuncture and moxibustion) on microcirculatory changes in the lumbar region.
OBJECTIVE:
This study aimed to assess the thermal effects of acupuncture at Weizhong (BL40), with acupuncture at Chize (LU5) and moxibustion at both acupoints as control interventions.
DESIGN, SETTING, PARTICIPANTS AND INTERVENTIONS:
In this randomized controlled trial, 140 healthy participants were equally divided into four groups: acupuncture at BL40 (Acu-BL40), acupuncture at LU5 (Acu-LU5), moxibustion at BL40 (Mox-BL40) and moxibustion at LU5 (Mox-LU5). Participants underwent a 30-minute session of their assigned treatment. Infrared thermal imaging was used to collect temperature data on the areas of interest for analysis.
MAIN OUTCOME MEASURES:
The primary measure was the change in average temperature of the observed area after the intervention. The secondary measures included periodic temperature changes every 5 min and the temperature changes of the Governor Vessel and Bladder Meridian in the observed area after the intervention.
RESULTS:
Significant interactions were observed between treatments and acupoints affecting temperature (P < 0.001). The Acu-BL40 group showed a notably higher increase in mean temperature after 30 min compared to the Acu-LU5 and Mox-BL40 groups, with increases of 0.29 (95% confidence interval [CI] = 0.17 to 0.41) and 0.24 (95% CI = 0.08 to 0.41) °C, respectively.
CONCLUSION:
Acupuncture at BL40 acupoint can significantly increase the mean temperature in the observed area, highlighting the specific thermal effect of acupuncture compared to moxibustion in the lumbar area. This suggests a potential therapeutic benefit of acupuncture at BL40 for managing lumbar conditions.
TRIAL REGISTRATION
ClinicalTrials.gov (NCT05665426). Please cite this article as: Zheng SY, Wang XY, Lin LN, Liu S, Huang XX, Liu YY, Yu XS, Pan W, Fang JQ, Liang Y. Lumbar temperature change after acupuncture or moxibustion at Weizhong (BL40) or Chize (LU5) in healthy adults: A randomized controlled trial. J Integr Med. 2025; 23(2): 145-151.
Adult
;
Female
;
Humans
;
Male
;
Young Adult
;
Acupuncture Points
;
Acupuncture Therapy
;
Body Temperature
;
Healthy Volunteers
;
Lumbosacral Region/physiology*
;
Moxibustion
;
Adolescent
9.Influence of Outdoor Light at Night on Early Reproductive Outcomes of In Vitro Fertilization and Its Threshold Effect: Evidence from a Couple-Based Preconception Cohort Study.
Wen Bin FANG ; Ying TANG ; Ya Ning SUN ; Yan Lan TANG ; Yin Yin CHEN ; Ya Wen CAO ; Ji Qi FANG ; Kun Jing HE ; Yu Shan LI ; Ya Ning DAI ; Shuang Shuang BAO ; Peng ZHU ; Shan Shan SHAO ; Fang Biao TAO ; Gui Xia PAN
Biomedical and Environmental Sciences 2025;38(8):1009-1015
10.Association between PM 2.5 Chemical Constituents and Preterm Birth: The Undeniable Role of Preconception H19 Gene Variation.
Ya Long WANG ; Pan Pan SUN ; Xin Ying WANG ; Jun Xi ZHANG ; Xiang Yu YU ; Jian CHAI ; Ruo DU ; Wen Yi LIU ; Fang Fang YU ; Yue BA ; Guo Yu ZHOU
Biomedical and Environmental Sciences 2025;38(8):1016-1022

Result Analysis
Print
Save
E-mail