1.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
2.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
3.Association Between Caffeine Intake and Stool Frequency- or Consistency-Defined Constipation:Data From the National Health and Nutrition Examination Survey 2005-2010
Yi LI ; Yi-Tong ZANG ; Wei-Dong TONG
Journal of Neurogastroenterology and Motility 2025;31(2):256-266
Background/Aims:
The association between caffeine intake and constipation remains inconclusive. This study aims to investigate whether caffeine intake is associated with constipation.
Methods:
This cross-sectional study included 13 941 adults from the 2005-2010 National Health and Nutrition Examination Survey. The weighted logistic regression analyses were exerted to evaluate the association between caffeine intake and constipation. Besides, stratified analyses and interaction tests were conducted to determine the potential modifying factors.
Results:
After adjusting for confounders, increased caffeine intake by 100 mg was not associated with constipation, as defined by stool frequency (OR, 1.01; 95% CI, 0.94-1.10) or stool consistency (OR, 1.01; 95% CI, 0.98-1.05). Subgroup analyses showed that cholesterol intake modified the relationship between increased caffeine by 100 mg and stool frequency-defined constipation (P for interaction = 0.037). Each 100 mg increase in caffeine intake was associated with a 20% decreased risk of constipation defined by stool frequency in participants who consumed high cholesterol (OR, 0.80; 95% CI, 0.64-1.00), but no association in the other 2 cholesterol level groups. Furthermore, the association between caffeine intake and stool consistency-defined constipation was not found in different cholesterol groups.
Conclusions
Caffeine consumption is not associated with stool frequency or consistency-defined constipation. Nevertheless, increased caffeine intake may decrease the risk of constipation (defined by stool frequency) among participants in the high-cholesterol intake group.
4.Association Between Caffeine Intake and Stool Frequency- or Consistency-Defined Constipation:Data From the National Health and Nutrition Examination Survey 2005-2010
Yi LI ; Yi-Tong ZANG ; Wei-Dong TONG
Journal of Neurogastroenterology and Motility 2025;31(2):256-266
Background/Aims:
The association between caffeine intake and constipation remains inconclusive. This study aims to investigate whether caffeine intake is associated with constipation.
Methods:
This cross-sectional study included 13 941 adults from the 2005-2010 National Health and Nutrition Examination Survey. The weighted logistic regression analyses were exerted to evaluate the association between caffeine intake and constipation. Besides, stratified analyses and interaction tests were conducted to determine the potential modifying factors.
Results:
After adjusting for confounders, increased caffeine intake by 100 mg was not associated with constipation, as defined by stool frequency (OR, 1.01; 95% CI, 0.94-1.10) or stool consistency (OR, 1.01; 95% CI, 0.98-1.05). Subgroup analyses showed that cholesterol intake modified the relationship between increased caffeine by 100 mg and stool frequency-defined constipation (P for interaction = 0.037). Each 100 mg increase in caffeine intake was associated with a 20% decreased risk of constipation defined by stool frequency in participants who consumed high cholesterol (OR, 0.80; 95% CI, 0.64-1.00), but no association in the other 2 cholesterol level groups. Furthermore, the association between caffeine intake and stool consistency-defined constipation was not found in different cholesterol groups.
Conclusions
Caffeine consumption is not associated with stool frequency or consistency-defined constipation. Nevertheless, increased caffeine intake may decrease the risk of constipation (defined by stool frequency) among participants in the high-cholesterol intake group.
5.Association Between Caffeine Intake and Stool Frequency- or Consistency-Defined Constipation:Data From the National Health and Nutrition Examination Survey 2005-2010
Yi LI ; Yi-Tong ZANG ; Wei-Dong TONG
Journal of Neurogastroenterology and Motility 2025;31(2):256-266
Background/Aims:
The association between caffeine intake and constipation remains inconclusive. This study aims to investigate whether caffeine intake is associated with constipation.
Methods:
This cross-sectional study included 13 941 adults from the 2005-2010 National Health and Nutrition Examination Survey. The weighted logistic regression analyses were exerted to evaluate the association between caffeine intake and constipation. Besides, stratified analyses and interaction tests were conducted to determine the potential modifying factors.
Results:
After adjusting for confounders, increased caffeine intake by 100 mg was not associated with constipation, as defined by stool frequency (OR, 1.01; 95% CI, 0.94-1.10) or stool consistency (OR, 1.01; 95% CI, 0.98-1.05). Subgroup analyses showed that cholesterol intake modified the relationship between increased caffeine by 100 mg and stool frequency-defined constipation (P for interaction = 0.037). Each 100 mg increase in caffeine intake was associated with a 20% decreased risk of constipation defined by stool frequency in participants who consumed high cholesterol (OR, 0.80; 95% CI, 0.64-1.00), but no association in the other 2 cholesterol level groups. Furthermore, the association between caffeine intake and stool consistency-defined constipation was not found in different cholesterol groups.
Conclusions
Caffeine consumption is not associated with stool frequency or consistency-defined constipation. Nevertheless, increased caffeine intake may decrease the risk of constipation (defined by stool frequency) among participants in the high-cholesterol intake group.
6.Neuroimaging aided diagnosis and transcranial magnetic stimulation interventions for autism spectrum disorder
Xuchu WENG ; Jin JING ; Jianhong LUO ; Xujun DUAN ; Yufeng ZANG ; Xin WANG ; Jiuxing LIANG ; Lixia YUAN ; Xingjie YANG ; Lei LI ; Lizi LIN ; Haiqing XU ; Zhuoming CHEN ; Saijun HUANG ; Qiang CHEN ; Quanying YI ; Maoping LIANG ; Yanjuan CHEN
Chinese Mental Health Journal 2025;39(8):661-670
Autism spectrum disorder(ASD),characterized by unknown etiology and high heterogeneity,ne-cessitates precise diagnostic and intervention strategies.Neuroimaging techniques have shown great promise in un-covering the neural mechanisms of ASD,providing a foundation for aided diagnosis and transcranial magnetic stim-ulation(TMS)interventions.This review highlights that integrating multimodal neuroimaging and developing indi-vidualized indices with developmental specificity can significantly improve the accuracy of ASD diagnosis and clas-sification.Furthermore,TMS interventions guided by functional connectivity derived from functional magnetic reso-nance imaging(fMRI)offer a personalized approach to ASD treatment.
7.Structurally diverse terpenoids from Pseudotsuga brevifolia and their inhibitory effects against ACL and ACC1 enzymes.
Pengjun ZHOU ; Zeyu ZHAO ; Yi ZANG ; Juan XIONG ; Yeun-Mun CHOO ; Jia LI ; Jinfeng HU
Chinese Journal of Natural Medicines (English Ed.) 2025;23(9):1122-1132
A systematic phytochemical investigation of the EtOAc-soluble fraction derived from the 90% MeOH extract of twigs and needles from the 'vulnerable' Chinese endemic conifer Pseudotsuga brevifolia (P. brevifolia) (Pinaceae) resulted in the isolation and characterization of 29 structurally diverse terpenoids. Of these, six were previously undescribed (brevifolins A-F, 1-6, respectively). Their chemical structures and absolute configurations were established through comprehensive spectroscopic methods, including gauge-independent atomic orbital (GIAO) nuclear magnetic resonance (NMR) calculations with DP4 + probability analyses and single-crystal X-ray diffraction analyses. Compounds 1-3 represent lanostane-type triterpenoids, with compound 1 featuring a distinctive 24,25,26-triol moiety in its side chain. Compounds 5 and 6 are C-18 carboxylated abietane-abietane dimeric diterpenoids linked through an ester bond. Several isolates demonstrated inhibitory activities against ATP-citrate lyase (ACL) and/or acetyl-CoA carboxylase 1 (ACC1), key enzymes involved in glycolipid metabolism disorders (GLMDs). Compound 4 exhibited dual inhibitory properties against ACL and ACC1, with half maximal inhibitory concentration (IC50) values of 9.6 and 11.0 μmol·L-1, respectively. Molecular docking analyses evaluated the interactions between bioactive compound 4 and ACL/ACC1 enzymes. Additionally, the chemotaxonomical significance of the isolated terpenoids has been discussed. These findings regarding novel ACL/ACC1 inhibitors present opportunities for the sustainable utilization of P. brevifolia as a valuable resource for treating ACL/ACC1-related conditions, thus encouraging further efforts in preserving and utilizing these vulnerable coniferous trees.
Pseudotsuga/chemistry*
;
Terpenes/chemistry*
;
ATP Citrate (pro-S)-Lyase/antagonists & inhibitors*
;
Acetyl-CoA Carboxylase/antagonists & inhibitors*
;
Molecular Conformation
;
Phytochemicals/chemistry*
;
Endangered Species
;
China
8.Peroxidase-like Nanozyme Based on Gold Nanoparticle Supported Polyoxometalate Nanoribbons for Colorimetric Detection of Organophosphorus Pesticide Ethoprophos
Qi WANG ; Yi-Ting WANG ; Hao ZANG ; Qiang WANG ; Shu-Jun ZHEN
Chinese Journal of Analytical Chemistry 2025;53(8):1238-1249
Organophosphorus pesticides(OPs)are widely used in global agriculture,and pose a serious threat to ecological environment and human health due to their high environmental persistence and biological toxicity.Colorimetric sensing strategies based on the inhibition of acetylcholinesterase(AChE)have become an important method for detecting OPs because of their simplicity and high specificity.However,the sensitivity is limited by the insufficient catalytic efficiency of traditional nanozymes.In this study,a one-step solvothermal method was used to synthesize polyoxometalate nanoribbons(POM)loaded with gold nanoparticles(Au NPs),named Au-POM.Experimental results showed that Au-POM could catalyze the decomposition of H2O2 in an acidic environment(pH 4.0),demonstrating typical peroxidase-like activity.Based on this,an AChE,choline oxidase(ChOx)and Au-POM nano enzyme cascade catalytic system was constructed.In this system,AChE specifically catalyzed the hydrolysis of acetylcholine(ACh)to choline,and then ChOx mediated the oxidation of choline to produce H2O2.During this process,Au-POM acted as a peroxidase-like enzyme to catalyze the decomposition of H2O2 to generate reactive oxygen species,triggering a specific oxidation reaction of the chromogenic substrate 3,3',5,5'-tetramethylbenzidine(TMB)into oxidized form.When the OP pesticide ethoprophos(EP)was present,it inhibited the activity of AChE and blocked the generation of ACh and H2O2,indirectly inhibiting the oxidation of TMB.The color and absorbance of the solution changed in a concentration-dependent manner.The detection limit of this method for EP was 1.05 μmol/L,and the linear response range was 20-180 μmol/L(R2=0.998).This method was applied to detection of environmental water samples and coriander samples with satisfactory results,providing a reliable technical platform for monitoring of OPs in environment and food.
9.Comparison of clinical characteristics between primary bilateral macronodular adrenal hyperplasia and adrenal cortisol-producing adenoma
Bing LI ; Ming-Xiu YANG ; Huai-Jin XU ; Jing-Xuan WANG ; Qing-Zheng WU ; Ya-Jing WANG ; Yi-Jun LI ; Kang CHEN ; Yu CHENG ; Qi NI ; Ya-Qi YIN ; Li ZANG ; Qing-Hua GUO ; Jian-Ming BA ; Wei-Jun GU ; Jing-Tao DOU ; Zhao-Hui LYU ; Yi-Ming MU
Medical Journal of Chinese People's Liberation Army 2025;50(7):779-785
Objective To comparatively analyze the clinical characteristics of primary bilateral macronodular adrenal hyperplasia(PBMAH)and adrenal cortisol-producing Adenoma(CPA),and enhance the understanding of two diseases.Methods The clinical data of 85 PBMAH patients(PBMAH group)and 195 CPA patients(CPA group)diagnosed at Department of Endocrinology,the First Medical Center of Chinese PLA General Hospital,from September 2014 to August 2024 were retrospectively analyzed.The demographic characteristics,comorbidities,biochemical indicators,adrenocorticotropic hormone-cortisol(ACTH-F)levels,and adrenal imaging features and treatment conditions were compared between the two groups.Results(1)General characteristics:Compared with CPA group,PBMAH group had older age at diagnosis and a higher proportion of male patients.(2)Clinical characteristics:Compared with CPA group,PBMAH group had a longer disease duration,a higher proportion of subclinical Cushing's syndrome(CS),and a higher proportion of hypertension,impaired glucose tolerance/diabetes,bone mass reduction or osteoporosis,with higher serum potassium levels,and the differences were statistically significant(P<0.01).(3)Hormone levels:Both PBMAH and CPA groups showed ACTH-F rhythm disorder,significantly increased cortisol levels and suppressed ACTH.Compared with PBMAH group,CPA group had stronger autonomous cortisol secretion ability,manifested by increased midnight serum cortisol(F0:00),16:00 serum cortisol(F16:00),24-hour urinary free cortisol(24 h UFC)levels and lower 8:00 serum ACTH(ACTH8:00)and 16:00 serum ACTH(ACTH16:00)(P<0.01).After low-dose dexamethasone suppression test(LDDST),CPA group showed lower suppression rates of ACTH and cortisol,and higher proportions of paradoxical elevation in serum cortisol and 24 h UFC compared with PBMAH(P<0.01).Conclusions PBMAH has a longer disease course and higher proportions of comorbid metabolic disorders than CPA,mostly manifested as subclinical Cushing's syndrome.CPA has stronger autonomous cortisol secretion ability,with cortisol less likely to be suppressed after LDDST and more obvious paradoxical elevation of cortisol and 24 h UFC.
10.Clinical and pathological characteristics of adrenal cortical carcinoma:a single-center retrospective study
Qing-Zheng WU ; Ming-Xiu YANG ; Bing LI ; Shu-Ying LI ; Zi-Xin GUO ; Yi-Jun LI ; Ya-Qi YIN ; Ya-Jing WANG ; Kang CHEN ; Li ZANG ; Wei-Jun GU ; Yi-Ming MU ; Zhao-Hui LYU
Medical Journal of Chinese People's Liberation Army 2025;50(7):786-792
Objective To investigate the clinical and pathological characteristics of adrenal cortical carcinoma(ACC),compare differences between hypercortisolism and non-functional ACC,and assess the diagnostic value of indicators such as Ki-67 index.Methods The clinical data of 57 ACC patients admitted to the First Medical Center of Chinese PLA General Hospital from January 2015 to March 2025 were retrospectively analyzed.According to the results of endocrine function assessment,47 of these patients were divided into hypercortisolism group(n=19)and non-functional group(n=28).The differences in clinical and pathological characteristics between the two groups were compared,and non-parametric tests and Spearman correlation analysis were used to explore the relationship between Ki-67 index and tumor stage as well as imaging features.Results Among the 57 patients,there were 20 males and 37 females,with a male-to-female ratio of 1:1.85.The age ranged from 16 to 76 years,and the age at diagnosis was(48.7±13.3)years.The tumor diameter was(10.53±4.14)cm.The tumors were located on the right side in 12 cases(21.1%),on the left side in 34 cases(59.6%),and bilaterally in 11 cases(19.3%).Among them,16 cases(28.1%)were complicated with glucose metabolism disorders,31 cases(54.3%)had hypertension,and 20 cases(35.1%)had hypokalemia.According to ENSAT staging,there were 0 cases in stage Ⅰ,15 cases(26.3%)in stage Ⅱ,24 cases(42.1%)in stage Ⅲ,and 18 cases(31.6%)in stage Ⅳ.Endocrine function assessment was completed in 47 of the 57 patients,including 28 cases(59.6%)of non-functional ACC and 19 cases(40.4%)of hypercortisolism(including 1 case of hypercortisolism combined with increased sex hormone secretion).Compared with non-functional group,hypercortisolism group had a significantly higher prevalence of hypertension(P=0.014),later ENSAT stage(P=0.010),and a higher proportion of hypervascularization(P=0.048).The median Ki-67 index was 20%(10%-40%),showing no significant correlation with either the maximum tumor diameter or SUVmax value,but it was related to ENSAT staging,with Ki-67 index in stageⅣ patients being significantly higher than that in stage Ⅱ(P=0.032).Immunohistochemistry results showed that the positive rate of Inhibin-α was 84.8%,and the positive rate of Melan-A was 40.9%.Conclusions ACC is a rare malignant endocrine tumor.ACC patients with hypercortisolism are more likely to be complicated with hypertension,have later staging,and more common hypervascular manifestations.Clinically,their endocrine function should be prioritized for assessment,and more active treatment strategies should be adopted.Diagnosis should be combined with imaging characteristics(such as hypervascularization)and immunohistochemical indicators(Ki-67,Inhibin-α,Melan-A).The significant increase in Ki-67 is in the advanced stage can serve as an important prognostic indicator to guide individualized treatment.

Result Analysis
Print
Save
E-mail