1.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
2.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
3.Eye Movement and Gait Variability Analysis in Chinese Patients With Huntington’s Disease
Shu-Xia QIAN ; Yu-Feng BAO ; Xiao-Yan LI ; Yi DONG ; Zhi-Ying WU
Journal of Movement Disorders 2025;18(1):65-76
Objective:
Huntington’s disease (HD) is characterized by motor, cognitive, and neuropsychiatric symptoms. Oculomotor impairments and gait variability have been independently considered as potential markers in HD. However, an integrated analysis of eye movement and gait is lacking. We performed multiple examinations of eye movement and gait variability in HTT mutation carriers, analyzed the consistency between these parameters and clinical severity, and then examined the associations between oculomotor impairments and gait deficits.
Methods:
We included 7 patients with pre-HD, 30 patients with HD and 30 age-matched controls. We collected demographic data and assessed the Unified Huntington’s Disease Rating Scale (UHDRS) score. Examinations, including saccades, smooth pursuit tests, and optokinetic (OPK) tests, were performed to evaluate eye movement function. The parameters of gait include stride length, walking velocity, step deviation, step length, and gait phase.
Results:
HD patients have significant impairments in the latency and velocity of saccades, the gain of smooth pursuit, and the gain and slow phase velocities of OPK tests. Only the speed of saccades significantly differed between pre-HD patients and controls. There are significant impairments in stride length, walking velocity, step length, and gait phase in HD patients. The parameters of eye movement and gait variability in HD patients were consistent with the UHDRS scores. There were significant correlations between eye movement and gait parameters.
Conclusion
Our results show that eye movement and gait are impaired in HD patients and that the speed of saccades is impaired early in pre-HD. Eye movement and gait abnormalities in HD patients are significantly correlated with clinical disease severity.
4.Three-dimensional kinematic analysis can improve the efficacy of acupoint selection for post-stroke patients with upper limb spastic paresis: A randomized controlled trial.
Xin-Yun HUANG ; Ou-Ping LIAO ; Shu-Yun JIANG ; Ji-Ming TAO ; Yang LI ; Xiao-Ying LU ; Yi-Ying LI ; Ci WANG ; Jing LI ; Xiao-Peng MA
Journal of Integrative Medicine 2025;23(1):15-24
BACKGROUND:
China is seeing a growing demand for rehabilitation treatments for post-stroke upper limb spastic paresis (PSSP-UL). Although acupuncture is known to be effective for PSSP-UL, there is room to enhance its efficacy.
OBJECTIVE:
This study explored a semi-personalized acupuncture approach for PSSP-UL that used three-dimensional kinematic analysis (3DKA) results to select additional acupoints, and investigated the feasibility, efficacy and safety of this approach.
DESIGN, SETTING, PARTICIPANTS AND INTERVENTIONS:
This single-blind, single-center, randomized, controlled trial involved 74 participants who experienced a first-ever ischemic or hemorrhagic stroke with spastic upper limb paresis. The participants were then randomly assigned to the intervention group or the control group in a 1:1 ratio. Both groups received conventional treatments and acupuncture treatment 5 days a week for 4 weeks. The main acupoints in both groups were the same, while participants in the intervention group received additional acupoints selected on the basis of 3DKA results. Follow-up assessments were conducted for 8 weeks after the treatment.
MAIN OUTCOME MEASURES:
The primary outcome was the Fugl-Meyer Assessment for Upper Extremity (FMA-UE) response rate (≥ 6-point change) at week 4. Secondary outcomes included changes in motor function (FMA-UE), Brunnstrom recovery stage (BRS), manual muscle test (MMT), spasticity (Modified Ashworth Scale, MAS), and activities of daily life (Modified Barthel Index, MBI) at week 4 and week 12.
RESULTS:
Sixty-four participants completed the trial and underwent analyses. Compared with control group, the intervention group exhibited a significantly higher FMA-UE response rate at week 4 (χ2 = 5.479, P = 0.019) and greater improvements in FMA-UE at both week 4 and week 12 (both P < 0.001). The intervention group also showed bigger improvements from baseline in the MMT grades for shoulder adduction and elbow flexion at weeks 4 and 12 as well as thumb adduction at week 4 (P = 0.007, P = 0.049, P = 0.019, P = 0.008, P = 0.029, respectively). The intervention group showed a better change in the MBI at both week 4 and week 12 (P = 0.004 and P = 0.010, respectively). Although the intervention group had a higher BRS for the hand at week 12 (P = 0.041), no intergroup differences were observed at week 4 (all P > 0.05). The two groups showed no differences in MAS grades as well as in BRS for the arm at weeks 4 and 12 (all P > 0.05).
CONCLUSION:
Semi-personalized acupuncture prescription based on 3DKA results significantly improved motor function, muscle strength, and activities of daily living in patients with PSSP-UL.
TRIAL REGISTRATION
Chinese Clinical Trial Registry ChiCTR2200056216. Please cite this article as: Huang XY, Liao OP, Jiang SY, Tao JM, Li Y, Lu XY, Li YY, Wang C, Li J, Ma XP. Three-dimensional kinematic analysis can improve the efficacy of acupoint selection for post-stroke patients with upper limb spastic paresis: A randomized controlled trial. J Integr Med. 2025; 23(1): 15-24.
Humans
;
Male
;
Female
;
Middle Aged
;
Acupuncture Points
;
Upper Extremity/physiopathology*
;
Biomechanical Phenomena
;
Single-Blind Method
;
Aged
;
Stroke/therapy*
;
Acupuncture Therapy/methods*
;
Stroke Rehabilitation/methods*
;
Adult
;
Muscle Spasticity/therapy*
;
Paresis/physiopathology*
;
Treatment Outcome
5.International clinical practice guideline on the use of traditional Chinese medicine for functional dyspepsia (2025).
Sheng-Sheng ZHANG ; Lu-Qing ZHAO ; Xiao-Hua HOU ; Zhao-Xiang BIAN ; Jian-Hua ZHENG ; Hai-He TIAN ; Guan-Hu YANG ; Won-Sook HONG ; Yu-Ying HE ; Li LIU ; Hong SHEN ; Yan-Ping LI ; Sheng XIE ; Jin SHU ; Bin-Fang ZENG ; Jun-Xiang LI ; Zhen LIU ; Zheng-Hua XIAO ; Jing-Dong XIAO ; Pei-Yong ZHENG ; Shao-Gang HUANG ; Sheng-Liang CHEN ; Gui-Jun FEI
Journal of Integrative Medicine 2025;23(5):502-518
Functional dyspepsia (FD), characterized by persistent or recurrent dyspeptic symptoms without identifiable organic, systemic or metabolic causes, is an increasingly recognized global health issue. The objective of this guideline is to equip clinicians and nursing professionals with evidence-based strategies for the management and treatment of adult patients with FD using traditional Chinese medicine (TCM). The Guideline Development Group consulted existing TCM consensus documents on FD and convened a panel of 35 clinicians to generate initial clinical queries. To address these queries, a systematic literature search was conducted across PubMed, EMBASE, the Cochrane Library, China National Knowledge Infrastructure (CNKI), VIP Database, China Biology Medicine (SinoMed) Database, Wanfang Database, Traditional Medicine Research Data Expanded (TMRDE), and the Traditional Chinese Medical Literature Analysis and Retrieval System (TCMLARS). The evidence from the literature was critically appraised using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach. The strength of the recommendations was ascertained through a consensus-building process involving TCM and allopathic medicine experts, methodologists, pharmacologists, nursing specialists, and health economists, leveraging their collective expertise and empirical knowledge. The guideline comprises a total of 43 evidence-informed recommendations that span a range of clinical aspects, including the pathogenesis according to TCM, diagnostic approaches, therapeutic interventions, efficacy assessments, and prognostic considerations. Please cite this article as: Zhang SS, Zhao LQ, Hou XH, Bian ZX, Zheng JH, Tian HH, Yang GH, Hong WS, He YY, Liu L, Shen H, Li YP, Xie S, Shu J, Zeng BF, Li JX, Liu Z, Xiao ZH, Xiao JD, Zheng PY, Huang SG, Chen SL, Fei GJ. International clinical practice guideline on the use of traditional Chinese medicine for functional dyspepsia (2025). J Integr Med. 2025; 23(5):502-518.
Dyspepsia/drug therapy*
;
Humans
;
Medicine, Chinese Traditional/methods*
;
Practice Guidelines as Topic
;
Drugs, Chinese Herbal/therapeutic use*
6.Association between Fish Consumption and Stroke Incidence Across Different Predicted Risk Populations: A Prospective Cohort Study from China.
Hong Yue HU ; Fang Chao LIU ; Ke Yong HUANG ; Chong SHEN ; Jian LIAO ; Jian Xin LI ; Chen Xi YUAN ; Ying LI ; Xue Li YANG ; Ji Chun CHEN ; Jie CAO ; Shu Feng CHEN ; Dong Sheng HU ; Jian Feng HUANG ; Xiang Feng LU ; Dong Feng GU
Biomedical and Environmental Sciences 2025;38(1):15-26
OBJECTIVE:
The relationship between fish consumption and stroke is inconsistent, and it is uncertain whether this association varies across predicted stroke risks.
METHODS:
A cohort study comprising 95,800 participants from the Prediction for Atherosclerotic Cardiovascular Disease Risk in China project was conducted. A standardized questionnaire was used to collect data on fish consumption. Participants were stratified into low- and moderate-to-high-risk categories based on their 10-year stroke risk prediction scores. Hazard ratios ( HRs) and 95% confidence intervals ( CIs) were estimated using Cox proportional hazard models and additive interaction by relative excess risk due to interaction (RERI), attributable proportion (AP), and synergy index (SI).
RESULTS:
During 703,869 person-years of follow-up, 2,773 incident stroke events were identified. Higher fish consumption was associated with a lower risk of stroke, particularly among moderate-to-high-risk individuals ( HR = 0.53, 95% CI: 0.47-0.60) than among low-risk individuals ( HR = 0.64, 95% CI: 0.49-0.85). A significant additive interaction between fish consumption and predicted stroke risk was observed (RERI = 4.08, 95% CI: 2.80-5.36; SI = 1.64, 95% CI: 1.42-1.89; AP = 0.36, 95% CI: 0.28-0.43).
CONCLUSION
Higher fish consumption was associated with a lower risk of stroke, and this beneficial association was more pronounced in individuals with moderate-to-high stroke risk.
Humans
;
China/epidemiology*
;
Male
;
Female
;
Stroke/etiology*
;
Middle Aged
;
Prospective Studies
;
Incidence
;
Aged
;
Animals
;
Fishes
;
Risk Factors
;
Diet
;
Seafood
;
Adult
;
Cohort Studies
8.MYCN-Mediated Transcriptional Activation of IDH2 Enhances Proliferation, Migration, and Invasion in Cervical Squamous Cell Carcinoma through the HIF1-α Pathway.
Xiao Juan LIU ; Hui MA ; Xiao Yan LI ; Chun Xing MA ; Li Sha SHU ; Hui Ying ZHANG
Biomedical and Environmental Sciences 2025;38(8):1003-1008
9.Health Risks from Exposure to PM 2.5-bound Polycyclic Aromatic Hydrocarbons in Fumes Emitted from Various Cooking Styles and Their Respiratory Deposition in a City Population Stratified by Age and Sex.
Jun Feng ZHANG ; Xi CHEN ; Ke GAO ; Shui Yuan CHENG ; Wen Jiao DUAN ; Li Ying FU ; Jian Jia LI ; Shu Shu LAN ; Cui Lan FANG
Biomedical and Environmental Sciences 2025;38(10):1230-1245
OBJECTIVES:
To characterize fine particulate matter (PM 2.5)-bound polycyclic aromatic hydrocarbons (PAHs) emitted from different cooking fumes and their exposure routes and assess their health-associated impact to provide a reference for health risk prevention from PAH exposure across different age and sex groups.
METHODS:
Sixteen PM 2.5-bound PAHs emitted from 11 cooking styles were analyzed using GC-MS/MS. The health hazards of these PAHs in the Handan City population (stratified by age and sex) were predicted using the incremental lifetime cancer risk ( ILCR) model. The respiratory deposition doses ( RDDs) of the PAHs in children and adults were calculated using the PM 2.5 deposition rates in the upper airway, tracheobronchial, and alveolar regions.
RESULTS:
The total concentrations of PM 2.5-bound PAHs ranged from 61.10 to 403.80 ng/m 3. Regardless of cooking styles, the ILCR total values for adults (1.23 × 10 -6 to 3.70 × 10 -6) and older adults (1.28 × 10 -6 to 3.88 × 10 -6) exceeded the acceptable limit of 1.00 × 10 -6. With increasing age, the ILCR total value first declined and then increased, varying substantially among the population groups. Cancer risk exhibited particularly high sensitivity to short exposure to barbecue-derived PAHs under equivalent body weights. Furthermore, barbecue, Sichuan and Hunan cuisine, Chinese cuisine, and Chinese fast food were associated with higher RDDs for both adults and children.
CONCLUSION
ILCR total values exceeded the acceptable limit for both females and males of adults, with all cooking styles showing a potentially high cancer risk. Our findings serve as an important reference for refining regulatory strategies related to catering emissions and mitigating health risks associated with cooking styles.
Humans
;
Polycyclic Aromatic Hydrocarbons/analysis*
;
Cooking/methods*
;
Male
;
Female
;
Particulate Matter/analysis*
;
Adult
;
Child
;
Middle Aged
;
Air Pollutants/analysis*
;
Adolescent
;
Air Pollution, Indoor/analysis*
;
Young Adult
;
Child, Preschool
;
Aged
;
China
;
Inhalation Exposure
;
Age Factors
;
Sex Factors
;
Cities
;
Infant
10.Research Progress in Copper Homeostasis and Diseases.
Shu-Ting QIU ; Xiao-Hua TAN ; Shi-Han SHAO ; Li YU ; Ying-Ying ZHANG ; Yue-Jia CAO ; Di CHUN-HONG
Acta Academiae Medicinae Sinicae 2025;47(1):102-109
As an indispensable trace element in the human body,copper plays an important role in various physiological and biochemical reactions.The dyshomeostasis of copper leads to the disorder of copper metabolism and the occurrence of related diseases.Cuproptosis,a newly proposed regulatory cell death mode,is different from the known apoptosis,pyroptosis,necroptosis,and ferroptosis.Recent studies have found that the dyshomeostasis of copper has been observed in a variety of cancers.Therefore,targeting copper for disease treatment may become a new strategy and a new idea.This article systematically summarizes the fundamental properties of copper,copper dyshomeostasis-related diseases (Menkes syndrome,Wilson's disease,and cancer) and their treatment,and reviews the research progress in cuproptosis.
Humans
;
Copper/metabolism*
;
Homeostasis
;
Neoplasms/metabolism*
;
Hepatolenticular Degeneration/metabolism*
;
Menkes Kinky Hair Syndrome/metabolism*

Result Analysis
Print
Save
E-mail