1.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
2.Pathogenesis Reasoning Chain-of-thought Supervision for Large Language Models: Syndrome Manifestation Recognition and Multidimensional Evaluation in Spleen-stomach Disorders
Shu-Han YANG ; Yu-Xin HU ; Xin-Yu YU ; Yu-Ying TU ; Yi-Chang ZANG ; Pan-Fei LI
Progress in Biochemistry and Biophysics 2026;53(5):1240-1263
ObjectiveThe essence of syndrome manifestation recognition in traditional Chinese medicine (TCM) is to infer the body’s latent pathogenesis state from clinical observational information, rather than to perform simple label matching. However, previous studies have largely modeled this task as syndrome pattern classification within a fixed label space, which does not adequately reflect the cognition process of TCM syndrome differentiation centered on pathogenesis reasoning, and is also insufficient to capture the openness, semantic variability, and cross-disease reusability of syndrome manifestation expression. This study aimed to investigate whether introducing pathogenesis reasoning chain-of-thought (PR-CoT) supervision into large language models (LLMs) could improve the quality and cognitive consistency of syndrome manifestation recognition and support cross-disease transfer. MethodsSyndrome manifestation recognition was formulated as a conditional generation task under the framework of clinical observational information (X)→pathogenesis structure (Z)→syndrome pattern output (Y), where Z serves as an explicit intermediate structural variable linking the clinical evidence and syndrome judgment. Within this framework, a PR-CoT-supervised dataset for syndrome manifestation recognition was constructed based on medical case records of spleen-stomach disorders. After preprocessing, information extraction, manual proofreading, and data cleaning, the dataset comprised 4 800 training cases, 400 development cases, and 400 test cases. Each sample was annotated with a structured PR-CoT consisting of three progressive levels: clinical information summarization, comprehensive pathogenesis analysis, and syndrome pattern output. Supervised fine-tuning was conducted on open-source LLMs, with an end-to-end model serving as the baseline. Qwen3-32B was used as the primary experimental model, and Qwen3-14B as the scale comparison model. A progressive multidimensional evaluation framework was further established, comprising a structural parsing level, a semantic similarity level, and an expert blind review level. At the structural parsing level, syndrome pattern expressions were decomposed into structural elements and evaluated using Precision, Recall, F1 score, and Jaccard similarity. At the semantic similarity level, independent LLMs scored the theoretical proximity between predicted and reference syndrome patterns. At the expert blind review level, three TCM experts independently evaluated model outputs on two dimensions: syndrome differentiation consistency and terminology standardization of syndrome patterns. In addition, zero-shot cross-disease transfer evaluation was conducted on gynecological and heart-system disorder test sets. ResultsAt the structural parsing level, PR-CoT supervision did not lead to a stable improvement in the element-wise overlap of syndrome pattern structural components. Compared with the corresponding baselines, neither Qwen3-32B nor Qwen3-14B showed consistent advantages in structural matching metrics after the introduction of PR-CoT supervision. In contrast, at the semantic similarity level, PR-CoT supervision produced stable positive gains across different model scales and evaluation systems. The average semantic score of Qwen3-32B increased from 6.425 8 in the baseline model to 6.585 0 after PR-CoT supervision, and that of Qwen3-14B increased from 5.870 0 to 5.964 2. At the expert blind review level, the overall score of Qwen3-32B (PR-CoT) was 7.026 0±0.107 7, higher than 6.416 3±0.288 9 for its baseline. In zero-shot cross-disease testing, the PR-CoT model still showed advantages in semantic evaluation and expert evaluation on both gynecological and heart-system disorder test sets, indicating a certain degree of transferability. ConclusionThe benefits of PR-CoT supervision are mainly reflected in TCM semantic consistency and clinical plausibility, rather than in improved hard matching of structural elements. These findings support understanding syndrome manifestation recognition as a process of generating and expressing latent pathogenesis structures, rather than as a classification task within a traditional fixed label space. By introducing pathogenesis reasoning as an explicit intermediate structure into the modeling process and combining it with a progressive multidimensional evaluation framework, this study provides a methodological pathway for intelligent TCM syndrome differentiation that integrates theoretical alignment, interpretability, and multi-level evaluation.
3.A case report of premature ovarian insufficiency caused by a novel FANCL mutation(c.1033G>A)and in vitro functional validation
Yi-qing LIU ; Shu-ting REN ; Yun-cheng PAN ; Feng ZHANG ; Xiao-jin ZHANG ; Yan-hua WU
Fudan University Journal of Medical Sciences 2025;52(2):270-276,291
Objective To investigate the characteristics of a novel FANCL mutation identified in a patient with premature ovarian insufficiency(POI)and to explore its potential functional impacts in vitro.Methods A novel FANCL heterozygous mutation c.1033G>A(p.Glu345Lys)was screened in a patient with POI using whole exome sequencing(WES),which was found to be inherited from a mother who had undergone early menopause.The authenticity of the mutation was identified by Sanger sequencing and the conserved nature of the mutation site was predicted by software.Overexpressing FANCL mutant and wildtype plasmids were constructed and transiently transfected into HEK293T cell lines,and the effect of the mutation was detected by qPCR,immunofluorescence and Western blot.Results The mutation site of FANCL was located within the Ring domain of FANCL,which was highly conserved across multiple species.The mutant showed no significant change in mRNA expression level,while the protein expression level was significantly down-regulated.In vitro cellular experiments further revealed that the mutation leads to decreased expression levels by reducing protein stability.Conclusion A FANCL c.1033G>A mutation was found and it may cause disease in the POI patient due to decreased protein stability.
4.Effects of glycerol ingestion on pure tone audiometry,distortion products otoacoustic emission,and electrocochleography in patients with Ménière disease
Hui PAN ; Linlin WANG ; Cheng LUO ; Meng GONG ; Mengjun WU ; Yi SHU ; Wen XIE ; Hongjun XIAO ; Bo LIU
Journal of Audiology and Speech Pathology 2025;33(4):372-376
Objective To investigate the effects of glycerol ingestion on pure tone audiometry(PTA),distor-tion products otoacoustic emission(DPOAE),and electrocochleography(ECochG)in patients with Ménière disease(MD).Methods Glycerol test was conducted in 50 patients with MD.PTA was performed in four series:before glycerol intake,1,2 and 3 hours after intake.DPOAE and ECochG were performed before glycerol intake and 2 hours after intake.All results were analyzed to assess the effect of glycerol on cochlear function of patients with MD.Results ① 55%of MD patients tested positive in PTA glycerol test,and the positive rate increased gradually after 1-3 hours of glycerin ingestion(P<0.05).For the 33 positive ears,the pure tone threshold decreased the most between 1-2 hours and reached the lowest thresholds at 3 hours.Thresholds at 0.5 kHz,1 kHz,2 kHz dropped the most.② The positive rate of DPOAE glycerol test was 56.67%,with 34 positive ears showing a sig-nificant increase in amplitude between 0.75-2 kHz of f2.③ The positive rate of ECochG glycerin test was 13.64%.The decrease of-SP/AP ratio was not statistically significant before and after ingestion of glycerin(P>0.05).Conclusion Ingestion of glycerin could alter to varying degrees of the results of PTA,DPOAE and ECo-chG,and influence the cochlear function to some extent.
5.Research progress on mechanism of curcumin in treatment of depression
Lin WANG ; Qi-fei PAN ; Wen-juan LONG ; Jia-rong DU ; Zhong-yang HU ; Xin-yao LI ; Yi-shu CHEN ; Dong-dong QIN ; Xiao-man LYU
Chinese Pharmacological Bulletin 2025;41(9):1618-1623
Depression is a prevalent mental and emotional disor-der that often results in significant emotional disturbances,cog-nitive dysfunction,and memory impairments.It is characterized by a high incidence rate,a substantial disability burden,and limited therapeutic efficacy.Currently,the long-term use of medications for the treatment of depression can result in a range of adverse reactions,highlighting the urgent need to explore no-vel approaches that can effectively alleviate depressive symptoms while minimizing side effects.Curcumin,a natural polyphenolic compound derived from the rhizome of turmeric,demonstrates considerable potential in the prevention and treatment of depres-sion,owing to its diverse array of biological activities.In recent years,numerous studies have investigated the use of curcumin for the treatment of depression.This article aims to provide a comprehensive review of the mechanisms of action underlying curcumin's efficacy in treating depression.Specifically,it focu-ses on its ability to improve neurotransmitter imbalances,restore neural plasticity,alleviate neural damage,mitigate dysfunction of the hypothalamic-pituitary-adrenal(HPA)axis,regulate in-flammatory factors and neuroinflammatory signaling pathways,and inhibit oxidative stress.This review is intended to offer in-sights and methodological references for basic research on curcu-min,as well as for the development of novel therapeutic agents for the treatment of depression.
6.Effects of glycerol ingestion on pure tone audiometry,distortion products otoacoustic emission,and electrocochleography in patients with Ménière disease
Hui PAN ; Linlin WANG ; Cheng LUO ; Meng GONG ; Mengjun WU ; Yi SHU ; Wen XIE ; Hongjun XIAO ; Bo LIU
Journal of Audiology and Speech Pathology 2025;33(4):372-376
Objective To investigate the effects of glycerol ingestion on pure tone audiometry(PTA),distor-tion products otoacoustic emission(DPOAE),and electrocochleography(ECochG)in patients with Ménière disease(MD).Methods Glycerol test was conducted in 50 patients with MD.PTA was performed in four series:before glycerol intake,1,2 and 3 hours after intake.DPOAE and ECochG were performed before glycerol intake and 2 hours after intake.All results were analyzed to assess the effect of glycerol on cochlear function of patients with MD.Results ① 55%of MD patients tested positive in PTA glycerol test,and the positive rate increased gradually after 1-3 hours of glycerin ingestion(P<0.05).For the 33 positive ears,the pure tone threshold decreased the most between 1-2 hours and reached the lowest thresholds at 3 hours.Thresholds at 0.5 kHz,1 kHz,2 kHz dropped the most.② The positive rate of DPOAE glycerol test was 56.67%,with 34 positive ears showing a sig-nificant increase in amplitude between 0.75-2 kHz of f2.③ The positive rate of ECochG glycerin test was 13.64%.The decrease of-SP/AP ratio was not statistically significant before and after ingestion of glycerin(P>0.05).Conclusion Ingestion of glycerin could alter to varying degrees of the results of PTA,DPOAE and ECo-chG,and influence the cochlear function to some extent.
7.A case report of premature ovarian insufficiency caused by a novel FANCL mutation(c.1033G>A)and in vitro functional validation
Yi-qing LIU ; Shu-ting REN ; Yun-cheng PAN ; Feng ZHANG ; Xiao-jin ZHANG ; Yan-hua WU
Fudan University Journal of Medical Sciences 2025;52(2):270-276,291
Objective To investigate the characteristics of a novel FANCL mutation identified in a patient with premature ovarian insufficiency(POI)and to explore its potential functional impacts in vitro.Methods A novel FANCL heterozygous mutation c.1033G>A(p.Glu345Lys)was screened in a patient with POI using whole exome sequencing(WES),which was found to be inherited from a mother who had undergone early menopause.The authenticity of the mutation was identified by Sanger sequencing and the conserved nature of the mutation site was predicted by software.Overexpressing FANCL mutant and wildtype plasmids were constructed and transiently transfected into HEK293T cell lines,and the effect of the mutation was detected by qPCR,immunofluorescence and Western blot.Results The mutation site of FANCL was located within the Ring domain of FANCL,which was highly conserved across multiple species.The mutant showed no significant change in mRNA expression level,while the protein expression level was significantly down-regulated.In vitro cellular experiments further revealed that the mutation leads to decreased expression levels by reducing protein stability.Conclusion A FANCL c.1033G>A mutation was found and it may cause disease in the POI patient due to decreased protein stability.
8.Research progress on mechanism of curcumin in treatment of depression
Lin WANG ; Qi-fei PAN ; Wen-juan LONG ; Jia-rong DU ; Zhong-yang HU ; Xin-yao LI ; Yi-shu CHEN ; Dong-dong QIN ; Xiao-man LYU
Chinese Pharmacological Bulletin 2025;41(9):1618-1623
Depression is a prevalent mental and emotional disor-der that often results in significant emotional disturbances,cog-nitive dysfunction,and memory impairments.It is characterized by a high incidence rate,a substantial disability burden,and limited therapeutic efficacy.Currently,the long-term use of medications for the treatment of depression can result in a range of adverse reactions,highlighting the urgent need to explore no-vel approaches that can effectively alleviate depressive symptoms while minimizing side effects.Curcumin,a natural polyphenolic compound derived from the rhizome of turmeric,demonstrates considerable potential in the prevention and treatment of depres-sion,owing to its diverse array of biological activities.In recent years,numerous studies have investigated the use of curcumin for the treatment of depression.This article aims to provide a comprehensive review of the mechanisms of action underlying curcumin's efficacy in treating depression.Specifically,it focu-ses on its ability to improve neurotransmitter imbalances,restore neural plasticity,alleviate neural damage,mitigate dysfunction of the hypothalamic-pituitary-adrenal(HPA)axis,regulate in-flammatory factors and neuroinflammatory signaling pathways,and inhibit oxidative stress.This review is intended to offer in-sights and methodological references for basic research on curcu-min,as well as for the development of novel therapeutic agents for the treatment of depression.
9.Simultaneous content determination of seventeen constituents in Yangxue Ruanjian Capsules by UPLC-MS/MS
Yong-Ming LIU ; Shu-Sen LIU ; Yi-Zhe XIONG ; Xiang WANG ; Yu-Yun WU ; Jin LIU ; Ling-Yun PAN ; Guo-Qing DU ; Hong-Sheng ZHAN
Chinese Traditional Patent Medicine 2024;46(2):353-358
AIM To establish a UPLC-MS/MS method for the simultaneous content determination of liquiritin apioside,alibiflorin,swertiamarin,methyl gallate,benzoylpaeoniflorin,sweroside,6′-O-β-D-glucosylgentiopicroside,isoliquiritigenin,loganic acid,liquiritigenin,gallic acid,paeoniflorin,oxypaeoniflorin,gentiopicroside,glycyrrhizic acid,isoliquiritoside and liquiritin in Yangxue Ruanjian Capsules.METHODS The analysis was performed on a 40℃thermostatic Waters BEH C18column(2.1 mm×100 mm,1.7 μm),with the mobile phase comprising of 2 mmol/L ammonium acetate(containing 0.1%formic acid)-acetonitrile flowing at 0.3 mL/min in a gradient elution manner,and electron spray ionization source was adopted in negative ion scanning with multiple reaction monitoring mode.RESULTS Seventeen constituents showed good linear relationships within their own ranges(r>0.999 6),whose average recoveries were 91.33%-104.03%with the RSDs of 1.58%-3.50%.CONCLUSION This rapid,accurate and stable method can be used for the quality control of Yangxue Ruanjian Capsules.
10.Preparation and performance evaluation of S100B time-resolved fluorescence immunoassay kit
Dong-Qing FENG ; Bu-Zhuo XU ; Shu-Hong LUO ; Yu-Nan WU ; Zhuo ZHANG ; Hao TANG ; Yi-Ming WENG ; Ruo-Pan HUANG ; Xu-Dong SONG
Chinese Medical Equipment Journal 2024;45(1):47-55
Objective To develop a time-resolved fluorescent immunoassay kit for the rapid,accurate and quantitative detection of S100B protein in serum and to evaluate its performance.Methods The test strip was prepared using time-resolved fluorescent microsphere-labeled anti-S100B polyclonal antibody and rabbit IgG antibody,labeling pads,sample pads,S100B nitrocellulose films and absorbent paper,and an S100B time-resolved fluorescence immunoassay kit was obtained by assembling the cartridge.The performance of the kit developed was evaluated by standard curve,accuracy,minimum detection limit,linear interval,specificity,reproducibility and stability.The reference intervals of 199 pieces of healthy human serum and plasma samples from a certain region were detected with the kit,and the clinical performance of the kit and Roche Elecsys S100 kit was tested by synchronous blind method to assess the consistency of the results of the two kits for 142 samples.Results The S100B time-resolved fluorescence immunoassay kit had the standard curve beingy=(1.133 02+1.752 24)/[1+(x/1.082 20)×(-0.603 52)]-1.752 24,R2=0.999 08 and the linear range being[0.05,30]ng/mL,which met the requirements of the relative deviation of the accuracy within±15%,the minimum detection limit not hgier than 0.05 ng/mL,the relative deviation of specificity within±15%and the coefficient of variation of intra-and inter-batch difference less than 15%.The stability test results indicated that the kit was valid for 12 months at 2-30 ℃ conditions.The reference intervals of serum and plasma samples measured by the kit were both lower than 0.3 ng/mL.Clinical trials showed that the results by the kit and Roche Elecsys S100 Assay Kit were in high agreement(Kappa=0.906 1>0.80)and met the requirements.Conclusion The kit developed detects the concentration of S100B protein in serum quickly,accurately and quantitatively,and provides references for the diagnosis and treatment of neurological diseases,autoimmune diseases,cerebrovascular diseases and etc.[Chinese Medical Equipment Journal,2024,45(1):47-55]

Result Analysis
Print
Save
E-mail