1.Strategies for Building an Artificial Intelligence-Empowered Trusted Federated Evidence-Based Analysis Platform for Spleen-Stomach Diseases in Traditional Chinese Medicine
Bin WANG ; Huiying ZHUANG ; Zhitao MAN ; Lifeng REN ; Chang HE ; Chen WU ; Xulei HU ; Xiaoxiao WEN ; Chenggong XIE ; Xudong TANG
Journal of Traditional Chinese Medicine 2026;67(1):95-102
This paper outlines the development of artificial intelligence (AI) and its applications in traditional Chinese medicine (TCM) research, and elucidates the roles and advantages of large language models, knowledge graphs, and natural language processing in advancing syndrome identification, prescription generation, and mechanism exploration. Using spleen-stomach diseases as an example, it demonstrates the empowering effects of AI in classical literature mining, precise clinical syndrome differentiation, efficacy and safety prediction, and intelligent education, highlighting an upgraded research paradigm that evolves from data-driven and knowledge-driven approaches to intelligence-driven models. To address challenges related to privacy protection and regulatory compliance in cross-institutional data collaboration, a "trusted federated evidence-based analysis platform for TCM spleen-stomach diseases" is proposed, integrating blockchain-based smart contracts, federated learning, and secure multi-party computation. The deep integration of AI with privacy-preserving computing is reshaping research and clinical practice in TCM spleen-stomach diseases, providing feasible pathways and a technical framework for building a high-quality, trustworthy TCM big-data ecosystem and achieving precision syndrome differentiation.
2.Herbal Textual Research on Quisqualis Fructus in Famous Classical Formulas
Xiuping WEN ; Shiying CHEN ; Ying TAN ; Guanwen ZHENG ; Huilong XU ; Wen XU ; Chengzi YANG ; Zehao HUANG ; Yu LIN ; Zhilai ZHAN
Chinese Journal of Experimental Traditional Medical Formulae 2026;32(6):225-237
This article systematically analyzed the historical evolution of the origin, scientific name, producing area, quality evaluation, harvesting and processing, and other aspects of Quisqualis Fructus by consulting the ancient materia medica, medical books, prescription books, local literature and combining with the modern literature and standards, summarized and explored the development rules of its medicinal properties and efficacy along with their underlying causes, in order to provide support for the development and utilization of famous classical formulas containing this herb. According to the textual research, Shijunzi was first recorded as Liuqiuzi in Nanfang Caomuzhuang of the Jin dynasty, and the name of Shijunzi was first used in Kaibao Bencao of the Song dynasty, which has been consistently used throughout subsequent dynasties, and there were also aliases such as Junziren, Sijunzi, and Dujilizi. The mainstream source of Quisqualis Fructus used in the past dynasties has been the dried mature fruits of Quisqualis indica, a plant belonging to the family Combretaceae. In modern times, its variety Q. indica var. villosa has also been recorded as the medicinal material of Quisqualis Fructus. In 2007, the Flora of China(English edition) designated Q. indica var. villosa as a synonym of Q. indica. Today, the accepted name of Shijunzi is updated to Combretum indicum. According to ancient herbal records, the producing areas of Quisqualis Fructus were Guangdong, Hong Kong, Macao, Guangxi, Hainan, Sichuan and Fujian, and then gradually expanded to Yunnan, Taiwan, Jiangxi and Guizhou. Since the Song dynasty, two major production regions have gradually emerged in Sichuan, Chongqing and Fujian. Currently, it is primarily cultivated in Chongqing, Guangxi and other areas, with Chongqing yielding the highest output. Since modern times, superior quality has been defined by large size, a purple-black surface, plump grains, and a yellowish-white kernel. According to ancient herbal records, the harvesting period of Quisqualis Fructus was the July and August of the lunar calendar, mostly used raw after shelling or with the shell intact, it underwent processing methods such as cleaning, slicing, mixing, steaming, roasting, stewing, and frying. Currently, the harvesting period is autumn, followed by sun-drying or low-heat drying, with processing methods including cleaning, stir-frying, and stewing. In ancient and modern literature, the records of the properties, functions and indications of Quisqualis Fructus are basically the same, that is, sweet in taste, warm in nature, predominantly non-toxic, belonging to the spleen and stomach meridians. It possesses effects of insecticide, decontamination and invigorating spleen for ascariasis, enterobiasis, abdominal pain due to worm accumulation and infantile malnutrition.The contraindications for use primarily include avoiding consumption by individuals without parasitic infestations, limiting use for those with spleen-stomach deficiency-cold, refraining from drinking hot tea during medication, and avoiding excessive intake. Based on the textual research, it is suggested that the dried mature fruits of Q. indica should be used as the medicinal material for the development of famous classical formulas containing Quisqualis Fructus. Processing methods may be chosen according to prescription requirements, and the raw products is recommended for medicinal use if not specified.
3.Establishment of a new predictive model for esophagogastric variceal rebleeding in liver cirrhosis based on clinical features
Wen GUO ; Xuyulin YANG ; Run GAO ; Yaxin CHEN ; Kun YIN ; Qian LI ; Manli CUI ; Mingxin ZHANG
Journal of Clinical Hepatology 2026;42(1):101-110
ObjectiveTo establish a new noninvasive, simple, and convenient clinical predictive model by identifying independent predictive factors for rebleeding after endoscopic therapy in cirrhotic patients with esophagogastric variceal bleeding (EGVB), and to provide a basis for individualized risk assessment and development of clinical intervention strategies. MethodsCirrhotic patients with EGVB who were diagnosed and treated in The First Affiliated Hospital of Xi’an Medical University from September 2018 to October 2023 were enrolled as subjects, and according to whether the patient experienced rebleeding within 1 year after endoscopic therapy, they were divided into rebleeding group with 93 patients and non-rebleeding group with 84 patients. Clinical data were collected and analyzed. The independent samples t-test was used for comparison of normally distributed continuous data between two groups, and the Mann-Whitney U test was used for comparison of non-normally distributed continuous data between two groups; the chi-square test was used for comparison of categorical data between two groups. A Logistic model was established based on the results of the univariate and multivariate analyses, and the receiver operating characteristic (ROC) curve and the area under the ROC curve (AUC) were used to assess the accuracy of the model. R software was used to visualize the model by plotting a nomogram, and the Bootstrap method was used for internal validation of the model. ResultsThe multivariate analysis showed that red blood cell count (RBC), cholinesterase (ChE), alkaline phosphatase (ALP), albumin (Alb), thrombin time (TT), portal vein trunk diameter, sequential therapy, and primary prevention were independent predictive factors for rebleeding. Based on the results of the multivariate analysis, a logistic model was established as logit(P)=-0.805-1.978×(RBC)+0.001×(ChE)-0.020×(ALP)-0.314×(Alb)+0.567×(TT)+0.428×(portal vein trunk diameter)-2.303×[sequential therapy (yes=1, no=0)]-2.368×[primary prevention (yes=1, no=0)]. The logistic model (AUC=0.928, 95% confidence interval [CI]: 0.893—0.964, P<0.001) had a better performance in predicting rebleeding than MELD score (AUC=0.603, 95%CI: 0.520—0.687, P=0.003), Child-Pugh class (AUC=0.650, 95%CI: 0.578—0.722, P=0.001), and FIB-4 index (AUC=0.587, 95%CI: 0.503—0.671, P=0.045). The model had an optimal cut-off value of 0.607, a sensitivity of 0.817, and a specificity of 0.817. Internal validation confirmed that the model had good predictive performance and accuracy. ConclusionSequential therapy, implementation of primary prevention, an increase in RBC, and an increase in Alb are protective factors against rebleeding, while prolonged TT and widened main portal vein diameter are risk factors. The logistic model based on these independent predictive factors can predict rebleeding and thus holds promise for clinical application.
4.Advances in evaluation techniques for traditional Chinese medicine dermatopharmacokinetics
Yiqiao CHEN ; Lu SUN ; Xiaodong WEN
Journal of China Pharmaceutical University 2026;57(2):172-180
Evaluating the absorption of traditional Chinese medicine (TCM) formulations in skin has long been a challenge in cutaneous pharmacokinetic studies of TCM. In recent years, various new techniques, including diffusion cell, microphysiological system, matrix-assisted laser desorption/ionization mass spectrometry imaging, tape stripping, microdialysis/open-flow microperfusion, and confocal Raman microscopy technology, have been developed to characterize and predict the pharmacokinetic profiles of these formulations more accurately. This review systematically summarizes the application progress of these methods in the evaluation of cutaneous pharmacokinetics of TCM, highlights their technical features and suitable scenarios, and discusses future development trends, providing new research perspectives for further understanding the dynamic changes, spatial distribution characteristics of active ingredients in the skin, and the rationality of compatibility in topical preparations.
5.Epidemiological characteristics of category C intestinal infectious diseases among children and adolescents in Shenzhen from 2012 to 2024 and the association with meteorological factors
Chinese Journal of School Health 2026;47(4):553-557
Objective:
To analyze the epidemiological characteristics of category C intestinal infectious diseases among children and adolescents in Shenzhen from 2012 to 2024 and the association with meteorological factors, so as to provide a scientific basis for the targeted prevention and control of infectious diseases for children and adolescents.
Methods:
Using data from the "Infectious Disease Reporting Information Management System" of the "China Disease Prevention and Control Information System" covering the period from January 1, 2012 to December 31, 2024, the study analyzed clinical and confirmed cases of hand, foot, and mouth disease, other infectious diarrhea, and acute hemorrhagic conjunctivitis among individuals aged 6-19 years old to describe demographic and temporal characteristics. It used Joinpoint regression to calculate the average annual percent change (AAPC) and annual percent change (APC) to analyze incidence trends, and Spearman s correlation was combined to generalize linear models so as to assess the association between category C intestinal infectious diseases and meteorological factors.
Results:
From 2012 to 2024, a cumulative total of 61 019 cases of hand, foot, and mouth disease among children and adolescents, 58 498 cases of other infectious diarrhea, and 6 377 cases of acute hemorrhagic conjunctivitis were reported. The AAPC in the incidence rates of these three diseases was 19.19%, 31.03% and 31.48 %, respectively(all P <0.05). Notably, the incidence of hand, foot, and mouth disease increased significantly after 2022 (APC= 133.66 %, P <0.01). The temporal distribution showed that hand,foot,and mouth disease was most prevalent in May,June and July (seasonal index of 2.39,3.64,1.97), other infectious diarrhea was most prevalent in February,March and December (seasonal index of 1.22,1.25,1.47), and acute hemorrhagic conjunctivitis peaked in September and October (seasonal index of 4.22,2.16). Monthly average temperature could increase the risk of hand,foot,and mouth disease( β = 0.18 ,95% CI =0.11-0.25); as monthly average wind speed increased, the incidence of other infectious diarrhea ( β =-0.86, 95% CI = -1.50 to -0.22) and acute hemorrhagic conjunctivitis ( β =-1.32, 95% CI =-2.60 to -0.05) both decreased (all P < 0.05 ).
Conclusions
Among children and adolescents in Shenzhen, category C intestinal infectious diseases remain prevalent throughout the year;the number of reported hand, foot, and mouth disease cases has shown an upward trend in recent years.Temperature and wind speed significantly affect the number of reported cases of three types with category C intestinal infectious diseases.
6.Comparison of bioelectrical impedance analysis and dual energy X ray absorptiometry in measuring body composition among Tibetan children and adolescents
Chinese Journal of School Health 2026;47(4):569-573
Objective:
To compare the consistency between bioelectrical impedance analysis (BIA) and dual energy X ray absorptiometry (DXA) in measuring body composition among Tibetan children and adolescents and to explore the applicability of BIA in plateau region, so as to provide scientific and convenient body composition measurement support among children and adolescents.
Methods:
From May to June, 2022, a total of 344 Tibetan children and adolescents aged 6-17 years were selected from Golmud Municipal National Middle School and Changjiangyuan Nationality Primary School in Qinghai Province by cluster sampling method, and their fat mass, fat mass percentage and lean mass were measured by DXA and BIA. The consistency and correlation between the two methods were assessed by using the Wilcoxon rank-sum test, Spearman correlation analysis, intraclass correlation coefficient (ICC), and Bland-Altman analysis.
Results:
DXA measured fat mass and fat mass percentage were significantly higher than those obtained by BIA (6-12 years old: Z =9.91, 11.28; 13-17 years old: Z =9.02, 10.21), while lean mass and lean mass percentage were significantly lower than BIA results (6-12 years old: Z =-11.60, -11.30; 13-17 years old: Z =-10.77, -10.36) (all P < 0.05 ). The two methods showed strong correlations in fat mass and lean mass (all r >0.80, all ICC >0.90), but exhibited poor agreement in fat mass percentage and lean mass percentage (6-12 years old: Lin s CCC =0.64, 0.41; 13-17 years old: Lin s CCC = 0.79 , 0.35). Bland-Altman analysis showed that the difference between the two methods was negatively correlated with the average value in FM%(6-12 years old: r =-0.75, 13-17 years old: r =-0.79, both P <0.01).
Conclusion
BIA and DXA show high consistency in measuring body fat mass and lean body mass in Tibetan children and adolescents, although some bias is still present in certain individuals.
7.Compact Fundus Imaging System Using Shack-Hartmann Wavefront Sensing for High-speed Auto-focus
Zhe-Kai LIN ; Long CHEN ; Geng-Yong ZHENG ; Jin-Tian HUANG ; Jia-Xin DONG ; Shang-Pan YANG ; Wen-Zheng DING ; Ding-An HAN ; Xue-Hua WANG ; Ya-Guang ZENG
Progress in Biochemistry and Biophysics 2026;53(4):1076-1086
ObjectiveThe widespread adoption of portable fundus cameras for primary care and community screening is hindered by limitations in current autofocus(AF) technologies. Image-based methods relying on sharpness evaluation require iterative searches, resulting in slow convergence, while projection-based techniques are susceptible to optical artifacts and calibration errors. To address these challenges, this study introduces a novel AF system based on direct wavefront sensing, designed to deliver simultaneous high speed, high precision, and operational robustness within the compact form factor essential for portable ophthalmic devices. MethodsOur approach fundamentally reimagines the AF process by directly measuring the ocular wavefront aberration. We developed a custom portable fundus camera integrating a miniaturized Shack-Hartmann wavefront sensor (SHWS) into the optical path. An 850 nm laser diode projects a point source onto the retina via oblique illumination to minimize corneal reflections. Light scattered from this spot carries the eye’s refractive error through the imaging optics and is directed to the SHWS, positioned at a plane optically conjugate to the primary color CMOS imaging sensor. A microlens array within the SHWS samples the incident wavefront, generating a pattern of focal spots on a CCD. Real-time centroid analysis of these spots provides a map of local wavefront slopes. These measurements are processed through a singular value decomposition (SVD) algorithm to fit a Zernike polynomial basis set, enabling real-time reconstruction of the wavefront phase. The defocus component (S) is extracted from the second-order Zernike coefficients, providing a direct, quantitative measure of the refractive error in diopters. This value serves as a precise error signal in a closed-loop control system, which commands a voice-coil actuated focusing lens to its null position in a single, deterministic step, eliminating the need for iterative search algorithms. ResultsComprehensive evaluation demonstrated the system’s high performance. Testing on a calibrated model eye (OEMI-7) established a highly linear relationship between the computed defocus S and the focusing lens position across a ±20 Diopter (D) compensation range, achievable within a 5 mm mechanical travel. The system achieved a focusing precision of 0.08 D, corresponding to an 18-fold improvement over a conventional projection spot-size method tested under identical conditions. The total focus acquisition time, encompassing wavefront measurement, computation, and lens actuation, averaged under 0.5 s. Clinical validation with 25 human volunteers (50 eyes, refractive range -15 D to +10 D) confirmed practical efficacy. The wavefront-sensing AF succeeded in 92% of attempts with a mean time of 0.5 s, substantially outperforming a projection-based benchmark which achieved only a 32% success rate with an average time of 4.25 s. The system provided instantaneous directional guidance and maintained stability during minor ocular movements. Objective assessment of image quality, via amplitude contrast of retinal vasculature, showed consistent and significant enhancement following AF correction across the entire tested diopter range. ConclusionThis work successfully implements and validates a direct wavefront-sensing autofocus paradigm for portable fundus cameras. By directly quantifying and compensating for the optical defocus aberration, this method bypasses the fundamental limitations of image-processing and projection-based techniques, enabling rapid, precise, and deterministic diopter compensation. The developed system delivers an exceptional combination of a wide operational range (±20 D), high accuracy (0.08 D), fast convergence (0.5 s), and a compact physical footprint. This technology provides a practical and high-performance focusing solution capable of enhancing the reliability, throughput, and diagnostic utility of portable retinal imaging in large-scale screening applications. Future efforts will be directed towards system cost optimization and performance adaptation for diverse ocular conditions.
8.The Regulatory Effects and Mechanisms of Piezo1 Channel on Chondrocytes and Bone Metabolic Dysregulation in Osteoarthritis
Yan LI ; Tao LIU ; Yu-Biao GU ; Hui-Qing TIAN ; Lei ZHANG ; Bi-Hui BAI ; Zhi-Jun HE ; Wen CHEN ; Jin-Peng LI ; Fei LI
Progress in Biochemistry and Biophysics 2026;53(3):564-576
Osteoarthritis (OA), a highly prevalent degenerative joint disease worldwide, is defined by articular cartilage degradation, abnormal bone remodeling, and persistent chronic inflammation. It severely compromises patients’ quality of life, and currently, there is no radical cure. Abnormal mechanical stress is widely regarded as a core driver of OA pathogenesis, and the exploration of mechanical signal perception and transduction mechanisms has become crucial for deciphering OA’s pathophysiological processes. Piezo1, a key mechanosensitive cation channel belonging to the Piezo protein family, has recently gained significant attention due to its pivotal role in mediating cellular responses to mechanical stimuli in joint tissues. This review systematically examines Piezo1’s expression patterns, regulatory mechanisms, and pathological functions in OA, with a particular focus on its dual roles in modulating chondrocyte homeostasis and bone metabolism disorders, while also delving into the underlying molecular signaling pathways and potential therapeutic implications. Piezo1, consisting of approximately 2 500 amino acids and forming a unique trimeric propeller-like structure, is widely expressed in chondrocytes, osteocytes, mesenchymal stem cells, and synovial cells. It exhibits permeability to cations such as Ca2+, K+, and Na+, and directly responds to membrane tension changes induced by mechanical stimuli like fluid shear stress and mechanical overload. In OA patients and animal models, Piezo1 expression is significantly upregulated, especially in cartilage regions subjected to abnormal mechanical stress (e.g., human temporomandibular joint cartilage). This overexpression is closely associated with aggravated cartilage degeneration, increased chondrocyte apoptosis, accelerated cellular senescence, and intensified inflammatory responses. Mechanical overload and pro-inflammatory cytokines (e.g., IL-1β) are key inducers of Piezo1 upregulation: IL-1β activates the PI3K/AKT/mTOR signaling pathway to enhance Piezo1 expression, forming a pathogenic positive feedback loop that inhibits chondrocyte autophagy, promotes apoptosis, and further accelerates joint degeneration. Mechanistically, Piezo1 mediates OA progression through multiple interconnected pathways. When activated by mechanical stress, Piezo1 triggers excessive Ca2+ influx, leading to endoplasmic reticulum stress (ERS) and mitochondrial dysfunction, which directly induce chondrocyte apoptosis. This process involves the activation of downstream signaling cascades such as cGAS-STING and YAP-MMP13/ADAMTS5. YAP, a transcriptional regulator, upregulates the expression of matrix metalloproteinase 13 (MMP13) and aggrecanase (ADAMTS5), thereby accelerating cartilage matrix degradation. Additionally, Piezo1-driven Ca2+ overload promotes the accumulation of reactive oxygen species (ROS) and upregulates senescence markers (p16 and p21), accelerating chondrocyte senescence via the p38MAPK and NF-κB pathways. Senescent chondrocytes secrete senescence-associated secretory phenotype (SASP) factors (e.g., IL-6, IL-1β), further amplifying joint inflammation. In terms of bone metabolism, Piezo1 maintains joint homeostasis by promoting the differentiation of fibrocartilage stem cells into chondrocytes and balancing bone formation and resorption through regulating the FoxC1/YAP axis and RANKL/OPG ratio. Therapeutically, targeting Piezo1 shows promising potential. Preclinical studies have demonstrated that Piezo1 inhibitors (e.g., GsMTx4) can reduce joint damage and alleviate pain in OA mice. Simultaneously, siRNA-mediated co-silencing of Piezo1 and TRPV4 (another mechanosensitive channel) decreases intracellular Ca2+ concentration, inhibits chondrocyte apoptosis, and promotes cartilage repair. Conditional knockout of Piezo1 using Gdf5-Cre transgenic mice alleviates cartilage degeneration in post-traumatic OA models by downregulating MMP13 and ADAMTS5 expression. Despite existing challenges, such as off-target effects of inhibitors, inefficient local drug delivery, and interindividual genetic variability, strategies like developing selective Piezo1 antagonists, optimizing targeted nanocarriers, and combining Piezo1-targeted therapy with physical therapy provide viable avenues for clinical translation. The authors propose that Piezo1 serves as a critical therapeutic target for OA, and future research should focus on deciphering its context-dependent regulatory networks, developing tissue-specific intervention strategies, and validating their efficacy and safety in clinical trials to address the unmet medical needs of OA patients.
9.The Dual Role of p21 in Hormone-related Cancers and Its Therapeutic Implications
Jia-Wen LI ; Yang CHEN ; Jia-Qi WANG ; Yu-Kai MA ; Zhi-Yi GUO
Progress in Biochemistry and Biophysics 2026;53(3):593-608
p21 (encoded by the CDKN1A gene) is a critical cell cycle regulatory protein endowed with versatile biological functions. In various sex hormone-related cancers, p21 exhibits a paradoxical dual role, capable of both inhibiting tumorigenesis and promoting cancer progression, exerting dual, often opposing, effects on cellular fate that are dictated by the specific context. The clinical targeting of p21 remains elusive, largely due to its functionally pleiotropic and context-dependent nature within intricate regulatory networks. During the initial, hormone-dependent phase of cancers like breast and prostate cancer, p21 expression and activity are largely governed by the transcriptional programs of estrogen or androgen receptor signaling. This hormonal regulation contributes to the control of tumor cell proliferation and underpins the initial efficacy of endocrine therapies. In contrast, as these diseases advance to late stages or evolve into non-hormone-dependent subtypes—exemplified by castration-resistant prostate cancer (CRPC) and specific forms of triple-negative breast cancer (TNBC)—these conventional hormonal control mechanisms often become dysfunctional or are entirely bypassed. This fundamental transition creates a critical therapeutic void, highlighting the urgent need to identify and exploit alternative molecular pathways to effectively target p21’s function. Promising strategies may include the precise modulation of its upstream transcriptional regulators, downstream effector proteins, or the intersecting parallel signaling networks that critically influence its activity. This review provides a systematic synthesis of the intricate and interconnected mechanisms that underpin the dual effects of p21 in sex hormone-related tumors. These mechanisms are categorized into three core, interrelated functional domains. (1) cell cycle regulation: p21 executes its canonical tumor-suppressive role by binding to and inhibiting cyclin-dependent kinases (CDKs) and by directly interacting with proliferating cell nuclear antigen (PCNA), thereby inducing cell cycle arrest, predominantly at the G1/S checkpoint; (2) apoptosis modulation: p21 exerts a highly context-dependent influence on programmed cell death, functioning either as a pro-apoptotic agent under severe genotoxic stress or as a pro-survival factor by inhibiting apoptosis through interactions with proteins like Bcl-2; (3) hormonal and signaling crosstalk: p21 is an integral node within broader cellular networks, engaging in direct physical interactions with hormone receptors(e.g., AR, ER) and participating in complex feedback loops with key oncogenic pathways, including PI3K/AKT, MAPK/ERK, and p53. Critically, the role of p21 is not static but highly dynamic. It can undergo a functional switch from tumor-suppressive to tumor-promoting in response to therapeutic pressures, metabolic alterations, or evolving tumor microenvironment cues. These adaptive shifts are frequently implicated in the development of therapy resistance and disease recurrence, particularly in advanced, hormone-resistant cancers. By synthesizing these insights, this review aims to establish a coherent theoretical framework to guide the future development of novel therapeutic strategies that target the p21 pathway. It underscores the necessity of moving beyond a simplistic, binary view of p21 and emphasizes the forthcoming challenges, such as the discovery of reliable biomarkers to predict its functional state and the rational design of context-specific pharmacological modulators to selectively harness its therapeutic potential.
10.Research on The Genealogical Inference Efficiency of High-density SNPs
Jing LI ; Yi-Jie SUN ; Wen-Ting ZHAO ; Zi-Chen TANG ; Jing LIU ; Cai-Xia LI
Progress in Biochemistry and Biophysics 2026;53(3):740-753
ObjectiveThis study aims to explore the potential of different orders of magnitude single-nucleotide polymorphism (SNP) locus combinations for predicting distant kinship relationships. A high-density SNP locus set was constructed, and a comprehensive assessment of its inference capability was conducted. MethodsFirstly, we selected three commercial chip panels, CGA (Chinese genotyping array, Illumina), GSA (Global screening array, Illumina), Affy (23MF_V2 high-density SNP array, Affymetrix) and merged them after quality control, forming a high-density SNP locus panel(1 180 k). Secondly, we selected 161 samples and collected their peripheral blood samples by using whole-genome sequencing technology. Within this sample population, the levels of kinship relationships fully covered the range from level 1 to level 9, and the number of kinship pairs at each level was consistently maintained at over 50 pairs. From 161 samples data of whole-genome sequencing, the 1 180 k locus set was extracted, which is referred to as the high-density SNP locus set in the following text. The kinship inference was conducted using the identity-by-descent (IBD) algorithm with the selected optimal parameters. To comprehensively evaluate the performance of the high-density SNP locus set in kinship inference, we compared it with the three commercial chip panels, the intersection of these three chip loci, and the control sets constructed by randomly reducing the number of the high-density SNP locus set. Based on the changes in the IBD lengths, as well as the dynamic trends in prediction accuracy, we conducted a scientific assessment of the kinship inference capability of the high-density SNP locus set. ResultsAfter screening, a set of 1 184 334 autosomal SNPs was obtained. During the process of screening the optimal IBD length threshold, the result revealed that 0 cM, 1 cM, and 2 cM all demonstrated good applicability. However, to avoid the issue of a large amount of redundant information caused by setting a too low IBD length threshold, this study ultimately selected 2 cM as the optimal threshold. Compared with the average results of three chip panels, the high-density SNP locus set increased the total IBD length and the average IBD length across levels 1-9; the accuracy of the confidence interval for level 8 was 70.97%, which represented a 3.50% improvement; the average confidence interval accuracy for levels 1-8 was 91.39%, representing a 1.00% increase; and the false negative rates at levels 8 and 9 were reduced by 2.42% and 6.76%, respectively. The system efficacy of the high-density SNP locus set for kinship inference of first to eighth degree relationships reached 98.91%. Through random reduction of the high-density SNP locus set results, it is found that increasing the number of SNPs with the panel, the detection efficiency of IBD length showed a significant upward trend. At the same time, the overall trend in the accuracy of kinship relationship prediction as well as the confidence interval accuracy also indicated that both metrics steadily increased with the addition of more loci. ConclusionThe results show that the high-density SNPs panel significantly enhances the efficacy of distant kinship inference, accurately covering kinship degrees, with the average confidence interval accuracy for first to eighth degree relationships stably above 90%. The study finds that increasing the number of SNPs panel can improve the ability to predict distant kinship.


Result Analysis
Print
Save
E-mail