1.scPANDA: PAN-Blood Data Annotator with a 10-Million Single-Cell Atlas.
Chang-Xiao LI ; Can HUANG ; Dong-Sheng CHEN
Chinese Medical Sciences Journal 2025;40(1):68-87
OBJECTIVES:
Recent advancements in single-cell RNA sequencing (scRNA-seq) have revolutionized the study of cellular heterogeneity, particularly within the hematological system. However, accurately annotating cell types remains challenging due to the complexity of immune cells. To address this challenge, we develop a PAN-blood single-cell Data Annotator (scPANDA), which leverages a comprehensive 10-million-cell atlas to provide precise cell type annotation.
METHODS:
The atlas, constructed from data collected in 16 studies, incorporated rigorous quality control, preprocessing, and integration steps to ensure a high-quality reference for annotation. scPANDA utilizes a three-layer inference approach, progressively refining cell types from broad compartments to specific clusters. Iterative clustering and harmonization processes were employed to maintain cell type purity throughout the analysis. Furthermore, the performance of scPANDA was evaluated in three external datasets.
RESULTS:
The atlas was structured hierarchically, consisting of 16 compartments, 54 classes, 4,460 low-level clusters (pd_cc_cl_tfs), and 611 high-level clusters (pmid_cts). Robust performance of the tool was demonstrated in annotating diverse immune scRNA-seq datasets, analyzing immune-tumor coexisting clusters in renal cell carcinoma, and identifying conserved cell clusters across species.
CONCLUSIONS
scPANDA exemplifies effective reference mapping with a large-scale atlas, enhancing the accuracy and reliability of blood cell type identification.
Humans
;
Single-Cell Analysis/methods*
;
Sequence Analysis, RNA/methods*
;
Blood Cells
2.Profiling and functional characterization of long noncoding RNAs during human tooth development.
Xiuge GU ; Wei WEI ; Chuan WU ; Jing SUN ; Xiaoshan WU ; Zongshan SHEN ; Hanzhang ZHOU ; Chunmei ZHANG ; Jinsong WANG ; Lei HU ; Suwen CHEN ; Yuanyuan ZHANG ; Songlin WANG ; Ran ZHANG
International Journal of Oral Science 2025;17(1):38-38
The regulatory processes in developmental biology research are significantly influenced by long non-coding RNAs (lncRNAs). However, the dynamics of lncRNA expression during human tooth development remain poorly understood. In this research, we examined the lncRNAs present in the dental epithelium (DE) and dental mesenchyme (DM) at the late bud, cap, and early bell stages of human fetal tooth development through bulk RNA sequencing. Developmental regulators co-expressed with neighboring lncRNAs were significantly enriched in odontogenesis. Specific lncRNAs expressed in the DE and DM, such as PANCR, MIR205HG, DLX6-AS1, and DNM3OS, were identified through a combination of bulk RNA sequencing and single-cell analysis. Further subcluster analysis revealed lncRNAs specifically expressed in important regions of the tooth germ, such as the inner enamel epithelium and coronal dental papilla (CDP). Functionally, we demonstrated that CDP-specific DLX6-AS1 enhanced odontoblastic differentiation in human tooth germ mesenchymal cells and dental pulp stem cells. These findings suggest that lncRNAs could serve as valuable cell markers for tooth development and potential therapeutic targets for tooth regeneration.
Humans
;
RNA, Long Noncoding/metabolism*
;
Odontogenesis/genetics*
;
Tooth Germ/embryology*
;
Cell Differentiation
;
Gene Expression Regulation, Developmental
;
Mesoderm/metabolism*
;
Tooth/embryology*
;
Gene Expression Profiling
;
Sequence Analysis, RNA
;
Dental Pulp/cytology*
3.High-throughput single-microbe RNA sequencing reveals adaptive state heterogeneity and host-phage activity associations in human gut microbiome.
Yifei SHEN ; Qinghong QIAN ; Liguo DING ; Wenxin QU ; Tianyu ZHANG ; Mengdi SONG ; Yingjuan HUANG ; Mengting WANG ; Ziye XU ; Jiaye CHEN ; Ling DONG ; Hongyu CHEN ; Enhui SHEN ; Shufa ZHENG ; Yu CHEN ; Jiong LIU ; Longjiang FAN ; Yongcheng WANG
Protein & Cell 2025;16(3):211-226
Microbial communities such as those residing in the human gut are highly diverse and complex, and many with important implications for health and diseases. The effects and functions of these microbial communities are determined not only by their species compositions and diversities but also by the dynamic intra- and inter-cellular states at the transcriptional level. Powerful and scalable technologies capable of acquiring single-microbe-resolution RNA sequencing information in order to achieve a comprehensive understanding of complex microbial communities together with their hosts are therefore utterly needed. Here we report the development and utilization of a droplet-based smRNA-seq (single-microbe RNA sequencing) method capable of identifying large species varieties in human samples, which we name smRandom-seq2. Together with a triple-module computational pipeline designed for the bacteria and bacteriophage sequencing data by smRandom-seq2 in four human gut samples, we established a single-cell level bacterial transcriptional landscape of human gut microbiome, which included 29,742 single microbes and 329 unique species. Distinct adaptive response states among species in Prevotella and Roseburia genera and intrinsic adaptive strategy heterogeneity in Phascolarctobacterium succinatutens were uncovered. Additionally, we identified hundreds of novel host-phage transcriptional activity associations in the human gut microbiome. Our results indicated that smRandom-seq2 is a high-throughput and high-resolution smRNA-seq technique that is highly adaptable to complex microbial communities in real-world situations and promises new perspectives in the understanding of human microbiomes.
Humans
;
Gastrointestinal Microbiome/genetics*
;
Bacteriophages/physiology*
;
High-Throughput Nucleotide Sequencing
;
Sequence Analysis, RNA/methods*
;
Bacteria/virology*
4.Gene print-based cell subtypes annotation of human disease across heterogeneous datasets with gPRINT.
Ruojin YAN ; Chunmei FAN ; Shen GU ; Tingzhang WANG ; Zi YIN ; Xiao CHEN
Protein & Cell 2025;16(8):685-704
Identification of disease-specific cell subtypes (DSCSs) has profound implications for understanding disease mechanisms, preoperative diagnosis, and precision therapy. However, achieving unified annotation of DSCSs in heterogeneous single-cell datasets remains a challenge. In this study, we developed the gPRINT algorithm (generalized approach for cell subtype identification with single cell's voicePRINT). Inspired by the principles of speech recognition in noisy environments, gPRINT transforms gene position and gene expression information into voiceprints based on ordered and clustered gene expression phenomena, obtaining unique "gene print" patterns for each cell. Then, we integrated neural networks to mitigate the impact of background noise on cell identity label mapping. We demonstrated the reproducibility of gPRINT across different donors, single-cell sequencing platforms, and disease subtypes, and its utility for automatic cell subtype annotation across datasets. Moreover, gPRINT achieved higher annotation accuracy of 98.37% when externally validated based on the same tissue, surpassing other algorithms. Furthermore, this approach has been applied to fibrosis-associated diseases in multiple tissues throughout the body, as well as to the annotation of fibroblast subtypes in a single tissue, tendon, where fibrosis is prevalent. We successfully achieved automatic prediction of tendinopathy-specific cell subtypes, key targets, and related drugs. In summary, gPRINT provides an automated and unified approach for identifying DSCSs across datasets, facilitating the elucidation of specific cell subtypes under different disease states and providing a powerful tool for exploring therapeutic targets in diseases.
Humans
;
Algorithms
;
Single-Cell Analysis
;
Databases, Genetic
;
Molecular Sequence Annotation
5.Identification of the Novel Allele HLA-B*54:01:11 Detected by NGS Using the Third Generation Sequencing Technology.
Nan-Ying CHEN ; Yi-Zheng HE ; Wen-Wen PI ; Qi LI ; Li-Na DONG ; Wei ZHANG
Journal of Experimental Hematology 2025;33(2):565-568
OBJECTIVE:
To distinguish the ambiguous genotyping results of human leukocyte antigen (HLA), identify a novel HLA-B allele and analyze the nucleotide sequence.
METHODS:
A total of 2 076 umbilical core blood samples from the Zhejiang Cord Blood Bank in 2022 were detected using the next generation sequencing technology (NGS) based on the Ion Torrent S5 platform. Among these a rare HLA-B allele with ambiguous combination result containing a base mutation was identified, and was further confimed by the third-generation sequencing (TGS) based on the nanopore technology.
RESULTS:
The NGS typing result of HLA-B locus showed HLA-B* 46:18, 54:06 or HLA-B*46:01, 54:XX (including a base mutation), and nanopore sequencing confirmed the typing as HLA-B*46:01, 54:XX (including a base mutation). Compared with HLA-B*54:01:01:01, the HLA-B*54:XX allele showed one single nucleotide substitution at position 1014 T>C in exon 6, with no amino acid change. The nucleotide sequence of the novel HLA-B*54:XX has been submitted to the GenBank nucleotide sequence database and the accession number OP853532 was assigned.
CONCLUSION
A ambiguous genotyping of the HLA-B Locus detected by NGS was distinguished by nanopore sequencing and a new HLA-B allele was successfully identified, which was officially named as HLA-B*54:01:11 by the World Health Organization Nomenclature Committee for Factors of the HLA System.
Humans
;
High-Throughput Nucleotide Sequencing
;
Alleles
;
HLA-B Antigens/genetics*
;
Genotype
;
Mutation
;
Sequence Analysis, DNA
;
Base Sequence
6.RNA Sequencing Reveals Molecular Alternations of Splenocytes Associated with Anti-FⅧ Immune Response in Hemophilia A Murine Model.
Chen-Chen WANG ; Ya-Li WANG ; Yuan-Hua CAI ; Qiao-Yun ZHENG ; Zhen-Xing LIN ; Ying-Yu CHEN
Journal of Experimental Hematology 2025;33(5):1476-1485
OBJECTIVE:
To investigate the molecular alterations of splenocytes associated with anti-factor Ⅷ (FⅧ) immune response and the underlying mechanisms based on hemophilia A (HA) murine model via RNA sequencing (RNA-seq) technology.
METHODS:
Severe HA mice were immunized with recombinant human factor Ⅷ (rhF8) weekly for 4 weeks to establish an FⅧ inhibitor model. High quality raw data were obtained by using bulk RNA-seq and CASAVA base identification technology, and the differentially expressed genes (DEGs) were identified. The DEGs were statistically classified by gene ontology (GO) annotation to obtain information on the major signaling pathways and biological processes involved in anti-FⅧ immune response in HA mouse splenocytes. The cell clusters, genes, and signaling pathway datasets were comprehensively analyzed by GO, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis and single cell RNA-seq (ScRNA-seq) analysis, respectively. Flow cytometry analysis was used to verify the changes in T follicular helper cells (Tfh) and regulatory T cells (Treg).
RESULTS:
A total of 3731 DEGs was identified, including 2275 genes with up-regulated expression and 1456 genes with down-regulated expression. The DEGs were enriched in helper T cell differentiation, cytokine receptor, T cell receptor signaling pathway, ferroptosis, etc. Uniform Manifold Approximation and Project (UMAP) downscaling and visualization analysis yielded a total number of 11 T/NK cell subsets, visualizing the overall expression distribution of C-X-C chemokine-specific receptor gene cxcr5 among these T/NK cell subsets. Higher expression of cxcr5 was found in activated Tfh from FⅧ inhibitor mice, in comparison to the control group. The visualization using Upset plot R language showed a close interaction between Tfh and Treg. Moreover, the increased frequencies of Tfh and the decreased frequencies of Treg in inhibitor mouse splenocytes were further verified by flow cytometry analysis.
CONCLUSION
Multiple immune cell subsets, signaling pathways, and characteristic genes may be involved in the process of anti-FⅧ immune response in HA mouse splenocytes. The molecules involved in the regulation of Tfh/Treg may play key roles, which provide potential biological targets and therapeutic strategies for HA patients with inhibitors in the future.
Animals
;
Hemophilia A/genetics*
;
Mice
;
Sequence Analysis, RNA
;
Disease Models, Animal
;
Spleen/cytology*
;
T-Lymphocytes, Regulatory/immunology*
;
Humans
;
Signal Transduction
;
Factor VIII/immunology*
;
T-Lymphocytes, Helper-Inducer/immunology*
7.Relationship between sterol carrier protein 2 gene and prostate cancer: Based on single-cell RNA sequencing combined with Mendelian randomization.
Jia-Xin NING ; Shu-Hang LUO ; Hao-Ran WANG ; Hui-Min HOU ; Ming LIU
National Journal of Andrology 2025;31(5):403-411
Objective: To investigate the relationship between the lipid metabolism-related gene sterol carrier protein 2(SCP2) and prostate cancer (PCa) from a multi-omics perspective using single-cell transcriptomes combined with Mendelian randomization. Methods: Single-cell transcriptome data of benign and malignant prostate tissues were obtained from GSE120716, GSE157703 and GSE141445 datasets, respectively. Integration, quality control and annotation were performed on the data to categorize the epithelial cells into high and low SCP2 expression groups, followed by further differential and trajectory analyses. Single nucleotide polymorphism (SNP) data for SCP2 expression quantitative trait loci (eQTL) were subsequently downloaded from Genotype-Tissue Expression (GTEx) and investigated from the PCa Society Cancer-Related Genomic Alteration Panel for the Investigation of Cancer-Related Alterations (PRACTICAL) to obtain PCa outcome data for Mendelian randomization analysis to validate the causal relationship between SCP2 and PCa. Results: High SCP2-expressing epithelial cells had higher energy metabolism and proliferation capacity with low immunotherapy response and metastatic tendency. Trajectory analysis showed that epithelial cells with high SCP2 expression may have a higher degree of malignancy, and SCP2 may be a key marker gene for differentiation of malignant epithelial cells in the prostate. Further Mendelian randomization results showed a significant causal relationship between SCP2 and PCa development (OR=1.045, 95% CI: 1.010 -1.083, P=0.011). Conclusion: By combining single-cell transcriptome and Mendelian randomization, the role of the lipid metabolism-related gene SCP2 in PCa development has been confirmed, and new targets and therapeutic directions for PCa treatment have been provided.
Humans
;
Prostatic Neoplasms/genetics*
;
Male
;
Mendelian Randomization Analysis
;
Polymorphism, Single Nucleotide
;
Quantitative Trait Loci
;
Single-Cell Analysis
;
Sequence Analysis, RNA
;
Carrier Proteins/genetics*
;
Transcriptome
;
Lipid Metabolism
8.Vascular Protection of Neferine on Attenuating Angiotensin II-Induced Blood Pressure Elevation by Integrated Network Pharmacology Analysis and RNA-Sequencing Approach.
A-Ling SHEN ; Xiu-Li ZHANG ; Zhi GUO ; Mei-Zhu WU ; Ying CHENG ; Da-Wei LIAN ; Chang-Geng FU ; Jun PENG ; Min YU ; Ke-Ji CHEN
Chinese journal of integrative medicine 2025;31(8):694-706
OBJECTIVE:
To explore the functional roles and underlying mechanisms of neferine in the context of angiotensin II (Ang II)-induced hypertension and vascular dysfunction.
METHODS:
Male mice were infused with Ang II to induce hypertension and randomly divided into treatment groups receiving neferine or a control vehicle based on baseline blood pressure using a random number table method. The hypertensive mouse model was constructed by infusing Ang II via a micro-osmotic pump (500 ng/kg per minute), and neferine (0.1, 1, or 10 mg/kg), valsartan (10 mg/kg), or double distilled water was administered intragastrically once daily for 6 weeks. A non-invasive blood pressure system, ultrasound, and hematoxylin and eosin staining were performed to assess blood pressure and vascular changes. RNA sequencing and network pharmacology were employed to identify differentially expressed transcripts (DETs) and pathways. Vascular ring tension assay was used to test vascular function. A7R5 cells were incubated with neferine for 24 h and then treated with Ang II to record the real-time Ca2+ concentration by confocal microscope. Immunohistochemistry (IHC) and Western blot were used to evaluate vasorelaxation, calcium, and the extracellular signal-regulated kinase (ERK)1/2 pathway.
RESULTS:
Neferine treatment effectively mitigated the elevation in blood pressure, pulse wave velocity, aortic thickening in the abdominal aorta of Ang II-infused mice (P<0.05). RNA sequencing and network pharmacology analysis identified 355 DETs that were significantly reversed by neferine treatment, along with 25 potential target genes, which were further enriched in multiple pathways and biological processes, such as ERK1 and ERK2 cascade regulation, calcium pathway, and vascular smooth muscle contraction. Further investigation revealed that neferine treatment enhanced vasorelaxation and reduced Ca2+-dependent contraction of abdominal aortic rings, independent of endothelium function (P<0.05). The underlying mechanisms were mediated, at least in part, via suppression of receptor-operated channels, store-operated channels, or voltage-operated calcium channels. Neferine pre-treatment demonstrated a reduction in intracellular Ca2+ release in Ang II stimulated A7R5 cells. IHC staining and Western blot confirmed that neferine treatment effectively attenuated the upregulation of p-ERK1/2 both in vivo and in vitro, which was similar with treatment of ERK1/2 inhibitor PD98059 (P<0.05).
CONCLUSIONS
Neferine remarkably alleviates Ang II-induced elevation of blood pressure, vascular dysfunction, and pathological changes in the abdominal aorta. This beneficial effect is mediated by the modulation of multiple pathways, including calcium and ERK1/2 pathways.
Animals
;
Angiotensin II
;
Male
;
Benzylisoquinolines/therapeutic use*
;
Network Pharmacology
;
Blood Pressure/drug effects*
;
Sequence Analysis, RNA
;
Mice
;
Hypertension/chemically induced*
;
Mice, Inbred C57BL
;
Calcium/metabolism*
9.Application progress of single-cell RNA sequencing technology in breast development and related diseases.
Shiyi WEN ; Yang HU ; Xiangyu CHEN ; Jianda ZHOU ; Ping LI
Journal of Central South University(Medical Sciences) 2025;50(6):1080-1087
The spatio-temporal heterogeneity of breast cell subsets forms the fundamental biological basis for physiological development and pathological progression, including tumorigenesis; however, its complex regulatory mechanisms are not yet fully elucidated. With its high-resolution capabilities, single-cell RNA sequencing (scRNA-seq) technology offers a powerful tool for dissecting this cellular heterogeneity. This technology enables the construction of high-precision breast cell atlases, the accurate identification of distinct cell subsets, and the reconstruction of differentiation trajectories from stem/progenitor cells to functional epithelial cells. By resolving the transcriptional regulatory networks that govern cell fate determination, intercellular communication patterns, and dynamic microenvironmental interactions, scRNA-seq has unveiled the molecular foundations of breast development and provided new perspectives on the pathogenesis of related diseases such as breast cancer and macromastia. Furthermore, scRNA-seq demonstrates significant potential for discovering early molecular markers of disease, deciphering tumor heterogeneity, and elucidating mechanisms of therapeutic resistance. The continued application of scRNA-seq for dissecting breast cell heterogeneity, combined with its integration with multi-modal data such as spatial omics, promises to provide critical evidence and new insights for revealing the molecular mechanisms of breast development-related diseases and for formulating precision therapeutic strategies.
Humans
;
Single-Cell Analysis/methods*
;
Female
;
Breast Neoplasms/pathology*
;
Sequence Analysis, RNA/methods*
;
Breast/cytology*
10.Progress of scRNA-seq technology in nasopharyngeal carcinoma research.
Journal of Clinical Otorhinolaryngology Head and Neck Surgery 2025;39(9):889-893
Nasopharyngeal carcinoma(NPC) is a distinct type of head and neck cancer closely associated with Epstein-Barr virus(EBV) infection and exhibits significant geographic variations in its incidence. Despite recent advancements in radiotherapy techniques and precision medicine for NPC, the overall survival rate remains unsatisfactory due to tumor metastasis, recurrence, and drug resistance. Single-cell RNA sequencing(scRNA-seq) is an emerging technology that allows for the analysis of gene expression at single-cell resolution, providing a clearer understanding of tumor cell subpopulations, the evolutionary trajectory of tumor cells, and the functional roles and interactions of cells within the tumor microenvironment. This provides new ideas for the development of precision medicine in NPC. Here, we review the applications of scRNA-seq in exploring the mechanisms of NPC pathogenesis, tumor heterogeneity, the tumor microenvironment, drug resistance, and therapeutic response.
Humans
;
Nasopharyngeal Neoplasms/genetics*
;
Tumor Microenvironment
;
Nasopharyngeal Carcinoma
;
Single-Cell Analysis
;
Sequence Analysis, RNA
;
Precision Medicine
;
Drug Resistance, Neoplasm
;
Epstein-Barr Virus Infections
;
Herpesvirus 4, Human
;
Single-Cell Gene Expression Analysis

Result Analysis
Print
Save
E-mail