1.Prediction of protein Kbhb sites based on learnable feature embedding.
Zhisen WEI ; Zhiwei WANG ; Jinyao YU ; Cheng DENG ; Dongjun YU
Journal of Biomedical Engineering 2025;42(5):1029-1035
Protein lysine β-hydroxybutyrylation (Kbhb) is a newly discovered post-translational modification associated with a wide range of biological processes. Identifying Kbhb sites is critical for a better understanding of its mechanism of action. However, biochemical experimental methods for probing Kbhb sites are costly and have a long cycle. Therefore, a feature embedding learning method based on the Transformer encoder was proposed to predict Kbhb sites. In this method, amino acid residues were mapped into numerical vectors according to their amino acid class and position in a learnable feature embedding method. Then the Transformer encoder was used to extract discriminating features, and the bidirectional long short-term memory network (BiLSTM) was used to capture the correlation between different features. In this paper, a benchmark dataset was constructed, and a Kbhb site predictor, AutoTF-Kbhb, was implemented based on the proposed method. Experimental results showed that the proposed feature embedding learning method could extract effective features. AutoTF-Kbhb achieved an area under curve (AUC) of 0.87 and a Matthews correlation coefficient (MCC) of 0.37 on the independent test set, significantly outperforming other methods in comparison. Therefore, AutoTF-Kbhb can be used as an auxiliary means to identify Kbhb sites.
Protein Processing, Post-Translational
;
Lysine/chemistry*
;
Proteins/chemistry*
;
Machine Learning
;
Algorithms
2.Research progress of histone 3 methyltransferase MLL4.
Hui-Hui GUO ; Sheng-Feng LU ; Yun CAI ; Shu-Bao LIU ; Bing-Mei ZHU
Acta Physiologica Sinica 2019;71(4):637-644
Mixed linked leukemia 4 (MLL4) is a specific methyltransferase of histone 3 position lysine 4 (H3K4). It is also one of the important members of COMPASS/Set1-like protein complex. Both MLL4 protein itself and its mediated H3K4 methylation modification can cause changes in chromatin structure and function, thus regulating gene transcription and expression. With the studies of MLL4 protein in recent years, the roles of MLL4 gene, MLL4 protein and protein complex in the development of tissues and organs, tumor diseases and other physiological and pathophysiological processes have been gradually revealed. In this paper, the research progress of MLL4 gene, MLL4 protein characteristics, biological function and its effect on disease were reviewed, in order to further understand the effect of histone methyltransferase on gene expression regulation, as well as its non-enzyme dependent function. This paper may provide new ideas for the prevention, diagnosis and treatment of related diseases.
DNA-Binding Proteins
;
physiology
;
Histone-Lysine N-Methyltransferase
;
physiology
;
Histones
;
chemistry
;
Humans
;
Methylation
3.Advances in isopeptide bond-mediated molecular superglue.
Deying GAO ; Jiawen GAO ; Xiaobao SUN ; Kexin ZHOU ; Tietao ZHANG ; Qian WANG
Chinese Journal of Biotechnology 2019;35(4):607-615
Isopeptide bond-mediated molecular superglue is the irreversible covalent bond spontaneously formed by the side chains of lysine (Lys) and asparagine/aspartic acid (Asn/Asp) residues. The peptide-peptide interaction is specific, stable, and can be achieved quickly without any particular physicochemical factor. In the light of recent progress by domestic and foreign researchers, here we summarize the origin, assembly system and mechanism of isopeptide bond reaction, as well as the molecular cyclization and protein topological structure mediated by it. The prospect for its application in synthetic vaccine, hydrogel and bacterial nanobiological reactor is further discussed.
Cyclization
;
Lysine
;
Peptides
;
chemistry
;
Proteins
4.Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites.
Zhen CHEN ; Ningning HE ; Yu HUANG ; Wen Tao QIN ; Xuhan LIU ; Lei LI
Genomics, Proteomics & Bioinformatics 2018;16(6):451-459
As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning (DL) network classifier based on long short-term memory (LSTM) with word embedding (LSTM) for the prediction of mammalian malonylation sites. LSTM performs better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTM is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning (ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTM and the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence. LEMP is available at http://www.bioinfogo.org/lemp.
Amino Acid Sequence
;
genetics
;
Amino Acids
;
Animals
;
Deep Learning
;
Forecasting
;
methods
;
Lysine
;
chemistry
;
Machine Learning
;
Malonates
;
chemistry
;
Protein Processing, Post-Translational
;
genetics
5.Parkin promotes proteasomal degradation of p62: implication of selective vulnerability of neuronal cells in the pathogenesis of Parkinson's disease.
Pingping SONG ; Shanshan LI ; Hao WU ; Ruize GAO ; Guanhua RAO ; Dongmei WANG ; Ziheng CHEN ; Biao MA ; Hongxia WANG ; Nan SUI ; Haiteng DENG ; Zhuohua ZHANG ; Tieshan TANG ; Zheng TAN ; Zehan HAN ; Tieyuan LU ; Yushan ZHU ; Quan CHEN
Protein & Cell 2016;7(2):114-129
Mutations or inactivation of parkin, an E3 ubiquitin ligase, are associated with familial form or sporadic Parkinson's disease (PD), respectively, which manifested with the selective vulnerability of neuronal cells in substantia nigra (SN) and striatum (STR) regions. However, the underlying molecular mechanism linking parkin with the etiology of PD remains elusive. Here we report that p62, a critical regulator for protein quality control, inclusion body formation, selective autophagy and diverse signaling pathways, is a new substrate of parkin. P62 levels were increased in the SN and STR regions, but not in other brain regions in parkin knockout mice. Parkin directly interacts with and ubiquitinates p62 at the K13 to promote proteasomal degradation of p62 even in the absence of ATG5. Pathogenic mutations, knockdown of parkin or mutation of p62 at K13 prevented the degradation of p62. We further showed that parkin deficiency mice have pronounced loss of tyrosine hydroxylase positive neurons and have worse performance in motor test when treated with 6-hydroxydopamine hydrochloride in aged mice. These results suggest that, in addition to their critical role in regulating autophagy, p62 are subjected to parkin mediated proteasomal degradation and implicate that the dysregulation of parkin/p62 axis may involve in the selective vulnerability of neuronal cells during the onset of PD pathogenesis.
Adaptor Proteins, Signal Transducing
;
chemistry
;
metabolism
;
Animals
;
HEK293 Cells
;
Heat-Shock Proteins
;
chemistry
;
metabolism
;
Humans
;
Lysine
;
metabolism
;
Mice
;
Neurons
;
metabolism
;
pathology
;
Oxidopamine
;
pharmacology
;
Parkinson Disease
;
metabolism
;
pathology
;
Proteasome Endopeptidase Complex
;
metabolism
;
Protein Stability
;
Proteolysis
;
drug effects
;
Sequestosome-1 Protein
;
Ubiquitin-Protein Ligases
;
metabolism
;
Ubiquitination
;
drug effects
6.Inhibition of Nuclear Receptor Binding SET Domain 2/Multiple Myeloma SET Domain by LEM-06 Implication for Epigenetic Cancer Therapies.
Journal of Cancer Prevention 2015;20(2):113-120
BACKGROUND: Multiple myeloma SET domain (MMSET)/nuclear receptor binding SET domain 2 (NSD2) is a lysine histone methyltransferase (HMTase) and bona fide oncoprotein found aberrantly expressed in several cancers, suggesting potential role for novel therapeutic strategies. In particular, MMSET/NSD2 is emerging as a target for therapeutic interventions against multiple myeloma, especially t(4;14) myeloma that is associated with a significantly worse prognosis than other biological subgroups. Multiple myeloma is the second most common hematological malignancy in the United States, after non-Hodgkin lymphoma and remains an incurable malignancy. Thus, effective therapeutic strategies are greatly needed. HMTases inhibitors are scarce and no NSDs inhibitors have been isolated. METHODS: We used homology modeling, molecular modeling simulations, virtual ligand screening, computational chemistry software for structure-activity relationship and performed in vitro H3K36 histone lysine methylation inhibitory assay using recombinant human NSD2-SET and human H3.1 histone. RESULTS: Here, we report the discovery of LEM-06, a hit small molecule inhibitor of NSD2, with an IC50 of 0.8 mM against H3K36 methylation in vitro. CONCLUSIONS: We propose LEM-06 as a hit inhibitor that is useful to further optimize for exploring the biology of NSD2. LEM-06 derivatives may pave the way to specific NSD2 inhibitors suitable for therapeutic efforts against malignancies.
Biology
;
Chemistry
;
Drug Design
;
Epigenomics*
;
Hematologic Neoplasms
;
Histone-Lysine N-Methyltransferase
;
Histones
;
Humans
;
Inhibitory Concentration 50
;
Lymphoma, Non-Hodgkin
;
Lysine
;
Mass Screening
;
Methylation
;
Models, Molecular
;
Multiple Myeloma
;
Prognosis
;
Structure-Activity Relationship
;
United States
8.Proteomic identification and functional characterization of MYH9, Hsc70, and DNAJA1 as novel substrates of HDAC6 deacetylase activity.
Linlin ZHANG ; Shanshan LIU ; Ningning LIU ; Yong ZHANG ; Min LIU ; Dengwen LI ; Edward SETO ; Tso-Pang YAO ; Wenqing SHUI ; Jun ZHOU
Protein & Cell 2015;6(1):42-54
Histone deacetylase 6 (HDAC6), a predominantly cytoplasmic protein deacetylase, participates in a wide range of cellular processes through its deacetylase activity. However, the diverse functions of HDAC6 cannot be fully elucidated with its known substrates. In an attempt to explore the substrate diversity of HDAC6, we performed quantitative proteomic analyses to monitor changes in the abundance of protein lysine acetylation in response to HDAC6 deficiency. We identified 107 proteins with elevated acetylation in the liver of HDAC6 knockout mice. Three cytoplasmic proteins, including myosin heavy chain 9 (MYH9), heat shock cognate protein 70 (Hsc70), and dnaJ homolog subfamily A member 1 (DNAJA1), were verified to interact with HDAC6. The acetylation levels of these proteins were negatively regulated by HDAC6 both in the mouse liver and in cultured cells. Functional studies reveal that HDAC6-mediated deacetylation modulates the actin-binding ability of MYH9 and the interaction between Hsc70 and DNAJA1. These findings consolidate the notion that HDAC6 serves as a critical regulator of protein acetylation with the capability of coordinating various cellular functions.
Acetylation
;
Actins
;
chemistry
;
metabolism
;
Animals
;
Cell Line
;
Chromatography, High Pressure Liquid
;
HSC70 Heat-Shock Proteins
;
metabolism
;
HSP40 Heat-Shock Proteins
;
metabolism
;
Histone Deacetylase 6
;
Histone Deacetylases
;
metabolism
;
Isotope Labeling
;
Liver
;
metabolism
;
Lysine
;
metabolism
;
Mice
;
Mice, Inbred C57BL
;
Mice, Knockout
;
Microscopy, Confocal
;
Nonmuscle Myosin Type IIA
;
metabolism
;
Protein Binding
;
Proteomics
;
Substrate Specificity
;
Tandem Mass Spectrometry
9.Properties and biocompatibility of collagen scaffold modified by genipin cross-linked L-lysine.
Donglin JIANG ; Junyu YANG ; Shengyang JIANG ; Guozhong LU ; Peng ZHAO
Journal of Biomedical Engineering 2014;31(4):816-821
Collagen (Coll), as the basic material of matrix scaffolds for cell growth, has been widely used in the field of tissue engineering and regenerative medicine. In this study, collagen protein was modified by L-lysine (Lys), and cross-linked by genipin (GN) to prepare the L-lysine-modified collagen (Lys-Coll-GN) scaffolds. Microstructure, pore size, porosity, stability and biocompatibility of Lys-Coll-GN scaffolds were observed. The results showed that the bond between L-lysine and collagen protein molecule was formed by generating amide linkage, and mouse embryo fibroblasts proliferation was not inhibited in the Lys-Coll-GN scaffolds. In the multiple comparisons of Coll-scaf- folds, Coll-GN scaffolds and Lys-Coll-GN scaffolds, Coll-scaffolds was the worst in mechanical characteristics while the highest in biodegradation rate. Compared to Coll-GN scaffolds, Lys-Coll-GN scaffolds had more fiber structure, higher interval porosity (P<0. 01). Although the tensile stress of Lys-Coll-GN scaffolds reduced significantly, its e- longation length extended when the scaffolds was fractured (P<0. 01). The percentage of Lys-Coll-GN scaffolds residual weight was lower than that of Coll-GN scaffolds after all the scaffolds were treated by collagenase for 5 days (P<0. 01). This study suggested that Lys-Coll-GN scaffold had good biocompatibility, and it improved the mechanical property and degradation velocity for collagen-based scaffold. This study gave a new predominant type of tissue engineering scaffold for the regenerative medicine.
Animals
;
Biocompatible Materials
;
chemistry
;
Cell Proliferation
;
Collagen
;
chemistry
;
Cross-Linking Reagents
;
Fibroblasts
;
cytology
;
Iridoids
;
chemistry
;
Lysine
;
chemistry
;
Mice
;
Porosity
;
Tissue Engineering
;
Tissue Scaffolds
10.Feeding mouse with stable isotope labelled with amino acid.
Fengxu FAN ; Huiying GAO ; Zhongwei XU ; Linhui ZHAI ; Tailong YI ; Tao ZHANG ; Feilin WU ; Chunping CUI ; Ping XU
Chinese Journal of Biotechnology 2014;30(10):1602-1611
The stable isotope labeling by amino acids in culture (SILAC) based quantitative proteomics serves as a gold standard because of the high accuracy and throughput for protein identifications and quantification. In this study, we discussed the application of SILAC technology in mammal model, and developed quantitative internal standard for comparative proteomics of disease model. The C57BL/6J mice fed by special diet containing the 13C6-Lysine and bred F2 generation. We identified and analyzed total proteins of 9 mice tissues of F2 generation, including brain, lung, heart, stomach, intestine, liver, spleen, kidney, and muscle. Quantitative analysis information could evaluate the mice and different tissues' labeling efficiency. Liver was the most efficient, brain the least, and the labeling efficiency were 96.34%±0.90% and 92.62%±1.98% respectively. The average of the labeling efficiency of F2 generation was 95.80%±0.64%, which met the international standard (≥ 95%) for SILAC quantitative proteomics effective study. SILAC technology was successfully extended to mammalian model system, which will provide powerful tools for the mechanism study of the pathophysiology process with mouse model.
Amino Acids
;
chemistry
;
Animals
;
Diet
;
veterinary
;
Isotope Labeling
;
Lysine
;
chemistry
;
Mice
;
Mice, Inbred C57BL
;
Proteins
;
chemistry
;
Proteomics
;
methods

Result Analysis
Print
Save
E-mail