A Computational Perspective on Differences Between MHC-I and MHC-II in TCR-pMHC Structure Prediction Resources: Review and Benchmarking

Xiao-Qin WU; Da-Wei LIU; Bin-Yu LI; Yang LIU; Yang CAO; Wen-Tao DAI

Return

A Computational Perspective on Differences Between MHC-I and MHC-II in TCR-pMHC Structure Prediction Resources: Review and Benchmarking

VernacularTitle:计算差异视角下MHC-I与MHC-II的TCR-pMHC结构预测资源概述与测评
Author: Xiao-Qin WU ¹ ; Da-Wei LIU ² ; Bin-Yu LI ² ; Yang LIU ³ ; Yang CAO ³ ; Wen-Tao DAI ¹
Author Information

1. School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
2. Shanghai-MOST Key Laboratory of Health and Disease Genomics, NHC Key Lab of Reproduction Regulation, Shanghai Institute for Biomedical and Pharmaceutical Technologies, School of Pharmacy, Fudan University, Shanghai200237, China
3. Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
Publication Type:Journal Article
Keywords: immunology; TCR-pMHC complexes; structural prediction; PFR deviation index; CDR conformational consistency
From: Progress in Biochemistry and Biophysics 2026;53(5):1376-1399
CountryChina
Language:Chinese
Abstract: The initiation of adaptive immune responses relies on the precise recognition and interpretation of antigenic information. In this process, the specific binding of T cell receptors (TCRs) to peptide-major histocompatibility complex (pMHC) molecules represents one of the key molecular events in the initiation of adaptive immune responses. Accordingly, the structural features of TCR-pMHC complexes provide a fundamental basis for dissecting antigen recognition mechanisms and support rational vaccine design, therapeutic target discovery in TCR-based immunotherapy, and TCR identification and optimization. However, experimental determination of TCR-pMHC structures remains costly, time-consuming, and limited in coverage, making computational approaches essential for rapidly obtaining reliable structural information. Computational methods for predicting the structures of TCR-pMHC complexes have advanced rapidly in recent years, driven by progress in deep learning-based modeling frameworks and the increasing availability of structural and sequence resources. Despite these developments, most existing tools do not adequately distinguish the key structural and biophysical differences between MHC class I (MHC-I) and MHC class II (MHC-II) complexes during model construction. As a consequence, their predictive performance differs substantially between class I and class II complexes. In general, structural predictions for class I complexes outperform those for class II complexes. This discrepancy may be related to several fundamental differences between the two systems, including the architecture of the peptide-binding groove, the distribution of peptide lengths, and the properties of peptide flanking residues (PFRs). Compared with MHC-I molecules, MHC-II molecules usually bind longer antigenic peptides, which typically range from 13 to 25 amino acids in length. PFRs at both termini of these peptides participate in regulating the overall conformation of TCR-pMHC class II complexes and exert a pronounced effect on the geometric and physicochemical characteristics of the TCR-pMHC binding interface. Furthermore, within the TCR recognition interface, the complementarity-determining regions (CDRs) consist of segments that differ markedly in conformational behavior. They commonly include regions that are relatively rigid and structurally stable, together with highly flexible segments exhibiting substantial conformational plasticity. These rigidity-flexibility features constitute an essential structural basis enabling TCRs to recognize diverse peptide-MHC ligands and to accommodate conformational heterogeneity at the interface. However, many current modeling tools, in an effort to enforce global conformational stability or reduce structural noise, tend to over-constrain intrinsically flexible regions. Such oversimplification may lead to inappropriate rigidification of flexible CDR loops, resulting in local structural distortions, compromised interface geometry, or even complete modeling failure for specific complexes. Against this background, the review approaches the field from the perspective of computational differences between MHC-I and MHC-II complexes. We first systematically organize and summarize available resources related to TCRs and pMHCs, including structural datasets, sequence databases, prediction tools, and benchmarking studies. We then focus on five representative tools capable of predicting both class I and class II complexes—AlphaFold2, AlphaFold3, TCRmodel2, tFold-TCR, and TCR-pHLA_ModellerS. After excluding structures present in the training sets of these tools, we constructed a benchmark dataset comprising 25 class I and 10 class II TCR-pMHC complexes in the bound state and conducted a systematic evaluation using this dataset. We first employ widely used general evaluation metrics, including All-Atom Root Mean Square Deviation (All-Atom RMSD), Backbone RMSD, Template Modeling score (TM-score), and DockQ, to assess the global conformational accuracy and interface modeling quality of class I and class II complexes. For class II complexes, we propose for the first time a peptide flanking residue deviation index, including the PFRs-Deviation Index (PFRs-DI), N-PFR-Deviation Index (N-PFR-DI), and C-PFR-Deviation Index (C-PFR-DI), to quantitatively characterize conformational deviations in PFRs. In addition, we propose the CDR conformational consistency index (CCC) designed to qualitatively evaluate the ability of prediction tools to capture TCR CDR conformational flexibility. These metrics collectively assess a tool’s ability to model both overall conformation and critical functional regions, thereby addressing the limitations of existing evaluation criteria that overemphasize global structure while inadequately capturing modeling quality in key functional areas. This establishes a unified analytical framework for MHC-I and MHC-II complexes to guide data resource selection, modeling strategy formulation, and evaluation system development. The framework further advances computational modeling and provides crucial support for multi-scale analysis of TCR-pMHC recognition mechanisms and their biological functions.