T-cell epitope prediction methods: an overview

DV Desai, U Kulkarni-Kale - Immunoinformatics, 2014 - Springer
Immunoinformatics, 2014Springer
The scientific community is overwhelmed by the voluminous increase in the quantum of data
on biological systems, including but not limited to the immune system. Consequently,
immunoinformatics databases are continually being developed to accommodate this ever
increasing data and analytical tools are continually being developed to analyze the same.
Therefore, researchers are now equipped with numerous databases, analytical and
prediction tools, in anticipation of better means of prevention of and therapeutic intervention …
Abstract
The scientific community is overwhelmed by the voluminous increase in the quantum of data on biological systems, including but not limited to the immune system. Consequently, immunoinformatics databases are continually being developed to accommodate this ever increasing data and analytical tools are continually being developed to analyze the same. Therefore, researchers are now equipped with numerous databases, analytical and prediction tools, in anticipation of better means of prevention of and therapeutic intervention in diseases of humans and other animals.
Epitope is a part of an antigen, recognized either by B- or T-cells and/or molecules of the host immune system. Since only a few amino acid residues that comprise an epitope (instead of the whole protein) are sufficient to elicit an immune response, attempts are being made to identify or predict this critical stretch or patch of amino acid residues, i.e., T-cell epitopes and B-cell epitopes to be included in multiple-subunit vaccines.
T-cell epitope prediction is a challenge owing to the high degree of MHC polymorphism and disparity in the volume of data on various steps encountered in the generation and presentation of T-cell epitopes in the living systems. Many algorithms/methods developed to predict T-cell epitopes and Web servers incorporating the same are available. These are based on approaches like considering amphipathicity profiles of proteins, sequence motifs, quantitative matrices (QM), artificial neural networks (ANN), support vector machines (SVM), quantitative structure activity relationship (QSAR) and molecular docking simulations, etc. This chapter aims to introduce the reader to the principle(s) underlying some of these methods/algorithms as well as procedural and practical aspects of using the same.
Springer