T-cell receptor (TCR) recognition of antigenic peptides presented by major histocompatibility complex (MHC) molecules is central to adaptive immunity, driving pathogen-specific responses and informing therapeutic vaccine development. Computational tasks such as predicting TCR-antigen binding affinity (NetTCR, Montemurro et al., 2021; ImRex, Moris et al., 2021) and clustering TCR sequences by epitope specificity (GLIPH, Glanville et al., 2017; TCRdist, Dash et al., 2017) have emerged as key challenges to decoding immune specificity. While recent models leveraging convolutional neural networks, transformers (e.g., ATM-TCR, Xu et al., 2021), and multimodal embeddings (ERGO, Springer et al., 2020; TCRMatch, Chronister et al., 2021) have significantly advanced performance, fragmented datasets and inconsistent evaluation methods have limited direct model comparisons and generalization. We propose a unified benchmark dataset integrating rigorously curated TCR sequences from human, mouse, and macaque responses to major pathogens (Influenza A, CMV, EBV, SARS-CoV-2) sourced from comprehensive databases such as VDJdb (Shugay et al., 2018) and IEDB (Vita et al., 2019). The benchmark incorporates standardized evaluation splits, structural representations enabled by AlphaFold2 predictions (Jumper et al., 2021), and robust evaluation metrics to ensure fair, reproducible comparisons. By consolidating disparate data and evaluation practices, our benchmark provides clarity on current progress, facilitating future innovation in computational TCR-antigen interaction modeling.