Over the last two decades, there have been three deadly human outbreaks of coronaviruses (CoVs) caused by SARS-CoV, MERS-CoV, and SARS-CoV-2, which has caused the current COVID-19 global pandemic. All three deadly CoVs originated from bats and transmitted to humans via various intermediate animal reservoirs. It remains highly possible that other global COVID pandemics will emerge in the coming years caused by yet another spillover of a bat-derived SARS-like coronavirus (SL-CoV) into humans. Determining the Ag and the human B cells, CD4+ and CD8+ T cell epitope landscapes that are conserved among human and animal coronaviruses should inform in the development of future pan-coronavirus vaccines. In the current study, using several immunoinformatics and sequence alignment approaches, we identified several human B cell and CD4+ and CD8+ T cell epitopes that are highly conserved in 1) greater than 81,000 SARS-CoV-2 genome sequences identified in 190 countries on six continents; 2) six circulating CoVs that caused previous human outbreaks of the common cold; 3) nine SL-CoVs isolated from bats; 4) nine SL-CoV isolated from pangolins; 5) three SL-CoVs isolated from civet cats; and 6) four MERS strains isolated from camels. Furthermore, the identified epitopes: 1) recalled B cells and CD4+ and CD8+ T cells from both COVID-19 patients and healthy individuals who were never exposed to SARS-CoV-2, and 2) induced strong B cell and T cell responses in humanized HLA-DR1/HLA-A*02:01 double-transgenic mice. The findings pave the way to develop a preemptive multiepitope pan-coronavirus vaccine to protect against past, current, and future outbreaks.

As deforestation continues to expand and humans progressively conquer wildlife habitats around the globe, the wildlife fights back by spilling over many zoonotic viruses into human populations (1, 2). Among these is the large family coronaviruses. Since the first human coronavirus was identified in 1965, many additional coronavirus strains have continued to emerge (35). These caused several major human coronavirus outbreaks within the last two decades (i.e., from 2002 to 2019): SARS-CoV (6), CoV-NL63 (7), CoV-HKU1 (8), CoV-229E (8), CoV-OC43 (9), MERS-CoV (10), and the highly contagious and deadly SARS-CoV-2 (11, 12). The many deadly coronavirus outbreaks in the past twenty years should have been the impetus for urgently developing a preemptive pan-coronavirus vaccine.

The first two deadly coronaviruses, the MERS-CoV and the SARS-CoV, originated from bats, their natural hosts and reservoirs, and were transmitted to humans from intermediate animals, namely camels and civet cats, respectively (10, 1316). The third deadly SARS-CoV-2 appears to be 96% identical to a bat SARS-like coronavirus (SL-CoV) strain termed Bat-CoV-RaTG13 and transmitted to humans from a yet-to-be determined intermediate animal (17, 18). Although human-to-human spread of the common cold coronaviruses occurs frequently, only rarely do animal-to-human coronavirus transmissions occur (19). However, the highly contagious SARS-CoV-2 successfully produces both animal-to-human spread and human-to-human transmission (20, 21). The first known human-to-human transmission of SARS-CoV-2, which causes coronavirus disease 2019 (COVID-19), was reported in late January 2020, prompting the World Health Organization and United States authorities to declare a global public health emergency (22).

All human coronaviruses are associated with respiratory illnesses, ranging from mild common colds to more severe lower respiratory tract symptoms (23). Within 2–14 d after SARS-CoV-2 exposure, newly infected individuals may develop fever, fatigue, myalgia, and respiratory symptoms, including cough and shortness of breath (24). Although 40–45% of newly infected individuals remained asymptomatic, 55–60% individuals are symptomatic, ranging from mild/severe to critically ill patients, especially the elderly and those with comorbidities: they develop severe pulmonary inflammatory disease and may need a rapid medical intervention to prevent acute respiratory distress syndrome and death (24, 26). The SARS-CoV-2 infection induces antiviral CD4+ T cells, helping the production of neutralizing/blocking Abs and the formation of effector IFN-γ–producing CD4+ T cells and cytotoxic CD8+ T cells, all arms of immunity critical in reducing viral load in the majority of asymptomatic and convalescence patients (2733). Although SARS-CoV-2–specific IgG/IgM Abs and CD4+ and CD8+ T cells are critical to reducing viral infection in a majority of asymptomatic and convalescence patients (32), an excessive proinflammatory cytokine storm appears to lead to acute respiratory distress syndrome and death in many symptomatic individuals (3440). Thus, it is crucial to determine the B cell and T cell epitope specificities and the repertoire, phenotype, and function of B cells and CD4+ and CD8+ T cells that are associated with natural resistance seen in asymptomatic patients (4143). The information in this study should guide in the development of pan-coronavirus vaccines.

In the current study, we identified several human B and CD4+ and CD8+ T cell epitopes that are highly conserved among six strains of coronaviruses previously reported to infect humans and over 81,000 genome sequences of SARS-CoV-2 that currently circulate in 190 countries on six continents. Moreover, as immune targets for preemptive pan-coronavirus vaccines, we identified the epitopes that are common among the above human coronaviruses and 25 animal strains isolated from bats, pangolins, civet cats, and camels. We demonstrated the antigenicity of these epitopes in both SARS-CoV-2 patients and unexposed healthy individuals and their immunogenicity in humanized HLA-DR1/HLA-A*02:01 double–transgenic (Tg) mice. Our findings pave the way for incorporating these highly conserved B cell and T cell epitopes in future preemptive multiepitope pan-coronavirus vaccines that would be expected to not only protect against COVID-19 but also against subsequent global outbreaks.

Sixty-three COVID-19 patients and ten unexposed healthy individuals who had never been exposed to SARS-CoV-2 or COVID-19 patients were enrolled in this study (Table I). Seventy-eight percent were non-White (African, Asian, Hispanic, and others), and 22% were White. Forty-four percent were females and 56% were males, with an age range of 26–95 y (median, 62 y)

Table I.

Clinical and demographic features of symptomatic and asymptomatic COVID-19 patients and unexposed healthy individuals enrolled in the study

Patient CharacteristicsSevere Symptoms (n = 9)Moderate Symptoms (n = 11)Mild Symptoms (n = 32)Asymptomatic (n = 11)Healthy Individuals (n = 10)
Demographic features Age 62 (26–95) 56 (24–91) 62 (24–87) 54 (22–78) 51 (25–67) 
 Gender (male/female) 5/4 (56%/44%) 9/2 (82%/18%) 21/11 (66%/34%) 2/9 (18%/82%) 6/4 (60%/40%) 
 Race (White/non-White [%]) 2/7 (22%/78%) 1/10 (9%/91%) 4/28 (12%/88%) 5/6 (45%/55%) 5/5 (50%/50%) 
HLA phenotype HLA-A*0201 (+ve) 4/9 (44%) 5/11 (45%) 14/32 (44%) 4/11 (36%) 10/10 (100%) 
 HLA-DRB1 (+ve) 9/9(100%) 11/11 (100%) 32/32 (100%) 11/11 (100%) 10/10 (100%) 
Clinical parameters BMI 25.2 (20.3–57.9) 26.5 (20.9–33.5) 30.3 (21.1–46.5) 29.3 (17.6–60.8) — 
 Temperature/fever/chills 98.4 (97.9–99.9) 100.3 (97.7–102.8) 99.1 (97.8–102.8) 98.7 (97.7–102.5) — 
 Cough 4 (44%) 6 (55%) 16 (50%) 1 (9%) — 
 Shortness of breath/dyspnea 7 (78%) 7 (63%) 21 (66%) 1 (9%) — 
 Fatigue/myalgia 0 (0%) 4 (36%) 15 (47%) 1 (9%) — 
 Headache 4 (44%) 6 (54%) 16 (50%) 1 (9%) — 
 ICU admission 9 (100%) 11 (100%) 2 (6%) 0 (0%) — 
 Ventilator support 6 (67%) 1 (9%) 1 (3%) 0 (0%) — 
 WBC 10.9 (7.4–14.8) 8 (6–29.8) 7.1 (3.9–18.9) 30.6 (4.9–60.8) — 
 RBC 4.07 (2.97–5.92) 4.04 (2.68–4.59) 4.4 (2.69–5.41) 7.1 (4.17–16.2) — 
 Hemoglobin (g/l) 11.1 (8.3–16.2) 12.1 (8.4–13.8) 13.1 (8.1–16.9) 4.4 (4.01–12.9) — 
Comorbidities Diabetes 2 (22%) 7 (64%) 18 (56%) 5 (46%) — 
 Hypertension 7 (78%) 7 (64%) 22 (69%) 4 (36%) — 
 Cardiovascular disease 1 (11%) 2 (18%) 4 (13%) 1 (9%) — 
 CAD 0 (0%) 1 (9%) 2 (6%) 0 (0%) — 
 ESRD 1 (11%) 3 (27%) 4 (13%) 0 (0%) — 
 Asthma/COPD 1 (11%) 1 (9%) 1 (3%) 2 (18%) — 
 Obesity 3 (33%) 1 (9%) 16 (50%) 5 (46%) — 
 Cancer 1 (11%) 0 (0%) 6 (19%) 1 (9%) — 
Patient CharacteristicsSevere Symptoms (n = 9)Moderate Symptoms (n = 11)Mild Symptoms (n = 32)Asymptomatic (n = 11)Healthy Individuals (n = 10)
Demographic features Age 62 (26–95) 56 (24–91) 62 (24–87) 54 (22–78) 51 (25–67) 
 Gender (male/female) 5/4 (56%/44%) 9/2 (82%/18%) 21/11 (66%/34%) 2/9 (18%/82%) 6/4 (60%/40%) 
 Race (White/non-White [%]) 2/7 (22%/78%) 1/10 (9%/91%) 4/28 (12%/88%) 5/6 (45%/55%) 5/5 (50%/50%) 
HLA phenotype HLA-A*0201 (+ve) 4/9 (44%) 5/11 (45%) 14/32 (44%) 4/11 (36%) 10/10 (100%) 
 HLA-DRB1 (+ve) 9/9(100%) 11/11 (100%) 32/32 (100%) 11/11 (100%) 10/10 (100%) 
Clinical parameters BMI 25.2 (20.3–57.9) 26.5 (20.9–33.5) 30.3 (21.1–46.5) 29.3 (17.6–60.8) — 
 Temperature/fever/chills 98.4 (97.9–99.9) 100.3 (97.7–102.8) 99.1 (97.8–102.8) 98.7 (97.7–102.5) — 
 Cough 4 (44%) 6 (55%) 16 (50%) 1 (9%) — 
 Shortness of breath/dyspnea 7 (78%) 7 (63%) 21 (66%) 1 (9%) — 
 Fatigue/myalgia 0 (0%) 4 (36%) 15 (47%) 1 (9%) — 
 Headache 4 (44%) 6 (54%) 16 (50%) 1 (9%) — 
 ICU admission 9 (100%) 11 (100%) 2 (6%) 0 (0%) — 
 Ventilator support 6 (67%) 1 (9%) 1 (3%) 0 (0%) — 
 WBC 10.9 (7.4–14.8) 8 (6–29.8) 7.1 (3.9–18.9) 30.6 (4.9–60.8) — 
 RBC 4.07 (2.97–5.92) 4.04 (2.68–4.59) 4.4 (2.69–5.41) 7.1 (4.17–16.2) — 
 Hemoglobin (g/l) 11.1 (8.3–16.2) 12.1 (8.4–13.8) 13.1 (8.1–16.9) 4.4 (4.01–12.9) — 
Comorbidities Diabetes 2 (22%) 7 (64%) 18 (56%) 5 (46%) — 
 Hypertension 7 (78%) 7 (64%) 22 (69%) 4 (36%) — 
 Cardiovascular disease 1 (11%) 2 (18%) 4 (13%) 1 (9%) — 
 CAD 0 (0%) 1 (9%) 2 (6%) 0 (0%) — 
 ESRD 1 (11%) 3 (27%) 4 (13%) 0 (0%) — 
 Asthma/COPD 1 (11%) 1 (9%) 1 (3%) 2 (18%) — 
 Obesity 3 (33%) 1 (9%) 16 (50%) 5 (46%) — 
 Cancer 1 (11%) 0 (0%) 6 (19%) 1 (9%) — 

Patients were scored on a scale of 1 to 4 and then classified into three groups of symptomatic patients (severe symptoms [i.e., ICU admission+/– intubation or death], moderate symptoms [i.e., ICU admission], mild symptoms [i.e., in-patient only],, and asymptomatic patients [i.e., infected patients but with no symptoms]). Unexposed healthy individuals with no history of COVID-19 or contact with COVID-19 patients. Median values are shown along with range. Dashes (—) show the absence of parameters.

BMI, body mass index, CAD, coronary artery disease; COPD, chronic obstructive pulmonary disease; ESRD, end-stage renal disease.

Detailed clinical and demographic characteristics of the COVID-19 patients and the unexposed healthy individuals with respect to age, gender, HLA-A*02:01 and HLA-DRB1 distribution, COVID-19 disease severity, comorbidity, and biochemical parameters are presented in Table I. None of the symptomatic patients were on antiviral or anti-inflammatory drug treatments at the time of blood sample collections. The COVID-19 patients (n = 63) were divided into four groups depending on the severity of the symptoms: group 1 comprised of SARS-CoV-2–infected patients who never developed any symptoms or any viral diseases (i.e., asymptomatic patients; n = 11), group 2 had mild symptoms (i.e., in-patient only; n = 32), group 3 had moderate symptoms (i.e., intensive care unit [ICU] admission; n = 11), and group 4 had severe symptoms (i.e., ICU admission ± intubation or death; n = 9). As expected, compared with the asymptomatic group, all of the three symptomatic groups (i.e., mild, moderate, and severe) had higher percentages of comorbidities, including diabetes (22–64%), hypertension (64–78%), cardiovascular disease (11–18%), and obesity (9–50%) (Table I). The final group, group 5, was comprised of unexposed healthy individuals (controls) with no history of COVID-19 or contact with COVID-19 patients (n = 10) collected prior to 2019. All subjects were enrolled at the University of California Irvine under Institutional Review Board–approved protocols (no. 2020-5779). A written informed consent was received from all participants prior to inclusion in this study.

We retrieved 81,963 human SARS-CoV-2 genome sequences from GISAID database, representing countries from North America, South America, Central America, Europe, Asia, Oceania, and Africa (Fig. 1). Furthermore, the full-length sequences of SARS-CoV strains (SARS-CoV-2-Wuhan-Hu-1 (MN908947.3), SARS-CoV-Urbani (AY278741.1), HKU1–genotype B (AY884001), CoV-OC43 (KF923903), CoV-NL63 (NC_005831), CoV-229E (KY983587), and MERS (NC_019843) found in the human host were obtained from the National Center for Biotechnology Information (NCBI) GenBank. SARS-CoV-2 genome sequences from bat (RATG13 [MN996532.2], ZXC21 [MG772934.1], YN01 [EPI_ISL_412976], and YN02 [EPI_ISL_412977]), and pangolin (GX-P2V [MT072864.1], GX-P5E [MT040336.1], GX-P5L [MT040335.1], GX-P1E [MT040334.1], GX-P4L [MT040333.1], GX-P3B [MT072865.1], MP789 [MT121216.1], and Guangdong-P2S [EPI_ISL_410544]) were obtained from NCBI (www.ncbi.nlm.nih.gov/nuccore) and GISAID (www.gisaid.org). More so, the SARS-CoV strains from bat (WIV16 [KT444582.1], WIV1 [KF367457.1], YNLF_31C [KP886808.1], Rs672 [FJ588686.1], and recombinant strain [FJ211859.1]), camel (KT368891.1, MN514967.1, KF917527.1, and NC_028752.1), and civet (Civet007, A022, and B039) were also retrieved from the NCBI GenBank. The sequences were aligned using ClustalW algorithm in MEGAX.

FIGURE 1.

Evolutionary comparison of genome sequences among β-coronavirus strains isolated from humans and animals. (A) Left panel, Phylogenetic analysis performed between SARS-CoV-2 strains (obtained from humans [H. sapiens (black)], along with the animal’s SL-CoV genome sequence, sequences obtained from bats [R. affinis and R. malayanus (red)], pangolins [M. javanica (blue)], civet cats [P. larvata (green)], and camels [C. dromedaries (brown)]). The included SARS-CoV/MERS-CoV strains are from previous outbreaks (obtained from humans (Urbani, MERS-CoV, OC43, NL63, 229E, and HKU1–genotype B), bats (WIV16, WIV1, YNLF-31C, Rs672, and recombinant strains), camel (C. dromedaries [KT368891.1, MN514967.1, KF917527.1, and NC_028752.1]), and civet (Civet007, A022, and B039). The human SARS-CoV-2 genome sequences are represented from six continents. (B) Phylogenetic analysis performed among SARS-CoV-2 strains from human and other species with previous strains of SARS/MERS-CoV showed minimum genetic distance between the first SARS-CoV-2 isolate Wuhan-Hu-1 reported from the Wuhan Seafood market with bat strains hCoV-19-bat-Yunnan-RmYN02, bat-CoV-19-ZXC21, and hCoV-19-bat-Yunnan-RaTG13. This makes the bat strains nearest precursor to the human-SARS-CoV-2 strain. (C) Genetic distances based on maximum composite likelihood model among the human, bat, pangolin, civet cat, and camel genome sequences. Results indicate least genetic distance among SARS-CoV-2 isolate Wuhan-Hu-1 and bat strains bat-CoV-19-ZXC21 (0.1), hCoV-19-bat-Yunnan-RaTG13 (0.1), and hCoV-19-bat-Yunnan-RmYN02 (0.2). (D) Evolutionary analysis performed among the human-SARS-CoV-2 genome sequences reported from six continents and SARS-CoV-2 genome sequences obtained from bats (R. affinis and R. malayanus) and pangolins (M. javanica) (E) Venn diagram showing the number of SARS-CoV-2 genome sequences reported from Africa (n = 1574), Asia (n = 7533), North America (n = 19659), South America (n = 1303), Europe (n = 48752), and Oceania region (n = 3142) as of August 18, 2020. (F) Complete genome tree derived from 81,963 outbreak SARS-CoV-2 genome sequences submitted from Asian, African, North American, South American, European, and Oceanian regions.

FIGURE 1.

Evolutionary comparison of genome sequences among β-coronavirus strains isolated from humans and animals. (A) Left panel, Phylogenetic analysis performed between SARS-CoV-2 strains (obtained from humans [H. sapiens (black)], along with the animal’s SL-CoV genome sequence, sequences obtained from bats [R. affinis and R. malayanus (red)], pangolins [M. javanica (blue)], civet cats [P. larvata (green)], and camels [C. dromedaries (brown)]). The included SARS-CoV/MERS-CoV strains are from previous outbreaks (obtained from humans (Urbani, MERS-CoV, OC43, NL63, 229E, and HKU1–genotype B), bats (WIV16, WIV1, YNLF-31C, Rs672, and recombinant strains), camel (C. dromedaries [KT368891.1, MN514967.1, KF917527.1, and NC_028752.1]), and civet (Civet007, A022, and B039). The human SARS-CoV-2 genome sequences are represented from six continents. (B) Phylogenetic analysis performed among SARS-CoV-2 strains from human and other species with previous strains of SARS/MERS-CoV showed minimum genetic distance between the first SARS-CoV-2 isolate Wuhan-Hu-1 reported from the Wuhan Seafood market with bat strains hCoV-19-bat-Yunnan-RmYN02, bat-CoV-19-ZXC21, and hCoV-19-bat-Yunnan-RaTG13. This makes the bat strains nearest precursor to the human-SARS-CoV-2 strain. (C) Genetic distances based on maximum composite likelihood model among the human, bat, pangolin, civet cat, and camel genome sequences. Results indicate least genetic distance among SARS-CoV-2 isolate Wuhan-Hu-1 and bat strains bat-CoV-19-ZXC21 (0.1), hCoV-19-bat-Yunnan-RaTG13 (0.1), and hCoV-19-bat-Yunnan-RmYN02 (0.2). (D) Evolutionary analysis performed among the human-SARS-CoV-2 genome sequences reported from six continents and SARS-CoV-2 genome sequences obtained from bats (R. affinis and R. malayanus) and pangolins (M. javanica) (E) Venn diagram showing the number of SARS-CoV-2 genome sequences reported from Africa (n = 1574), Asia (n = 7533), North America (n = 19659), South America (n = 1303), Europe (n = 48752), and Oceania region (n = 3142) as of August 18, 2020. (F) Complete genome tree derived from 81,963 outbreak SARS-CoV-2 genome sequences submitted from Asian, African, North American, South American, European, and Oceanian regions.

Close modal

The SARS-CoV-2-Wuhan-Hu-1 (MN908947.3) protein sequence was compared with SARS-CoV– and MERS-CoV–specific protein sequences obtained from human, bat, pangolin, civet, and camel. The Sequence Variation Analysis was performed on the consensus-aligned protein sequences from each virus strain. This Sequence Homology Analysis identified consensus protein sequences from the SARS-CoV and MERS-CoV and predicted the Epitope Sequence Analysis.

Epitope prediction was carried out using the 12 proteins predicted for the reference SARS-CoV-2 isolate Wuhan-Hu-1. The corresponding SARS-CoV-2 protein accession identification numbers obtained from NCBI (www.ncbi.nlm.nih.gov/protein) are as follows: YP_009724389.1 (open reading frame [ORF]1ab), YP_009725295.1 (ORF1a), YP_009724390.1 (spike glycoprotein [S]), YP_009724391.1 (ORF3a), YP_009724392.1 (envelope protein [E]), YP_009724393.1 (membrane glycoprotein), YP_009724394.1 (ORF6), YP_009724395.1 (ORF7a), YP_009725318.1 (ORF7b), YP_009724396.1 (ORF8), YP_009724397.2 (nucleocapsid phosphoprotein), and YP_009725255.1 (ORF10). The tools used for CD8+ T cell–based epitope prediction were SYFPEITHI, MHC class I (MHC-I) binding predictions, and class I immunogenicity. Of these, the latter two were hosted on the Immune Epitope Database (IEDB) platform. For the prediction of CD4+ T cell epitopes, we used multiple databases and algorithms, namely SYFPEITHI, MHC class II (MHC-II) binding predictions, TepiTool, and TEPITOPEpan. For CD8+ T cell epitope prediction, we selected the five most frequent HLA-A class I alleles (HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, and HLA-A*23:01) with large coverage of the world population, regardless of race and ethnicity (Supplemental Fig. 1A, 1C), using a phenotypic frequency cutoff ≥6%. Similarly, for CD4 T cell epitope prediction, we selected HLA-DRB1*01:01, HLA-DRB1*11:01, HLA-DRB1*15:01, HLA-DRB1*03:01, and HLA-DRB1*04:01 alleles with large population coverage (Supplemental Fig. 1B, 1D). Subsequently, using NetMHC, we analyzed the SARS-CoV-2 protein sequence against all the aforementioned MHC-I and MHC-II alleles. Epitopes with 9-mer length for MHC-I and 15-mer length for MHC-II were predicted. Subsequently, the peptides were analyzed for binding stability to the respective HLA allotype. Our stringent epitope selection criteria were based on picking the top 1% epitopes focused on prediction percentile scores.

Linear B cell epitope predictions were carried out on the S, the primary target of B cell immune responses for SARS-CoV. We used the BepiPred 2.0 algorithm embedded in the B cell prediction analysis tool hosted on IEDB platform. For each protein, the epitope probability score for each amino acid and the probability of exposure were retrieved. Potential B cell epitopes were predicted using a cutoff of 0.55 (corresponding to a specificity >0.81 and sensitivity <0.3) and considering sequences having more than 5-aa residues. This screening process resulted in 28 B cell peptides (Supplemental Table III). From this pool, we selected 10 B cell epitopes with 19–62-aa lengths. Three B cell epitopes were observed to possess receptor-binding domain (RBD) region–specific amino acids. Structure-based Ab prediction was performed by using Discotope 2.0, and a positivity cutoff >−2.5 was applied (corresponding to specificity ≥0.80 and sensitivity <0.39) using the SARS-CoV-2 spike glycoprotein structure (Protein Data Bank identifier: 6M1D).

Computational peptide docking of B cell peptides into the ACE2 complex (binding protein) was performed using the GalaxyPepDock under GalaxyWEB. To retrieve the ACE2 structure, we used the x-ray crystallographic structure ACE2–B0AT1 complex–6M1D available on the Protein Data Bank. The 6M1D with a structural weight of 334.09 kDa possesses two unique protein chains, 2,706 residues, and 21,776 atoms. In this study, flexible target docking based on an energy-optimization algorithm was carried out on the ligand-binding domain containing ACE2 within the 4GBX structure. Similarity scores were calculated for protein–peptide interaction pairs for each residue. The prediction accuracy is estimated from a linear model as the relationship between the fraction of correctly predicted binding site residues and the template-target similarity measured by the protein structure similarity score and interaction similarity (SInter) score obtained by linear regression. SInter shows the similarity of amino acids of the B cell peptides aligned to the contacting residues in the amino acids of the ACE2 template structure. Higher SInter score represents a more significant binding affinity among the ACE2 molecule and B cell peptides. Subsequently, molecular docking models were built based on distance restraints for protein–peptide pairs using GalaxyPepDock. Based on the optimized energy scores, docking models were ranked.

While performing the protein–peptide docking analysis for CD8+ T cell epitope peptides, we used the x-ray crystal structure of HLA-A*02:01 in complex 4UQ3 available on the Protein Data Bank and for CD4 peptides x-ray crystallographic structure HLA-DM–HLA-DRB1 complex 4GBX.

The epitope conservancy analysis tool was used to compute the degree of the conservancy of CD8+ T cell, CD4+ T cell, and B cell epitopes within a given protein sequence of SARS-CoV-2 set at 100% identity level. The fraction of protein sequences that contain the regions similar to epitopes were evaluated on the degree of similarity or correspondence among two sequences. The CD8+ T cell and CD4+ T cell epitopes were screened against all the 12 structural and nonstructural proteins of SARS-CoV-2, namely YP_009724389.1 (ORF1ab), YP_009725295.1 (ORF1a), YP_009724390.1 (S), YP_009724391.1 (ORF3a), YP_009724392.1 (E), YP_009724393.1 (membrane glycoprotein), YP_009724394.1 (ORF6), YP_009724395.1 (ORF7a), YP_009725318.1 (ORF7b), YP_009724396.1 (ORF8), YP_009724397.2 (nucleocapsid phosphoprotein), and YP_009725255.1 (ORF10). B cell epitopes were screened for their conservancy against S (YP_009724390.1) of SARS-CoV-2. Epitope linear sequence conservancy approach was used for linear epitope sequences with a sequence identity threshold set at ≥50%. This analysis resulted in 1) the calculated degree of conservancy (percentage of protein sequence matches a specified identity level); and 2) the matching minimum/maximum identity levels within the protein sequence set. The CD8+ and CD4+ T cell epitopes that showed ≥50% conservancy in at least two human SARS-CoV strains and two SARS-CoV strains (from bat/civet/pangolin/camel) were selected as candidate epitopes. N- and O-glycosylation sites were screened using NetNGlyc 1.0 and NetOGlyc 4.0 prediction servers, respectively (44).

For a robust epitope screening, we evaluated the conservancy of CD8+ T cell, CD4+ T cell, and B cell epitopes within human-SARS-CoV-2 genome sequences representing North America, South America, Africa, Europe, Asia, and Australia. As of August 27, 2020, the NextStrain database recorded 81,963 human-SARS-CoV-2 genome sequences, and the number of genome sequences continues growing daily. In the present analysis, 81,963 human-SARS-CoV-2 genome sequences were extrapolated from the GISAID and NCBI GenBank databases. We therefore considered all the 81,963 SARS-CoV-2 genome sequences representing six continents for subsequent conservancy analysis. We set a threshold for a candidate CD8+ T cell, CD4+ T cell, and B cell epitope if the epitope showed 100% sequence conservancy in ≥95 human-SARS-CoV-2 genome sequences. Furthermore, population coverage calculation was carried out using the Population Coverage software hosted on IEDB platform (45). Population coverage calculation was performed to evaluate the distribution of screened CD8+ and CD4+ T cell epitopes in world population at large in combination with HLA-I (HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, and HLA-A*23:01) and HLA-II (HLA-DRB1*01:01, HLA-DRB1*11:01, HLA-DRB1*15:01, HLA-DRB1*03:01, and HLA-DRB1*04:01) alleles.

Potential peptide epitopes (9-mer long for CD8+ T cell epitopes and 15-mer long for CD4+ T cell epitopes) identified from 12 human-SARS-CoV-2 proteins, namely ORF1ab, ORF1a, S, ORF3a, E, membrane glycoprotein, ORF6, ORF7a, ORF7b, ORF8, nucleocapsid phosphoprotein, and ORF10, were synthesized using solid-phase peptide synthesis and standard 9-fluorenylmethoxycarbonyl technology (21st Century Biochemicals, Marlborough, MA). The purity of peptides was over 90%, as determined by reversed-phase HPLC (Vydac C18) and mass spectroscopy (Voyager MALDI-TOF System). Stock solutions were made at 1 mg/ml in 10% DMSO in PBS. Similar method of synthesis was used for B cell peptide epitopes from the spike protein of SARS-CoV-2.

T2 (174 × CEM.T2) mutant hybrid cell line derived from the T lymphoblast cell line CEM was obtained from the American Type Culture Collection (www.atcc.org). The T2 cell line was maintained in IMDM (American Type Culture Collection, Manassas, VA) supplemented with 10% heat-inactivated FCS and 100 U of penicillin/ml, 100 U of streptomycin/ml (Sigma-Aldrich, St. Louis, MO). T2 cells lack the functional TAP heterodimer and failed to express normal amounts of HLA-A*02:01 on the cell surface. HLA-A*02:01 surface expression is stabilized following the binding of exogenous peptides to these MHC-I molecules.

To determine whether synthetic peptides could stabilize HLA-A*02:01 molecule expression on the T2 cell surface, peptide-inducing HLA-A*02:01 upregulation on T2 cells was examined according to a previously described protocol (46, 47). T2 cells (3 × 105 per well) were incubated with different concentrations (30, 10, and 3 µM) of 91 individual CD8+ T cell–specific peptides in 48-well plates for 18 h at 26°C. Cells were then incubated at 37°C for 3 h in the presence of 0.7 μl/ml BD GolgiStop to block cell surface expression of newly synthesized HLA-A*02:01 molecules and human β-2 microglobulin (1 μg/ml). The cells were subsequently washed with FACS buffer (1% BSA and 0.1% sodium azide in PBS) and stained with anti–HLA-A2–specific mAb (clone BB7.2) (BD Pharmingen, San Diego, CA) at 4°C for 30 min. After incubation, the cells were washed with FACS buffer, fixed with 2% paraformaldehyde in PBS, and analyzed by flow cytometry using a Fortessa (Becton Dickinson) flow cytometer equipped with a BD High Throughput Sampler for rapid analysis of samples prepared in plate format. The acquired data were analyzed with FlowJo software (BD Biosciences, San Jose, CA), and expression was measured by mean fluorescence intensity (MFI). Percentage of MFI increase was calculated as follows: percentage MFI increase = (MFI with the given peptide − MFI without peptide)/(MFI without peptide) × 100. Each experiment was performed three times, and means ± SD values were calculated.

A colony of HLA class I and class II double-Tg mice was maintained at the University of California Irvine (48) vivarium and treated in accordance with the Association for Assessment and Accreditation of Laboratory Animal Care according to Institutional Animal Care and Use Committee–approved animal protocols (no. 2020-19-111) and National Institutes of Health guidelines. The HLA-Tg mice retain their endogenous mouse MHC locus and express human HLA-A*02:01 and HLA-DRB*01 under the control of its normal promoter (49, 50). Prior to this study, the expression of HLA-A*02:01 and DR1 molecules on the PBMCs of each HLA-Tg mouse was confirmed by FACS.

Groups of age-matched HLA-transgenic mice/B6 mice (n = 3) were immunized s.c. on days 0 and 14, with a mixture of four SARS-CoV-2–derived human CD4+ T/CD8+T/B cell peptide epitopes delivered in aluminum hydroxide (alum) and CpG1826 adjuvants. As a negative control, mice received adjuvants alone (mock immunized).

Spleens were harvested from mice in 2 wk after second immunization. Spleens were placed in 10 ml of cold PBS with 10% FBS and 2× antibiotic–antimycotic (Life Technologies, Carlsbad, CA). Spleens were minced finely and sequentially passed through a 100-µm screen and a 70-µm screen (BD Biosciences). Cells were then pelleted via centrifugation at 400 × g for 10 min at 4°C. RBCs were lysed using a lysis buffer (ammonium chloride) and washed again. Isolated splenocytes were diluted to 1 × 106 viable cells/ml in RPMI media with 10% (v/v) FBS and 2× antibiotic–antimycotic. Viability was determined by trypan blue staining.

PBMCs/splenocytes were analyzed by flow cytometry. The following Abs were used: CD8, CD4, CD62L, CD107a/b, CD44, CD69, TNF-α, and IFN-γ. For surface staining, mAbs against various cell markers were added to a total of 1 × 106 cells in PBS containing 1% FBS and 0.1% sodium azide (FACS buffer) and left for 45 min at 4°C. At the end of the incubation period, the cells were washed twice with FACS buffer. A total of 100,000 events were acquired by LSR II (Becton Dickinson, Mountain View, CA), followed by analysis using FlowJo software (Tree Star, Ashland, OR).

All reagents used were filtered through a 0.22-µm filter. Wells of 96-well MultiscreenHTS plates (MilliporeSigma, Billerica, MA) were prewet with 30% ethanol and then coated with 100 µl primary anti–IFN-γ Ab solution (10 µg/ml of 1-D1K coating Ab from Mabtech in PBS [pH 7.4], V-E4) overnight at 4°C. After washing, nonspecific binding was blocked with 200 µl of RPMI media with 10% (v/v) FBS for 2 h at room temperature. Following the blockade, 0.5 × 106 cells from patients PBMCs (or from mouse splenocytes) in 100 µl of RPMI were mixed with 10 µg individual peptides (with DMSO for no stimulation or with individual peptide at a final concentration of 10 µg/ml). After incubation in humidified 5% CO2 at 37°C for 72 h (samples from COVID-19 patients) or 5 d (for healthy donor samples to recall their T cell memory), cells were removed by washing (using PBS and PBS–Tween 0.02% solution), and 100 µl of biotinylated secondary anti–IFN-γ Ab (clone 7-B6-1; Mabtech) in blocking buffer (PBS with 0.5% FBS) was added to each well. Following a 2-h incubation and washing, HRP-conjugated streptavidin was diluted 1:1000, and wells were incubated with 100 µl for 1 h at room temperature. Following washing, wells were incubated for 1 h at room temperature with 100 µl of tetramethylbenzidine detection reagent and spots counted with an automated EliSpot Reader System (ImmunoSpot reader; Cellular Technology, Shaker Heights, OH).

The efficacy of our B cell peptide epitopes toward inducing specific Abs was measured in the HLADR1/A*02:01-immunized mice by ELISA. ELISA plates (catalog M5785; Sigma-Aldrich) were first coated overnight at 4°C with 10 μg/ml of each B cell peptide epitope. Subsequently, plates were washed five times with PBS–Tween 0.01% before starting the blocking by adding PBS with 1% BSA for 3 h at room temperature, followed by a second wash. Sera of C57BL/6 mice immunized either with pool B cell peptides alum/CpG or adjuvant alone (control) were added into the wells at varying dilutions (1/5, 1/25, 1/125, and 1/625 or PBS only in triplicates). Plates were incubated at 4°C overnight with the sera, then washed with PBS–Tween 0.01% before to add anti-mouse IgG Ab (1/500 dilution; Mabtech). After the last washing, streptavidin–HRP (1/1000 dilution; Mabtech) was added for 30 min at room temperature. Finally, we added 100 μl of filtered tetramethylbenzidine substrate for 15 min and blocked the reaction with H2S04 before the readout (OD measurement was done at 450 nm on the Bio-Rad iMark microplate reader). The same procedure was followed to measure the titers of Abs specific against our 15 screened B cell epitopes in the sera of COVID-19 patients (n = 40) and healthy donors (n = 10), using anti-human IgG Ab as the secondary Ab (1/500 dilution; Mabtech).

Phylogenetic analyses were conducted in MEGAX. The evolutionary history was performed, and phylogenetic tree was constructed using the maximum likelihood method and Tamura-Nei model. The maximum likelihood method assumes that each locus evolves independently by pure genetic drift. The tree with the highest log likelihood was selected. Initial tree(s) for the heuristic search were obtained by applying neighbor-joining and BioNJ algorithms to a matrix of pairwise genetic distances estimated using the Tamura-Nei model and then selecting the topology with superior log likelihood value. This analysis involved available nucleotide sequences of SARS-CoV-2 from human (Homo sapiens), bat (Rhinolophus affinis and R. malayanus), and pangolin (Manis javanica). In addition, genome sequences from previous outbreaks of SARS-CoV in human, bat, civet, and camel were taken into consideration while performing the evolutionary analyses.

The human-specific SARS-CoV-2 complete genome sequences were retrieved from the GISAID database, whereas the SARS-CoV-2 sequences for pangolin (M. javanica), and bat (R. affinis and R. malayanus) were retrieved from NCBI. Genome sequences of previous strains of SARS-CoV for human, bat, civet, and camel were retrieved from the NCBI GenBank.

Data for each differentially expressed markers among blockade-treated and mock-treated groups of HLA-Tg mice were compared by ANOVA and Student t test using GraphPad Prism version 6 (GraphPad Software, La Jolla, CA). Statistical differences observed in the measured CD8 and CD4 T cells and Ab responses between healthy donors and COVID-19 patients were calculated using ANOVA and multiple t test comparison procedures in GraphPad Prism. Data are expressed as the mean ± SD. Results were considered statistically significant at p ≤ 0.05.

Understanding the animal origins of SARS-CoV-2 is critical for the development of a preemptive pan-coronavirus vaccine to protect from future human outbreaks and deter future zoonosis.

We first screened for the evolutionary relationship among human SARS-CoV-2 and SARS-CoV/MERS-CoV strains from previous outbreaks (i.e., Urbani, MERS-CoV, OC43, NL63, 229E, and HKU1–genotype B) along with 25 SL-CoV genome sequences obtained from different animal species: bats (R. affinis and R. malayanus), civet cats (Paguma larvata), and pangolins (M. javanica) and MERS-CoVs from camels (Camelus dromedarius and C. bactrianus) (Fig. 1). These sequence alignments revealed similarity of the original human-SARS-CoV-2 strain found in Wuhan, China, to four bat SL-CoV strains: hCoV-19-bat-Yunnan-RmYN02, bat-CoV-19-ZXC21, and hCoV-19-bat-Yunnan-RaTG13 obtained from the Yunnan and Zhejiang provinces of China (Fig. 1A). With further genetic distance analysis, we discovered the least evolutionary divergence between SARS-CoV-2 isolate Wuhan-Hu-1 and the above-mentioned three SL-CoV isolates from bats, namely 1) Bat-CoV-RaTG13 (0.1), 2), bat-CoV-19-ZXC21 (0.1), and 3) Bat-CoV-YN02 (0.2) (Fig. 1B, 1C). Moreover, the phylogenetic analysis performed among the whole genome sequences of a total of 81,963 SARS-CoV-2 strains for which sequences have been reported in circulation in 190 countries suggest an evolutionary convergence of bat and pangolin SL-CoVs into the human SARS-CoV-2 strains (Fig. 1D, 1E). Furthermore, through a complete genome tree derived from the 81,963 SARS-CoV-2 genome sequences submitted from Asian, African, North American, South American, European, and Oceanian regions, we confirmed that the least evolutionary divergence for SARS-CoV-2 strains is in SL-CoVs isolated from bats and pangolins (Fig. 1D–F).

Altogether, the phylogenetic analysis and genetic distance suggest that the highly contagious and deadly human-SARS-CoV-2 strain originated from bats, most likely from either the Bat-CoV-19-ZXC21 (MG772934.1) or Bat-CoV-RaTG13 (MN996532.2) strains that spilled over into humans after further mutations and/or recombination. These mutations and/or recombination(s) possibly contributed to the rapid global expansion of the highly contagious and deadly SARS-CoV-2 (51, 52).

We first predicted potential CD8+ T cell epitopes from the entire genome sequence of the first SARS-CoV-2-Wuhan-Hu-1 strain (NCBI GenBank accession number MN908947.3) (5359). For this, we used multiple databases and algorithms, including the SYFPEITHI, MHC-I processing predictions, MHC-I binding predictions, MHC-I immunogenicity, and IEDB (56, 60). We focused on epitopes restricted to the five most frequent HLA class I alleles with large coverage in worldwide human populations, regardless of race and ethnicity (i.e., HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, and HLA-A*23:01) (6163) (Supplemental Fig. 1A, 1C).

Using the aforementioned criteria, we originally identified a total of 9,660 potential CD8+ T cell epitopes derived from 12 structural proteins (S, membrane glycoprotein, and nucleocapsid phosphoprotein) and ORFs of SARS-CoV-2-Wuhan-Hu-1 strain (MN908947.3) (Supplemental Table I). Subsequently, this large pool of epitopes was narrowed down to 91 epitopes that are highly conserved among 1) over 81,000 SARS-CoV-2 strains (that currently circulate in 190 countries on six continents); 2) the four major common cold coronaviruses that caused previous outbreaks (i.e., hCoV-OC43 [KF923903], hCoV-229E [KY983587], hCoV-HKU1 genotype B [AY884001], and hCoV-NL63 [NC_005831]); and 3) the SL-CoVs that are isolated from bats, civet cats, pangolins, and camels (Fig. 2A). Although the highest degree of similarity (expressed as percentage of resemblance) was identified among 81,963 SARS-CoV-2 strains, 6 strains of previous human SARS-CoVs and 18 animal SL-CoVs strains isolated from bats and pangolins, only a small percentage of similarity was found between the SARS-CoV-2 and MERS-CoV strains (Supplemental Fig. 2). However, a significantly lower degree of similarity was recorded among the SARS-CoV-2 and the SL-CoVs strains isolated from civet cats’ and camels’ CoVs (Supplemental Fig. 2).

FIGURE 2.

Identification of highly conserved potential SARS-CoV-2–derived human CD8+ T cell epitopes that bind with high affinity to HLA-A*02:01 molecules. (A) Ninety-one genome-wide, in silico predicted, and highly conserved SARS-CoV-2–derived CD8+ T cell epitope peptides were synthetized and were tested for their binding affinity in vitro to HLA-A*02:01 molecules expressed on the surface of T2 cells. (B) Out of the 91 CD8+ T cell epitopes, four epitopes were selected as high binders to HLA-A*02:01 molecules, even at the lowest molarity of 3 μM. Further, 20 epitopes with high and three epitopes with moderate binding affinity were found to stabilize the expression of HLA-A*02:01 molecules on the surface of the T2 cells. The levels of HLA-A*02:01 surface expression was determined by MFI, measured by flow cytometry on T2 cells following an overnight incubation of T2 cells at 26°C with decreasing peptide epitopes molarity (30, 15, and 5 μM) as shown in graphs. Percentage of MFI increase was calculated as follows: percentage MFI increase = (MFI with the given peptide − MFI without peptide)/(MFI without peptide) × 100.

FIGURE 2.

Identification of highly conserved potential SARS-CoV-2–derived human CD8+ T cell epitopes that bind with high affinity to HLA-A*02:01 molecules. (A) Ninety-one genome-wide, in silico predicted, and highly conserved SARS-CoV-2–derived CD8+ T cell epitope peptides were synthetized and were tested for their binding affinity in vitro to HLA-A*02:01 molecules expressed on the surface of T2 cells. (B) Out of the 91 CD8+ T cell epitopes, four epitopes were selected as high binders to HLA-A*02:01 molecules, even at the lowest molarity of 3 μM. Further, 20 epitopes with high and three epitopes with moderate binding affinity were found to stabilize the expression of HLA-A*02:01 molecules on the surface of the T2 cells. The levels of HLA-A*02:01 surface expression was determined by MFI, measured by flow cytometry on T2 cells following an overnight incubation of T2 cells at 26°C with decreasing peptide epitopes molarity (30, 15, and 5 μM) as shown in graphs. Percentage of MFI increase was calculated as follows: percentage MFI increase = (MFI with the given peptide − MFI without peptide)/(MFI without peptide) × 100.

Close modal

We further identified 27 SARS-CoV-2 human CD8+ T cell epitopes out of the 91 epitopes that bound with high affinity with HLA-A*02:01 molecules using in vitro peptide–HLA binding assay (Fig. 2A). Four epitopes were found to be very high affinity binders (Fig. 2B). The 27 epitopes with high binding affinity were later confirmed in silico using molecular docking models across five major HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, and HLA-A*23:01 haplotypes (Supplemental Fig. 3) (64). The highest binding affinity to HLA-A*02:01 molecules with the highest Sinter scores (blue squares) were recorded for ORF1ab6749–6757, S2–10, S958–966, S1220–1228, E26–34, ORF883–91, ORF103–11, and ORF105–13, whereas minimum Sinter score was observed for ORF1ab3732–3740, S691–699, and membrane protein (M)89–97. Other CD8+ T cell epitopes like ORF1ab1675–1683, ORF1ab2363–2371, ORF1ab3013–3021, and ORF7b26–34 were also found with intermediate Sinter scores (Supplemental Fig. 3A, 3B). Although the identified highly conserved CD8+ T cell epitopes were distributed within 8 of the 12 structural and nonstructural ORFs (i.e., ORF1ab, S, E, M, ORF6, ORF7b, ORF8, and ORF10), the highest numbers of epitopes were localized in the replicase polyprotein 1ab/1a (ORF1ab) (nine epitopes) followed by the spike glycoprotein (five epitopes) (Supplemental Figs. 2, 8).

FIGURE 3.

CD8+ T cells specific to highly conserved SARS-CoV-2 epitopes detected in COVID-19 patients and unexposed healthy individuals. (A) Experimental design: PBMCs from HLA-A*02:01 positive COVID-19 patients (n = 30) (B) and controls unexposed healthy individuals (n = 10) (C) were isolated and stimulated overnight with 10 μM of each of the 27 SARS-CoV-2–derived CD8+ T cell epitopes. The number of IFN-γ–producing cells were quantified using ELISpot assay (B, C, and D). Dotted lines represent threshold to evaluate the relative magnitude of the response: mean SFCs between 25 and 50 correspond to a medium/intermediate response, whereas a strong response is defined for mean SFCs >50. PBMCs from HLA-A*02:01–positive COVID-19 patients (E) were further stimulated for an additional 5 h in the presence of mAbs specific to CD107a and CD107b and GolgiPlug and GolgiStop. Tetramers specific to Spike epitopes, CD107a/b and CD69, and TNF-α expression were then measured by FACS. Representative FACS plot showing the frequencies of tetramer+CD8+ T cells, CD107a/b+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells following priming with a group of four Spike CD8+ T cell epitope peptides. Average frequencies of tetramer+CD8+ T cells, CD107a/b+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells. *p < 0.05.

FIGURE 3.

CD8+ T cells specific to highly conserved SARS-CoV-2 epitopes detected in COVID-19 patients and unexposed healthy individuals. (A) Experimental design: PBMCs from HLA-A*02:01 positive COVID-19 patients (n = 30) (B) and controls unexposed healthy individuals (n = 10) (C) were isolated and stimulated overnight with 10 μM of each of the 27 SARS-CoV-2–derived CD8+ T cell epitopes. The number of IFN-γ–producing cells were quantified using ELISpot assay (B, C, and D). Dotted lines represent threshold to evaluate the relative magnitude of the response: mean SFCs between 25 and 50 correspond to a medium/intermediate response, whereas a strong response is defined for mean SFCs >50. PBMCs from HLA-A*02:01–positive COVID-19 patients (E) were further stimulated for an additional 5 h in the presence of mAbs specific to CD107a and CD107b and GolgiPlug and GolgiStop. Tetramers specific to Spike epitopes, CD107a/b and CD69, and TNF-α expression were then measured by FACS. Representative FACS plot showing the frequencies of tetramer+CD8+ T cells, CD107a/b+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells following priming with a group of four Spike CD8+ T cell epitope peptides. Average frequencies of tetramer+CD8+ T cells, CD107a/b+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells. *p < 0.05.

Close modal

Altogether, our findings identified 27 highly conserved potential human CD8+ T cell epitopes from the sequence of SARS-CoV-2 that are highly conserved among 81,963 SARS-CoV-2 strains, the four major common cold coronaviruses (i.e., hCoV-OC43, hCoV-229E, hCoV-HKU1 genotype B, and hCoV-NL63), newly found highly transmissible variants (Supplemental Fig. 9), and several SL-CoV strains that are isolated from bats and pangolins. These results suggest that both the structural and the nonstructural proteins are immunodominant Ags that are targeted by human CD8+ T cells from both COVID-19 patients and common cold coronavirus–infected healthy individuals.

We subsequently identified a total of 9,594 potential HLA-DR–restricted CD4+ T cell epitopes from the whole genome sequence of SARS-CoV-2-Wuhan-Hu-1 strain (MN908947.3) using multiple databases and algorithms including the SYFPEITHI, MHC-II binding predictions, TepiTool, and TEPITOPEpan (Supplemental Table II). These potential promiscuous CD4+ T cell epitopes were screened in silico against the five most frequent HLA-DR alleles with large coverage in the human population, regardless of race or ethnicity: HLA-DRB1*01:01, HLA-DRB1*11:01, HLA-DRB1*15:01, HLA-DRB1*03:01, and HLA-DRB1*04:01 (Supplemental Fig. 1B, 1D). The number of potential CD4+ T cell epitopes was later narrowed down to 16 epitopes based on 1) the epitope sequences that are highly conserved among 81,963 SARS-CoV-2 strains, the four major common cold and 25 SL-CoV strains isolated from bats, civet cats, pangolins, and camels (Supplemental Fig. 4); and 2) their high binding affinity to HLA-DR molecules using in silico molecular docking models (Supplemental Fig. 5). The sequences of most of the 16 CD4+ T cell epitopes are 100% conserved and common among 81,963 SARS-CoV-2 strains currently circulating in six continents (Supplemental Fig. 4). A high degree of sequence similarities was also identified in the sequences of most 16 CD4+ T cell epitopes among the SARS-CoV-2 strains and the six strains of previous human SARS-CoVs (e.g., up to 100% sequence identity for epitopes ORF1ab5019–5033, ORF1ab6088–6102, ORF1ab 6420–6434, E20–34, E26–40, and M176–190). Moreover, a high degree of sequence similarities was also identified among the SARS-CoV-2 and the SL-CoV strains isolated from bats and pangolins. In contrast, a lower sequence similarity was identified among CD4+ T cell epitopes from SARS-CoV-2 strains and the SL-CoV strains isolated from civet cats, followed by MERS-like CoV strains isolated from camels (Supplemental Figs. 4, 8).

FIGURE 4.

CD4+ T cells specific to highly conserved SARS-CoV-2 epitopes detected in COVID-19 patients and unexposed healthy individuals. (A) Experimental design: PBMCs from HLA-DRB1–positive COVID-19 patients (n = 30) (B) and controls unexposed healthy individuals (n = 10) (C) were isolated and stimulated for 48 h. with 10 μM of each of the 16 SARS-CoV-2-derived CD4+ T cell epitopes. The number of IFN-γ–producing cells were quantified using ELISpot assay (B, C, and D). Dotted lines represent a threshold to evaluate the relative magnitude of the response: mean SFCs between 25 and 50 correspond to a medium/intermediate response, whereas a strong response is defined for mean SFCs >50. PBMCs from HLA-DRB1–positive COVID-19 patients (E) were further stimulated for an additional 5 h in the presence of mAbs specific to CD107a and CD107b, and GolgiPlug and GolgiStop. Tetramers specific to two Spike epitopes, CD107a/b and CD69, and TNF-α expression were then measured by FACS. Representative FACS plot showing the frequencies of tetramer+CD4+ T cells, CD107a/b+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells following priming with a group of two Spike CD4+ T cell epitope peptides. Average frequencies are shown for tetramer+CD4+ T cells, CD107a/b+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells. *p < 0.05.

FIGURE 4.

CD4+ T cells specific to highly conserved SARS-CoV-2 epitopes detected in COVID-19 patients and unexposed healthy individuals. (A) Experimental design: PBMCs from HLA-DRB1–positive COVID-19 patients (n = 30) (B) and controls unexposed healthy individuals (n = 10) (C) were isolated and stimulated for 48 h. with 10 μM of each of the 16 SARS-CoV-2-derived CD4+ T cell epitopes. The number of IFN-γ–producing cells were quantified using ELISpot assay (B, C, and D). Dotted lines represent a threshold to evaluate the relative magnitude of the response: mean SFCs between 25 and 50 correspond to a medium/intermediate response, whereas a strong response is defined for mean SFCs >50. PBMCs from HLA-DRB1–positive COVID-19 patients (E) were further stimulated for an additional 5 h in the presence of mAbs specific to CD107a and CD107b, and GolgiPlug and GolgiStop. Tetramers specific to two Spike epitopes, CD107a/b and CD69, and TNF-α expression were then measured by FACS. Representative FACS plot showing the frequencies of tetramer+CD4+ T cells, CD107a/b+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells following priming with a group of two Spike CD4+ T cell epitope peptides. Average frequencies are shown for tetramer+CD4+ T cells, CD107a/b+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells. *p < 0.05.

Close modal
FIGURE 5.

Immunogenicity of genome-wide identified human SARS-CoV-2 CD8+ T epitopes in HLA-A*02:01/HLA-DRB1 double-transgenic mice. (A) Timeline of immunization and immunological analyses. Eight groups of age-matched HLA-A*02:01–transgenic mice (n = 3) were immunized s.c., on days 0 and 14, with a mixture of four SARS-CoV-2–derived human CD8+ T cell peptide epitopes mixed with pan DR epitope (PADRE) CD4+ Th epitope, delivered in alum and CpG1826 adjuvants. As a negative control, mice received adjuvants alone (mock immunized). (B) Gating strategy used to characterize spleen-derived CD8+ T cells. Lymphocytes were identified by a low forward scatter (FSC) and low side scatter (SSC) gate. Singlets were selected by plotting FSC area (FSC-A) versus FSC height (FSC-H). CD8-positive cells were then gated by the expression of CD8 and CD3 markers. (C) Representative ELISpot images (left panel) and average frequencies (right panel) of IFN-γ–producing cell spots from splenocytes (1 × 106 cells per well) stimulated for 48 h with 10 µM of 10 immunodominant CD8+ T cell peptides and 1 subdominant CD8+ T cell peptide out of the total pool of 27 CD8+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins. The number on the top of each ELISpot image represents the number of IFN-γ–producing, SFCs per one million splenocytes. (D) Representative FACS plot (left panel) and average frequencies (right panel) of IFN-γ and TNF-α production by CD107a/b and CD69 expression on 10 immunodominant CD8+ T cell peptides and 1 subdominant CD8+ T cell peptide out of the total pool of 27 CD8+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins determined by FACS. Numbers indicate frequencies of IFN-γ+CD8+ T cells, CD107+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells detected in three immunized mice.

FIGURE 5.

Immunogenicity of genome-wide identified human SARS-CoV-2 CD8+ T epitopes in HLA-A*02:01/HLA-DRB1 double-transgenic mice. (A) Timeline of immunization and immunological analyses. Eight groups of age-matched HLA-A*02:01–transgenic mice (n = 3) were immunized s.c., on days 0 and 14, with a mixture of four SARS-CoV-2–derived human CD8+ T cell peptide epitopes mixed with pan DR epitope (PADRE) CD4+ Th epitope, delivered in alum and CpG1826 adjuvants. As a negative control, mice received adjuvants alone (mock immunized). (B) Gating strategy used to characterize spleen-derived CD8+ T cells. Lymphocytes were identified by a low forward scatter (FSC) and low side scatter (SSC) gate. Singlets were selected by plotting FSC area (FSC-A) versus FSC height (FSC-H). CD8-positive cells were then gated by the expression of CD8 and CD3 markers. (C) Representative ELISpot images (left panel) and average frequencies (right panel) of IFN-γ–producing cell spots from splenocytes (1 × 106 cells per well) stimulated for 48 h with 10 µM of 10 immunodominant CD8+ T cell peptides and 1 subdominant CD8+ T cell peptide out of the total pool of 27 CD8+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins. The number on the top of each ELISpot image represents the number of IFN-γ–producing, SFCs per one million splenocytes. (D) Representative FACS plot (left panel) and average frequencies (right panel) of IFN-γ and TNF-α production by CD107a/b and CD69 expression on 10 immunodominant CD8+ T cell peptides and 1 subdominant CD8+ T cell peptide out of the total pool of 27 CD8+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins determined by FACS. Numbers indicate frequencies of IFN-γ+CD8+ T cells, CD107+CD8+ T cells, CD69+CD8+ T cells, and TNF-α+CD8+ T cells detected in three immunized mice.

Close modal

The 16 highly conserved CD4+ T cell epitopes are distributed within 9 out of the 12 structural and nonstructural ORFs (i.e., ORF1ab, S, E, M, ORF6, ORF7a, ORF7b, ORF8, and nucleoprotein [N]). The highest numbers of epitopes were localized in the replicase polyprotein ORF1ab/1a (five epitopes) followed by ORF7a (three epitopes) (Supplemental Figs. 4, 8). Unlike the human CD8+ T cell epitopes, the human CD4+ T cell epitopes are found to be expressed in each of the structural S, E, M, and N proteins. Two epitopes are from the E, one epitope from the M, one epitope from the N protein, and one epitope from the spike protein. The remaining CD4+ T cell epitopes are distributed among the ORF6, ORF7a, ORF7b, and ORF8 proteins (Supplemental Figs. 4, 8).

Altogether, these results identified 16 potential CD4+ T cell epitopes from the whole sequence of SARS-CoV-2 that cross-react and have high sequence similarity among 81,963 SARS-CoV-2 strains, the main four major common cold coronaviruses, and the SL-CoV strains isolated from bats and pangolins. Similar to CD8+ T cell epitopes, the replicase polyprotein ORF1ab appeared to be the most immunodominant Ag, with a high number of conserved epitopes that may possibly be targeted by human CD4+ T cells.

Next, we assessed whether the potential SARS-CoV-2 CD4+ and CD8+ T cell epitopes that are highly conserved between human and animal coronaviruses would recall memory CD8+ T cells from COVID-19 patients as well as from healthy individuals who have never been exposed to SARS-CoV-2 or to COVID-19 patients (i.e., from healthy individuals’ blood samples that were collected from 2014 to 2018; Figs. (3A, 4A). Detailed clinical and demographic characteristics of the COVID-19 patients and the unexposed healthy individuals enrolled in the current study, with respect to age, gender, HLA-A*02:01 and HLA-DRB1 distribution, COVID-19 disease severity, comorbidity, and biochemical parameters, are described in Table I and in Materials and Methods.

Blood-derived PBMCs from COVID-19 patients (black, (Fig. 3B) and healthy individuals (white, (Fig. 3C) were analyzed by ELISpot for frequencies in SARS-CoV-2 epitope–specific, IFN-γ–producing CD8+ T cells. As shown in (Fig. 3B and 3D, significant numbers of SARS-CoV-2 epitope–specific memory CD8+ T cells producing IFN-γ were detected in PBMCs of COVID-19 patients. Out of the 27 highly conserved cross-reactive SARS-CoV-2 CD8+ T cell epitopes (Supplemental Fig. 2) selected for their binding affinity with HLA-A*02:01 molecules (Fig. 2B), strong T cell responses (mean spot-forming T cells [SFCs] > 50 per 0.5 × 106 PBMCs fixed as threshold) were detected in COVID-19 patients against 10 epitopes derived from 1) structural proteins like Spike (i.e., S958–966, S976–984, S1000–1008, and S1220–1228) or the Es (i.e., E26–34); and 2) nonstructural proteins (i.e., ORF1ab1675–1683, ORF1ab2210–2218, ORF1ab6749–6757, ORF63–11, and ORF103–11) (Fig. 3B, 3D). In addition, 12 other SARS-CoV-2 CD8+ T cell epitopes from structural of nonstructural SARS-CoV-2 proteins induced an intermediate response (with a mean SFCs between 25 and 50 per 0.5 × 106 PBMCs) in COVID-19 patients: ORF1ab84–92, ORF1ab3013–3021, ORF1ab3183–3191, ORF1ab3732–3740, ORF1ab4283–4291, ORF1ab6419–6427, S2–10, S691–699, E20–28, M52–60, M89–97, and ORF105–13.

Moreover, among the 27 SARS-CoV-2 epitopes, seven epitopes recalled a strong memory CD8+ T cells response (mean SFCs > 50) from unexposed healthy individuals (i.e., ORF1ab1675–1683, ORF1ab3732–3740, ORF1ab4283–4290, ORF1ab5470–5478, ORF1ab6749–6757, S976–984, S1000–1008, and S1220–1228), and five epitopes recalled a memory CD8+ T cells response that was intermediate (ORF1ab6419–6427, S2–10, E26–34, ORF103–11, and ORF105–13) (Fig. 3C, 3D). However, the unexposed healthy individuals exhibited a different pattern of CD8+ T cell immunodominance as compared with COVID-19 patients. We then compared the epitopes specificity and function of memory CD8+ T cells in HLA-*A02:01–positive COVID-19 patients and healthy individuals using flow cytometry (Fig. 3E). For a better comparison, a similar FACS gating strategy was applied to PBMC-derived T cells from both COVID-19 and healthy donors (data not shown). Our COVID-19 patients appeared to have a higher frequency of CD8+ T cells compared with healthy donors (Fig. 3E). Tetramer staining showed that many of SARS-CoV-2 epitope–specific CD8+ T cells are multifunctional, producing IFN-γ and TNF-α and expressing CD69 and CD107a/b markers of activation and cytotoxicity in COVID-19 patients (Fig. 3E).

Similar to SARS-CoV-2 memory CD8+ T cells, memory CD4+ T cells specific to several highly conserved SARS-CoV-2 epitopes were detected in both COVID-19–recovered patients and unexposed healthy individuals (Fig. 4B–D). Out of the 16 highly conserved cross-reactive SARS-CoV-2 CD4+ T cell epitopes (Supplemental Fig. 4), strong T cell responses (mean SFCs > 50 per 0.5 × 106 PBMCs fixed as a threshold) were detected in COVID-19 patients against two epitopes, one derived from the structural protein M (M176–190) and one from the nonstructural protein ORF1a (ORF1a1350–1365) (Fig. 4B, 4D). Moreover, six additional SARS-CoV-2 CD8+ T cell epitopes from nonstructural SARS-CoV-2 proteins (i.e., ORF1a1801–1815, ORF1a6088–6102, ORF1a6420–6434, ORF612–26, ORF7a3–17, and ORF8b1–15) and two more epitopes from structural proteins (i.e., S1–13 and N388–403) induced an intermediate CD4+ T cell response (mean SFCs between 25 and 50 per 0.5 × 106 PBMCs) in COVID-19 patients (Fig. 4B, 4D).

Besides, among the 16 SARS-CoV-2 epitopes, two epitopes recalled a strong memory CD4+ T cells response (mean SFCs > 50) from unexposed healthy individuals with no history of COVID-19 (i.e., ORF1a1350–1365 and ORF612–26) (Fig. 4C, 4D). Furthermore, five additional epitopes recalled an intermediate CD4+ T cells response in these unexposed healthy individuals (i.e., ORF1a1801–1815, S1–13, M176–190, ORF8b1–15, and N388–403). Unlike for CD8+ T cell responses, the unexposed healthy individuals exhibited a similar pattern of CD4+ T cell immunodominance as compared with COVID-19 patients, with few differences in the magnitude of the responses only. Multifunctional SARS-CoV-2 epitope–specific CD4+ T cells expressing CD69, CD107a/b, and TNF-α were detected using specific tetramers in PBMCs of HLA-DR1–positive COVID-19 patients and healthy individuals (Fig. 4E) with a trend showing higher percentage of these cells in COVID-19 patients, although not significantly higher.

The immunogenicity of the identified SARS-CoV-2 human CD4+ and CD8+ T cell epitopes was assessed in humanized HLA-DR1/HLA-A*02:01 double-transgenic mice (Figs. 5A, 6A). A mixture of peptides incorporating CD4+ T cell or CD8+ T cell epitopes was delivered with CpG and alum, as shown in Figs. (5A and 6A and detailed in the Materials and Methods. As a negative control, mice received adjuvant alone. The induced SARS-CoV-2 epitope–specific CD4+ and CD8+ T cell responses were determined in the spleen using multiple immunological assays, including IFN-γ ELISpot, FACS surface markers of activation, markers of cytotoxic degranulation, and intracellular cytokine staining. The gating strategy used for mice is shown in Figs. (5B and 6B. Two weeks after the second immunization with the mixture of CD8+ T cell peptides, 10 out of 27 highly conserved SARS-CoV-2 human CD8+ T cell epitope peptides were immunogenic in humanized HLA-DR1/HLA-A*02:01 double-transgenic mice (Fig. 5C, 5D). The remaining 17 CD8+ T cell epitopes presented moderate/low immunogenicity levels in HLA-DR1/HLA-A*02:01 double-transgenic mice. The immunogenic epitopes were derived from both structural Spike protein (S2–10, S958–966, S1000–1008, and S1220–1228) and E (E20–28) and from nonstructural proteins (i.e., ORF1ab2363–2371, ORF1ab3732–3740, ORF1ab5470–5478, ORF873–81, and ORF105–13). Moreover, 7 out of 16 SARS-CoV-2 peptides induced significant CD4+ T cell responses in humanized HLA-DR1/HLA-A*02:01 double-transgenic mice (Fig. 6C, 6D). The immunogenic epitopes were derived from both structural Spike protein (S1–13) and M (M176–190) and from nonstructural proteins (ORF1a1350–1365, ORF1a5019–5033, ORF1a6420–6434, ORF612–26, ORF7b8–22, and ORF8b1–15). The remaining 9 CD4+ T cell epitopes presented moderate/low level of immunogenicity in HLA-DR1/HLA-A*02:01 double-transgenic mice.

FIGURE 6.

Immunogenicity of genome-wide identified human SARS-CoV-2 CD4+ T epitopes in HLA-A*02:01/HLA-DRB1 double-transgenic mice. (A) Timeline of immunization and immunological analyses. Four groups of age-matched HLA-DRB1–transgenic mice (n = 3) were immunized s.c. on days 0 and 14 with a mixture of four SARS-CoV-2–derived human CD4+ T cell peptide epitopes delivered in alum and CpG1826 adjuvants. As a negative control, mice received adjuvants alone (mock immunized). (B) Gating strategy used to characterize spleen-derived CD4+ T cells. CD4-positive cells were gated by the CD4 and CD3 expression markers. (C) Representative ELISpot images (left panel) and average frequencies (right panel) of IFN-γ–producing cell spots from splenocytes (1 × 106 cells per well) stimulated for 48 h with 10 µM of seven immunodominant CD4+ T cell peptides and one subdominant CD4+ T cell peptide out of the total pool of 16 CD4+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins. The number of IFN-γ–producing SFCs per one million of total cells is presented on the top of each ELISpot image. (D) Representative FACS plot (left panel) and average frequencies (right panel) show IFN-γ and TNF-α production by CD107a/b and CD69 expression on seven immunodominant CD4+ T cell peptides and one subdominant CD4+ T cell peptide out of the total pool of 16 CD4+ T cell peptides derived from SARS-CoV-2 determined by FACS. The numbers indicate percentages of IFN-γ+CD4+ T cells, CD107+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells detected in three immunized mice.

FIGURE 6.

Immunogenicity of genome-wide identified human SARS-CoV-2 CD4+ T epitopes in HLA-A*02:01/HLA-DRB1 double-transgenic mice. (A) Timeline of immunization and immunological analyses. Four groups of age-matched HLA-DRB1–transgenic mice (n = 3) were immunized s.c. on days 0 and 14 with a mixture of four SARS-CoV-2–derived human CD4+ T cell peptide epitopes delivered in alum and CpG1826 adjuvants. As a negative control, mice received adjuvants alone (mock immunized). (B) Gating strategy used to characterize spleen-derived CD4+ T cells. CD4-positive cells were gated by the CD4 and CD3 expression markers. (C) Representative ELISpot images (left panel) and average frequencies (right panel) of IFN-γ–producing cell spots from splenocytes (1 × 106 cells per well) stimulated for 48 h with 10 µM of seven immunodominant CD4+ T cell peptides and one subdominant CD4+ T cell peptide out of the total pool of 16 CD4+ T cell peptides derived from SARS-CoV-2 structural and nonstructural proteins. The number of IFN-γ–producing SFCs per one million of total cells is presented on the top of each ELISpot image. (D) Representative FACS plot (left panel) and average frequencies (right panel) show IFN-γ and TNF-α production by CD107a/b and CD69 expression on seven immunodominant CD4+ T cell peptides and one subdominant CD4+ T cell peptide out of the total pool of 16 CD4+ T cell peptides derived from SARS-CoV-2 determined by FACS. The numbers indicate percentages of IFN-γ+CD4+ T cells, CD107+CD4+ T cells, CD69+CD4+ T cells, and TNF-α+CD4+ T cells detected in three immunized mice.

Close modal

Altogether, these results indicate that preexisting memory CD4+ T and CD8+ T cells specific to both structural and nonstructural protein Ags and epitopes are present in COVID-19 patients and unexposed healthy individuals. Although SARS-CoV-2–specific CD4+ and CD8+ T cells in COVID-19 patients and healthy donors target epitopes from the whole virus proteome, most T cell epitopes are concentrated in the nonstructural proteins, with ORF1a/b being the most targeted Ags. These memory T cells recognized highly conserved SARS-CoV-2 epitopes that cross-react with the human and animal coronaviruses. It is likely that infection with a common cold coronavirus and/or human exposition with animal- and pet-related coronaviruses induced long-lasting memory CD4+ and CD8+ T cells specific to the structural and nonstructural SARS-CoV-2 epitopes in healthy unexposed individuals. Heterologous immunity and heterologous immunopathology orchestrated by these cross-reactive, epitope-specific memory CD4+ and CD8+ T cells following previous multiple exposures to common cold coronaviruses may have shaped protection versus susceptibility to SARS-CoV-2 infection and disease with a yet-to-be determined mechanism(s).

We next predicted potential linear B cell (Ab) epitopes on Spike protein sequence of the first SARS-CoV-2-Wuhan-Hu-1 strain (NCBI GenBank accession number MN908947.3) using BepiPred 2.0, with a cutoff of 0.55 (corresponding to a specificity >0.81 and sensitivity <0.30) and considering sequences having more than 5-aa residues (65). This stringent screening process initially resulted in the identification of 28 linear B cell epitopes (Supplemental Table III). From this pool of 28 potential epitopes, we later selected 15 B cell epitopes (19–62 aa in length) based on 1) their sequences being highly conserved between SARS-CoV-2, the main four major common cold coronaviruses (CoV-OC43 [KF923903], CoV-229E [KY983587], CoV-HKU1 [AY884001], and CoV-NL63 [NC_005831]) (66), and the SL-CoVs that are isolated from bats, civet cats, pangolins, and camels; and 2) the probability of exposure each linear epitope to the surface of infected target cells (Supplemental Fig. 6). The Spike epitope sequences highlighted in blue indicate a high degree of homology among the currently circulating 81,963 SARS-CoV-2 strains and at least a 50% conservancy among two or more human SARS-CoV strains from previous outbreaks and the SL-CoV strains isolated from bats, civet cats, pangolins, and camels (Supplemental Fig. 6). Two of the fifteen B cell epitopes, namely S369–393 and S440–501, overlap with the Spike’s RBD regions that bind to the ACE2 receptor (designated as RBD-1 and RBD-2 in Supplemental Fig. 7A). Higher SInter scores were observed for RBD-derived epitopes S369–393 and S471–501 when molecular docking was performed against the ACE2 receptor (Supplemental Fig. 7B). Upon screening for the glycosylation regions, we observed B cell epitopes S13–37, S59–81, S329–363, S601–640, and S1133–1172 with Asparagines predicted to be N-glycosylated. In contrast, B cell epitopes S516–536, S524–598, and S802–819 were observed to be the O-glycosylated. The remaining B cell epitopes S287–317, S304–322, S369–393, S404–501, S440–501, S672–690, and S888–909 were found to possess no glycosylation.

FIGURE 7.

IgG Abs specific to SARS-CoV-2 Spike protein–derived B cell epitopes in immunized B6 mice and in convalescent COVID-19 patients. (A) Timeline of immunization and immunological analyses. A total of 22 SARS-CoV-2–derived B cell epitope peptides selected from SARS-CoV-2 Spike protein and tested in B6 mice were able to induce Ab responses. Four groups of age-matched B6 mice (n = 3) were immunized s.c. on days 0 and 14 with a mixture of four or five SARS-CoV-2–derived B cell peptide epitopes emulsified in alum and CpG1826 adjuvants. Alum/CpG1826 adjuvants alone were used as negative controls (mock immunized). (B and C) The frequencies of IgG-producing CD3(–)CD138(+)B220(+) plasma B cells were determined in the spleen of immunized mice by flow cytometry. (B) The gating strategy was as follows: lymphocytes were identified by a low forward scatter (FSC) and low side scatter (SSC) gate. Singlets were selected by plotting FSC area (FSC-A) versus FSC height (FSC-H). B cells were then gated by the expression of CD3(–) and B220(+) cells and CD138 expression on plasma B cells determined. (C) Representative FACS plot (left panels) and average frequencies (right panel) of plasma B cells detected in spleen of immunized mice. The percentages of plasma CD138(–)B220(+)B cells is indicated on the top left of each dot plot. (D) SARS-CoV-2–derived B cell epitope–specific IgG responses were quantified in immune serum, 14 d after second immunization (i.e., day 28) by ELISpot (number of IgG(+)spots). Representative ELISpot images (left panels) and average frequencies (right panel) of anti-peptide specific IgG-producing B cell spots (1 × 106 splenocytes per well) following 4 d in vitro B cell polyclonal stimulation with mouse Poly-S (ImmunoSpot). The top/left of each ELISpot image shows the number of IgG-producing B cells per half a million cells. ELISA plates were coated with each individual immunizing peptide. The B cell epitope–specific IgG concentrations (micrograms per milliliter) measured by ELISA in (E) levels of IgG detected in peptide-immunized B6 mice after subtraction of the background measured from mock-vaccinated mice. The dashed horizontal line indicates the limit of detection and in (F) level of IgG specific to each of the 22 Spike peptides detected SARS-CoV-2–infected patients (n = 40) after subtraction of the background measured from healthy nonexposed individuals, as shown in (G) (n = 10). Black bars and gray bars show high and medium immunogenic B cell peptides, respectively. The dashed line indicates the limit of detection.

FIGURE 7.

IgG Abs specific to SARS-CoV-2 Spike protein–derived B cell epitopes in immunized B6 mice and in convalescent COVID-19 patients. (A) Timeline of immunization and immunological analyses. A total of 22 SARS-CoV-2–derived B cell epitope peptides selected from SARS-CoV-2 Spike protein and tested in B6 mice were able to induce Ab responses. Four groups of age-matched B6 mice (n = 3) were immunized s.c. on days 0 and 14 with a mixture of four or five SARS-CoV-2–derived B cell peptide epitopes emulsified in alum and CpG1826 adjuvants. Alum/CpG1826 adjuvants alone were used as negative controls (mock immunized). (B and C) The frequencies of IgG-producing CD3(–)CD138(+)B220(+) plasma B cells were determined in the spleen of immunized mice by flow cytometry. (B) The gating strategy was as follows: lymphocytes were identified by a low forward scatter (FSC) and low side scatter (SSC) gate. Singlets were selected by plotting FSC area (FSC-A) versus FSC height (FSC-H). B cells were then gated by the expression of CD3(–) and B220(+) cells and CD138 expression on plasma B cells determined. (C) Representative FACS plot (left panels) and average frequencies (right panel) of plasma B cells detected in spleen of immunized mice. The percentages of plasma CD138(–)B220(+)B cells is indicated on the top left of each dot plot. (D) SARS-CoV-2–derived B cell epitope–specific IgG responses were quantified in immune serum, 14 d after second immunization (i.e., day 28) by ELISpot (number of IgG(+)spots). Representative ELISpot images (left panels) and average frequencies (right panel) of anti-peptide specific IgG-producing B cell spots (1 × 106 splenocytes per well) following 4 d in vitro B cell polyclonal stimulation with mouse Poly-S (ImmunoSpot). The top/left of each ELISpot image shows the number of IgG-producing B cells per half a million cells. ELISA plates were coated with each individual immunizing peptide. The B cell epitope–specific IgG concentrations (micrograms per milliliter) measured by ELISA in (E) levels of IgG detected in peptide-immunized B6 mice after subtraction of the background measured from mock-vaccinated mice. The dashed horizontal line indicates the limit of detection and in (F) level of IgG specific to each of the 22 Spike peptides detected SARS-CoV-2–infected patients (n = 40) after subtraction of the background measured from healthy nonexposed individuals, as shown in (G) (n = 10). Black bars and gray bars show high and medium immunogenic B cell peptides, respectively. The dashed line indicates the limit of detection.

Close modal

We later determined the ability of each of the 15 B cell epitopes selected from the Spike protein, which showed a high conservancy between human and animal coronaviruses to induce SARS-CoV-2 epitope–specific, Ab-producing plasma B cells and IgG Abs in B6 mice (Fig. 7). Synthetic peptides corresponding to each linear B cell epitope were produced. Because four epitopes were too long to synthetize (e.g., 62 aa), they were divided into two or three short fragments, resulting in a total of 22 B cell epitope peptides (Supplemental Table III). As illustrated in (Fig. 7A, groups of five B6 mice each received two s.c. injections with mixtures of three to four B cell epitope peptides mixed with CpG and alum adjuvants. Negative control mice received adjuvant alone without Ags. The frequency of Ab-producing plasma B cells and the level of IgG Abs specific to each SARS-CoV-2 B cell epitope were determined in the spleen and in the serum using FACS staining of CD138 and B220 surface markers and IgG ELISpot and ELISA assays, respectively. The gating strategy used to determine the frequencies of plasma B cells in the spleen is shown in (Fig. 7B. Out of the 22 Spike B cell epitopes, seven epitopes (S13–37, S287–317, S524–558, S544–578, S565–598, S601–628, and S614–640) induced high frequencies of CD138+B220+ plasma B cells in the spleen of B6 mice (Fig. 7C). The IgG ELISpot assay confirmed that 7 out of the 22 Spike B cell epitopes induced significant numbers of IgG-producing B cells in the spleen (Fig. 7D). Moreover, significant amounts of IgG were detected in the serum of the immunized B6 mice. These IgG Abs were specific to 6 out of the 22 Spike B cell peptide epitopes (S13–37, S59–81, S287–317, S565–598, S601–628, and S614–640) (Fig. 7E). As expected, nonimmunized animals or those that received adjuvant alone did not develop detectable IgG responses. Of these six highly immunogenic B cell peptides, five peptides (S13–37, S59–81, S287–317, S601–628, and S614–640) were highly antigenic, as they were recognized by serum IgG from COVID-19 patients, confirming the presence of at least one native linear B cell epitope in each peptide (Fig. F). In summary, we identified five highly conserved immunogenic and antigenic human B cell target epitopes from the Spike SARS-CoV-2 virus that recall IgG Abs from COVID-19 patients (Fig. 8). This study further discovered five highly conserved B cell epitopes from SARS-CoV-2: S13–37, S287–317, S338–363, S614–640, and S1133–1160, which are recognized by IgG Abs from healthy individuals who were never exposed to COVID-19, suggesting B cell epitopes cross-reactivity to other human coronaviruses (Fig. 7G).

FIGURE 8.

Illustrations of the SARS-CoV/SARS-CoV-2 genome-wide location of the highly conserved, antigenic, and immunogenic CD4+ T cell, CD8+ T cell, and B cell epitopes. (A) Enveloped, spherical, ∼120 nm in diameter SARS-CoV/SARS-CoV-2 genome encodes four structural proteins: spike, E, M, and N, highlighted in blue, green, gray, and black, respectively. (B) The SARS-CoV/SARS-CoV-2 genome encodes two large nonstructural genes, ORF1a (green) and ORF1b (gray), encoding 16 nonstructural proteins (NSP1–NSP16). The genome encodes at least six accessory proteins (shades of light gray) that are unique to SARS-CoV/SARS-CoV-2 in terms of number, genomic organization, sequence, and function. The common SARS-CoV, SARS-CoV-2, and SL-CoVs–derived human B (blue), CD4+ (green), and CD8+ (black) T cell epitopes are shown. Structural and nonstructural ORFs used in this study were from SARS-CoV-2-Wuhan-Hu-1 strain (NCBI accession number MN908947.3). The amino acid sequence of the SARS-CoV-2-Wuhan-Hu-1 structural and nonstructural proteins was screened for human B and CD4+ and CD8+ T cell epitopes using different computational algorithms as previously described in Materials and Methods. Shown are genome-wide identified SARS-CoV-2 human B cell epitopes (in blue), CD4+ T cell epitopes (in green), and CD8+ T cell epitopes (in black) that are highly conserved between human and animal coronaviruses.

FIGURE 8.

Illustrations of the SARS-CoV/SARS-CoV-2 genome-wide location of the highly conserved, antigenic, and immunogenic CD4+ T cell, CD8+ T cell, and B cell epitopes. (A) Enveloped, spherical, ∼120 nm in diameter SARS-CoV/SARS-CoV-2 genome encodes four structural proteins: spike, E, M, and N, highlighted in blue, green, gray, and black, respectively. (B) The SARS-CoV/SARS-CoV-2 genome encodes two large nonstructural genes, ORF1a (green) and ORF1b (gray), encoding 16 nonstructural proteins (NSP1–NSP16). The genome encodes at least six accessory proteins (shades of light gray) that are unique to SARS-CoV/SARS-CoV-2 in terms of number, genomic organization, sequence, and function. The common SARS-CoV, SARS-CoV-2, and SL-CoVs–derived human B (blue), CD4+ (green), and CD8+ (black) T cell epitopes are shown. Structural and nonstructural ORFs used in this study were from SARS-CoV-2-Wuhan-Hu-1 strain (NCBI accession number MN908947.3). The amino acid sequence of the SARS-CoV-2-Wuhan-Hu-1 structural and nonstructural proteins was screened for human B and CD4+ and CD8+ T cell epitopes using different computational algorithms as previously described in Materials and Methods. Shown are genome-wide identified SARS-CoV-2 human B cell epitopes (in blue), CD4+ T cell epitopes (in green), and CD8+ T cell epitopes (in black) that are highly conserved between human and animal coronaviruses.

Close modal

Although the current COVID-19 pandemic will likely be overcome through the implementation of physical distancing, barriers together with a mass vaccination, it is indispensable that a safe and effective preemptive vaccine be developed and in place ready to protect against another inevitable COVID pandemic that will emerge in the years to come.

Toward this goal of developing a multiepitope preemptive pan-coronavirus human vaccine, we identified several cross-reactive human B and T cell epitopes of SARS-CoV-2 that are highly conserved among the human SARS-CoVs and animal SL-CoVs. Although antiviral SARS-CoV-2–specific Abs and CD4+ and CD8+ T cell responses appear crucial in protecting asymptomatic patients and convalescent patients, very little information exists with regards to the repertoire of targeted SARS-CoV-2 B and T cell epitopes that are common within a substantial group of human and animal coronaviruses (56, 67, 68). In agreement with our results, 4 out of the 27 CD8+ T cell epitopes reported in this study have been recently reported to be cross-reactive between SARS-CoV and SARS-CoV-2: ORF1ab2363–2371, ORF1ab3013–3021, S958–966, and S1220–1228 (56, 68). Similarly, B cell epitope S287–317 has been reported as cross-reactive between SARS-CoV and SARS-CoV-2 (56, 68). However, to the best of our knowledge, none of the 16 CD4+ T cell epitopes identified in this study have been reported previously. The highly conserved human B and T cell epitopes reported in this study have massive implications for the development of a universal preemptive pan-coronavirus vaccine to induce (or to boost) neutralizing Abs, CD4+ Ths (Th1), and antiviral CD8+ cytotoxic T cells (21, 69, 70). Some of our identified epitopes are similar to those recently reported by Grifoni et al. (56, 68), whereas other epitopes have never been reported. Moreover, in agreement with recent reports (56, 68, 71), our study revealed a high degree of similarity among SARS-CoV-2, SARS-CoV, and bat-SL-CoV epitope sequences, but not the MERS-CoV epitope sequences. In the current study, we have identified B cell epitopes S13–37, S59–81, S329–363, S601–640, and S1133–1172 with N-glycosylated regions and S516–536, S524–598, and S802–819 with O-glycosylated regions. Extensive glycosylation has been observed in CoV’s S, representing the most extensive known class I viral fusion proteins. SARS spike glycoprotein is known to encode 69 N-linked glycan sequons per trimeric spike, with SARS-CoV-2 containing 66 sites (72). These modifications may mask immunogenic B cell epitopes from the host humoral immune system by occluding them with host-derived glycans. However, some studies underscore the importance of glycosylation in the lack of immunogenicity and viral immune evasion. Watanabe et al. (72) have reported that extensive N-linked glycan modifications of SARS and MERS-CoV S do not constitute an effective shield compared with glycan shields of certain other viruses, which is reflected by the overall structure, density, and oligomannose abundance across the corresponding trimeric glycoproteins (66).

Whether the identified epitopes will contribute to protection is beyond the scope of the current study. Unlike most coronavirus subunit vaccines (7375), our multiepitope coronavirus vaccine (e.g., the pan-coronavirus candidate 1 illustrated in Supplemental Fig. 8) incorporates multiple human asymptomatic B and CD4+/CD8+ T cell epitopes that are selected carefully from the whole genome of SARS-CoV-2 for being recognized by Abs and CD4+/CD8+ T cells from asymptomatic and convalescent patients that are naturally protected from COVID-19. The present study employed a combinatorial approach for designing an all-in-one multiepitope pan-coronavirus vaccine candidate (Supplemental Fig. 8) by applying highly conserved genome-wide human B and T cell epitopes from 12 genome-derived antigenic proteins of SARS-CoV-2. The present study focused on HLA-A*02:01–restricted epitopes represented by more than 50% of the human population. However, epitopes restricted to other HLA-A, HLA-B, and HLA-C haplotypes, including the forecasted population coverage of the chosen T cell epitope ensemble (combined HLA class I), are expected to cover 99.8% of the global population, regardless of race and ethnicity. In addition, for a wider vaccine coverage (i.e., close to 99%), our multiepitope pan-coronavirus vaccine platform would be easily adapted to include CD8+ T cell epitopes for other HLA supertypes that are distributed in the various human populations. The polymorphic HLA molecules can be clustered into a handful of HLA-A supertypes that bind largely overlapping peptide repertoires (76). Moreover, such a multiepitope vaccine would be easily adapted to exclude undesirable epitopes that are restricted to HLA-B*44 and HLA-C*01 alleles, which appear to correlate with SARS-CoV-2 virus spreading across certain countries (77), and HLA-B*35 allele, which appear to be associated with severe pneumonia developed by SARS-CoV-2 in young patients (78).

We do not exclude that the highly conserved B cell and CD4+ and CD8+ T cell epitopes identified from bat’s coronavirus variants will mutate following recombination that often occurs for zoonotic events before an animal SL-CoV spills over into humans. In this context, our preemptive multiepitope pan-coronavirus vaccine is highly adaptable to newly mutated coronavirus strains. If a coronavirus epitope mutates, that single epitope can be easily adjusted and replaced in the multiepitope vaccine (79). In lieu of this, we have screened all of our candidate epitopes against SARS-CoV-2 variants, which have been evidenced with increased transmissibility, including B.1.1.7 variant emerging from the U.K. (variant 20I/501Y.V1), B.1.351 variant emerging from South Africa (variant 20H/501Y.V2), B.1.1.28 variant emerging from Brazil (P.1 variant 20J/501Y.V3), and CAL.20C variant observed in California (Supplemental Fig. 9). We found 100% conservancy for 15 out of 16 CD4+ T cell epitopes against B.1.1.7 (U.K.), B.1.351 (South Africa), and B.1.1.28 (Brazil) variants. One mutation in S13I region that was found in the CAL.20C variant is observed in one of our CD4+ T cell epitopes S1–13 (MFVFLVLLPLVSS), whereas the remaining 15 CD4+ T cell epitopes showed 100% conservancy CAL.20C variant. Notably, all our 27 CD8+ T cell epitopes showed conservancy against highly transmissible South African, Brazilian, and Californian (CAL.20C) variants. However, one region specific to the nonsynonymous mutation S982A from the U.K. variant was observed in our CD8+ T cell epitope S976–984 (VLNDILSRL). The remaining 26 CD8+ T cell epitopes showed 100% conservancy with the B.1.1.7 (U.K.) variant. In comparison, two of the screened B cell epitopes (S59–81 and S601–640) belonged to regions specific to South African variant B.1.351 (20H/501Y.V2). More so, two B cell epitopes (S404–426 and S440–501) represent the regions specific to South African variant B.1.351 (20H/501Y.V2), Brazilian variant B.1.1.28 (P.1 variant 20J/501Y.V3), and Californian variant (CAL.20C). Two B cell epitopes, S524–598 and S672–690, represent regions specific to nonsynonymous spike protein–specific mutations A570D and P681H found in the B1.1.7 strain from the U.K. This emphasizes that our preemptive multiepitope pan-coronavirus vaccine strategy could be easily adapted to any variant as well as to any new zoonotic bat SL-CoVs that may spill over into humans in the future. This high adaptability is expected to speed up the implementation of a future preemptive multiepitope vaccine before a local outbreak spreads and transforms into a global pandemic.

It is inevitable that future COVID-like outbreaks, caused by yet another spillover of a bat SL-CoV, could lead to other COVID-like pandemics with global health, social, and economic disasters in the years to come. However, because it is almost impossible to predict which viral strain might cause the next coronavirus pandemic, it is urgent to develop a pan-coronavirus vaccine that targets a wide range of human and animal coronavirus strains. Unlike conventional monovalent vaccines made from epitopes selected from a single virus strain, a preemptive multiepitope pan-coronavirus vaccine (Supplemental Fig. 8) that includes several highly conserved human B and CD4+ and CD8+ T cell epitopes identified from the entire genome sequences of human SARS-CoVs that cross-react and are shared with bat and pangolin SL-CoVs (18, 8083). The current ongoing collaborative research efforts should not only focus on developing a vaccine for COVID-19 but should also be oriented toward developing preemptive pan-coronavirus vaccines. Such a proactive vaccine strategy would help fight and contain future local outbreaks and epicenters of highly contagious and deadly zoonotic coronaviruses globally before becoming the next deadly pandemic worldwide (21, 70). Moreover, because it is impossible to predict the time and location of the next deadly global pandemic, it is essential to have ready, at least preclinically, several pan-coronavirus vaccine candidates that would be quickly implemented in a clinical trial against a substantial group of coronaviruses before an outbreak spreads into a global pandemic(s).

In the current study, CD4+ and CD8+ T cells specific to highly conserved SARS-CoV2 epitopes were detected in healthy adults, recruited between 2014 and 2018, who have never been exposed to the SARS-CoV-2 virus. These findings suggest cross-reactive T cells between current SARS-CoV-2 and previous circulating common cold coronaviruses, as confirmed by recent reports (68, 70, 71). However, because it is unknown whether the healthy adults used in this study were indeed exposed to any common cold coronaviruses, such an assertion may not be conclusive. Among the many circulating common cold coronaviruses known to infect humans, four serotypes that cause severe respiratory infections are highly seasonal: CoV-OC43, CoV-229E, CoV-HKU1, and CoV-NL63 (66), and appear to have a similar transmission potential to influenza A (H3N2). The seasonality of these common cold coronaviruses is predictable, as their outbreaks often emerged in December, peaked in January/February, and began to decrease in March of every year (66).

The human SARS-CoV-2 CD4+ and CD8+ T cell epitopes identified in this study are highly conserved between 81,963 strains of SARS-CoV-2 and CoV-OC43, CoV-229E, CoV-HKU1, and CoV-NL63 (Supplemental Figs. 2, 4). Whether these apparent cross-reactive CD4+ and CD8+ T cells play a protective or a harmful role or an entirely negligible role in SARS-CoV-2 infection and disease remains to be determined (70, 71). Nevertheless, because common cold coronavirus infections are frequent in children, it will be interesting to determine whether children who appeared more resistant to COVID-19 compared with adults will have robust antiviral memory T cell responses to some of the common SARS-CoV-2 epitopes identified in this study. Stronger CD4+ and CD8+ T cell responses to common coronavirus epitopes in children would shed some light on the unique situation currently seen in COVID-19 in which immune children tend to be more resistant to SARS-CoV-2 infection and disease, as compared with more susceptible adults (84, 85). Such a result would also imply that a pan-coronavirus vaccine incorporating these cross-reactive highly conserved SARS-CoV-2 human CD4+ and CD8+ T cell epitopes would boost protective T cell immunity that would have been previously induced by a common cold coronavirus. This would protect not only from seasonal circulating common cold coronaviruses but also from SARS-CoV-2 infection and disease.

Even though the highly conserved coronavirus human CD4+ and CD8+ T cell epitopes identified in this report can be enlightening for a pan-coronavirus vaccine, humans are not immunologically naive, and they often have memory CD4+ and CD8+ T cell populations that can cross-react with and respond to other infectious agents, a phenomenon termed heterologous immunity (86). Therefore, we cannot exclude that some SARS-CoV-2–specific CD4+ and CD8+ T cell epitopes identified in this study are cross-reactive with other viral pathogen-derived epitopes, such as epitopes from circulating seasonal influenza or common cold coronaviruses (87, 88). This may explain, in part, the high proportion of asymptomatic infections with SARS-CoV-2 in the current pandemic. The latter is supported by a recent elegant study that detected SARS-CoV-2–reactive CD8+ and CD4+ T cells in healthy individuals that were never exposed to SARS-CoV-2 (68). SARS-CoV-2–specific, but cross-reactive, CD4+ and CD8+ T cells can become activated and modulate the immune responses and clinical outcome of subsequent heterologous SARS-CoV-2 infections. Therefore, T cell cross-reactivity may be crucial in protective heterologous immunity instead of damaging heterologous immunopathology, as has been reported in other systems (89). To confirm SARS-CoV-2 heterologous CD4+ and CD8+ T cell epitopes that may potentially cross-react with other pathogenic (noncoronaviruses) epitopes, we are currently comparing the CD4+ and CD8+ T cell response to those highly conserved SARS-CoV-2 epitopes identified using humans CD4+ and CD8+ T cell responses to those of pathogen-free SARS-CoV-2–infected transgenic mice.

In conclusion, we report, in this study, to our knowledge, several previously unknown human universal B and CD4+ and CD8+ T cell target epitopes identified from the whole SARS-CoV-2 genome. These epitopes are highly conserved and common between SARS-CoV-2 Wuhan strain and 1): seven circulating common cold human coronaviruses that caused previous human SARS and MERS outbreaks (70); 2) 81,963 strains of human SARS-CoV-2 that now circulate in six continents; 3); several bat-derived SL-CoV strains (13, 14); and 4) several SL-CoV strains isolated from pangolins (90). The findings from this report pave the way for the development of preemptive multiepitope pan-coronavirus vaccine candidates that would target not only the current human COVID-19 outbreak but also possible future coronavirus outbreaks that might come from a bat-derived SL-CoV strain that would spill over again into humans.

The authors thank Dr. Dale Long from the National Institutes of Health Tetramer Facility (Emory University, Atlanta, GA) for providing the tetramers used in this study. We thank University of California Irvine Center for Clinical Research and Institute for Clinical and Translational Science for providing human blood samples used in this study. A special thanks to Dr. Delia F. Tifrea for continuous efforts and dedication in providing COVID-19 samples that are crucial for this clinical research. We also thank Dr. Steven A. Goldstein, Dr. Michael J. Stamos, Dr. Suzanne B. Sandmeyer, Jim Mazzo, Dr. Daniela Bota, Dr. Beverly L. Alger, Dr. Dan Forthal, Dr. Tahseen Muzaffar, Dr. Ilhem Messaoudi, Anju Subba, Janice Briggs, Marge Brannon, Beverley Alberola, Jessica Sheldon, Rosie Magallon, and Andria Pontello for contributing directly or indirectly to this COVID-19 vaccine project.

This work was supported by the Emergent Ventures Fast-Grant PR12501, a Gavin Herbert Eye Institute internal grant, and National Institute of Allergy and Infectious Diseases Public Health Service Research Grants AI158060, AI150091, AI143348, AI147499, AI143326, AI138764, AI124911, and AI110902 (to L.B.).

The online version of this article contains supplemental material.

Abbreviations used in this article

COVID-19

coronavirus disease 2019

E

envelope protein

IEDB

Immune Epitope Database

M

membrane protein

MFI

mean fluorescence intensity

MHC-I

MHC class I

MHC-II

MHC class II

N

nucleoprotein

NCBI

National Center for Biotechnology Information

ORF

open reading frame

RBD

receptor-binding domain

S

spike glycoprotein

SFC

spot-forming T cell

SInter

interaction similarity

SL-CoV

SARS-like coronavirus

Tg

transgenic

1.
Carroll
D.
,
B.
Watson
,
E.
Togami
,
P.
Daszak
,
J. A.
Mazet
,
C. J.
Chrisman
,
E. M.
Rubin
,
N.
Wolfe
,
C. M.
Morel
,
G. F.
Gao
, et al
.
2018
.
Building a global atlas of zoonotic viruses.
Bull. World Health Organ.
96
:
292
294
.
2.
Allen
T.
,
K. A.
Murray
,
C.
Zambrana-Torrelio
,
S. S.
Morse
,
C.
Rondinini
,
M.
Di Marco
,
N.
Breit
,
K. J.
Olival
,
P.
Daszak
.
2017
.
Global hotspots and correlates of emerging zoonotic diseases.
Nat. Commun.
8
:
1124
.
3.
Valitutto
M. T.
,
O.
Aung
,
K. Y. N.
Tun
,
M. E.
Vodzak
,
D.
Zimmerman
,
J. H.
Yu
,
Y. T.
Win
,
M. T.
Maw
,
W. Z.
Thein
,
H. H.
Win
, et al
.
2020
.
Detection of novel coronaviruses in bats in Myanmar.
PLoS One
15
:
e0230802
.
4.
Vijaykrishna
D.
,
G. J.
Smith
,
J. X.
Zhang
,
J. S.
Peiris
,
H.
Chen
,
Y.
Guan
.
2007
.
Evolutionary insights into the ecology of coronaviruses. [Published erratum appears in 2007 J. Virol. 81: 8371.]
J. Virol.
81
:
4012
4020
.
5.
Masters
P. S.
2006
.
The molecular biology of coronaviruses.
Adv. Virus Res.
66
:
193
292
.
6.
Guan
Y.
,
B. J.
Zheng
,
Y. Q.
He
,
X. L.
Liu
,
Z. X.
Zhuang
,
C. L.
Cheung
,
S. W.
Luo
,
P. H.
Li
,
L. J.
Zhang
,
Y. J.
Guan
, et al
.
2003
.
Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China.
Science
302
:
276
278
.
7.
Pyrc
K.
,
M. F.
Jebbink
,
B.
Berkhout
,
L.
van der Hoek
.
2004
.
Genome structure and transcriptional regulation of human coronavirus NL63.
Virol. J.
1
:
7
.
8.
Farsani
S. M.
,
R.
Dijkman
,
M. F.
Jebbink
,
H.
Goossens
,
M.
Ieven
,
M.
Deijs
,
R.
Molenkamp
,
L.
van der Hoek
.
2012
.
The first complete genome sequences of clinical isolates of human coronavirus 229E.
Virus Genes
45
:
433
439
.
9.
St-Jean
J. R.
,
H.
Jacomy
,
M.
Desforges
,
A.
Vabret
,
F.
Freymuth
,
P. J.
Talbot
.
2004
.
Human respiratory coronavirus OC43: genetic stability and neuroinvasion.
J. Virol.
78
:
8824
8834
.
10.
Lu
G.
,
Q.
Wang
,
G. F.
Gao
.
2015
.
Bat-to-human: spike features determining ‘host jump’ of coronaviruses SARS-CoV, MERS-CoV, and beyond.
Trends Microbiol.
23
:
468
478
.
11.
Wurzer
W. J.
,
K.
Obojes
,
R.
Vlasak
.
2002
.
The sialate-4-O-acetylesterases of coronaviruses related to mouse hepatitis virus: a proposal to reorganize group 2 Coronaviridae.
J. Gen. Virol.
83
:
395
402
.
12.
Agnihothram
S.
,
B. L.
Yount
Jr.
,
E. F.
Donaldson
,
J.
Huynh
,
V. D.
Menachery
,
L. E.
Gralinski
,
R. L.
Graham
,
M. M.
Becker
,
S.
Tomar
,
T. D.
Scobey
, et al
.
2014
.
A mouse model for Betacoronavirus subgroup 2c using a bat coronavirus strain HKU5 variant.
MBio
5
:
e00047-14
.
13.
Menachery
V. D.
,
B. L.
Yount
Jr.
,
A. C.
Sims
,
K.
Debbink
,
S. S.
Agnihothram
,
L. E.
Gralinski
,
R. L.
Graham
,
T.
Scobey
,
J. A.
Plante
,
S. R.
Royal
, et al
.
2016
.
SARS-like WIV1-CoV poised for human emergence.
Proc. Natl. Acad. Sci. USA
113
:
3048
3053
.
14.
Menachery
V. D.
,
B. L.
Yount
Jr.
,
K.
Debbink
,
S.
Agnihothram
,
L. E.
Gralinski
,
J. A.
Plante
,
R. L.
Graham
,
T.
Scobey
,
X. Y.
Ge
,
E. F.
Donaldson
, et al
.
2016
.
Corrigendum: a SARS-like cluster of circulating bat coronaviruses shows potential for human emergence.
Nat. Med.
22
:
446
.
15.
Menachery
V. D.
,
B. L.
Yount
Jr.
,
K.
Debbink
,
S.
Agnihothram
,
L. E.
Gralinski
,
J. A.
Plante
,
R. L.
Graham
,
T.
Scobey
,
X. Y.
Ge
,
E. F.
Donaldson
, et al
.
2015
.
A SARS-like cluster of circulating bat coronaviruses shows potential for human emergence. [Published errata appear in 2016 Nat. Med. 22: 446 and 2020 Nat. Med. 26: 1146.]
Nat. Med.
21
:
1508
1513
.
16.
Liu
J.
,
X.
Zheng
,
Q.
Tong
,
W.
Li
,
B.
Wang
,
K.
Sutter
,
M.
Trilling
,
M.
Lu
,
U.
Dittmer
,
D.
Yang
.
2020
.
Overlapping and discrete aspects of the pathology and pathogenesis of the emerging human pathogenic coronaviruses SARS-CoV, MERS-CoV, and 2019-nCoV.
J. Med. Virol.
92
:
491
494
.
17.
Menachery
V. D.
,
R. L.
Graham
,
R. S.
Baric
.
2017
.
Jumping species-a mechanism for coronavirus persistence and survival.
Curr. Opin. Virol.
23
:
1
7
.
18.
Zhou
P.
,
X. L.
Yang
,
X. G.
Wang
,
B.
Hu
,
L.
Zhang
,
W.
Zhang
,
H. R.
Si
,
Y.
Zhu
,
B.
Li
,
C. L.
Huang
, et al
.
2020
.
A pneumonia outbreak associated with a new coronavirus of probable bat origin.
Nature
579
:
270
273
.
19.
Wan
Y.
,
J.
Shang
,
R.
Graham
,
R. S.
Baric
,
F.
Li
.
2020
.
Receptor recognition by the novel coronavirus from Wuhan: an analysis based on decade-long structural studies of SARS coronavirus.
J. Virol.
94
:
e00127-20
.
20.
Layne
S. P.
,
J. M.
Hyman
,
D. M.
Morens
,
J. K.
Taubenberger
.
2020
.
New coronavirus outbreak: framing questions for pandemic prevention.
Sci. Transl. Med.
12
:
eabb1469
.
21.
Lun
Z. R.
,
L. H.
Qu
.
2004
.
Animal-to-human SARS-associated coronavirus transmission?
Emerg. Infect. Dis.
10
:
959
.
22.
Chowell
G.
,
K.
Mizumoto
.
2020
.
The COVID-19 pandemic in the USA: what might we expect?
Lancet
395
:
1093
1094
.
23.
Decru
B.
,
J.
Van Elslande
,
M.
Weemaes
,
E.
Houben
,
I.
Empsen
,
E.
André
,
M.
Van Ranst
,
K.
Lagrou
,
P.
Vermeersch
.
2020
.
Comparison of the diagnostic performance with whole blood and plasma of four rapid antibody tests for SARS-CoV-2.
Clin. Chem. Lab. Med.
58
:
e197
e199
.
24.
An
P.
,
P.
Song
,
Y.
Wang
,
B.
Liu
.
2020
.
Asymptomatic patients with novel coronavirus disease (COVID-19).
Balkan Med. J.
37
:
229
230
.
25.
Oran
D. P.
,
E. J.
Topol
.
2020
.
Prevalence of asymptomatic SARS-CoV-2 infection: a narrative review.
Ann. Intern. Med.
173
:
362
367
.
26.
Breslin
N.
,
C.
Baptiste
,
C.
Gyamfi-Bannerman
,
R.
Miller
,
R.
Martinez
,
K.
Bernstein
,
L.
Ring
,
R.
Landau
,
S.
Purisch
,
A. M.
Friedman
, et al
.
2020
.
Coronavirus disease 2019 infection among asymptomatic and symptomatic pregnant women: two weeks of confirmed presentations to an affiliated pair of New York City hospitals.
Am. J. Obstet. Gynecol. MFM
2
:
100118
.
27.
Zhao
J.
,
J.
Zhao
,
A. K.
Mangalam
,
R.
Channappanavar
,
C.
Fett
,
D. K.
Meyerholz
,
S.
Agnihothram
,
R. S.
Baric
,
C. S.
David
,
S.
Perlman
.
2016
.
Airway memory CD4(+) T cells mediate protective immunity against emerging respiratory coronaviruses.
Immunity
44
:
1379
1391
.
28.
Chen
H.
,
J.
Hou
,
X.
Jiang
,
S.
Ma
,
M.
Meng
,
B.
Wang
,
M.
Zhang
,
M.
Zhang
,
X.
Tang
,
F.
Zhang
, et al
.
2005
.
Response of memory CD8+ T cells to severe acute respiratory syndrome (SARS) coronavirus in recovered SARS patients and healthy individuals.
J. Immunol.
175
:
591
598
.
29.
Rajendran
K.
,
N.
Krishnasamy
,
J.
Rangarajan
,
J.
Rathinam
,
M.
Natarajan
,
A.
Ramachandran
.
2020
.
Convalescent plasma transfusion for the treatment of COVID-19: systematic review.
J. Med. Virol.
92
:
1475
1483
.
30.
Pinto
D.
,
Y. J.
Park
,
M.
Beltramello
,
A. C.
Walls
,
M. A.
Tortorici
,
S.
Bianchi
,
S.
Jaconi
,
K.
Culap
,
F.
Zatta
,
A.
De Marco
, et al
.
2020
.
Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody.
Nature
583
:
290
295
.
31.
Ni
L.
,
F.
Ye
,
M. L.
Cheng
,
Y.
Feng
,
Y. Q.
Deng
,
H.
Zhao
,
P.
Wei
,
J.
Ge
,
M.
Gou
,
X.
Li
, et al
.
2020
.
Detection of SARS-CoV-2-specific humoral and cellular immunity in COVID-19 convalescent individuals.
Immunity
52
:
971
977.e3
32.
Lee
C. Y.
,
R. T. P.
Lin
,
L.
Renia
,
L. F. P.
Ng
.
2020
.
Serological approaches for COVID-19: epidemiologic perspective on surveillance and control.
Front. Immunol.
11
:
879
.
33.
Azkur
A. K.
,
M.
Akdis
,
D.
Azkur
,
M.
Sokolowska
,
W.
van de Veen
,
M. C.
Bruggen
,
L.
O’Mahony
,
Y.
Gao
,
K.
Nadeau
,
C. A.
Akdis
.
2020
.
Immune response to SARS-CoV-2 and mechanisms of immunopathological changes in COVID-19.
Allergy
75
:
1564
1581
34.
Mehta
P.
,
D. F.
McAuley
,
M.
Brown
,
E.
Sanchez
,
R. S.
Tattersall
,
J. J.
Manson
;
HLH Across Speciality Collaboration, UK
.
2020
.
COVID-19: consider cytokine storm syndromes and immunosuppression.
Lancet
395
:
1033
1034
.
35.
Vaninov
N.
2020
.
In the eye of the COVID-19 cytokine storm.
Nat. Rev. Immunol.
20
:
277
.
36.
Ma
J.
,
P.
Xia
,
Y.
Zhou
,
Z.
Liu
,
X.
Zhou
,
J.
Wang
,
T.
Li
,
X.
Yan
,
L.
Chen
,
S.
Zhang
, et al
.
2020
.
Potential effect of blood purification therapy in reducing cytokine storm as a late complication of critically ill COVID-19.
Clin. Immunol.
214
:
108408
.
37.
Henderson
L. A.
,
S. W.
Canna
,
G. S.
Schulert
,
S.
Volpi
,
P. Y.
Lee
,
K. F.
Kernan
,
R.
Caricchio
,
S.
Mahmud
,
M. M.
Hazen
,
O.
Halyabar
, et al
.
2020
.
On the alert for cytokine storm: immunopathology in COVID-19.
Arthritis Rheumatol.
72
:
1059
1063
38.
Walsh
K. B.
,
J. R.
Teijaro
,
L. G.
Brock
,
D. M.
Fremgen
,
P. L.
Collins
,
H.
Rosen
,
M. B.
Oldstone
.
2014
.
Animal model of respiratory syncytial virus: CD8+ T cells cause a cytokine storm that is chemically tractable by sphingosine-1-phosphate 1 receptor agonist therapy.
J. Virol.
88
:
6281
6293
.
39.
Matheu
M. P.
,
J. R.
Teijaro
,
K. B.
Walsh
,
M. L.
Greenberg
,
D.
Marsolais
,
I.
Parker
,
H.
Rosen
,
M. B.
Oldstone
,
M. D.
Cahalan
.
2013
.
Three phases of CD8 T cell response in the lung following H1N1 influenza infection and sphingosine 1 phosphate agonist therapy.
PLoS One
8
:
e58033
.
40.
Walsh
K. B.
,
J. R.
Teijaro
,
P. R.
Wilker
,
A.
Jatzek
,
D. M.
Fremgen
,
S. C.
Das
,
T.
Watanabe
,
M.
Hatta
,
K.
Shinya
,
M.
Suresh
, et al
.
2011
.
Suppression of cytokine storm with a sphingosine analog provides protection against pathogenic influenza virus.
Proc. Natl. Acad. Sci. USA
108
:
12018
12023
.
41.
Weiskopf
D.
,
K. S.
Schmitz
,
M. P.
Raadsen
,
A.
Grifoni
,
N. M. A.
Okba
,
H.
Endeman
,
J. P. C.
van den Akker
,
R.
Molenkamp
,
M. P. G.
Koopmans
,
E. C. M.
van Gorp
, et al
.
2020
.
Phenotype and kinetics of SARS-CoV-2-specific T cells in COVID-19 patients with acute respiratory distress syndrome.
Sci. Immunol.
5
:
eabd2071
.
42.
Braun
J.
,
L.
Loyal
,
M.
Frentsch
,
D.
Wendisch
,
P.
Georg
,
F.
Kurth
,
S.
Hippenstiel
,
M.
Dingeldey
,
B.
Kruse
,
F.
Fauchere
, et al
.
2020
.
SARS-CoV-2-reactive T cells in healthy donors and patients with COVID-19.
Nature
587
:
270
274
.
43.
Jeyanathan
M.
,
S.
Afkhami
,
F.
Smaill
,
M. S.
Miller
,
B. D.
Lichty
,
Z.
Xing
.
2020
.
Immunological considerations for COVID-19 vaccine strategies.
Nat. Rev. Immunol.
20
:
615
632
.
44.
Steentoft
C.
,
S. Y.
Vakhrushev
,
H. J.
Joshi
,
Y.
Kong
,
M. B.
Vester-Christensen
,
K. T.
Schjoldager
,
K.
Lavrsen
,
S.
Dabelsteen
,
N. B.
Pedersen
,
L.
Marcos-Silva
, et al
.
2013
.
Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology.
EMBO J.
32
:
1478
1488
.
45.
Bui
H. H.
,
J.
Sidney
,
K.
Dinh
,
S.
Southwood
,
M. J.
Newman
,
A.
Sette
.
2006
.
Predicting population coverage of T-cell epitope-based diagnostics and vaccines.
BMC Bioinformatics
7
:
153
.
46.
Oh
S.
,
M.
Terabe
,
C. D.
Pendleton
,
A.
Bhattacharyya
,
T. K.
Bera
,
M.
Epel
,
Y.
Reiter
,
J.
Phillips
,
W. M.
Linehan
,
C.
Kasten-Sportes
, et al
.
2004
.
Human CTLs to wild-type and enhanced epitopes of a novel prostate and breast tumor-associated protein, TARP, lyse human breast cancer cells.
Cancer Res.
64
:
2610
2618
.
47.
Stuber
G.
,
G. H.
Leder
,
W. T.
Storkus
,
M. T.
Lotze
,
S.
Modrow
,
L.
Székely
,
H.
Wolf
,
E.
Klein
,
K.
Kärre
,
G.
Klein
.
1994
.
Identification of wild-type and mutant p53 peptides binding to HLA-A2 assessed by a peptide loading-deficient cell line assay and a novel major histocompatibility complex class I peptide binding assay.
Eur. J. Immunol.
24
:
765
768
.
48.
Khan
A. A.
,
R.
Srivastava
,
D.
Spencer
,
S.
Garg
,
D.
Fremgen
,
H.
Vahed
,
P. P.
Lopes
,
T. T.
Pham
,
C.
Hewett
,
J.
Kuang
, et al
.
2015
.
Phenotypic and functional characterization of herpes simplex virus glycoprotein B epitope-specific effector and memory CD8+ T cells from symptomatic and asymptomatic individuals with ocular herpes.
J. Virol.
89
:
3776
3792
.
49.
Srivastava
R.
,
X.
Dervillez
,
A. A.
Khan
,
A. A.
Chentoufi
,
S.
Chilukuri
,
N.
Shukr
,
Y.
Fazli
,
N. N.
Ong
,
R. E.
Afifi
,
N.
Osorio
, et al
.
2016
.
The herpes simplex virus latency-associated transcript gene is associated with a broader repertoire of virus-specific exhausted CD8+ T cells retained within the trigeminal ganglia of latently infected HLA transgenic rabbits.
J. Virol.
90
:
3913
3928
.
50.
Khan
A. A.
,
R.
Srivastava
,
H.
Vahed
,
S.
Roy
,
S. S.
Walia
,
G. J.
Kim
,
M. A.
Fouladi
,
T.
Yamada
,
V. T.
Ly
,
C.
Lam
, et al
.
2018
.
Human asymptomatic epitope peptide/CXCL10-based prime/pull vaccine induces herpes simplex virus-specific gamma interferon-positive CD107+ CD8+ T cells that infiltrate the corneas and trigeminal ganglia of humanized HLA transgenic rabbits and protect against ocular herpes challenge.
J. Virol.
92
:
e00535-18
.
51.
Korber
B.
,
W. M.
Fischer
,
S.
Gnanakaran
,
H.
Yoon
,
J.
Theiler
,
W.
Abfalterer
,
N.
Hengartner
,
E. E.
Giorgi
,
T.
Bhattacharya
,
B.
Foley
, et al
.
2020
.
Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus.
Cell
182
:
812
827.e19
.
52.
Li
X.
,
E. E.
Giorgi
,
M. H.
Marichannegowda
,
B.
Foley
,
C.
Xiao
,
X.-P.
Kong
,
Y.
Chen
,
S.
Gnanakaran
,
B.
Korber
,
F.
Gao
.
2020
.
Emergence of SARS-CoV-2 through recombination and strong purifying selection.
Sci. Adv.
6
:
eabb9153
.
53.
Zheng
M.
,
L.
Song
.
2020
.
Novel antibody epitopes dominate the antigenicity of spike glycoprotein in SARS-CoV-2 compared to SARS-CoV.
Cell. Mol. Immunol.
17
:
536
538
.
54.
Walls
A. C.
,
Y. J.
Park
,
M. A.
Tortorici
,
A.
Wall
,
A. T.
McGuire
,
D.
Veesler
.
2020
.
Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein.
Cell
181
:
281
292.e6
55.
Tilocca
B.
,
A.
Soggiu
,
V.
Musella
,
D.
Britti
,
M.
Sanguinetti
,
A.
Urbani
,
P.
Roncada
.
2020
.
Molecular basis of COVID-19 relationships in different species: a one health perspective.
Microbes Infect.
22
:
218
220
.
56.
Grifoni
A.
,
J.
Sidney
,
Y.
Zhang
,
R. H.
Scheuermann
,
B.
Peters
,
A.
Sette
.
2020
.
A sequence homology and bioinformatic approach can predict candidate targets for immune responses to SARS-CoV-2.
Cell Host Microbe
27
:
671
680.e2
.
57.
Bhattacharya
M.
,
A. R.
Sharma
,
P.
Patra
,
P.
Ghosh
,
G.
Sharma
,
B. C.
Patra
,
S. S.
Lee
,
C.
Chakraborty
.
2020
.
Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): Immunoinformatics approach.
J. Med. Virol.
92
:
618
631
58.
Baruah
V.
,
S.
Bose
.
2020
.
Immunoinformatics-aided identification of T cell and B cell epitopes in the surface glycoprotein of 2019-nCoV.
J. Med. Virol.
92
:
495
500
.
59.
Ahmed
S. F.
,
A. A.
Quadeer
,
M. R.
McKay
.
2020
.
Preliminary identification of potential vaccine targets for the COVID-19 coronavirus (SARS-CoV-2) based on SARS-CoV immunological studies.
Viruses
12
:
254
.
60.
Vita
R.
,
S.
Mahajan
,
J. A.
Overton
,
S. K.
Dhanda
,
S.
Martini
,
J. R.
Cantrell
,
D. K.
Wheeler
,
A.
Sette
,
B.
Peters
.
2019
.
The immune epitope database (IEDB): 2018 update.
Nucleic Acids Res.
47
:
D339
D343
.
61.
Sette
A.
,
J.
Sidney
.
1999
.
Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism.
Immunogenetics
50
:
201
212
.
62.
Sette
A.
,
J.
Sidney
.
1998
.
HLA supertypes and supermotifs: a functional perspective on HLA polymorphism.
Curr. Opin. Immunol.
10
:
478
482
.
63.
Hertz
T.
,
C.
Yanover
.
2007
.
Identifying HLA supertypes by learning distance functions.
Bioinformatics
23
:
e148
e155
.
64.
Chentoufi
A. A.
,
X.
Zhang
,
K.
Lamberth
,
G.
Dasgupta
,
I.
Bettahi
,
A.
Nguyen
,
M.
Wu
,
X.
Zhu
,
A.
Mohebbi
,
S.
Buus
, et al
.
2008
.
HLA-A*0201-restricted CD8+ cytotoxic T lymphocyte epitopes identified from herpes simplex virus glycoprotein D.
J. Immunol.
180
:
426
437
.
65.
Jespersen
M. C.
,
B.
Peters
,
M.
Nielsen
,
P.
Marcatili
.
2017
.
BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes.
Nucleic Acids Res.
45
:
W24
W29
.
66.
Monto
A. S.
,
P. M.
DeJonge
,
A. P.
Callear
,
L. A.
Bazzi
,
S. B.
Capriola
,
R. E.
Malosh
,
E. T.
Martin
,
J. G.
Petrie
.
2020
.
Coronavirus occurrence and transmission over 8 years in the HIVE cohort of households in Michigan.
J. Infect. Dis.
222
:
9
16
.
67.
Mateus
J.
,
A.
Grifoni
,
A.
Tarke
,
J.
Sidney
,
S. I.
Ramirez
,
J. M.
Dan
,
Z. C.
Burger
,
S. A.
Rawlings
,
D. M.
Smith
,
E.
Phillips
, et al
.
2020
.
Selective and cross-reactive SARS-CoV-2 T cell epitopes in unexposed humans.
Science
370
:
89
94
.
68.
Grifoni
A.
,
D.
Weiskopf
,
S. I.
Ramirez
,
J.
Mateus
,
J. M.
Dan
,
C. R.
Moderbacher
,
S. A.
Rawlings
,
A.
Sutherland
,
L.
Premkumar
,
R. S.
Jadi
, et al
.
2020
.
Targets of T cell responses to SARS-CoV-2 coronavirus in humans with COVID-19 disease and unexposed individuals.
Cell
181
:
1489
1501.e15
.
69.
Zost
S. J.
,
P.
Gilchuk
,
J. B.
Case
,
E.
Binshtein
,
R. E.
Chen
,
J. P.
Nkolola
,
A.
Schäfer
,
J. X.
Reidy
,
A.
Trivette
,
R. S.
Nargi
, et al
.
2020
.
Potently neutralizing and protective human antibodies against SARS-CoV-2.
Nature
584
:
443
449
.
70.
Ng
O. W.
,
Y. J.
Tan
.
2017
.
Understanding bat SARS-like coronaviruses for the preparation of future coronavirus outbreaks - implications for coronavirus vaccine development.
Hum. Vaccin. Immunother.
13
:
186
189
.
71.
Wu
A.
,
Y.
Peng
,
B.
Huang
,
X.
Ding
,
X.
Wang
,
P.
Niu
,
J.
Meng
,
Z.
Zhu
,
Z.
Zhang
,
J.
Wang
, et al
.
2020
.
Genome composition and divergence of the novel coronavirus (2019-nCoV) originating in China.
Cell Host Microbe
27
:
325
328
.
72.
Watanabe
Y.
,
J. D.
Allen
,
D.
Wrapp
,
J. S.
McLellan
,
M.
Crispin
.
2020
.
Site-specific glycan analysis of the SARS-CoV-2 spike.
Science
369
:
330
333
.
73.
Martin
J. E.
,
M. K.
Louder
,
L. A.
Holman
,
I. J.
Gordon
,
M. E.
Enama
,
B. D.
Larkin
,
C. A.
Andrews
,
L.
Vogel
,
R. A.
Koup
,
M.
Roederer
, et al
VRC 301 Study Team
.
2008
.
A SARS DNA vaccine induces neutralizing antibody and cellular immune responses in healthy adults in a phase I clinical trial.
Vaccine
26
:
6338
6343
.
74.
He
Y.
,
S.
Jiang
.
2005
.
Vaccine design for severe acute respiratory syndrome coronavirus.
Viral Immunol.
18
:
327
332
.
75.
He
Y.
,
Y.
Zhou
,
P.
Siddiqui
,
S.
Jiang
.
2004
.
Inactivated SARS-CoV vaccine elicits high titers of spike protein-specific antibodies that block receptor binding and virus entry.
Biochem. Biophys. Res. Commun.
325
:
445
452
.
76.
Sidney
J.
,
B.
Peters
,
N.
Frahm
,
C.
Brander
,
A.
Sette
.
2008
.
HLA class I supertypes: a revised and updated classification.
BMC Immunol.
9
:
1
.
77.
Correale
P.
,
L.
Mutti
,
F.
Pentimalli
,
G.
Baglio
,
R. E.
Saladino
,
P.
Sileri
,
A.
Giordano
.
2020
.
HLA-B*44 and C*01 prevalence correlates with Covid19 spreading across Italy.
Int. J. Mol. Sci.
21
:
5205
.
78.
Correale
P.
,
R. E.
Saladino
,
D.
Giannarelli
,
A.
Sergi
,
M. A.
Mazzei
,
G.
Bianco
,
R.
Giannicola
,
E.
Iuliano
,
I. M.
Forte
,
N. D.
Calandruccio
, et al
.
2020
.
HLA expression correlates to the risk of immune checkpoint inhibitor-induced pneumonitis.
Cells
9
:
1964
.
79.
Crooke
S. N.
,
I. G.
Ovsyannikova
,
R. B.
Kennedy
,
G. A.
Poland
.
2020
.
Immunoinformatic identification of B cell and T cell epitopes in the SARS-CoV-2 proteome.
Sci. Rep.
10
:
14179
.
80.
Wang
L. F.
,
D. E.
Anderson
,
J. S.
Mackenzie
,
M. H.
Merson
.
2020
.
From Hendra to Wuhan: what has been learned in responding to emerging zoonotic viruses.
Lancet
395
:
e33
e34
.
81.
Plowright
R. K.
,
A. J.
Peel
,
D. G.
Streicker
,
A. T.
Gilbert
,
H.
McCallum
,
J.
Wood
,
M. L.
Baker
,
O.
Restif
.
2016
.
Transmission or within-host dynamics driving pulses of zoonotic viruses in reservoir-host populations.
PLoS Negl. Trop. Dis.
10
:
e0004796
.
82.
Eng
C. L.
,
J. C.
Tong
,
T. W.
Tan
.
2016
.
Distinct host tropism protein signatures to identify possible zoonotic influenza A viruses.
PLoS One
11
:
e0150173
.
83.
Simons
R. R.
,
P.
Gale
,
V.
Horigan
,
E. L.
Snary
,
A. C.
Breed
.
2014
.
Potential for introduction of bat-borne zoonotic viruses into the EU: a review.
Viruses
6
:
2084
2121
.
84.
Baggio
S.
,
A. G.
L’Huillier
,
S.
Yerly
,
M.
Bellon
,
N.
Wagner
,
M.
Rohr
,
A.
Huttner
,
G.
Blanchard-Rohner
,
N.
Loevy
,
L.
Kaiser
,
F.
Jacquerioz
,
I.
Eckerle
.
2020
.
SARS-CoV-2 viral load in the upper respiratory tract of children and adults with early acute COVID-19.
Clin. Infect. Dis.
85.
Moratto
D.
,
M.
Giacomelli
,
M.
Chiarini
,
L.
Savarè
,
B.
Saccani
,
M.
Motta
,
S.
Timpano
,
P.
Poli
,
S.
Paghera
,
L.
Imberti
, et al
.
2020
.
Immune response in children with COVID-19 is characterized by lower levels of T-cell activation than infected adults.
Eur. J. Immunol.
50
:
1412
1414
.
86.
Nelde
A.
,
T.
Bilich
,
J. S.
Heitmann
,
Y.
Maringer
,
H. R.
Salih
,
M.
Roerden
,
M.
Lubke
,
J.
Bauer
,
J.
Rieth
,
M.
Wacker
, et al
.
2020
.
SARS-CoV-2-derived peptides define heterologous and COVID-19-induced T cell recognition.
Nat. Immunol.
22
:
74
85
87.
Ong
E. Z.
,
Y. F. Z.
Chan
,
W. Y.
Leong
,
N. M. Y.
Lee
,
S.
Kalimuddin
,
S. M.
Haja Mohideen
,
K. S.
Chan
,
A. T.
Tan
,
A.
Bertoletti
,
E. E.
Ooi
,
J. G. H.
Low
.
2020
.
A dynamic immune response shapes COVID-19 progression.
Cell Host Microbe
27
:
879
882.e2
.
88.
Le Bert
N.
,
A. T.
Tan
,
K.
Kunasegaran
,
C. Y. L.
Tham
,
M.
Hafezi
,
A.
Chia
,
M. H. Y.
Chng
,
M.
Lin
,
N.
Tan
,
M.
Linster
, et al
.
2020
.
SARS-CoV-2-specific T cell immunity in cases of COVID-19 and SARS, and uninfected controls.
Nature
584
:
457
462
.
89.
Welsh
R. M.
,
L. K.
Selin
.
2002
.
No one is naive: the significance of heterologous T-cell immunity.
Nat. Rev. Immunol.
2
:
417
426
.
90.
Becker
M. M.
,
R. L.
Graham
,
E. F.
Donaldson
,
B.
Rockx
,
A. C.
Sims
,
T.
Sheahan
,
R. J.
Pickles
,
D.
Corti
,
R. E.
Johnston
,
R. S.
Baric
,
M. R.
Denison
.
2008
.
Synthetic recombinant bat SARS-like coronavirus is infectious in cultured cells and in mice.
Proc. Natl. Acad. Sci. USA
105
:
19944
19949
.

The authors have no financial conflicts of interest. The University of California Irvine has filed a patent application on the results reported in this manuscript.

Supplementary data