Abstract
The TNF superfamily (TNFSF) of proteins are cytokines involved in diverse immunological and developmental pathways. Little is known about their evolution or expression in lower vertebrate species. Bioinformatic searches of Zebrafish, Tetraodon, and Fugu genome and other teleost expressed sequence tag databases identified 44 novel gene sequences containing a TNF homology domain. This work reveals the following: 1) teleosts possess orthologs of BAFF, APRIL, EDA, TWEAK, 4-1BBL, Fas ligand, LIGHT, CD40L, RANKL, and possibly TL1A; 2) the BAFF-APRIL subfamily is enriched by a third member, BALM, unique to fish; 3) orthologs of lymphotoxins α and β were not clearly identified in teleosts and are substituted by a related ligand, TNF-New; 4) as many as four TRAIL-like genes are present in teleosts, as compared with only one in mammals; and 5) T cell activation ligands OX40L, CD27L, CD30L, and GITRL were not identified in any fish species. Finally, we characterize mRNA expression of TNFSF members CD40L, LIGHT, BALM, APRIL, Fas ligand, RANKL, TRAIL-like, and TNF-New in rainbow trout, Oncorhynchus mykiss, immune and nonimmune tissues. In conclusion, we identified a total of 14 distinct TNFSF members in fishes, indicating expansion of this superfamily before the divergence of bony fish and tetrapods, ∼360–450 million years ago. Based on these findings, we extend a model of TNFSF evolution and the coemergence of the vertebrate adaptive immune system.
Ligands within the TNF superfamily (TNFSF)4 are effector proteins involved in a variety of pathways ranging from inflammation, lymphocyte maturation, and apoptosis to lymphoid and epithelial tissue development (1, 2). These ligands are type II membrane proteins with an intracellular N terminus and an extracellular C terminus. The majority of these ligands are membrane bound, while many members also contain a proteolytic cleavage site to generate soluble forms (3). The C terminus is modestly conserved between TNFSF ligands (20–30%) and is referred to as the TNF homology domain (THD) that is typically composed of 10 β-strands (designated A, A′, B′, B, C, D, E, F, G, and H) (1). Structurally, each TNFSF ligand’s conical trimer is formed by the THD region of three monomers. After trimer formation, the ligands can bind respective receptor(s) to initiate signaling.
Eighteen distinct TNFSF genes have been identified in humans, and nearly all members are physically located on the chromosome adjacent to one or two additional TNFSF genes. It has been proposed that this clustered organization of human TNFSF members on MHC-paralogous chromosomes 1 (FASL, GITRL, and OX40L), Chr 6 (LTB, TNF, and LTA), Chr 9 (CD30L and TL1A), and Chr 19 (LIGHT, CD27L, and 4-1BBL) arose from two ancestral TNFSF members within the proto-MHC, followed by partial cis-duplication, and subsequently by large-scale genomic duplications. This was followed by discrete deletion and/or either cis- or trans-duplications (4). Similarly, the remaining genes on chromosomes X (EDA and CD40L), Chr 3 (TRAIL), Chr 13 (BAFF and RANKL), and Chr 17 (APRIL and TWEAK) may have also evolved by localized or genome-wide duplication with rearrangement by translocation. Collette et al. (4) postulate that the duplication and evolution of the TNFSF and TNFRSF families paralleled the emergence of the adaptive immune system.
Teleosts (ray-finned, bony fish) possess an immune system with B cells and T cells, primary and secondary lymphoid organs, and are capable of adaptive responses to pathogens. Teleosts, however, display a number of characteristics different from the mammalian immune system: the anterior kidney (AK) is the primary hemopoietic organ; they lack germinal centers and lymph nodes, genes of the MHC are dispersed across chromosomes, and they fail to undergo isotype switching. Given that TNFSF members play critical roles in many of these aspects of immune system organization and function, identification of teleost TNFSF orthologs and paralogs is of interest to better understand immune system evolution and the immunological pathways elicited to pathogens.
To date, only a limited number of TNFSF members have been identified in teleosts. A TNF has been cloned and characterized in many fish species and has been found to be similar to mammalian TNF-α (TNFSF2) (5, 6, 7, 8, 9, 10). Other genes that have been characterized in teleosts include a TRAIL-like (11), EDA (12), and BAFF (TNFSF13b) genes (G. D. Wiens, S. Gahr, F. Rodriguez, Y. Palti, C. Rexroad, and G. Glenney, manuscript in preparation). A homolog of mammalian LT-α (TNFSF1) has not been described in fish and has been suggested to not exist (13). However, Savan et al. (14) recently describe a novel TNF gene (TNF-New) found both in fugu, Takifugu rubripes, and zebrafish, Danio rerio, that is similar to LT-A due to its genomic proximity to TNF-A and similar transcriptional orientation. While this manuscript was in preparation, Kono et al. (15) identified two orthologs of TNF-New in rainbow trout and concluded that the proteins are more similar to LT-β (TNFSF3) than to LT-α due to their absence of a signal sequence and their phylogenetic clustering with other mammalian and Xenopus LT-β proteins.
In an effort to better understand the evolution of TNF superfamily, we systematically searched teleost expressed sequence tag (EST) and genomic databases for additional orthologs and paralogs. In this study, we have assimilated 71 teleost sequences, of which 44 are novel, that contain a THD, and we determined their relationship to mammalian TNFSF members by phylogenetic and synteny analyses. We have examined expression of TNFSF members in rainbow trout as it is a commercially important species and a good model species for functional analysis (16). Our analyses identified orthologs of mammalian TNFSF members and also genes that appear to be unique to teleosts. One rainbow trout protein has similarities to both BAFF (TNFSF13b) and APRIL (TNFSF13) and shares sequence identity to a protein identified in the threespine stickleback, Gasterosteus aculeatus (12). This gene will be referred to as BAFF-APRIL-like molecule (BALM). We also identify four TRAIL-like molecules in teleosts. Finally, we use gene synteny, intron/exon organization, predicted secondary structure, and molecular phylogeny to compare all known teleost TNFSF members and extend a model of the evolution of this superfamily.
Materials and Methods
TNFSF member search and identification
Teleost TNF orthologs were determined by searching the EST and genomic databases of www.tigr.org/tdb/tgi/, www.ncbi.nlm.nih.gov/BLAST/, and www.ensembl.org/ (Ensembl version 33, September 2005). Amino acid sequences from known mammalian and teleost TNF family members were used to blast via blastp (protein vs protein) and tblastn (protein vs DNA sequence). Putative transmembrane domains (TMDs) and signal peptides were determined by submitting amino acid sequences to the respective databases: http://sosui.proteome.bio.tuat.ac.jp/sosuiframe0.html, www.cbs.dtu.dk/services/TMHMM/, and www.cbs.dtu.dk/services/SignalP/. For assigning nomenclature to teleost TNFSF members, we followed previously proposed standards (1, 3, 4, 13). When there was evidence for gene synteny in the Zebrafish or Tetraodon genomes with the human genome, we designated the gene/protein with the equivalent vertebrate designation. If synteny was not definitive but phylogenetic analysis grouped the sequence with other mammalian genes, they were designated with the suffix “like.”
Cloning and sequencing
Full (CD40L, RANKL, and TRAIL-like) and partial (FasL, BALM, and Om TNFSF-N) Oncorhynchus mykiss cDNA sequences were obtained from rainbow trout EST libraries at the National Center for Cool and Coldwater Aquaculture (Kearneysville, WV). To complete partial sequences, RNA ligase-mediated rapid amplification of the 5′ end of FasL and BALM using GeneRacer 5′ Primer with FasL R1 or BALM R1 (RNA ligase mediated-RACE, GeneRacer kit; Invitrogen Life Technologies) was conducted. Nested PCR was conducted to amplify the desired genes (FasL (GeneRacer 5′ Nested Primer, FasL-like Rext.) and BALM (GeneRacer 5′ Nested Primer, BALM R1)). All 5′ RACE procedures were conducted on the total RNA extracted from splenic tissue of an unstimulated rainbow trout.
To complete the initial cDNA Om TNFSF-N sequence, four sets of primers (Om TNFSF-N F1, R1, F2, R2, F3, and R3) were used to establish a complete open reading frame (ORF) from total RNA extracted from the AK, posterior kidney (PK), and gill of an unstimulated rainbow trout. Amplification was performed in 20-μl samples containing 4.6 μl of PCR grade water (Sigma-Aldrich), 2 μl of 10× PCR buffer, 1.2 μl of 25 mM MgCl2, 2 μl of 2.0 μM dNTP (Sigma-Aldrich), 4 μl of forward and reverse primers (5 μM), and 0.2 μl of Hotstar Taq polymerase (5 U/μl). PCR products were extracted from 1% agarose gels with QIAquick Gel Extraction Kit (250) (Qiagen). Cloning of initial Om TNFSF-N products was conducted in pCR2.1-TOPO vector chemically transformed into Transforming One Shot TOP10 Competent Cells and selected on 50 μg/ml kanamycin containing Luria-Bertani plates. To complete the 3′ untranslated region (UTR), SuperScript III reverse transcriptase was used to generate cDNA from the splenic total RNA of a Flavobacterium psychrophilum (strain CSF259-93) intramusculature injected rainbow trout (GeneRacer Oligo dT Primer). This cDNA was initially amplified by PCR (Om TNFSF-N F3, GeneRacer 3′ Primer), followed by a nested PCR (Om TNFSF-N F2, GeneRacer 3′ Nested Primer).
To obtain the initial cDNA of the rainbow trout Om LIGHT gene, primers were designed from a Salmo salar (S. salar LIGHT F1 and R1) LIGHT sequence. Only a partial LIGHT-like sequence was obtained from the total RNA extracted from the kidney and spleen of an unstimulated rainbow trout. To complete the LIGHT sequence, 5′ RACE (GeneRacer 5′ Primer, LIGHT R1) was conducted. To complete the 3′ end, the same SuperScript III reverse transcribed cDNA described above to amplify the Om TNFSF-N 3′ end was used. This cDNA was initially amplified by PCR (S. salar LIGHT F1, GeneRacer 3′ Primer), followed by a nested PCR (S. salar LIGHT F2, GeneRacer 3′ Nested Primer). Two variant sequences were found by these processes, so two forward primers and two reverse primers were designed in variable regions (Om LIGHT F1, R1, F2, and R2). PCR products were cloned and sequenced to establish true variants. Cloning of trout BAFF and APRIL will be described elsewhere (G. D. Wiens, S. Gahr, F. Rodriguez, C. Morrison, Y. Palti, C. Rexroad, and G. Glenney, manuscript in preparation).
All product amplifications, unless stated otherwise, were performed in 50-μl samples containing 32 μl of PCR grade water (Sigma-Aldrich), 5 μl of 10× Pfu PCR buffer, 1 μl of 10 mM dNTP (Sigma-Aldrich), 1 μl of 0.5% gelatin, 4 μl of forward and reverse primers, and 1 μl of PfuULTRA Hotstart DNA Polymerase (2.5 U/μl). PCR products were extracted from 1% agarose gels with the QIAquick Gel Extraction kit (Qiagen).
All cloning, unless stated otherwise, was conducted in pCRII-Blunt-TOPO vector chemically transformed into Transforming One-Shot Competent Cells. EST library-competent cells containing pCMV · SPORT 6 and pT7T3D-PAC vectors were grown overnight in 5.0 ml of Luria-Bertani broth (Difco) containing 100 μg/ml ampicillin. Plasmid DNA was extracted with the QIAprep Spin Miniprep kit (Qiagen) and sequenced using an ABI 3100 Sequencer. Contigs were established, and extension primers were used until two complete sequences from either clonal inserts or PCR products were obtained. PCR primers and extension primers are listed (Table I). All custom primers were made by using primer designing software (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi).
Primers used for sequencing and expression analysis of rainbow trout TNF family ligand members: BALM, CD40L, RANKL, TRAIL-like, LIGHT, FASL, APRIL, and TNF-New
Name . | Nucleotide Sequence . |
---|---|
Om_BALM F1 | 5′-CCAACAGGAAGGCTTCTTCA-3′ |
Om_BALM R1 | 5′-CCGTATGATGCCAAAGAAGG-3′ |
Om_BALM F2 | 5′-TGCTCCGCTGTCTACA-3′ |
Om_BALM R2 | 5′-TGCCAAAGAAGGTTGAGTCT-3′ |
Om_CD40L F1 | 5′-TTCCTGTCTCCCCCTTCCT-3′ |
Om_CD40L R1 | 5′-CCCCAGTGCGTTTGACAG-3′ |
Om_CD40L Fext. | 5′-TTCCTGTCTCCCCCTTCCT-3′ |
Om_CD40L Rext. | 5′-AAAAGCTGTCAAACGCACTG-3′ |
Om_RANKL-like F1 | 5′-CCCTAGTTGAGACTTCGGAC-3′ |
Om_RANKL-like R1 | 5′-GTGACTCCTCTACCTGGTCGT-3′ |
Om_RANKL Fext. | 5′-CTGTTGGAGTCTGGGTCTCA-3′ |
Om_RANKL Rext. | 5′-CTGTGAATGCTAGGCTCTGCT-3′ |
Om_TRAIL-like F1 | 5′-GGCAGGTCACACAGCAACTA-3′ |
Om_TRAIL-like R1 | 5′-GCAACTTTGGGACGAGA′-3′ |
Om_TRAIL-like Fext. | 5′-CTTTTTCGGGGCTGTCCT-3′ |
Om_TRAIL-like Rext. | 5′-CTCCCTCCCTCGCTGCTT-3′ |
S. salar_LIGHT F1 | 5′-CAACCAGGGATCTGTGGAGT-3′ |
S. salar_LIGHT F2 | 5′-CATCTGACAGCTGGACCTCA-3′ |
S. salar_LIGHT R1 | 5′-TCCTGGACTTAAGGAGTTCAATG-3′ |
Om_LIGHT F1 | 5′-ACCATGAGGAATTTCCGAAG-3′ |
Om_LIGHT R1 | 5′-AACCAATGATAATCAGCCAGCTA-3′ |
Om_LIGHT F2 | 5′-ACCAAGAGGACATTCCGAAG-3′ |
Om_LIGHT R2 | 5′-CCAATGATAATCAGCCAGCTATC-3′ |
Om_FASL F1 | 5′-CGAGAAGGATGTTTCCCAGA-3′ |
Om_FASL R1 | 5′-GTGGAGTCCAAAGAAGTTGG-3′ |
Om_FASL Fext. | 5′-TACTGCAGCTCGGGCTCTAA-3′ |
Om_FASL Rext. | 5′-ATAGACCACGCCTCCCTCA-3′ |
Om_APRIL F | 5′-TGTGACATAATGACCAACCGTTA-3′ |
Om_APRIL R | 5′-TCGTCACTCAGTGTCAAATCC-3′ |
Om_TNF-N F1 | 5′-CGAGCCTCAACTCAAGTGAAA-3′ |
Om_TNF-N R1 | 5′-AGGTTTCTCTCCCCCATACCT-3′ |
Om TNF-N F2 | 5′-CTCCGCGACAACTCTGTCTAC-3′ |
Om_TNF-N R2 | 5′-CAATAACAATGTCTGTTGGCTCA-3′ |
Om_TNF-N F3 | 5′-CCAGATACCTGCTGCTCCAT-3′ |
Om_TNF-N R3 | 5′-TGCACTCTTTCTCTCCGTGAT-3′ |
Om TNF-N R4 | 5′-CAATGCATATGCTAAACAGTTACA-3′ |
Name . | Nucleotide Sequence . |
---|---|
Om_BALM F1 | 5′-CCAACAGGAAGGCTTCTTCA-3′ |
Om_BALM R1 | 5′-CCGTATGATGCCAAAGAAGG-3′ |
Om_BALM F2 | 5′-TGCTCCGCTGTCTACA-3′ |
Om_BALM R2 | 5′-TGCCAAAGAAGGTTGAGTCT-3′ |
Om_CD40L F1 | 5′-TTCCTGTCTCCCCCTTCCT-3′ |
Om_CD40L R1 | 5′-CCCCAGTGCGTTTGACAG-3′ |
Om_CD40L Fext. | 5′-TTCCTGTCTCCCCCTTCCT-3′ |
Om_CD40L Rext. | 5′-AAAAGCTGTCAAACGCACTG-3′ |
Om_RANKL-like F1 | 5′-CCCTAGTTGAGACTTCGGAC-3′ |
Om_RANKL-like R1 | 5′-GTGACTCCTCTACCTGGTCGT-3′ |
Om_RANKL Fext. | 5′-CTGTTGGAGTCTGGGTCTCA-3′ |
Om_RANKL Rext. | 5′-CTGTGAATGCTAGGCTCTGCT-3′ |
Om_TRAIL-like F1 | 5′-GGCAGGTCACACAGCAACTA-3′ |
Om_TRAIL-like R1 | 5′-GCAACTTTGGGACGAGA′-3′ |
Om_TRAIL-like Fext. | 5′-CTTTTTCGGGGCTGTCCT-3′ |
Om_TRAIL-like Rext. | 5′-CTCCCTCCCTCGCTGCTT-3′ |
S. salar_LIGHT F1 | 5′-CAACCAGGGATCTGTGGAGT-3′ |
S. salar_LIGHT F2 | 5′-CATCTGACAGCTGGACCTCA-3′ |
S. salar_LIGHT R1 | 5′-TCCTGGACTTAAGGAGTTCAATG-3′ |
Om_LIGHT F1 | 5′-ACCATGAGGAATTTCCGAAG-3′ |
Om_LIGHT R1 | 5′-AACCAATGATAATCAGCCAGCTA-3′ |
Om_LIGHT F2 | 5′-ACCAAGAGGACATTCCGAAG-3′ |
Om_LIGHT R2 | 5′-CCAATGATAATCAGCCAGCTATC-3′ |
Om_FASL F1 | 5′-CGAGAAGGATGTTTCCCAGA-3′ |
Om_FASL R1 | 5′-GTGGAGTCCAAAGAAGTTGG-3′ |
Om_FASL Fext. | 5′-TACTGCAGCTCGGGCTCTAA-3′ |
Om_FASL Rext. | 5′-ATAGACCACGCCTCCCTCA-3′ |
Om_APRIL F | 5′-TGTGACATAATGACCAACCGTTA-3′ |
Om_APRIL R | 5′-TCGTCACTCAGTGTCAAATCC-3′ |
Om_TNF-N F1 | 5′-CGAGCCTCAACTCAAGTGAAA-3′ |
Om_TNF-N R1 | 5′-AGGTTTCTCTCCCCCATACCT-3′ |
Om TNF-N F2 | 5′-CTCCGCGACAACTCTGTCTAC-3′ |
Om_TNF-N R2 | 5′-CAATAACAATGTCTGTTGGCTCA-3′ |
Om_TNF-N F3 | 5′-CCAGATACCTGCTGCTCCAT-3′ |
Om_TNF-N R3 | 5′-TGCACTCTTTCTCTCCGTGAT-3′ |
Om TNF-N R4 | 5′-CAATGCATATGCTAAACAGTTACA-3′ |
Sequence analysis
Seventy-one teleost sequences, which possessed distinctive THDs, were identified (Table II), and all sequences are included in Supplementary Data 1. 5 The teleost sequences were aligned with known mammalian, avian, and amphibian TNFSF ligand sequences (Table II and Supplementary Data 2). 5 For the phylogenetic analysis, a total of 124 full-length sequences and 9 partial sequences were used. All partial sequences contained a complete THD (1). The rainbow trout sequences characterized in this article were translated using ExPASy proteomics and sequence analysis tools (http://us.expasy.org/tools/dna.html). All variant forms were included in the analysis unless identical ORFs were observed. Sequences were aligned using Clustal X (matrix-blosum). Alignment files were imported into the molecular evolutionary genetics analysis (MEGA) program, version 2.1. A phylogenetic tree was constructed using the Neighbor-Joining method (Poisson correction), with the bootstrap resampling technique to test the reliability of the inferred tree (1000 replications). Known mammalian, chicken, and frog sequences included in the phylogenetic analyses were as follows: Hs_LT-α, EMBL:OTTHUMT00000076237; Mm_LT-α, GenBank: P09225; Mmx_LT-α, GenBank:AAF34868; Bt_LT-α, EMBL:ENSBTAG00000000016; Oc_LT-α, EMBL:ENSOCUG00000006694; Xt_LT-α, EMBL:752:311946:312803; Hs_TNF-α, GenBank:P01375; Mm_TNF-α, GenBank:P06804; Xt_TNF-α, EMBL:752:293085:294756; Hs_LT-β, GenBank:Q06643; Pt_LT-β, EMBL:6:32052431:32054074; Mm_LT-β, GenBank:P41155; Pm_LT-β, GenBank:AAP34710; Mm_LT-β, GenBank:AAF34865; Me_LT-β, GenBank:AAD41774; Xt_LT-β, EMBL:752:236178:237176; Hs_OX- 40L, GenBank:P23510; Pt_OX-40L, EMBL:ENSPTRG00000001692; Bt_ OX-40L, EMBL:ENSBTAG00000002894; Cf_OX-40L, EMBL:ENSCAFG00000014587; Mm_OX-40L, GenBank:P43488; Hs_CD40-L, GenBank:P29965; Mm_CD40-L, GenBank:P27548; Xt_CD40-L, EMBL:ENSXETG00000017494; Hs_FasL, GenBank:P48023; Mm_FasL, GenBank:P41047, Gg_FasL, EMBL:ENSGALP00000004854; Hs_CD27L, GenBank:P32970; Bt_CD27L, EMBL:ENSBTAG00000009752; Cf_CD27L, EMBL:ENSCAFG00000018630; Mm_CD27L; GenBank:O55237; Mmu_CD27L, EMBL:ENSMMUG00000011042; Hs_CD30L, GenBank:P32971; Pt_CD30L, EMBL:ENSPTRG00000021293; Bt_CD30L, EMBL:ENSBTAG00000025782; Cf_CD30L, EMBL:ENSCAFP00000005002; Dr_4-1BBL, EMBL:GENSCAN00000025997; Fr_4-1BBL, EMBL:NEWSINFRUG00000160956; Ga_4-1BBL, EMBL:GENSCAN00000042857; Hs_4-1BBL, GenBank:P41273; Bt_4-1BBL, EMBL:ENSBTAG00000020500; Mm_4-1BBL, GenBank: P41274; Rn_4-1BBL, GenBank:AA13993; Tn_4-1BBL, EMBL:GSTENG00026327001; Hs_TRAIL, GenBank:P50591; Mm_TRAIL, GenBank:P50 592; Gg_TRAIL_v1, GenBank:NM_204379; Gg_TRAIL_v2, GenBank:NP_989922; Hs_RANKL, GenBank:O14788; Mm_RANKL, GenBank:O35235; Xt_RANKL, EMBL:ENSXETG00000025365; Hs_TWEAK, GenBank:O75888; Mm_TWEAK, GenBank:O54907; Hs_APRIL, GenBank:O75888; Rr_APRIL, GenBank:NP_001009623; Mm_APRIL, GenBank:Q9D777; Hs_BAFF, GenBank:Q9WU72; Mm_BAFF, GenBank:Q9WU72; Gg_BA FF, GenBank:NM_204327; Hs_LIGHT, GenBank:O43557; Mm_LIGHT, GenBank:Q9QYH9; Hs_TL1A, EMBL:OTTHUMP00000022739; Rn_ TL1A, EMBL:ENSRNOG00000008930; Gg_TL1A, EMBL:ENSF00000000980; Mm_TL1A, ENSMUSG00000050395; Xt_TL1A, ENSXET00000026658; Hs_GITRL, GenBank:Q9UNG2, Bt_GITRL, EMBL:ENSBTAG00000016468; Cf_GITRL, EMBL:ENSCAFG00000014590; Rn_GITRL, EMBL:ENSRNOP00000035225; Mm_GITRL, GenBank:Q7TNY2; Hs_EDA, GenBank:Q92838; Mm_EDA, GenBank:O54693; Gg_EDA, EMBL:ENSGALP00000007125; Xt_EDA, EMBL:117:1427238:1508975. Novel Xenopus sequences included within the phylogenetic analysis were as follows: Xt_BAFF, EMBL:42:1929088:1912327; Xt_FASL, CX366641; Xt_GITRL-like, EMBL:84:2095534:2101116; Xt_OX-40L-like, EMBL:85:2186432:2198707; Xt_TRAIL_v1, DV034135; Xt_TRAIL_v2, Scaff.239:55915:61329, GenBank:BJ618569.
Identified fish TNFSF members and the highest percent identity and similarity with human TNFSF membera
Teleost Species . | TNFSF Designation . | Accession Source and Numberb . | Highest Percent Identity (and Similarity) with Human TNFSF . |
---|---|---|---|
Fugu rubripes | Fr_TNF-α | EN: SINFRUG00000137029 | 30.9(47.1) TNF-α |
Cyprinus carpio | Cc_TNF-α_v1 | GB:CAC84641 | 27.0(44.2) TNF-α |
Cyprinus carpio | Cc_TNF-α_v2 | GB:CAC84642 | 29.4(42.4) LT-α |
Cyprinus carpio | Cc_TNF-α_v3 | GB:BAC77690.1 | 24.0(42.6) LIGHT |
Danio rerio | Dr_TNF-α_v1 | EN:ENSDARP00000025726 | 28.9(44.2) TNF-α |
Danio rerio | Dr_TNF-α_v2 | EN:ENSDARP00000010728 | 26.4(43.6) TNF-α |
Danio rerio | Dr_TNF-α_v3 | GB:NM_212859 | 29.3(45.4) TNF-α |
Oncorhynchus mykiss | Om_TNF-α_v1 | GB:AJ277604.2 | 29.4(43.5) LT-α |
Oncorhynchus mykiss | Om_TNF-α_v2 | GB:AJ401377.1 | 29.8(43.4) TNF-α |
Salvelinus fontinalis | Sf_TNF-α | GB:AAF86331.1 | 28.0(43.3) TNF-α |
Ictalurus punctatus | Ip_TNF-α | GB:AJ417565.2 | 28.5(46.7) TNF-α |
Paralichthys olivaceus | Po_TNF-α | GB:AB040448.1 | 26.7(41.3) TL1A |
Oreochromis niloticus | On_TNF-α | GB:AY428948 | 26.5(41.1) TNF-α |
Tetraodon nigroviridis | Tn_TNF-α | EN:Un_random:31626073:31627489 | 24.9(39.9) TNF-α |
Oncorhynchus mykiss | Om_CD40-L* | GB:EF160131 | 25.4(40.8) CD40L |
Salmo salar | Ss_CD40-L* | TIGR:TC62196 | 25.2(43.1) CD40L |
Tetraodon nigroviridis | Tn_CD40-L* | EN:1:8832563:8833687 | 18.7(34.7) TL1A |
Gasterosteus aculeatus | Ga_CD40-L* | EN:ENSGACP00000022786 | 25.5(40.0) TNF-α |
Fugu rubripes | Fr_CD40-L* | EN:scaffold_132:150026:157827 | 17.3(26.8) LIGHT |
Danio rerio | Dr_CD40-L* | EN:14:45076326:45077282 | 17.5(30.5) LT-α |
Danio rerio | Dr_FasL | EN:ENSDARP00000002615 | 30.4(47.8) FASL |
Oncorhynchus mykiss | Om_FasL* | TIGR:TC93531 | 17.2(30.7) TRAIL |
Tetraodon nigroviridis | Tn_FasL* | EN:1_random:2832242:2836658 | 16.7(28.5) LIGHT |
Fugu rubripes | Fr_FasL* | EN:25:1939216:1944081 | 16.3(26.6) LIGHT |
Tetraodon nigroviridis | Tn_4-1BBL* | EN:GSTENG00026327001 | 15.0(32.4) CD40L |
Fugu rubripes | Fr_4-1BBL* | EN:NEWSINFRUG00000160956 | 17.8(33.1) TWEAK |
Gasterosteus aculeatus | Ga_4-1BBL* | EN:GENSCAN00000042857 | 20.7(33.6) TNF-α |
Danio rerio | Dr_4-1BBL* | EN:Chr.1:66706975-66707541 | 17.3(26.2) LT-α |
Pimephales promelas | Pp_4-1BBL* | GB:DT146109.1 | 24.2(35.7) LT-α |
Fugu rubripes | Fr_TRAIL-like* | EN:SINFRUG00000130060 | 38.0(54.7) TRAIL |
Tetraodon nigroviridis | Tn_TRAIL-like v1* | EN:GSTENP00012727001 | 36.8(54.7) TRAIL |
Tetraodon nigroviridis | Tn_TRAIL-like v2* | EN:GSTENT00016551001 | 36.6(55.5) TRAIL |
Danio rerio | Dr_TRAIL-like v1 (DL1bc) | EN:ENSDARP00000038477 | 28.2(47.1) TRAIL |
Danio rerio | Dr_TRAIL-like v2 (DL1ac) | GB:AAH44336 | 26.3(44.3) TRAIL |
Danio rerio | Dr_TRAIL-like v3 (DL2c) | GB:AAH76005 | 40.3(60.3) TRAIL |
Danio rerio | Dr_TRAIL-like v4 (DL3c) | GB:XP_689994 | 22.1(42.9) TRAIL |
Oncorhynchus mykiss | Om_TRAIL-like* | GB:DQ218468 | 37.2(57.7) TRAIL |
Ctenopharyngodon idella | Ci_TRAIL-like | GB:AY_697730 | 40.5(61.1) TRAIL |
Tetraodon nigroviridis | Tn_RANKL* | EN:GSTENP00021782001 | 25.9(39.9) RANKL |
Oncorhynchus mykiss | Om_RANKL* | GB:DQ218471 | 20.7(40.4) TRAIL |
Fugu rubripes | Fr_RANKL* | EN:1703:32157:37016 | 24.6(38.3) RANKL |
Danio rerio | Dr_RANKL* | GB:XM_688158 | 22.2(40.5) TRAIL |
Danio rerio | Dr_TWEAK* | EN:ENSDARG00000036431 | 31.5(44.2) TWEAK |
Tetraodon nigroviridis | Tn_TWEAK* | EN:GSTENG00024352001 | 30.9(46.5) TWEAK |
Fugu rubripes | Fr_TWEAK* | EN:NEWSINFRUG00000160536 | 25.7(39.0) TWEAK |
Salmo salar | Ss_APRIL* | GB: DY736744.1 | 31.6(44.0) APRIL |
Oncorhynchus mykiss | Om_APRIL* | GB: EF451543 | 31.4(44.5) APRIL |
Ictalurus punctatus | Ip_APRIL* | GB:CB940845 | 27.1(41.5) APRIL |
Danio rerio | Ip_APRIL* | EN:7:53725064:53725888 | 19.2(30.1) APRIL |
Oncorhynchus mykiss | Om_BAFF* | GB:DQ218467 | 37.5(51.0) BAFF |
Danio rerio | Dr_BAFF* | GB:XM_684671 | 35.3(49.0) BAFF |
Tetraodon nigroviridis | Tn_BAFF* | EN:SINFRUP00000153079 | 27.4(39.0) BAFF |
Oncorhynchus mykiss | Om_BALM_v2* | GB:DQ218469 | 24.1(42.8) BAFF |
Tetraodon nigroviridis | Tn_BALM* | EN:1:1996703:1997806 | 21.7(32.9) BAFF |
Fugu rubripes | Fr_BALM* | EN:SINFRUG00000144148 | 19.9(28.7) BAFF |
Gasterosteus aculeatus | Ga_BALM | GB:AAY27077 | 27.5(44.1) BAFF |
Salmo salar | Ss_LIGHT* | GB:CB513825 | 20.1(32.7) TNF-α |
Tetraodon nigroviridis | Tn_LIGHT_v1* | EN:GSTENP00026326001 | 19.9(34.1) TRAIL |
Tetraodon nigroviridis | Tn_LIGHT_v2* | GB:CR730934 | 17.8(34.1) TNF-α |
Fugu rubripes | Fr_LIGHT* | EN:194:420,541-422,231 | 24.0(37.1) LIGHT |
Oncorhynchus mykiss | Om_LIGHT_v1* | GB:DQ218470 | 31.8(44.7) LIGHT |
Danio rerio | Dr_LIGHT* | EN:3:58,324,378-58,345,745 | 16.5(28.1) LIGHT |
Danio rerio | Dr_TL1A-like* | EN:ENSDARESTG00000020986 | 20.0(34.0) TL1A |
Tetraodon nigroviridis | Tn_EDA | EN:GSTENP00017090001 | 36.2(44.4) EDA |
Fugu rubripes | Fr_EDA | EN:NEWSINFRUG00000144147 | 25.0(32.7) EDA |
Gasterosteus aculeatus | Ga_EDA | GB:AAY27076 | 49.0(64.4) EDA |
Danio rerio | Dr_EDA | TIGR:TC27579 | 39.2(46.1) EDA |
Oncorhynchus mykiss | Om_TNF-N_v1* | GB:DQ218472 | 22.0(36.0) TNF-α |
Oncorhynchus mykiss | Om_TNF-N_v2* | GB:DQ218473 | 19.9(33.6) CD27 |
Fugu rubripes | Fr_TNF-N | EN:170:255876:257702 | 21.7(36.6) LT-α |
Danio rerio | Dr_TNF-N | EN:15:38337533:38337913 | 20.3(35.2) OX40L |
Teleost Species . | TNFSF Designation . | Accession Source and Numberb . | Highest Percent Identity (and Similarity) with Human TNFSF . |
---|---|---|---|
Fugu rubripes | Fr_TNF-α | EN: SINFRUG00000137029 | 30.9(47.1) TNF-α |
Cyprinus carpio | Cc_TNF-α_v1 | GB:CAC84641 | 27.0(44.2) TNF-α |
Cyprinus carpio | Cc_TNF-α_v2 | GB:CAC84642 | 29.4(42.4) LT-α |
Cyprinus carpio | Cc_TNF-α_v3 | GB:BAC77690.1 | 24.0(42.6) LIGHT |
Danio rerio | Dr_TNF-α_v1 | EN:ENSDARP00000025726 | 28.9(44.2) TNF-α |
Danio rerio | Dr_TNF-α_v2 | EN:ENSDARP00000010728 | 26.4(43.6) TNF-α |
Danio rerio | Dr_TNF-α_v3 | GB:NM_212859 | 29.3(45.4) TNF-α |
Oncorhynchus mykiss | Om_TNF-α_v1 | GB:AJ277604.2 | 29.4(43.5) LT-α |
Oncorhynchus mykiss | Om_TNF-α_v2 | GB:AJ401377.1 | 29.8(43.4) TNF-α |
Salvelinus fontinalis | Sf_TNF-α | GB:AAF86331.1 | 28.0(43.3) TNF-α |
Ictalurus punctatus | Ip_TNF-α | GB:AJ417565.2 | 28.5(46.7) TNF-α |
Paralichthys olivaceus | Po_TNF-α | GB:AB040448.1 | 26.7(41.3) TL1A |
Oreochromis niloticus | On_TNF-α | GB:AY428948 | 26.5(41.1) TNF-α |
Tetraodon nigroviridis | Tn_TNF-α | EN:Un_random:31626073:31627489 | 24.9(39.9) TNF-α |
Oncorhynchus mykiss | Om_CD40-L* | GB:EF160131 | 25.4(40.8) CD40L |
Salmo salar | Ss_CD40-L* | TIGR:TC62196 | 25.2(43.1) CD40L |
Tetraodon nigroviridis | Tn_CD40-L* | EN:1:8832563:8833687 | 18.7(34.7) TL1A |
Gasterosteus aculeatus | Ga_CD40-L* | EN:ENSGACP00000022786 | 25.5(40.0) TNF-α |
Fugu rubripes | Fr_CD40-L* | EN:scaffold_132:150026:157827 | 17.3(26.8) LIGHT |
Danio rerio | Dr_CD40-L* | EN:14:45076326:45077282 | 17.5(30.5) LT-α |
Danio rerio | Dr_FasL | EN:ENSDARP00000002615 | 30.4(47.8) FASL |
Oncorhynchus mykiss | Om_FasL* | TIGR:TC93531 | 17.2(30.7) TRAIL |
Tetraodon nigroviridis | Tn_FasL* | EN:1_random:2832242:2836658 | 16.7(28.5) LIGHT |
Fugu rubripes | Fr_FasL* | EN:25:1939216:1944081 | 16.3(26.6) LIGHT |
Tetraodon nigroviridis | Tn_4-1BBL* | EN:GSTENG00026327001 | 15.0(32.4) CD40L |
Fugu rubripes | Fr_4-1BBL* | EN:NEWSINFRUG00000160956 | 17.8(33.1) TWEAK |
Gasterosteus aculeatus | Ga_4-1BBL* | EN:GENSCAN00000042857 | 20.7(33.6) TNF-α |
Danio rerio | Dr_4-1BBL* | EN:Chr.1:66706975-66707541 | 17.3(26.2) LT-α |
Pimephales promelas | Pp_4-1BBL* | GB:DT146109.1 | 24.2(35.7) LT-α |
Fugu rubripes | Fr_TRAIL-like* | EN:SINFRUG00000130060 | 38.0(54.7) TRAIL |
Tetraodon nigroviridis | Tn_TRAIL-like v1* | EN:GSTENP00012727001 | 36.8(54.7) TRAIL |
Tetraodon nigroviridis | Tn_TRAIL-like v2* | EN:GSTENT00016551001 | 36.6(55.5) TRAIL |
Danio rerio | Dr_TRAIL-like v1 (DL1bc) | EN:ENSDARP00000038477 | 28.2(47.1) TRAIL |
Danio rerio | Dr_TRAIL-like v2 (DL1ac) | GB:AAH44336 | 26.3(44.3) TRAIL |
Danio rerio | Dr_TRAIL-like v3 (DL2c) | GB:AAH76005 | 40.3(60.3) TRAIL |
Danio rerio | Dr_TRAIL-like v4 (DL3c) | GB:XP_689994 | 22.1(42.9) TRAIL |
Oncorhynchus mykiss | Om_TRAIL-like* | GB:DQ218468 | 37.2(57.7) TRAIL |
Ctenopharyngodon idella | Ci_TRAIL-like | GB:AY_697730 | 40.5(61.1) TRAIL |
Tetraodon nigroviridis | Tn_RANKL* | EN:GSTENP00021782001 | 25.9(39.9) RANKL |
Oncorhynchus mykiss | Om_RANKL* | GB:DQ218471 | 20.7(40.4) TRAIL |
Fugu rubripes | Fr_RANKL* | EN:1703:32157:37016 | 24.6(38.3) RANKL |
Danio rerio | Dr_RANKL* | GB:XM_688158 | 22.2(40.5) TRAIL |
Danio rerio | Dr_TWEAK* | EN:ENSDARG00000036431 | 31.5(44.2) TWEAK |
Tetraodon nigroviridis | Tn_TWEAK* | EN:GSTENG00024352001 | 30.9(46.5) TWEAK |
Fugu rubripes | Fr_TWEAK* | EN:NEWSINFRUG00000160536 | 25.7(39.0) TWEAK |
Salmo salar | Ss_APRIL* | GB: DY736744.1 | 31.6(44.0) APRIL |
Oncorhynchus mykiss | Om_APRIL* | GB: EF451543 | 31.4(44.5) APRIL |
Ictalurus punctatus | Ip_APRIL* | GB:CB940845 | 27.1(41.5) APRIL |
Danio rerio | Ip_APRIL* | EN:7:53725064:53725888 | 19.2(30.1) APRIL |
Oncorhynchus mykiss | Om_BAFF* | GB:DQ218467 | 37.5(51.0) BAFF |
Danio rerio | Dr_BAFF* | GB:XM_684671 | 35.3(49.0) BAFF |
Tetraodon nigroviridis | Tn_BAFF* | EN:SINFRUP00000153079 | 27.4(39.0) BAFF |
Oncorhynchus mykiss | Om_BALM_v2* | GB:DQ218469 | 24.1(42.8) BAFF |
Tetraodon nigroviridis | Tn_BALM* | EN:1:1996703:1997806 | 21.7(32.9) BAFF |
Fugu rubripes | Fr_BALM* | EN:SINFRUG00000144148 | 19.9(28.7) BAFF |
Gasterosteus aculeatus | Ga_BALM | GB:AAY27077 | 27.5(44.1) BAFF |
Salmo salar | Ss_LIGHT* | GB:CB513825 | 20.1(32.7) TNF-α |
Tetraodon nigroviridis | Tn_LIGHT_v1* | EN:GSTENP00026326001 | 19.9(34.1) TRAIL |
Tetraodon nigroviridis | Tn_LIGHT_v2* | GB:CR730934 | 17.8(34.1) TNF-α |
Fugu rubripes | Fr_LIGHT* | EN:194:420,541-422,231 | 24.0(37.1) LIGHT |
Oncorhynchus mykiss | Om_LIGHT_v1* | GB:DQ218470 | 31.8(44.7) LIGHT |
Danio rerio | Dr_LIGHT* | EN:3:58,324,378-58,345,745 | 16.5(28.1) LIGHT |
Danio rerio | Dr_TL1A-like* | EN:ENSDARESTG00000020986 | 20.0(34.0) TL1A |
Tetraodon nigroviridis | Tn_EDA | EN:GSTENP00017090001 | 36.2(44.4) EDA |
Fugu rubripes | Fr_EDA | EN:NEWSINFRUG00000144147 | 25.0(32.7) EDA |
Gasterosteus aculeatus | Ga_EDA | GB:AAY27076 | 49.0(64.4) EDA |
Danio rerio | Dr_EDA | TIGR:TC27579 | 39.2(46.1) EDA |
Oncorhynchus mykiss | Om_TNF-N_v1* | GB:DQ218472 | 22.0(36.0) TNF-α |
Oncorhynchus mykiss | Om_TNF-N_v2* | GB:DQ218473 | 19.9(33.6) CD27 |
Fugu rubripes | Fr_TNF-N | EN:170:255876:257702 | 21.7(36.6) LT-α |
Danio rerio | Dr_TNF-N | EN:15:38337533:38337913 | 20.3(35.2) OX40L |
An asterisk (∗) designates novel teleost sequences.
EN, Ensembl; GB, GenBank accession numbers, for genes without accession numbers, the Ensembl chromosome number and genomic location are listed.
TRAIL nomenclature according to Eimon et al. (46 ).
To compare amino acid identity and similarity, we used the global sequence alignment program Needle (http://sbcr.bii.a-star.edu.sg/emboss/). Synteny analysis of TNFSF ligand member genes was conducted by comparing gene order and orientation. Schematic diagrams were constructed between fish and human chromosomes (Ensembl version 33, September 2005). Only teleost members with sequenced genomes were used for synteny analyses.
To predict secondary structure of BALM and TNF-New, we used PredictProtein (http://cubic.bioc.columbia.edu/predictprotein/). For sequence alignment, we incorporated known secondary structure from the following crystal structures: 1ALY of human CD40L (17), 1XU1 of murine APRIL bound to Taci (18); 1JTZ of human TNF-related activation-induced cytokine/RANKL (19); and 1D2Q of human TRAIL (20). For TWEAK, LIGHT, 4-1BBL, and FasL, β-sheet predictions from Bodmer et al. (1) were used.
Expression analysis
Tissues (80.0 mg) were collected from five adult rainbow trout (2.0 kg) and placed into RNAlater (1.0 ml). Blood was drawn into heparinized collection tubes. Peripheral blood leukocytes were isolated by collecting cells at the interface using Histopaque 1077 (500 × g for 40 min). RNA was extracted from 30.0 mg of tissue using a RNeasy Mini Extraction kit (Qiagen), and cDNA was prepared as described previously (21).
Semiquantitative amplification was performed in 20-μl samples containing 2.0 μl of cDNA, 4.6 μl of PCR grade water (Sigma-Aldrich), 2 μl of 10× PCR buffer, 1.2 μl of 25 mM MgCl2, 2 μl of 2.0 μM dNTP (Sigma-Aldrich), 4 μl of forward and reverse primers (5 μM), and 0.2 μl of Hotstar Taq polymerase (5 U/μl). PCR products were extracted from 1 to 1.5% agarose gels with QIAquick Gel Extraction Kit (250) (Qiagen) and sequenced.
Results
Phylogenetic analysis of teleost TNFSF members
A total of 71 teleost proteins containing TNF-homology domains was identified from our sequencing efforts, database searches, and the published literature (Table II). Of these, 44 have not been described previously. Accession numbers for all genes are listed in Table II, and amino acid sequences used in the phylogenetic analysis are included in the Supplementary Data 1. 5 The majority of the teleost sequences formed clades, defined by bootstrap values >70%, with mammalian TNFSF members (Fig. 1). These clades include BAFF (TNFSF 13b), APRIL (TNFSF 13), EDA, TWEAK (TNFSF 12), FasL (TNFSF 6), LIGHT (TNFSF 14), CD40L (TNFSF 5), RANKL (TNFSF 11), TRAIL (TNFSF 10), and TNF-α (TNFSF 2). In some trees, teleost 4-1BBL (TNFSF9) sequences grouped with high bootstrap values with mammalian 4-1BBL sequences, whereas in other trees, it grouped at the base of the 4-1BBL sequences. A single TL1A-like protein sequence was identified in zebrafish, but this protein (Dr_TL1A-like) failed to group closely with mammalian TL1A. Rather, this sequence grouped most closely with the sequence of Xenopus TNF-α. This protein does not appear to be zebrafish TNF-α because there are two other zebrafish genes that have higher sequence similarity and have been previously annotated as TNF-α, and we designated them here as Dr_TNFα_v1 and Dr_TNFα_v2. We also identified a third variant sequence in GenBank, Dr_TNFα_v3, which is a putative splice variant of TNF-α_v2. These three Dr_TNFα protein sequences cluster closely with other teleost TNF-α proteins. Interestingly, mammalian TNF-α and LT-α branch more closely to each other than to the teleost TNF-α sequences, suggesting a recent, common evolutionary origin for mammalian TNF-α and LT-α before the fish and amphibian divergence.
An unrooted phylogenetic tree of mammalian, avian, amphibian, and teleost TNF ligand family members (TNFSF). Colored boxes outline TNFSF ligand families found to have fish representatives, whereas dotted boxes demarcate fish-only sequences within each ligand family. Groups of TNFSF members with >70% bootstrap values are considered a clade and putatively share a common ancestor. Note some clades also share high bootstrap values with other clades, suggesting an evolutionary relationship between members such as BAFF-BALM-APRIL-EDA, FasL-LIGHT, RANKL-TRAIL, and TNF-α-LTα. Only confidence probability values >50% are listed. The tree was constructed with full-length amino acid sequences (except partial sequences for T. nigroviridis BALM, FasL, EDA; F. rubripes LIGHT, CD40L, 4-1BBL; D. rerio 4-1BBL; O. mykiss FasL; X. tropicalis LT-α, LT-β, TRAIL-like v1, and TL1A) by the neighbor-joining method in MEGA 2.1 (1000 bootstrap replications-Poisson correction). A F. rubripes EDA sequence was not included within the tree due to unresolved intron/exon borders. O. mykiss APRIL was also not included due to its recent cloning but does not alter the phylogenetic analysis. Rainbow trout, O. mykiss, sequences characterized in the current article are indicted with a “∗.” Abbreviations: Bt, Bos taurus; Cf, Canis familiaris; Ci, Ctenopharyngodon idella; Cc, Cyprinus carpio; Dr, D. rerio; Fr, F. rubripes; Gg, Gallus gallus; Ga, Gasterosteus aculeatus; Hs, Homo sapiens; Ip, Ictalurus punctatus; Mmu, Macaca mulatta; Me, Macropus eugenii; Mmx, Marmota monax; Mm, Mus musculus; Oc, Oryctolagus cuniculus; Om, O. mykiss; On, Oreochromis niloticus; Pt, Pan troglodytes; Po, Paralichthys olivaceus; Pm, Peromyscus maniculatus; Rn, Rattus norvegicus; Ss, S. salar; Sf, Salvelinus fontinalis; Tn, T. nigroviridis; Xt, X. tropicalis.
An unrooted phylogenetic tree of mammalian, avian, amphibian, and teleost TNF ligand family members (TNFSF). Colored boxes outline TNFSF ligand families found to have fish representatives, whereas dotted boxes demarcate fish-only sequences within each ligand family. Groups of TNFSF members with >70% bootstrap values are considered a clade and putatively share a common ancestor. Note some clades also share high bootstrap values with other clades, suggesting an evolutionary relationship between members such as BAFF-BALM-APRIL-EDA, FasL-LIGHT, RANKL-TRAIL, and TNF-α-LTα. Only confidence probability values >50% are listed. The tree was constructed with full-length amino acid sequences (except partial sequences for T. nigroviridis BALM, FasL, EDA; F. rubripes LIGHT, CD40L, 4-1BBL; D. rerio 4-1BBL; O. mykiss FasL; X. tropicalis LT-α, LT-β, TRAIL-like v1, and TL1A) by the neighbor-joining method in MEGA 2.1 (1000 bootstrap replications-Poisson correction). A F. rubripes EDA sequence was not included within the tree due to unresolved intron/exon borders. O. mykiss APRIL was also not included due to its recent cloning but does not alter the phylogenetic analysis. Rainbow trout, O. mykiss, sequences characterized in the current article are indicted with a “∗.” Abbreviations: Bt, Bos taurus; Cf, Canis familiaris; Ci, Ctenopharyngodon idella; Cc, Cyprinus carpio; Dr, D. rerio; Fr, F. rubripes; Gg, Gallus gallus; Ga, Gasterosteus aculeatus; Hs, Homo sapiens; Ip, Ictalurus punctatus; Mmu, Macaca mulatta; Me, Macropus eugenii; Mmx, Marmota monax; Mm, Mus musculus; Oc, Oryctolagus cuniculus; Om, O. mykiss; On, Oreochromis niloticus; Pt, Pan troglodytes; Po, Paralichthys olivaceus; Pm, Peromyscus maniculatus; Rn, Rattus norvegicus; Ss, S. salar; Sf, Salvelinus fontinalis; Tn, T. nigroviridis; Xt, X. tropicalis.
In many clades, multiple sequences were obtained from the same fish species. These include TRAIL, TNF-α, LIGHT, TNF-New, and BALM (Fig. 1, Table II, and Supplementary Data 1). 5 A total of four TRAIL-like sequences was identified from Zebrafish; however, Dr_TRAIL-like_v4 is more distantly related and only grouped weakly with other TRAIL and RANKL sequences.
There were two groups of teleost protein sequences that were distinct: BALM, which is related to BAFF and APRIL, and TNF-New. The position of the TNF-New clade was unstable in different trees and branched deeply between putative teleost New orthologs. There were six mammalian TNFSF members that consistently did not group closely with teleost sequences: CD27 (TNFSF 7), LT-β (TNFSF 3), TL1A (TNFSF 15), OX40L (TNFSF 4), GITRL (TNFSF 18), and CD30L (TNFSF 8). Similar trees were obtained using maximum parsimony analyses and by systematically adding or removing sequences from the phylogenetic analyses.
Amino acid similarity, gene synteny, and expression of teleost TNFSF members
To confirm the phylogenetic analysis and further extend the characterization of these molecules, we determined amino acid identity/similarity, gene synteny, intron/exon conservation, secondary structure prediction, and finally, mRNA expression of select teleost TNFSF members in rainbow trout. Most of the fish TNFSF ligands shared highest percent similarity with other members of their respective clade determined by phylogenetic analysis (Table II and Supplementary Data 2). 5 Local gene synteny, defined as two or more common flanking genes, was identified between teleost and humans TNFSF members, including BAFF, APRIL, EDA, TWEAK, 4-1BBL, FasL, LIGHT, CD40L, and RANKL. For teleost TRAIL-like and TL1A-like members, the phylogenetic or sequence identity analyses suggested TNFSF family member grouping, but there was less convincing syntenic support, and thus, we use the “like” suffix for these genes. Below, we discuss sequence characteristics of each of the clades and mRNA expression in rainbow trout.
BALM-varient1 and -varient2, APRIL (TNFSF 13), BAFF (TNFSF 13B), and EDA
TNFSF clades containing mammalian and teleost BAFF and APRIL also grouped with a distinctly teleost subgroup containing sequences from trout, stickleback, and pufferfish (Fig. 1). Initially, a 1203-nt rainbow trout cDNA (tcba 0010c.a.20; Table III) encoding a putative protein with a THD but without a TM region was identified in the rainbow trout EST database (Fig. 2,A). This sequence had high sequence similarity values with BAFF but had a short D-E loop characteristic of APRIL and was thus designated BALM. While performing 5′ RACE, we obtained a second sequence (BALM-v2) that was 98% identical but containing a start methionine and predicted TM region. Interestingly, the BALM-v2 protein also has a predicted signal sequence cleavage site between aa 17 and 18 that would interrupt the predicted TMD similar to mammalian LT-α, which is predominantly secreted (Fig. 2,A). We failed to identify a variant-1 sequence with a predicted TM region or a secretory signal, suggesting it is a pseudogene. Rainbow trout BALM-v2 contains a potential polybasic region 92 aa downstream of the start methionine, that has been shown in mammalian BAFF, APRIL, EDA, and TWEAK to include a furin cleavage site which initiates a soluble form. Another similarity among mammalian TNFSF members BAFF, APRIL, EDA, and BALM are cysteines located in the E and F β-strands (Fig. 2,A). Rainbow trout BALM shows highest sequence identity (48%) with a stickleback TNFSF initially annotated as TNFSF 13b but which we have redesignated here as Ga_BALM based on phylogenetic (Fig. 1) and synteny (Fig. 2 B) analyses. We have identified the BAFF (TNFSF 13b) locus in rainbow trout, Tetraodon, Fugu, and zebrafish and have determined that it has nine flanking genes with synteny to human BAFF locus. Characterization of this locus and functional analysis will be published elsewhere (G. D. Wiens, et al. manuscript in preparation). Sequence alignments of teleost and mammalian BAFF and EDA orthologs are contained in Supplementary Data 2. 5
Summary of characteristics of rainbow trout TNFSF genes and peptides
Gene . | Clone . | Total Length (bp) . | 5′ UTR (bp) . | ORF (bp) . | 3′ UTR (bp) . | RNA Instability Motif Number and Base Pair from Stop . | Predicted Peptide (aa) . | Predicted Peptide Mass (Da) . |
---|---|---|---|---|---|---|---|---|
BALM v1 | tcba 0010c.a.20 | 1203 | ND | Pseudogene (?) | 60 | None | ND | ND |
BALM v2 | PCR product | 789 | 48 | 741 | N/A | N/A | 246 | 27,352 |
CD40L | 1RT9M08_C_G04 | 1237 | 94 | 795 | 348 | 1 (280 bp) | 264 | 29,707 |
tcbk0052c.k.15 | 1194 | 48 | 795 | 351 | 1 (280 bp) | 264 | 29,707 | |
tcbk0066c.d.15 | 1205 | 47 | 795 | 363 | 1 (280 bp) | 264 | 29,707 | |
FasL | tcbi0027c.i.17 | 927 | ND | Pseudogene (?) | 226 | 2 (85 bp,187 bp) | ND | ND |
tcba0001c.n.08 | 930 | ND | Pseudogene (?) | 211 | 2 (85 bp,187 bp) | ND | ND | |
LIGHT v1 | PCR product | 1360 | 382 | 717 | 261 | 1 (81 bp) | 238 | 26,408 |
LIGHT v2 | PCR product | 1155 | 382 | 717 | 56 (partial) | ND | 238 | 26,465 |
LIGHT v3 | PCR product | 727 | ND | 479 (partial.) | 245 | 1 (80 bp) | ND | ND |
TNF-N v1 | PCR product | 962 | 121 | 612 | 229 | 1 (99 bp) | 203 | 23,060 |
TNF-N v2 | PCR product | 789 | ND | partial | 198 | 1 (97 bp) | ND | ND |
RANKL-like | tcba0001c.i.04 | 1199 | 92 | 780 | 327 | 1 (212 bp) | 259 | 28,746 |
TRAIL-like | tcay0005b.j.21 | 1396 | 26 | 876 | 494 | None | 291 | 32,189 |
Gene . | Clone . | Total Length (bp) . | 5′ UTR (bp) . | ORF (bp) . | 3′ UTR (bp) . | RNA Instability Motif Number and Base Pair from Stop . | Predicted Peptide (aa) . | Predicted Peptide Mass (Da) . |
---|---|---|---|---|---|---|---|---|
BALM v1 | tcba 0010c.a.20 | 1203 | ND | Pseudogene (?) | 60 | None | ND | ND |
BALM v2 | PCR product | 789 | 48 | 741 | N/A | N/A | 246 | 27,352 |
CD40L | 1RT9M08_C_G04 | 1237 | 94 | 795 | 348 | 1 (280 bp) | 264 | 29,707 |
tcbk0052c.k.15 | 1194 | 48 | 795 | 351 | 1 (280 bp) | 264 | 29,707 | |
tcbk0066c.d.15 | 1205 | 47 | 795 | 363 | 1 (280 bp) | 264 | 29,707 | |
FasL | tcbi0027c.i.17 | 927 | ND | Pseudogene (?) | 226 | 2 (85 bp,187 bp) | ND | ND |
tcba0001c.n.08 | 930 | ND | Pseudogene (?) | 211 | 2 (85 bp,187 bp) | ND | ND | |
LIGHT v1 | PCR product | 1360 | 382 | 717 | 261 | 1 (81 bp) | 238 | 26,408 |
LIGHT v2 | PCR product | 1155 | 382 | 717 | 56 (partial) | ND | 238 | 26,465 |
LIGHT v3 | PCR product | 727 | ND | 479 (partial.) | 245 | 1 (80 bp) | ND | ND |
TNF-N v1 | PCR product | 962 | 121 | 612 | 229 | 1 (99 bp) | 203 | 23,060 |
TNF-N v2 | PCR product | 789 | ND | partial | 198 | 1 (97 bp) | ND | ND |
RANKL-like | tcba0001c.i.04 | 1199 | 92 | 780 | 327 | 1 (212 bp) | 259 | 28,746 |
TRAIL-like | tcay0005b.j.21 | 1396 | 26 | 876 | 494 | None | 291 | 32,189 |
Amino acid alignment, synteny, and expression of BALM. A, Alignment of deduced amino acid sequence for two TNF superfamily member variants from rainbow trout (Om_BALM_v1 and Om_BALM_v2) with the threespine stickleback, G. aculeatus (Ga_BALM). Identical (∗) and similar amino acids (: , .) are identified. Cysteines and individual β-strands (A-H) are shaded. Putative signal peptide is indicated by < >, whereas putative TMDs are indicated by solid underlines. A polybasic region and a putative cleavage site are marked by double underline (RNKR). Stop codon in 5′ end of Om_BALM_v1 is indicated (∗). The locations of introns within the gene sequence of Ga-BALM are denoted by a short, bold underline. If a codon is formed by the junction of two exons, the single amino acid formed is underlined. If the intron is located between two codons, then amino acids on either side of the junction are underlined. Rainbow trout BALM introns are not known. B, Diagram of synteny of genes between Stickleback, Tetraodon, and human EDA/BALM locus. The TNF ligand members are highlighted in gray boxes, whereas transcriptional orientation is indicated by arrows. C, Graphic representation of semiquantitative PCR analysis (using primers BALM F2 and BALM R2) of BALM expression across a panel of healthy adult rainbow trout tissues. Error bars indicate SD (n = 5).
Amino acid alignment, synteny, and expression of BALM. A, Alignment of deduced amino acid sequence for two TNF superfamily member variants from rainbow trout (Om_BALM_v1 and Om_BALM_v2) with the threespine stickleback, G. aculeatus (Ga_BALM). Identical (∗) and similar amino acids (: , .) are identified. Cysteines and individual β-strands (A-H) are shaded. Putative signal peptide is indicated by < >, whereas putative TMDs are indicated by solid underlines. A polybasic region and a putative cleavage site are marked by double underline (RNKR). Stop codon in 5′ end of Om_BALM_v1 is indicated (∗). The locations of introns within the gene sequence of Ga-BALM are denoted by a short, bold underline. If a codon is formed by the junction of two exons, the single amino acid formed is underlined. If the intron is located between two codons, then amino acids on either side of the junction are underlined. Rainbow trout BALM introns are not known. B, Diagram of synteny of genes between Stickleback, Tetraodon, and human EDA/BALM locus. The TNF ligand members are highlighted in gray boxes, whereas transcriptional orientation is indicated by arrows. C, Graphic representation of semiquantitative PCR analysis (using primers BALM F2 and BALM R2) of BALM expression across a panel of healthy adult rainbow trout tissues. Error bars indicate SD (n = 5).
Interestingly, BALM is directly downstream of EDA in stickleback, Tetraodon, and Fugu (Fig. 2 B and data not shown). There is additional synteny surrounding the Tetraodon and stickleback BALM genome loci with GAP Junction Connexin, Melatonin receptor, and Neuralized 1 genes downstream, and EDA just upstream in the same transcriptional orientation. Searches of sequence surrounding human EDA for a human BALM ortholog were unsuccessful; however, there was a weak sequence similarity with an ovarian tumor otubain EST that does not have a THD (Supplementary Data 2). 5 Presently, BALM appears to be unique to teleosts.
We examined expression of rainbow trout BALM in adult fish and the highest constitutive expression of BALM was observed in the spleen, PBL, PK, and AK, suggesting a potential immunological role (Fig. 2 C). To a lesser extent BALM expression was observed in the gill, heart, skin, liver and intestinal tissues.
CD40L (TNFSF 5)
The finding of BALM adjacent to EDA prompted us to examine whether CD40L was also present. In mammals, CD40L is located on chromosome X as is EDA, albeit at a large distance ∼55 Mb. Surprisingly, similar to mammals, we were able to locate a TNFSF member in Tetraodon and zebrafish on the same chromosome as teleost EDA in addition to BALM (Fig. 3,A and Table IV). There was synteny of three upstream and downstream Tetraodon genes with human CD40L, supporting the designation as teleost CD40L. Furthermore, sequences from trout and pufferfish grouped with high bootstrap values to Xenopus, chicken, and mammalian CD40L (Fig. 1 and data not shown). Three rainbow trout cDNA clones were sequenced (Table III) encoding an identical 261-aa CD40L-like peptide (Fig. 3 B).
Amino acid alignment, synteny, and expression of CD40L. A, Diagram of synteny of genes between Fugu, Tetraodon, and human CD40L locus. B, CD40L amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline as described in Fig. 2. Amino acids directly involved in CD40L-CD40 in mammals are indicated (∩) (22 ). A potential glycosylation site (Asn240) described in human CD40L is indicated (‡) (22 ). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Xt, X. tropicalis; Om, O. mykiss; Tn, T. nigroviridis. C, Semiquantitative PCR analysis (primers CD40L F1 and CD40L R1) of CD40L expression as described in Fig. 2.
Amino acid alignment, synteny, and expression of CD40L. A, Diagram of synteny of genes between Fugu, Tetraodon, and human CD40L locus. B, CD40L amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline as described in Fig. 2. Amino acids directly involved in CD40L-CD40 in mammals are indicated (∩) (22 ). A potential glycosylation site (Asn240) described in human CD40L is indicated (‡) (22 ). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Xt, X. tropicalis; Om, O. mykiss; Tn, T. nigroviridis. C, Semiquantitative PCR analysis (primers CD40L F1 and CD40L R1) of CD40L expression as described in Fig. 2.
Chromosomal location of teleost TNF ligand members by species
TNFSF . | Zebrafish . | Tetraodon . | Fugu . |
---|---|---|---|
TNFα v1 | Chr 19 | Unk_random | scaf.170a |
22,127,207–22,131,451 | 31,626,073–31,627,489 | 247,439–248,739 | |
TNFα v2 | Chr 15 | ||
45,228,810–45,231,372 | ND | ND | |
TNF New | Chr 15 | Unk_random | scaf. 170 |
38,337,533–38,337,913 | 129,073,599–129,074,492 | 255,876:257,702 | |
BALM | Chr 1 | scaf. 89 | |
ND | 1,996,703–1,997,806 | 490,418:503,593 | |
CD40L | Chr 14 | Chr 1 | scaf. 132 |
45,076,326–45,077,282 | 8,832,563–8,833,687 | 150,026–157,827 | |
FasL | Chr 20 | Chr 1_random | scaf. 25 |
16,305,001–16,308,978 | 2,832,242–2,836,658 | 1,939,216–1,944,081 | |
TRAIL-like v1 | Chr 21 | Chr 10 | scaf. 158 |
28,980,389–28,993,105 | 8,895,767–8,898,836 | 594,828–598,067 | |
TRAIL-like v2 | Chr 7 | Unk_random | |
28,552,963–28,593,143 | 132,835,101–132,838,210 | ND | |
TRAIL-like v3 | Chr 24 | ||
32,976,123–32,980,198 | ND | ND | |
TRAIL-like v4 | Chr 5 | ||
14,646,900–14,649,877 | ND | ND | |
RANKL | Chr 9 | Chr 2 | scaf. 437 |
45,199,277–45,219,042 | 6,593,541–6,597,011 | 94,391–99,739 | |
TWEAK | Chr 7 | Chr 20 | scaf. 63 |
13,790,061–13,790,882 | 646,398–651,415 | 1,106,386–1,112,003 | |
APRIL | Chr 5 | ||
53,725,064–53,725,888 | ND | ND | |
BAFF | Chr 9 | Chr 2 | scaf. 42 |
2,071,777–2,077,013 | 9,602,179–9,604,901 | 226,949–319,996 | |
LIGHT | Chr 3 | Chr 18 | scaf. 194 |
58,324,378–58,345,745 | 1,619,349–1,621,099 | 420,541–422,231 | |
VEGI-like | Chr 8 | ||
51,515,179–51,518,306 | ND | ND | |
Ectodysplasin | Chr 1 | scaf. 89 | |
ND | 1,993,863–1,995,537 | 493,132–494,322 |
TNFSF . | Zebrafish . | Tetraodon . | Fugu . |
---|---|---|---|
TNFα v1 | Chr 19 | Unk_random | scaf.170a |
22,127,207–22,131,451 | 31,626,073–31,627,489 | 247,439–248,739 | |
TNFα v2 | Chr 15 | ||
45,228,810–45,231,372 | ND | ND | |
TNF New | Chr 15 | Unk_random | scaf. 170 |
38,337,533–38,337,913 | 129,073,599–129,074,492 | 255,876:257,702 | |
BALM | Chr 1 | scaf. 89 | |
ND | 1,996,703–1,997,806 | 490,418:503,593 | |
CD40L | Chr 14 | Chr 1 | scaf. 132 |
45,076,326–45,077,282 | 8,832,563–8,833,687 | 150,026–157,827 | |
FasL | Chr 20 | Chr 1_random | scaf. 25 |
16,305,001–16,308,978 | 2,832,242–2,836,658 | 1,939,216–1,944,081 | |
TRAIL-like v1 | Chr 21 | Chr 10 | scaf. 158 |
28,980,389–28,993,105 | 8,895,767–8,898,836 | 594,828–598,067 | |
TRAIL-like v2 | Chr 7 | Unk_random | |
28,552,963–28,593,143 | 132,835,101–132,838,210 | ND | |
TRAIL-like v3 | Chr 24 | ||
32,976,123–32,980,198 | ND | ND | |
TRAIL-like v4 | Chr 5 | ||
14,646,900–14,649,877 | ND | ND | |
RANKL | Chr 9 | Chr 2 | scaf. 437 |
45,199,277–45,219,042 | 6,593,541–6,597,011 | 94,391–99,739 | |
TWEAK | Chr 7 | Chr 20 | scaf. 63 |
13,790,061–13,790,882 | 646,398–651,415 | 1,106,386–1,112,003 | |
APRIL | Chr 5 | ||
53,725,064–53,725,888 | ND | ND | |
BAFF | Chr 9 | Chr 2 | scaf. 42 |
2,071,777–2,077,013 | 9,602,179–9,604,901 | 226,949–319,996 | |
LIGHT | Chr 3 | Chr 18 | scaf. 194 |
58,324,378–58,345,745 | 1,619,349–1,621,099 | 420,541–422,231 | |
VEGI-like | Chr 8 | ||
51,515,179–51,518,306 | ND | ND | |
Ectodysplasin | Chr 1 | scaf. 89 | |
ND | 1,993,863–1,995,537 | 493,132–494,322 |
scaf., scaffold.
Through single and double amino acid substitutions, five amino acids (Gln220, Arg203, Lys143, Tyr145, and Tyr146) have been found to be important for CD40L-CD40 binding in mammals (22). None of these residues were found to be conserved in alignment analysis of teleost O. mykiss, S. salar, Tetraodon nigroviridis, Fugu rubripes, D. rerio, P. olivaceus, and G. aculeatus CD40L sequences (Fig. 3,B and data not shown). However, two of the five residues in human CD40L (Lys143 and Tyr145) were conserved in Xenopus tropicalis (Fig. 3,B). A potential glycosylation site (Asn240) has also been described in human CD40L and is conserved in murine, chicken, and amphibian CD40L. This glycosylation site is not found in any of the seven teleost sequences examined but is always a tryptophan (Fig. 3 B and Supplementary Data 1). 5 A striking difference between mammalian CD40L and teleosts are the locations of cysteines in the THD. In human and mouse, there is a disulfide bond between the C and F β-strands, whereas in fish, two cysteines are located in the E and F strands similar to human BAFF, APRIL, and EDA. Modeling of the secondary structure of O. mykiss CD40L was attempted with human CD40L as a template. Although the first four β-strands (aa 120–158) of the O. mykiss CD40L structure were unable to be modeled, we did find that Cys213 and Cys224 in the fish E and F β-strands likely form a disulfide bond, suggesting that protein structural changes accompanied the further evolution of this locus.
The highest constitutive expression of rainbow trout CD40L was observed in the spleen, PBL, gill, PK, and AK. To a lesser extent, CD40L expression was observed in the heart, skin, liver, and intestinal tissues (Fig. 3 C).
TWEAK (TNFSF 12) and APRIL (TNFSF 13)
TWEAK and APRIL are both located on human Chr 17 within 700 bp of one another. Sequences resembling both genes were identified in teleosts; however, these two genes are not linked in Zebrafish, and linkage is uncertain in Tetraodon (Fig. 4,A), suggesting that this close proximity between these two genes arose after the fish-tetrapod divergence. The locations of the cysteines in the teleost and mammalian molecules are highly conserved (Fig. 4, B and C). Unusual aspects of teleost APRIL are the presence of two putative furin cleavage sites and the lack of an identifiable transmembrane (TM) region (Fig. 4 C). Rainbow trout APRIL is weakly expressed in the spleen, gill, intestine, skin and heart tissues (data not shown). Rainbow trout TWEAK has not yet been identified.
Diagram of synteny of genes between zebrafish, Tetraodon, and human APRIL/TWEAK-like locus (A). B, Alignment of TWEAK sequences. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. C, APRIL amino acid sequence alignment. The furin recognition site in mammalian APRIL molecules are underlined, and the cleavage site is denoted with (▾). This precise cleavage site is not conserved in Atlantic salmon, trout, or catfish APRIL; however, there are two putative recognition sites (Arg-X-Lys-Arg) proximal to this site (double underlined). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Tn, T. nigroviridis; Dr, D. rerio; Ss, S. salar; Om, O. mykiss; Ip, Ictalurus punctatus.
Diagram of synteny of genes between zebrafish, Tetraodon, and human APRIL/TWEAK-like locus (A). B, Alignment of TWEAK sequences. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. C, APRIL amino acid sequence alignment. The furin recognition site in mammalian APRIL molecules are underlined, and the cleavage site is denoted with (▾). This precise cleavage site is not conserved in Atlantic salmon, trout, or catfish APRIL; however, there are two putative recognition sites (Arg-X-Lys-Arg) proximal to this site (double underlined). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Tn, T. nigroviridis; Dr, D. rerio; Ss, S. salar; Om, O. mykiss; Ip, Ictalurus punctatus.
RANKL (TNFSF 11)
Single gene sequences were identified from pufferfish, zebrafish, trout, and frog that grouped with mouse and human RANKL TNFSF members with high bootstrap values (Fig. 1). Synteny analysis of Fugu and Tetraodon both showed isocitrate dehydrogenase downstream of RANKL, with a kinase anchor 11, and ATP GTP-A binding motif A located upstream of RANKL all with the same orientation (Fig. 5,A). Human Chr 13 also had a kinase anchor 11 and ATP GTP-A binding motif genes upstream of human RANKL; however, the ATP GTP-A binding motif was not similar in orientation. Om_RANKLs’ highest identity and similarity percentages were with human TRAIL (Table II), contradicting the phylogenetic tree grouping with RANKL orthologs (Fig. 1). However, Tn_RANKL had higher similarity (41.5 and 25.6% Id) with Hs_RANKL than with Hs_TRIAL (38.1 and 25.6% Id). Furthermore, there is conservation of a cysteine in the C β-strand among all RANKL members (Fig. 5,B) that is not found among TRAIL members, while all TRAIL sequences have a conserved cysteine in the E-F loop that coordinates trimerization of the protein (Fig. 6 B). This cysteine is not found in any of the RANKL sequences, albeit there is a cysteine in the fish sequences in the F-β strand that is in close proximity and may represent an ancestral location. In Tetraodon, both RANKL and BAFF are located on the same chromosome similar to the organization found in humans. A unique aspect of the teleost RANKL sequences is the large variation in the size of the putative C-D loop, which may require the disulfide bond between the C and F β-strands for stability.
Amino acid alignment, synteny, and expression of RANKL. A, Diagram of synteny of genes between Fugu, Tetraodon, and human RANKL locus. B, RANKL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Murine cleavage site (!) is marked, whereas associated sequence is marked by a dotted underline (IVGPQR-FSGAPA). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Xt, X. tropicalis; Tn, T. nigroviridis; Om, O. mykiss; Fr, F. rubripes; Dr, D. rerio. C, Semiquantitative PCR analysis (primers RANKL F1 and RANKL R2) of RANKL expression across a panel of healthy adult rainbow trout tissues.
Amino acid alignment, synteny, and expression of RANKL. A, Diagram of synteny of genes between Fugu, Tetraodon, and human RANKL locus. B, RANKL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Murine cleavage site (!) is marked, whereas associated sequence is marked by a dotted underline (IVGPQR-FSGAPA). Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Xt, X. tropicalis; Tn, T. nigroviridis; Om, O. mykiss; Fr, F. rubripes; Dr, D. rerio. C, Semiquantitative PCR analysis (primers RANKL F1 and RANKL R2) of RANKL expression across a panel of healthy adult rainbow trout tissues.
Amino acid alignment, synteny, and expression of TRAIL-like gene(s). A, Diagram of synteny of genes between a Fugu, two Tetraodon, and the human TRAIL loci. The TNF ligand members are highlighted in gray boxes, whereas transcriptional orientation is indicated by arrows. B, TRAIL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Gg, Gallus gallus; Fr, F. rubripes; Tn, T. nigroviridis; Om, O. mykiss; Dr, D. rerio. C, Semiquantitative PCR analysis (primers TRAIL-like F1 and TRAIL-like R1) of TRAIL-like expression across a panel of healthy adult rainbow trout tissues.
Amino acid alignment, synteny, and expression of TRAIL-like gene(s). A, Diagram of synteny of genes between a Fugu, two Tetraodon, and the human TRAIL loci. The TNF ligand members are highlighted in gray boxes, whereas transcriptional orientation is indicated by arrows. B, TRAIL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Gg, Gallus gallus; Fr, F. rubripes; Tn, T. nigroviridis; Om, O. mykiss; Dr, D. rerio. C, Semiquantitative PCR analysis (primers TRAIL-like F1 and TRAIL-like R1) of TRAIL-like expression across a panel of healthy adult rainbow trout tissues.
The highest constitutive expression of rainbow trout RANKL was observed in the spleen, PBL, skin, heart, and AK. This expression pattern is similar to mammalian RANKL, which is expressed in the spleen, blood, bone, and vasculature. To a lesser extent, trout RANKL-like expression was observed in the gill, PK, liver, and intestine tissues (Fig. 5 C).
TRAIL-like (TNFSF 10)
By phylogenetic analysis, RANKL groups closely with TRAIL. In humans, there is only a single TRAIL gene while two genes have been identified in chickens. In this study, we identify as many as four TRAIL-like genes in Zebrafish in agreement with a recent sequence and functional analyses by Emion et al. (23). Synteny analysis of Fugu and Tetraodon both showed serine/threonine protein kinase, cohesin subunit SA stromal Ag SCC3 homolog, propionyl CoA carboxylase, and Zn-finger genes located downstream of Fugu TRAIL-like and Tetraodon TRAIL-like v2 with the same orientation, except for serine/threonine protein kinase (Fig. 6,A). Arylacetamide deacetylase and serine/threonine phosphatase genes were both located upstream of Fugu and Tetraodon TRAIL-like v1 with the same orientation (Fig. 6,A). Human Chr 3 had the arylacetamide deacetylase gene upstream of human TRAIL and in the same orientation as Fugu TRAIL-like and Tetraodon TRAIL-like v1 and TRAIL-like v2. Interestingly, all of mammalian, bird, frog, and fish sequences contain a cysteine immediately adjacent to the E β-strand (Fig. 6,B). In crystal structures of human TRAIL, this cysteine coordinates a zinc atom involved in trimerization of the molecule. All the TRAIL sequences also contain two cysteines in the stalk region of the molecule immediately adjacent to the TMD. All teleost TRAIL molecules have an extended A-A′ loop that in human TRAIL is involved in receptor binding. This loop may have expanded in comparison to other TNFSF members due to the presence of an intron in the middle of the A-A′ loop, allowing the addition of sequence due to shifting of the splice acceptor and donor sites (Fig. 6 B).
From rainbow trout, an insert of 1396 bases was obtained from clone tcay0005b.j.21 encoding a predicted 291 aa peptide (Table III). This sequence, by phylogenetic analysis, groups most closely with Tetraodon TRAIL-like variant 2 and zebrafish TRAIL-like v3 protein (Fig. 1). The highest constitutive expression of rainbow trout TRAIL-like was observed in the spleen, gills, and PK. To a lesser extent, TRAIL-like expression was observed in the heart, liver, intestine, AK, skin and PBL tissues (Fig. 6 C).
LIGHT (TNFSF 14) and 4-1BBL (TNFSF 9)
In humans, three TNFSF members are clustered on Chr 19: LIGHT, CD27L, and 4-1BBL. We find two of the three also cluster together in fish. Sequences from Tetraodon, Fugu, and trout grouped with high bootstrap values with mammalian LIGHT (Fig. 1). Synteny analysis of Fugu and Tetraodon both showed complement C3a/C4a/C5a anaphylatoxin, glycosyl transferase, thioredoxin, and olfactomedin-like gene complex, and 4-1BBL downstream of LIGHT, with TM EMP 24 domain, IFN-induced GTP-binding MX, D dopachrome tautomerase, and two BTB/POZ domains were located upstream of LIGHT with the same orientation (Fig. 7 A). Human Chr 19 also has complement, IFN-induced GTP-binding MX, and TM EMP 24 domain genes as well; however, they are all located upstream of human LIGHT. CD27L appears to be missing from the Tetraodon and Fugu locus.
Synteny and expression of LIGHT and 4-1BBL. A, Diagram of synteny of genes between Fugu, Tetraodon, and human LIGHT/4-1BBL locus. B, Semiquantitative PCR analysis (nonvariant-specific primers: S. salar LIGHT F1 and S. salar LIGHT R1) of LIGHT expression across a panel of healthy adult rainbow trout tissues. C, LIGHT amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. D, 4-1BBL amino acid sequence alignment. Abbreviations: Hs, Homo sapiens; Rn, Rattus norvegicus; Mm, Mus musculus; Om, O. mykiss; Ga, Gasterosteus aculeatus; Pp, Pimephales promelas; Tn, T. nigroviridis.
Synteny and expression of LIGHT and 4-1BBL. A, Diagram of synteny of genes between Fugu, Tetraodon, and human LIGHT/4-1BBL locus. B, Semiquantitative PCR analysis (nonvariant-specific primers: S. salar LIGHT F1 and S. salar LIGHT R1) of LIGHT expression across a panel of healthy adult rainbow trout tissues. C, LIGHT amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. D, 4-1BBL amino acid sequence alignment. Abbreviations: Hs, Homo sapiens; Rn, Rattus norvegicus; Mm, Mus musculus; Om, O. mykiss; Ga, Gasterosteus aculeatus; Pp, Pimephales promelas; Tn, T. nigroviridis.
LIGHT sequences from rainbow trout were found by combining PCR and 5′ and 3′ RACE products generated from splenic tissue (Table III). One PCR product and a 5′ and 3′ RACE product overlapped in 240 identical bps to construct the 1360-bp sequence of LIGHT v1. LIGHT v2 consisted of a PCR and a 5′ RACE product, which overlapped for 318 bp of identical sequence. LIGHT v3 was constructed from a PCR and a 3′ RACE product, which overlapped for 458 bp of identical sequence. Translation of LIGHT v1 and v2 both predicted a 251-aa peptide, whereas LIGHT v3 translated into a 159-bp partial amino acid peptide. Interestingly, the conserved cysteine in mammals, located in the EF loop, is present in pufferfish but not in the trout sequences (Fig. 7 C).
Mammalian LIGHT has been shown to be expressed on T cells and immature dendritic cells (24, 25) and is instrumental in T cell homeostasis and stimulation of monocyte and neutrophil bactericidal activity via the herpes virus entry mediator receptor (26). The highest constitutive expression of LIGHT in unstimulated rainbow trout was observed in the spleen, PBL, gill, kidney, and intestine (Fig. 7,B). Although functional assays were not conducted, high constitutive LIGHT expression in rainbow trout appears to occur in the primary hemopoietic tissues and tissues with direct contact to the external environment. This may indicate rainbow trout LIGHT has similar immunological or organogenesis functions as in mammals. Further work in this area is needed to define the specific roles LIGHT plays in these functions. To a lesser extent, LIGHT expression was observed in the liver, skin, and heart (Fig. 7 B).
Four 4-1BBL orthologs were found from Tetraodon, Fugu, Zebrafish, and Stickleback EST and genomic database searches (Fig. 7, A and D). Further evidence supporting the identification of 4-1BBL orthologs is that we have isolated a rainbow trout gene with similarities to the 4-1BB receptor (G. D. Wiens, unpublished data). Interestingly, phylogenetic analysis suggests that mammalian CD27L is closely related to mammalian 4-1BBL, suggesting that CD27L arose recently by cis-duplication of 4-1BBL (Fig. 1). Alternatively, CD27L may have been lost in teleosts by deletion. Rainbow trout 4-1BBL has not yet been identified.
FasL (TNFSF 6)
Strong synteny was observed between the Tetraodon FasL loci and its human FasL ortholog. Analysis showed IFN-induced GTP-binding MX, phosphatidylinositol-N-acetylglucosaminyltransferase, and bipartite nuclear localization signal are downstream of Tetraodon FasL, whereas these same genes are upstream of human FasL while still in the same orientation (Fig. 8,A). The locations of cysteines in the C-D loop and E-F loops are highly conserved between fish and mammalian FasL sequences (Fig. 8 C). The human Chr 1 has two TNFSF ligand members (OX40L and GITRL) downstream of FasL, while orthologs to these two genes appear to be missing from the Tetraodon locus. Upstream of Tetraodon FasL are three RAL genes, fibrinogen, and an unknown gene labeled PF06702. Human orthologs to these Tetraodon genes are located downstream of FasL with similar orientation on Chr 1. It is noteworthy that both FasL and LIGHT, which group closely by phylogenetic analysis, both lack the adjacent TNFSF member(s) found in mammals.
Synteny and expression of FasL. A, Diagram of synteny of genes between Fugu, Tetroadon, and human FasL locus. B, Semiquantitative PCR analysis (primers FasL F1 and FasL R1) of FasL expression across a panel of healthy adult rainbow trout tissues. C, FasL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Gg, Gallus gallus; Xt, X. tropicalis; Dr, D. rerio; Om, O. mykiss.
Synteny and expression of FasL. A, Diagram of synteny of genes between Fugu, Tetroadon, and human FasL locus. B, Semiquantitative PCR analysis (primers FasL F1 and FasL R1) of FasL expression across a panel of healthy adult rainbow trout tissues. C, FasL amino acid sequence alignment. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Hs, Homo sapiens; Mm, Mus musculus; Gg, Gallus gallus; Xt, X. tropicalis; Dr, D. rerio; Om, O. mykiss.
Two separate rainbow trout FasL clones were identified (tcbi0027c.i.17-927 bp, tcba0001c.n.08-930 bp) that were identical in their coding regions and 3′ UTR, except for the poly(A) tail length (Table III). Multiple attempts at obtaining the 5′ region by RACE were unsuccessful; therefore, nucleotide translation predicted a partial 185-aa peptide for both clones. Both FasL clone inserts contained a stop codon in the 5′ region. With this in mind, these sequences could potentially be pseudogenes or partially spliced cDNAs. Rainbow trout FasL had highest similarity to human and Xenopus LIGHT. However, the trout sequence did share its highest similarity (40%) and identity (28.4%) with zebrafish FasL, whose highest similarity is with human FasL. This underscores the close evolutionary relationship between FasL and LIGHT.
The highest constitutive expression of rainbow trout FasL was observed in the spleen, PBL, gill, and intestine. To a lesser extent, FasL expression was observed in the skin, PK, AK, and heart tissues (Fig. 8 B). Further research is needed to determine, if present, the full rainbow trout FasL sequence and to pinpoint its immunostimulatory and/or apoptotic roles.
Om TNF-New, TNF-α (TNFSF 2), and TL1A-like (TNFSF 15)
While this article was in preparation, a novel TNFSF member was described in teleosts and designated TNF-New (14, 15). Kono et al. (15) postulated that this gene is related to mammalian LT-β. However, in our phylogenetic analysis, the location of TNF-New was unstable, grouping sometimes with TNF-α/LT-α, other times with CD30L/GITRL, and other times with EDA/BAFF/APRIL/BALM sequences. However, we have never observed grouping with mammalian LT-β. For Trout TNF-New, the highest sequence similarity and identity values were with pufferfish and zebrafish TNF-N orthologs (data not shown) and human TNF-α (Table II). Interestingly, Fr_TNF-N was most similar to human LT-α, while zebrafish TNF-New was most similar to human OX40L and then Xenopus TNF-α. It should be noted that the percent sequence identity among fish and human TNFSF is quite low.
Two TNF-New variants were identified from rainbow trout. The initial sequencing of clone 1RT149_E_07 described a partial sequence with errors in the 3′ end. The second variant sequence was found by combining multiple PCR and 3′ RACE products (Table III). PCR was conducted (Om TNF-N F1, R4) to obtain a complete ORF from the total RNA extracted from the kidney of an unstimulated rainbow trout. This PCR product and a 3′ RACE product from splenic tissue overlapped in 459 identical bps and were combined to construct the full cDNA of Om TNF-N v1, totalling 962 bp (Fig. 8,A). The second variant consisted of two PCR products from gill tissue overlapping a 3′ RACE product from splenic tissue to construct a 788-bp partial sequence. The Om TNF-New v2 construct consisted of 180 overlapping nt with an identical 590-bp ORF. Two nucleotide differences were observed between two products at position 604 and 605 in the 3′ UTR. Fourteen amino acid substitutions were observed when the overlapping amino acid sequences of the two variants were compared (Fig. 9,A). Seven of the 14 substitutions were conserved. These sequences probably represent duplicated genes, which are commonly found in salmonids, since they have undergone recent genome duplication. Recently, we have identified several bacterial artificial chromosome clones that contain both trout TNF-α1 and TNF-New_v1, indicating that the genomic arrangement of these genes is similar between trout and other teleosts (Fig. 9 B and data not shown).
Amino acid alignment, synteny, and expression of TNF-N and multiple sequence alignment of TL1A. A, Multiple alignment of deduced amino acid sequence for TNF superfamily member New (Om TNF-New) from rainbow trout with other teleost homologs. Putative signal peptide is indicated by dotted line and < >, whereas putative TMDs are indicated by solid underlines. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Dr, D. rerio; Fr, F. rubripes; Om, O. mykiss. B, Synteny between two zebrafish loci, one fugu and human TNF-α locus. C, Graphic representation of semiquantitative PCR analysis (primers designed from Om TNF-N v1: Om TNFSF-N F2 and Om TNFSF-N R2) of Om_TNF-N expression across a panel of healthy adult rainbow trout tissues. D, Multiple amino acid sequence alignment of mammalian, chicken, and fish TL1A and TL1A-like sequences. Abbreviations: Hs, Homo sapiens; Pt, Pan troglodytes; Rn, Rattus norvegicus; Gg, Gallus gallus; Dr, D. rerio.
Amino acid alignment, synteny, and expression of TNF-N and multiple sequence alignment of TL1A. A, Multiple alignment of deduced amino acid sequence for TNF superfamily member New (Om TNF-New) from rainbow trout with other teleost homologs. Putative signal peptide is indicated by dotted line and < >, whereas putative TMDs are indicated by solid underlines. Putative TMDs are indicated by solid underlines, cysteines and individual β-strands (A-H) are shaded, and the locations of introns within the gene sequence are denoted by a short, bold underline. Abbreviations: Dr, D. rerio; Fr, F. rubripes; Om, O. mykiss. B, Synteny between two zebrafish loci, one fugu and human TNF-α locus. C, Graphic representation of semiquantitative PCR analysis (primers designed from Om TNF-N v1: Om TNFSF-N F2 and Om TNFSF-N R2) of Om_TNF-N expression across a panel of healthy adult rainbow trout tissues. D, Multiple amino acid sequence alignment of mammalian, chicken, and fish TL1A and TL1A-like sequences. Abbreviations: Hs, Homo sapiens; Pt, Pan troglodytes; Rn, Rattus norvegicus; Gg, Gallus gallus; Dr, D. rerio.
Interestingly, trout TNF-New v1 contains a cysteine in the putative F β-strand, which is similar to the EDA, BAFF, APRIL, and BALM cysteine organization (Fig. 9 A). This differs from all TNF-α and LT-α sequences we analyzed that both contain conserved cysteines in the C-D loops and E-F loops, while all mammalian LT-β sequences have a conserved cysteine in the C β-strand similar to RANKL, CD40L, and TWEAK (Supplementary Data 2). 5 Modeling of the secondary structure of trout TNF-New suggested more similarity to TNF-α and LT-α than to LT-β (Supplementary Data 2). 5 Thus, from our analyses, the precise relationship between TNF-New and other mammalian TNFSF members remains uncertain.
The highest constitutive expression of rainbow trout TNF-New, using nonvariant-specific primers, was observed in the PK, PBL, AK, gill, and intestine. To a lesser extent, TNF-N expression was observed in the spleen, skin, and heart tissues (Fig. 9 C). Kono et al. (15) also observed highest expression across a tissue panel in the intestine, gill, and kidney.
Only one fish TL1A ortholog was found in our searches. This ortholog was found in zebrafish and shares two conserved cysteines in the TMD and one cysteine between the TMD and THD regions with mammalian TL1A sequences (Fig. 9,D). Although amino acid sequence similarities indicate a TL1A-like protein, the phylogenetic analysis positioned Dr_TL1A-like with X. tropicalis TNF-α (Fig. 1), indicating the evolution of these proteins is complex and not easily resolved from the species examined.
Discussion
The TNF superfamily is an ancient family of proteins that, similar to TLRs, can be identified in the Ecdysozoan clade of bilateral animals. Drosophila has one known TNF homolog encoded by the gene, eiger (27, 28). Eiger binds a type I membrane receptor, Wengen, and activates signal transduction components that are similar to the mammalian TNF signal cascade (29, 30). Furthermore, mutations in eiger delay the lethality in Salmonella-infected flies, implicating a functional role for eiger in a condition that resembles TNF-induced metabolic collapse in vertebrates (31). In all TNFSF members, the THD, on the protein C terminus, form a trimer that is structurally similar to the C1q family of proteins (32). The C1q family is an expanding group of proteins, including adiponectin, CORS-26, precerebelin, hibernation protein, type X collagen, type VIII collagen, emilin-1 through 4, and multimerin that have diverse immune and nonimmune functions (32). In this study, we have restricted our analysis to TNFSF members and demonstrate that many mammalian TNFSF members have orthologs in teleosts.
To our knowledge, the phylogenetic tree (Fig. 1) is the most inclusive analysis of fish TNFSF ligands described to date. Fourteen distinct teleost TNFSF members were identified. It is striking that no clear TNFSF orthologs were found for mammalian OX40L, CD27, CD30L, and GITRL. The fact that none of these genes were found across a number of teleost genome sequences and EST databases lends support to their loss and or absence. However, given the large number of teleost species (>20,000), it is possible that the TNFSF repertoire may be different depending on fish species examined and the degree of selective gene loss or duplication. For example, in a comparison between zebrafish and Tetraodon gene duplicates, only half remain in either of the zebrafish or Tetraodon genomes presumably due to gene loss or rearrangement (33). Interestingly, the genome of the chicken is missing orthologs of eight mammalian genes, which are present in three tandem clusters in the mammalian genome (TNF, LT-A, and LT-B; LIGHT, CD27L, and 4-1BBL; APRIL and TWEAK) (34). The portrait of missing teleost TNFSF members is not as clear as in chickens and may, in large, be attributed to multiple teleost en bloc and genome duplications, which obscures the evolutionary picture. It is also possible that we are unable to recognize these genes due to a high sequence divergence. Analysis of the receptors may be required to clarify the status of the missing ligands.
The association of homeobox (HOX) clusters has aided in deciphering the origin of teleost chemokine and chemokine receptors families and has shown that en bloc and tandem duplications have been a common source of these genes (35). A similar, but more indistinct, picture appears to be present with the TNFSF ligands as depicted in our evolutionary model (Fig. 10). Groups of TNFSF ligands are found in tandem and phylogenetic relationships and are present between these tandem clusters, indicating the occurrence of en bloc or genome duplications. For example, mammalian LIGHT, CD27L, and 4-1BBL are found in tandem on Chr 19 of the human genome (Fig. 9). LIGHT and 4-1BBL were found in our search in Tetraodon and Fugu, whereas CD27L was absent. A similar circumstance is present with FasL (likely ancestry with LIGHT), which was found in Tetraodon, zebrafish, and rainbow trout, whereas GITRL and OX40L were absent. Mammalian FasL, GITRL, and OX40L are found in tandem of human Chr 1.
Comparative analysis of human and teleost TNFSF chromosomal location, exon organization, transcriptional direction, and extension of a model of TNFSF evolution proposed by Collette et al. (4 ). Dashed lines indicate significant phylogenetic relationships, whereas dotted lines indicate weak phylogenetic grouping from Fig. 1. Blocks indicate exons and arrows indicate transcriptional direction. TNFSF members, which we were unable to find, or exons in fish that are uncertain are indicated (?). This model assumes that the blocks of TNFSF genes originally arose from genome-wide or chromosomal duplications of the metazoan ancestor to the early vertebrates. The ancestral TNFSF gene organization before the bony fish-tetrapod split, ∼360–450 mya (48 ), can be deduced from the common gene organization. Blocks of two genes appear to be the original TNFSF unit, whereas blocks of three TNFSF appear to be a derived characteristic. The most parsimonious explanation suggests that cis-duplication led to 4-1BBL-CD27L pair and also to the TNF-α-LTα pair occurring after the fish-tetrapod split. Translocation may have contributed to the placement of BALM next to EDA in teleost genomes and is shown by an arrow. It is possible that the precursor of TNF-New gave rise to GITRL/CD30L as suggested from phylogenic analysis. Alternatively, TNF-New, which is in the same location (next to TNF-α), and transcriptional orientation as mammalian LT-α may be a degenerate form of the precursor of LT-α. The evolutionary origin of LT-β is unclear but may have arisen from a translocation of a related TNFSF member, which in teleosts may have included a translocated TNF-α gene or TL1A-like gene. Abbreviations: Chr, chromosome; Dr, D. rerio; Tn, T. nigroviridis; TNFSF, TNF superfamily. Exon size and distances are not drawn to scale and splice variants are not shown.
Comparative analysis of human and teleost TNFSF chromosomal location, exon organization, transcriptional direction, and extension of a model of TNFSF evolution proposed by Collette et al. (4 ). Dashed lines indicate significant phylogenetic relationships, whereas dotted lines indicate weak phylogenetic grouping from Fig. 1. Blocks indicate exons and arrows indicate transcriptional direction. TNFSF members, which we were unable to find, or exons in fish that are uncertain are indicated (?). This model assumes that the blocks of TNFSF genes originally arose from genome-wide or chromosomal duplications of the metazoan ancestor to the early vertebrates. The ancestral TNFSF gene organization before the bony fish-tetrapod split, ∼360–450 mya (48 ), can be deduced from the common gene organization. Blocks of two genes appear to be the original TNFSF unit, whereas blocks of three TNFSF appear to be a derived characteristic. The most parsimonious explanation suggests that cis-duplication led to 4-1BBL-CD27L pair and also to the TNF-α-LTα pair occurring after the fish-tetrapod split. Translocation may have contributed to the placement of BALM next to EDA in teleost genomes and is shown by an arrow. It is possible that the precursor of TNF-New gave rise to GITRL/CD30L as suggested from phylogenic analysis. Alternatively, TNF-New, which is in the same location (next to TNF-α), and transcriptional orientation as mammalian LT-α may be a degenerate form of the precursor of LT-α. The evolutionary origin of LT-β is unclear but may have arisen from a translocation of a related TNFSF member, which in teleosts may have included a translocated TNF-α gene or TL1A-like gene. Abbreviations: Chr, chromosome; Dr, D. rerio; Tn, T. nigroviridis; TNFSF, TNF superfamily. Exon size and distances are not drawn to scale and splice variants are not shown.
To further verify that we were not missing fish TNFSF orthologs (CD27L, GITRL, and OX40L) associated with the mammalian LIGHT and FasL loci mentioned above, we scanned the associated fish genomic sequence obtained from the Ensembl Tetraodon database through WU-BLAST (program, blastx; database, nrdb95) to search for potential TNFSF ORFs in all six reading frames (http://dove.embl-heidelberg.de/Blast2/). We were unable to locate any TNFSF members from the genomic sequence between LIGHT and 4-1BBL (Chr 18) and between FasL and Trans. Fac. AP-1 (Chr 1-random). However, the 64,368 bases of genomic sequence analyzed downstream of Tetraodon FasL did contain 3,056 bases of uncharacterized “n” sequence. An interesting observation is that the majority of these absent mammalian members are instrumental in T cell regulation and homeostasis. This may indicate a fundamental difference between the teleost and mammalian immune system and specifically adaptive immunity.
The current search established only one conclusive TNFSF “triplet” of genes (BALM-EDA-CD40L), and this organization is unique to teleosts (Fig. 10). This supports the proposal of Collette et al. (4) that the “ancestral” TNFSF organization is two genes. Recently, further support for this organization is observed in the purple sea urchin genome (36). Four TNFSF have been identified, and two of these genes, Sp_TNFSF-like 1 and SP_TNFSF-like 2, are within 30 kb of one another. The predicted protein sequence of SP_TNFSF-like 1 groups at the root of the EDA-BAFF-BALM-APRIL sequences while Sp_TNFSF-like2 groups weakly with Fr_TNF-new by phylogenetic analysis (data not shown). Further analyses of TNFSF members from additional intermediate species are required to resolve the complex evolution of these ancestral genes.
The location of BALM adjacent to EDA suggests a local duplication or translocation not found in higher vertebrates (Fig. 10). BALM was found to be located on the same chromosome with EDA for Tetraodon (Chr 1) and Fugu (scaffold 663). TNFSF5 (CD40L) was also found on Tetraodon Chr 1, while it is still unclear if Fugu CD40L (scaffold 465) is on the same chromosome with EDA and BALM due to the current inconclusive scaffold organization of the genome. Zebrafish CD40L is located on Chr 14, whereas no data are available on the location of a zebrafish EDA. A tblastn hit of (8.1e-70) was observed in zebrafish with a partial S. salar EDA (TIGR: TC27579) as query. Preliminary searches indicated the possibility of a second triplet consisting of APRIL, TWEAK, and TRAIL-like located on zebrafish Chr 7, which is partially consistent with the organization of APRIL and TWEAK on human Chr 17. However, the more recent Ensembl release 41 for zebrafish (assembly Zv6) shows that TWEAK and TRAIL-like_v2 are located on Chr 7, while APRIL and TRAIL-like_v4 are present on Chr 5. The location of BAFF and RANKL-like on Tetraodon Chr 2 also coincides with Chr 13 in humans (Fig. 10). In summary, gene organization has been conserved for many of the TNFSF members with putative gene translocation, gene loss, and truncated clusters as has been reported for the hox gene family in teleosts (37).
Colosimo et al. (12) characterized threespine stickleback EDA and have started to reveal its role in scale/plate development. They also described a gene within the stickleback EDA locus and, due to its characteristics, called it stickleback TNF (ligand) superfamily member 13b (BAFF). Upon closer observation, we determined this stickleback TNFSF 13b gene to be an ortholog of rainbow trout BALM. Because teleost BALM shows characteristics of both BAFF and APRIL, it may be an ancient precursor or have emerged from a teleost-specific duplication. Although the receptor is at present unknown, it is interesting that BAFF, and also one splice variant of APRIL, bind to three different receptors BCMA, BAFF-R, and TACI (38). Thus, BALM may be the third ligand in this set that was lost during higher vertebrate evolution. Interestingly, rainbow trout BALM expression is highest in the blood and lymphoid organs similar to trout BAFF and APRIL. Further functional analyses are underway to elucidate the receptor specificity and role of these closely related proteins in rainbow trout.
In humans and mice, CD40L is primarily expressed on CD4+T cells. In the absence of CD40-CD40L interaction between T cells and APCs, macrophages are unable to up-regulate costimulatory molecules, and B cells are unable to proliferate and switch Ig class. The synteny between teleost and human CD40L loci appears to be strong, while CD40-CD40L binding and glycosylation sites are not conserved. Currently, isotype switch regions have not been found in rainbow trout (39), and it has been proposed that progenitor B cells become either of two Ig lineages (40). With this in mind, teleost CD40L may be playing a limited or alternative role to its mammalian orthologs. While our manuscript was in preparation, we observed a Japanese flounder TNFSF protein had been reported to be an ortholog of mammalian FasL (41). This conclusion was based on phylogenetic analysis with three mammalian TNFSF members: TNF-α, LT-α, and FasL. Our phylogenetic analyses using all available sequences indicate that this flounder sequence groups strongly with teleost and mammalian CD40L sequences and not with the teleost and mammalian FasL sequences (data not shown), underscoring the limitations of phylogenetic analysis using limited sequences and the importance of including the analysis of gene synteny. Interestingly, the Japanese flounder protein induced apoptosis of HINAE cells-derived Japanese flounder embryos, suggesting a novel bioactivity for teleost CD40L. Possibly, this may relate to the recently uncovered role of murine CD40L in immune homeostasis. Naive CD4+T cells expressing CD40L have been shown to augment the survival of autoantigen-engaged B cells (42). This homeostatic/apoptotic function may be the primary role of CD40L in lower vertebrates, whereas its role further expanded through mammalian evolution.
Recently, Kono et al. (15) characterized almost identical sequences to rainbow trout TNF-New v1 and TNF-New v2. Through phylogenetic analysis with mammalian and frog TNF-α, LT-α, and LT-β, they propose TNF-New to be the ortholog of mammalian LT-β. We have included all 18 mammalian TNF ligand members and have shown (Fig. 1) that rainbow trout TNF-N v1 groups with Fugu and zebrafish TNF-N but have not observed grouping with LT-β in any of the trees constructed. The present results are similar to the phylogenetic analysis findings of Savan et al. (14), who found Fugu and zebrafish TNF-New to form a distinct cluster separated from TNF-α, LT-α, and LT-β. Synteny analysis indicates that the Fugu and zebrafish TNF-New orthologs are located on the same scaffolds as TNF, share synteny with human Chr 6, which contains TNF, LT-A, and LT-B, and shares the same transcriptional direction as mammalian LT-A (15). TNF-New shares closest percent amino acid identity with human TNF-α (22%), then followed by LT-α (18.5%) and finally LT-β (16.7%). Furthermore, SignalP 3.0 predicts a signal peptide on TNF-New similar to LT-α (data not shown). Thus, our results suggest that rainbow trout TNF-N is also similar to mammalian LT-α. Interestingly, zebrafish have two copies of TNF, one of which is present on Chr 15 next to TNF-New, whereas a second TNF is located by itself on Chr 19 (Fig. 8 B). The TNF gene on zebrafish Chr 19 is surrounded by the largest stretch of MHC-related genes, while a number of other class III genes are found distributed throughout the genome (43). This organization is in abrupt contrast to that of Xenopus in which the LT-B, TNF, and LT-A genes and the extended MHC is organized in very similar manner to mammals (44). Functional analysis and examination of this locus in additional transitional vertebrate species are required to resolve the evolutionary relationship between teleost TNF-New with TNF-α, LT-α, and LT-β and costimulatory ligands OX40L, GITRL, and CD30L.
Seven HOX clusters have been found in the zebrafish genome, suggesting an additional genome duplication in fish compared with higher vertebrates (35). In salmonids, the situation is even more complex as 14 hox gene clusters have been identified (45), suggesting that salmonids have undergone an additional genome duplication estimated to occur ∼25 million years ago (46). This makes it likely that additional gene duplicates will be identified in trout. The presence of multiple forms of TNF-α, LIGHT, BALM, and TNF-New suggests that redundant and/or divergent systems are tolerated within salmonids. In teleosts, TRAIL-like molecules are particularly complex as we have identified four related genes in Zebrafish in agreement with recently published findings from Eimon et al. (23).
Expression analysis indicates the majority of the rainbow trout ligands are constitutively expressed in hemopoietic tissue and tissues with direct contact to the aquatic environment. This would indicate that many of the same organogenesis and immunological pathways present in mammals could be functioning at some similar capacity in fish. Notable exceptions are the absence of lymph nodes in teleosts or the organization of the spleen into distinct white and red pulp areas. It is possible that the rearrangement or partial duplication of teleost TNF and TNF-New into the LTB-TNF-LT-A gene locus may have been a driving force in the evolution of these structures in tetrapods.
In summary, in addition to TNF-α, there are at least 13 distinct TNFSF members present in teleosts. We demonstrate for the first time that teleosts possess orthologs of BAFF, APRIL, TWEAK, 4-1BBL, FasL, LIGHT, CD40-L, RANKL, and possibly TL1A. Teleosts have a unique TNFSF member (BALM), along with multiple variant forms of mammalian orthologs such as TNF-α, TRAIL, and LIGHT. At this point, we have been unable to find four TNFSF members (CD27L, OX40L, GITRL, and CD30L), which have a propensity to cluster in our analysis as well as other phylogenetic trees in the literature (4). The majority of these absent ligands are instrumental in T cell activation and homeostasis (1, 3, 4, 47). There are three explanations for their absence: 1) an artifact of the incomplete assembly of the genomes examined, 2) they may have been lost during teleost evolution, 3) or may have arisen under pressures of the adaptive immune systems of higher vertebrates. Further analyses are required to distinguish between these possibilities. We believe the current analysis of TNFSF in teleosts sheds new light into teleost and higher vertebrate comparative immunology and are the basis for future functional studies.
Acknowledgments
We thank Katherine Hovatter for her technical expertise in the lab and Drs. C. Rexroad and Y. Palti for supplying EST clones. We thank Dr. Pascal Schneider, University of Lausanne; Dr. Susan Murray, Oregon Health and Science University; Dr. Ram Savan, Laboratory of Experimental Immunology, National Cancer Institute, Center for Cancer Research, National Institutes of Health; and Dr. Hyun Lillehoj Beltsville Agricultural Research Center, for discussions and critical review of this manuscript. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture.
Disclosures
The authors have no financial conflict of interest.
Footnotes
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
This research was supported by Agricultural Research Service CRIS Project 1930-32000-002 “Host-Pathogen and Environmental Interactions in Cool and Cold Water Aquaculture.”
Abbreviations used in this paper: TNFSF, tumor necrosis factor superfamily; AK, anterior kidney; APRIL, a proliferation-inducing ligand; BAFF, B cell-activating factor; BALM, BAFF-APRIL-like molecule; EDA, ectodysplasin; EST, expressed sequence tag; FasL, Fas ligand; GITRL, glucocorticoid-induced TNFR-related gene ligand; HOX, homeobox; LIGHT, lymphotoxin-like inducible protein that competes with glycoprotein D for binding herpesvirus entry mediator on T cell; LT, lymphotoxin; ORF, open reading frame; PK, posterior kidney; RANKL, receptor activator of NF-κB; THD, TNF homology domain; TL1A, TNF ligand-related molecule 1; TM, transmembrane; TMD, transmembrane domain; TRAIL, TNF-related apoptosis inducing ligand; TWEAK, TNF-like weak inducer of apoptosis; UTR, untranslated region.
The online version of this article contains supplemental material.