Gnathostome adaptive immunity is defined by the Ag receptors, Igs and TCRs, and the MHC. Cartilaginous fish are the oldest vertebrates with these adaptive hallmarks. We and others have unearthed nonrearranging Ag receptor-like genes in several vertebrates, some of which are encoded in the MHC or in MHC paralogous regions. One of these genes, named UrIg, was detected in the class III region of the shark MHC that encodes a protein with typical V and C domains such as those found in conventional Igs and TCRs. As no transmembrane region was detected in gene models or cDNAs, the protein does not appear to act as a receptor. Unlike some other shark Ig genes, the UrIg V region shows no evidence of RAG-mediated rearrangement, and thus it is likely related to other V genes that predated the invasion of the RAG transposon. The UrIg gene is present in all elasmobranchs and evolves conservatively, unlike Igs and TCRs. Also, unlike Ig/TCR, the gene is not expressed in secondary lymphoid tissues, but mainly in the liver. Recombinant forms of the molecule form disulfide-linked homodimers, which is the form also detected in many shark tissues by Western blotting. mAbs specific for UrIg identify the protein in the extracellular matrix of several shark tissues by immunohistochemistry. We propose that UrIg is related to the V gene invaded by the RAG transposon, consistent with the speculation of emergence of Ig/TCR within the MHC or proto-MHC.

This article is featured in Top Reads, p. 905

Immunoglobulins and TCRs are the Ag receptors that coordinate adaptive immunity in jawed vertebrates (gnathostomes) (1). Ig/TCR genes are generated by RAG-dependent gene rearrangement events during lymphocyte ontogeny. Igs and most γ/δ TCRs recognize free Ag whereas α/β TCRs recognize peptide or lipid Ags bound in the grooves of MHC class I and class II proteins (2). It is universally agreed that Igs and TCRs were generated from a common ancestral Ag receptor (3–5), but speculation abounds on the nature of such a common ancestor because all living gnathostomes have all three types of Ig superfamily (IgSF)–based Ag receptors, and jawless fish and invertebrates lack such receptors.

Cartilaginous fishes are in the oldest vertebrate group with rearranging Ag receptors and the MHC (6). Other hallmarks of adaptive immunity such as primary (thymus) and secondary (spleen) lymphoid tissues, activation-induced cytidine deaminase (AID)–driven somatic hypermutation, and diverse cytokine and chemokine networks, among others, also arose in cartilaginous fish. The only glaring omissions of shark adaptive immunity are class-switch recombination (classically arose in amphibians, although sharks also have a form of switch) (7, 8), germinal centers (arose in reptiles/birds) (9), and lymph nodes (arose in mammals) (10). However, sharks have some primordial features of their adaptive immune system that were lost in most higher vertebrates (and all primates) such as the cluster (split) organization of Ig H chain genes (11), linkage of β2-microglobulin to the MHC (12), somatic hypermutation of TCRα genes during thymic development (13), “germline joining” of V, D, and J segments that can generate functional Ig genes in the genome (14–18), single V domain–containing H chains that make dimers but do not associate with L chains (15, 19, 20), and usage of IgH V regions in TCRs (21, 22). Such ancestral features highlight the shark as an attractive model for immune studies.

Ig/TCR genes arose by an invasion of an IgSF exon by the RAG transposon early in gnathostome evolution (23–25). The IgSF domain used by Ig/TCR is a so-called “VJ domain” in which the last (G) strand of the molecule bears a Gly-X-Gly motif in the N-terminal part of the strand involved in dimerization and a less conserved V(l)TVT motif in the C-terminal part (26, 27). There are not many genes besides Ig/TCR with this motif in the genome, with most of them having probably derived from an ancestral, nonrearranging VJ domain. A recent review compiled all of the reported animal VJ domains, most of which are involved in immunity (27). In addition, the C IgSF domains used by Ig/TCR are likewise special, so-called C1 domains that are the most compact IgSF domains also with distinguishing motifs: FYP in the B strand and C-V-H in the F strand (26, 28). Besides Ig/TCR, C1 domains are found in MHC class I, MHC class II, β2-microglobulin, and a few other molecules involved in immunity (29). Interestingly, several VJ and C1 domain genes are encoded in the MHC or in MHC paralogous regions, prompting the idea from several groups (including ours) that Ig/TCR and class I/II arose from a proto-MHC region (28, 30–32).

While examining the shark MHC we uncovered a nonrearranging Ig-like gene in the class III region that converges on mammalian IgG, near where other VJ exons are found (33). This was an unexpected result and not one based on hypothesis testing, but nevertheless is consistent with an MHC origin of Ag receptors. In this study, we report the basic features of the gene and molecule, which likely serves an innate immune or structural role in sharks. We speculate on this molecule’s evolutionary significance in relationship to Ag receptor emergence.

Wild-caught nurse sharks (Ginglymostoma cirratum) were maintained in artificial seawater at ∼28°C in indoor tanks at the Institute of Marine and Environmental Technology (Baltimore, MD). Animals were anesthetized with MS222 (0.1%) before bleeds were harvested from the caudal vein with 1000 U/ml heparin reconstituted in shark-modified PBS (SPBS), then spun at 300 × g for 10 min to isolate blood plasma and buffy coats. We harvested major organs in SPBS and then euthanized animals according to protocol. All procedures were conducted in accordance with University of Maryland School of Medicine Institutional Animal Care and Use Committee protocols.

While searching for MHC genes in databases from various cartilaginous fish genomes at the National Center for Biotechnology Information Web site (https://www.ncbi.nlm.nih.gov), we discovered the UrIg gene. The presence of UrIg was further confirmed by BLASTp and tblastn searches against various vertebrate species including the nurse shark transcriptomics database.

The UrIg V and C IgSF domains were separately aligned against different sets of IgSF-containing genes (e.g., Igs, TCRs) based on the predicted evolutionary origin and domain similarities using ClustalW. Phylogenetic trees were then constructed using the bootstrapping neighbor-joining method (34) with 500 runs.

First-strand cDNA was made from 500 ng of total RNA from various nurse shark “NJ” tissue using a SuperScript IV first-strand synthesis kit (Invitrogen) following the manufacturer’s protocol. RT-PCR was performed using GoTaq master mix (Promega) with 35 cycles of denature (at 95°C) for 45 s, annealing (see below for each gene) for 45 s, and extension (at 72°C) for 60 s, following 2 min of hot start at 95°C, and followed by a 5-min final extension at 72°C. The primers used to examine the gene expression were as follows: UrIg (C1–C3 IgSF domains) 5′-CCGGAAGAACATCTCGCTGCT-3′ and 5′-CCGGAAGAACATCTCGCTG CT-3′ (at 58°C annealing temperature); nucleoside-diphosphate kinase (NDPK; control) 5′-AACAAGGAACGAACCTTC-3′ and 5′-TCACTCATAGATCCA GTC-3′ (at 50°C annealing temperature). The PCR amplicons were visualized on 1% agarose gel.

The four-domain UrIg molecule (V-C1-C2-C3) was cloned into the phCMV3 vector for mammalian expression, with an N-terminal murine Igκ signal peptide and C-terminal His tag. Protein was produced by transient transfection of expi293F cells (Thermo Fisher Scientific) following the manufacturer’s standard protocols. Protein was purified from the medium by affinity chromatography with Ni Sepharose excel histidine-tagged protein purification resin (Cytiva). After binding, resin was washed with 20 mM Tris (pH 7.5) and then with 500 mM imidazole, 20 mM Tris, 500 mM NaCl (pH 7.5) for elution. Protein was then dialyzed overnight at 4°C against 1× PBS/1 mM EDTA. After dialysis, NDSB-201 (MilliporeSigma) was added to a final concentration of 200 mM and protein was concentrated to ∼1 mg/ml before size-exclusion chromatography on a Superdex 200 16/60 column (Cytiva) with running buffer Dulbecco’s PBS. Peak fractions were pooled, and samples were then concentrated and examined by SDS-PAGE, and for immunization of mice according to Institutional Animal Care and Use Committee protocols.

mAbs were produced as previously described (35). Briefly, mice were immunized s.c. to 50 μg of UrIg protein emulsified in IFA and boosted 3 wk later with 50 μg in IFA. Three days later, splenocytes were fused with myeloma line X63. Two weeks later, hybridomas were tested for reactivity by ELISA on 96-well plates coated with recombinant UrIg (1 μg/ml). Positive clones were expanded, and supernatants were used for immunohistochemistry and Western blotting.

Collagen adhesin (CNA)35 synthetic DNA with optimized codons for Escherichia coli (Integrated DNA Technologies) was cloned in-frame with the human collagen XVIII trimerization domain sequence (36) into the pET23-His-Trx-thr vector using BamHI and EcoRI sites (37). The resultant construct encoded a thrombin-cleavable, His-tagged thioredoxin domain added to the N terminus of the collagen XVIII trimerization domain (18TD) fused with CNA35. The plasmid sequence was verified by Sanger sequencing.

The protein was expressed in the BL21(DE3) strain of E. coli using 1 mM IPTG induction for 3 h at 37°C. Cells were collected by centrifugation and disrupted by ultrasonication in 20 mM Tris-HCl (pH 8) on ice. Insoluble material was removed by centrifugation at 10,000 × g at 4°C for 30 min and supernatant was adjusted to include 50 mM sodium phosphate (pH 8), 150 mM NaCl, and 5 mM imidazole. The His-tagged chimera protein was affinity purified using a Ni-NTA column (Qiagen) by elution using the same buffer supplemented with increasing imidazole concentrations. The fractions with the protein of interest were pooled and dialyzed against 50 mM Tris-HCl, 100 mM NaCl (pH 8.3), after which 10 mM CaCl2 and 0.01% sodium azide were added to the dialyzed protein sample to facilitate thrombin cleavage and prevent bacterial growth, respectively. Every week 4 U/ml thrombin was added for a total time of 4 wk to complete the thrombin cleavage at room temperature. His-tagged thioredoxin was separated from (18TD-CNA35)3 using a Ni-NTA column (Qiagen), but this time the trimeric CNA35 was found in the flowthrough. The (18TD-CNA35)3 was dialyzed against 20 mM Tris HCl (pH 8), loaded onto a Q-Sepharose column (Cytiva), and eluted at ∼10–20 mM NaCl gradient. The fractions were pooled and run over the size-exclusion chromatography using the Superdex S200 Increase column (Cytiva) equilibrated with the coupling buffer (100 mM sodium carbonate [pH 8.8]).

Conjugation with Alexa Fluor 488 N-hydroxysuccinimide ester (Thermo Fisher Scientific) was performed at 2 mg/ml protein concentration at 4°C overnight using 1 mg of dye/10 mg of protein. The labeled protein was separated from unreacted dye by desalting into PBS buffer. The stock solution was adjusted to 1 mg/ml. The working dilution for staining was 1:1000.

Fresh-frozen nurse shark tissues in Tissue-Tek OCT compound (Sakura) were cryosectioned at 7-μm thickness and briefly fixed in cold acetone. Slides could be stored at −80°C for several days. Slides were thawed, rehydrated in PBS for 5 min, and made permeable in PBS supplemented with 0.05% Tween 20 (PBST). Slides were blocked with 10% FCS in PBS for 45 min at 4°C, then incubated with mAb supernatant for 5 min at 4°C. After washing with PBST, slides were incubated with the secondary Ab, goat anti-mouse IgG–Alexa Fluor 488, for 45 min at 4°C, washed, and mounted with ProLong gold with DAPI mounting media. Images were taken with a Nikon Eclipse E800 and analyzed with Mosaic software.

For double immunofluorescence staining with CNA and mAb, kidney sections (7 μm) were mounted on glass slides, air dried for 15–30 min at room temperature, and fixed in acetone at −20°C for 10 min. Sections were washed for 5 min with 50 mM Tris/150 mM NaCl/0.1% Tween 20 buffer (TBST). Blocking was performed with 10% goat serum for 1 h (Invitrogen, 50062Z) followed by incubation with primary anti-UrIg mAb O-1 (1:500 dilution in 1% goat serum TBST, Fig. 7) overnight in a cold room. Following three 15-min washes with TBST, secondary Ab conjugated with Alexa Fluor 568 and CNA conjugated with Alexa Fluor 488 were incubated on sections for 1.5 h at room temperature. Sections were then washed and mounted with antifade mounting solution with DAPI. Images were taken with a Nikon Eclipse Ti microscope and analyzed with GIMP software.

Nurse shark tissues were dissected from animals in SPBS and minced into small pieces. Approximately 500 mg of tissue was dissociated with the frosted ends of microscope slides in 5 ml of 2% Nonidet P-40/PBS lysis buffer containing protease inhibitors. Lysates were kept on ice for 30 min and then were cleared of nucleic acid and debris by centrifugation at 600 × g for 15 min. Then, 30 μl of lysates was mixed with an equal volume of Laemmli sample buffer and subjected to SDS-PAGE. Gels were transferred to polyvinylidene difluoride as described (38) and cut into strips for incubation with 500 μl of mAb supernatant or antisera at 1:1000. Bands were revealed with a Vectastain ELITE ABC kit (Novus Biologicals) using precipitable diaminobenzidine as substrate.

AlphaFold-Multimer (39) was used to prepare a structural model for the full UrIg sequence. The relaxed predicted structure with highest confidence was then submitted to the GLYCAM-Web server (40) for addition of carbohydrate for N-linked glycan sites. Any modeled glycans with steric clashes were then adjusted manually to avoid clashing using Coot (41).

Cartilaginous fish databases were scanned for MHC-linked genes in the National Center for Biotechnology Information Web site. One gene in the MHC class III region of catshark (Scyliorhinus canicula) was annotated as a V region of the Igκ L chain (LOC119976294), but upon inspection it had a VJ exon followed by a C1 exon. The gene is encoded near the cluster of TNF genes, not far from other MHC-linked genes encoding VJ domains, the NK receptor NCR3 (33). Using this gene fragment, we searched other shark genomic and transcriptomic databases and found the ortholog in various shark genomes as well as full-length transcriptomes (GIWU01140354 and GIWU01140356) from a nurse shark liver database. In most cartilaginous fish species, the gene is composed of a leader segment, one VJ exon, followed by three C domain exons (Fig. 1A–D). Upon examination of genomic sequences including nurse shark (position 341176–354343 in JAHRHZ010000005 (Supplemental Fig. 1)), we found that the leader exon is “split” as is found for all Ig/TCR V genes (a short leader exon followed by an intron and then a short leader segment 5′ of the VJ exon, Fig. 1A), and each IgSF domain is encoded by a separate exon (Fig. 1B–D, Supplemental Fig. 1). No transmembrane exons were found in the genome, nor were any transcripts found with a transmembrane region. Furthermore, there is no secretory tail at the end of the C3 exon and thus no identifiable cryptic splice site that would allow for alternative splicing, as is found for all vertebrate H chain mRNA (42). Besides the VC1C2C3 mRNA transcript, there is a shorter transcript in which the C2 domain exon is spliced out (Supplemental Fig. 2). In all species in which larger genomic contigs were available, the gene mapped to the class III region of MHC (Fig. 1E).

FIGURE 1.

Basic features of shark UrIg. (A) UrIg is an ancient VJ domain. The UrIg VJ domain is aligned to nurse shark Ag receptor VJ domains: four types of TCR (α, β, γ, δ), IgM, IgW, IgNAR, and the three types of L chains (κ, λ, σ); the germline-joined Hydrolagus IgM V (Hyco-gjH) and single domain Hydrolagus IgM V (Hyco-SD) are also aligned. Numbering and strand positions are based on the nurse shark UrIg mature protein sequence and AlphaFold prediction. Accession numbers for sequences are as follows: nurse shark: UrIg (GIWU01140354.1), IgM (AAA50817), IgW (AAB08972), IgNAR (AAB42621), Igλ (sequence taken from Igκ [L16765]), Igσ (EF114759), TCRα (ADW95866), TCRβ (ADW95893), TCRδ (ADW95945), and TCRγ (QNT23252); and spotted ratfish (Hydrolagus colliei): Hyco-SD (HHC1; sequence taken from Ref. 63) and Hyco-gjV (AAC12891). Conserved amino acids in VJ domains are shown below the figure. Residues that interact between V domains of bona fide Ig are bolded below the figure and in the IgM, IgW, and Hydrolagus V sequences. The potential glycosylation site in UrIg is underlined. CDR1, CDR2, and CDR3 are shaded. Only the CDR3 for UrIg is shown because the other CDR3s for Ig/TCR are somatically generated. The one free cysteine (Cys-113) in UrIg is shaded red. The arrow marking the leader segment indicates the intron of the “split leader.” Dashes indicate gaps in amino acid sequences. (B) UrIg C domains bear classical C1 domain features found in Ig/TCR/MHC. The three C domains of UrIg are noted with the conserved residues in blue (especially FYP and C-V-H). Strand positions are based on the AlphaFold prediction. Potential glycosylation sites are underlined. (C) Schematic model of the UrIg monomer. Potential glycosylation sites are shown as lollipops. (D) Genomic (intron/exon) organization of nurse shark UrIg. Genomic DNA (position 341176–354343 in JAHRHZ010000005) was aligned against a full-length transcript GIWU01140354 using Splign. Each colored box is a coding region, and the untranslated regions (UTRs) are in an open box. (E) UrIg maps to the shark MHC in the class III region. UrIg is shown in blue; the other VJ gene in the MHC, NCR3, is shown in green; class I/II is shown light peach; other genes including class III genes are in open boxes. Mapping is not to scale, but simply shows relative locations. Wider boxes indicate clusters of multigenes.

FIGURE 1.

Basic features of shark UrIg. (A) UrIg is an ancient VJ domain. The UrIg VJ domain is aligned to nurse shark Ag receptor VJ domains: four types of TCR (α, β, γ, δ), IgM, IgW, IgNAR, and the three types of L chains (κ, λ, σ); the germline-joined Hydrolagus IgM V (Hyco-gjH) and single domain Hydrolagus IgM V (Hyco-SD) are also aligned. Numbering and strand positions are based on the nurse shark UrIg mature protein sequence and AlphaFold prediction. Accession numbers for sequences are as follows: nurse shark: UrIg (GIWU01140354.1), IgM (AAA50817), IgW (AAB08972), IgNAR (AAB42621), Igλ (sequence taken from Igκ [L16765]), Igσ (EF114759), TCRα (ADW95866), TCRβ (ADW95893), TCRδ (ADW95945), and TCRγ (QNT23252); and spotted ratfish (Hydrolagus colliei): Hyco-SD (HHC1; sequence taken from Ref. 63) and Hyco-gjV (AAC12891). Conserved amino acids in VJ domains are shown below the figure. Residues that interact between V domains of bona fide Ig are bolded below the figure and in the IgM, IgW, and Hydrolagus V sequences. The potential glycosylation site in UrIg is underlined. CDR1, CDR2, and CDR3 are shaded. Only the CDR3 for UrIg is shown because the other CDR3s for Ig/TCR are somatically generated. The one free cysteine (Cys-113) in UrIg is shaded red. The arrow marking the leader segment indicates the intron of the “split leader.” Dashes indicate gaps in amino acid sequences. (B) UrIg C domains bear classical C1 domain features found in Ig/TCR/MHC. The three C domains of UrIg are noted with the conserved residues in blue (especially FYP and C-V-H). Strand positions are based on the AlphaFold prediction. Potential glycosylation sites are underlined. (C) Schematic model of the UrIg monomer. Potential glycosylation sites are shown as lollipops. (D) Genomic (intron/exon) organization of nurse shark UrIg. Genomic DNA (position 341176–354343 in JAHRHZ010000005) was aligned against a full-length transcript GIWU01140354 using Splign. Each colored box is a coding region, and the untranslated regions (UTRs) are in an open box. (E) UrIg maps to the shark MHC in the class III region. UrIg is shown in blue; the other VJ gene in the MHC, NCR3, is shown in green; class I/II is shown light peach; other genes including class III genes are in open boxes. Mapping is not to scale, but simply shows relative locations. Wider boxes indicate clusters of multigenes.

Close modal

The VJ domain bears the classic YYC in the F strand and GX(C)G and LTVK motifs in the G strand (Fig. 1A). Residues that interact between VH and VL are not well conserved in the UrIg V domain, suggesting that the V-UrIg may not form a closed dimer (bold residues below the alignment and in the IgM/IgW sequences in Fig. 1A). All three C domains are card-carrying C1 IgSF domains bearing the cardinal FYP and C-V-H motifs described in the Introduction (Fig. 1B). The overall four-domain structure suggests that this molecule is a type of Ig and therefore we named it UrIg (Ig-original). Unlike Ig/TCR, there is no cysteine in the A strand of the first C domain to make a disulfide bond with an L chain cysteine (also true of IgNAR [43]), so if the molecule associates with L chains it must occur via noncovalent bonding. There is only one free cysteine in the entire molecule, found in the VJ domain GXG motif in the G strand, which may form a disulfide bond between UrIg H chains (Fig. 1A, Cys-113 in red). The molecule has six potential N-linked glycosylation sites (underlined in Fig. 1A, 1B). From these sequence analyses we predicted that UrIg forms a disulfide-linked homodimer, perhaps with free V domains as for IgNAR and camelid IgG (discussed below) (44). This speculation is tested below with recombinant UrIg and with UrIg found in nurse shark tissues.

The entire UrIg protein sequence is only 30% identical to other shark IgH genes such as IgM (11, 14), IgNAR (19), and IgW (45–47), showing that UrIg diverged from other Ag receptors long ago. Domain-by-domain bioinformatics (e.g., BLAST) searches showed that the VJ domain is just as related to Vs of IgL chains, TCRs, and IgH chains with no clear orthology, which is confirmed by the phylogenetic tree (Fig. 2A). The UrIg VJ domain is more like Ig/TCR V domains than the other VJ gene encoded in the class III region of the MHC, NKp30 (encoding gene: NCR3), which forms the root of the VJ tree (33, 48).

FIGURE 2.

Phylogenetic tree analyses of UrIg domains demonstrate the ancient origin of UrIg. A neighbor-joining phylogenetic tree was performed with 500 bootstrap runs. The original tree is shown. Scale bars indicate the number of amino acid changes per site. (A) Tree of VJ domains from nurse shark Ig/TCR and NKp30. UrIg VJ from two highly divergent elasmobranchs, that is, nurse shark, Ginglymostoma cirratum (Gici), and little skate, Leucoraja erinacea (Leer), are included. Ratfish (Hydrolagus colliei) IgM (Hyco-SD) and germline-joined IgM (Hyco-gH) are included. CDR3s were removed from the sequences to construct the tree. (B) Relationships of UrIg C domains to nurse shark Ig/TCR (and Hydrolagus IgM) C domains. Positions of the three UrIg C domains are shown with arrows. Because clustering was more informative, we did not root these trees. Additional accession numbers for these analyses are as follows: V: nurse shark IgS′ (Sp) (AAV34678), nurse shark NCR3 genes (MT914154–914161); C: Hyco-IgM (AAC12891), TCRα (ADW95866; JAHRHZ010000115.1), TCRβ (ADW95903), TCRδ (ADW95942), TCRγ (ADW95913).

FIGURE 2.

Phylogenetic tree analyses of UrIg domains demonstrate the ancient origin of UrIg. A neighbor-joining phylogenetic tree was performed with 500 bootstrap runs. The original tree is shown. Scale bars indicate the number of amino acid changes per site. (A) Tree of VJ domains from nurse shark Ig/TCR and NKp30. UrIg VJ from two highly divergent elasmobranchs, that is, nurse shark, Ginglymostoma cirratum (Gici), and little skate, Leucoraja erinacea (Leer), are included. Ratfish (Hydrolagus colliei) IgM (Hyco-SD) and germline-joined IgM (Hyco-gH) are included. CDR3s were removed from the sequences to construct the tree. (B) Relationships of UrIg C domains to nurse shark Ig/TCR (and Hydrolagus IgM) C domains. Positions of the three UrIg C domains are shown with arrows. Because clustering was more informative, we did not root these trees. Additional accession numbers for these analyses are as follows: V: nurse shark IgS′ (Sp) (AAV34678), nurse shark NCR3 genes (MT914154–914161); C: Hyco-IgM (AAC12891), TCRα (ADW95866; JAHRHZ010000115.1), TCRβ (ADW95903), TCRδ (ADW95942), TCRγ (ADW95913).

Close modal

Phylogenetic trees of the C domains (Fig. 2B) showed that the UrIg C1 domain weakly clusters with (and may be ancestral to) the IgM C1 domains of nurse shark and the holocephalan Hydrolagus as well as the nurse shark IgW C1 domain. Interestingly, the UrIg C2 domain clustered weakly (and also may be ancestral to) the C-terminal domains of IgM (C4), IgNAR (C5), and IgW (C6). The UrIg C3 domain was not closely related to any IgH/IgL/TCR domain cluster. In summary, although the domains of UrIg are clearly VJ and C1 domains similar to those seen in Ag receptors, there is no high relatedness to any known Ig or TCR. Furthermore, although the bootstrap values are very low, a weak case can be made that UrIg domains are older than all other cartilaginous fish (and thus all other vertebrate) Ig/TCR domains.

We briefly point out other features of the C domain tree, which were noted previously but now are definitive, perhaps reinforced with the addition of the UrIg domains. The relationships between the IgNAR, IgW, and IgM C domains (45, 47), as well as duplication by convergent evolution of the C1 domains from C2 domains in IgNAR (19) and the H chain–only class of Hydrolagus IgM (15, 20), are robustly confirmed by this new tree.

Nurse shark UrIg was used as bait to search for the gene in all other cartilaginous fish databases and was detected in all elasmobranchs (sharks and rays) (Fig. 3). The alignment demonstrated that UrIg is highly conserved in all species (60–97% overall; 70–95% VJ exon), reaching back at least 300 million years. Note that the VJ domain is more conserved than the C domains, and that “CDR3” has been particularly well conserved during 300 million years of evolution. Such conservation suggests that UrIg is under strong negative selection, perhaps for binding to an evolutionarily conserved epitope. These characteristics are clearly unlike bona fide Igs and even other IgSF molecules and suggest that UrIg serves an innate immune or even a nonimmune function in sharks.

FIGURE 3.

UrIg evolves conservatively among different cartilaginous fish. We identified cartilaginous fish UrIg from various databases. Although most of them are gene models (i.e., predicted genes), Clustal Omega multiple sequence alignment (https://www.ebi.ac.uk) revealed high conservation to the nurse shark UrIg transcripts. Dashes and dots indicate absence and identical amino acid sequences compared with nurse shark UrIg, respectively. CDR, glycosylation sites, and Cys-113 are underlined. Species abbreviations and accession numbers (when available) are: Caca, great white shark, Carcharodon carcharias (NC_054485.1)*; Chpl, whitespotted bamboo shark, Chiloscyllium plagiosum (GCC18650.1); Chpu, brownbanded bamboo shark, Chiloscyllium punctatum (BEZZ01003013); Gici, nurse shark, Ginglymostoma cirratum (GIWU01140354.1, GIWU01140356.1); Heze, zebra bullhead shark, Heterodontus zebra (GGGL01499922.1); Leer, little skate, Leucoraja erinacea (JAMYXN010000071.1)*; Leuoc, winter skate, Leucoraja ocellata (GEZH01007499+GEZH01041997); Rhty, whale shark, Rhincodon typus (XP_048472529.1); Scca, small-spotted catshark, Scyliorhinus canicula (NC_052158+CACTIS020018579)*; Sqac, spiny dogfish, Squalus acanthias (HAGU01063221.1); Teca, Pacific electric ray, Tetronarce californica (GFBV01010368). An asterisk (*) indicates that sequences were manually curated from the genome assembly and/or transcripts databases.

FIGURE 3.

UrIg evolves conservatively among different cartilaginous fish. We identified cartilaginous fish UrIg from various databases. Although most of them are gene models (i.e., predicted genes), Clustal Omega multiple sequence alignment (https://www.ebi.ac.uk) revealed high conservation to the nurse shark UrIg transcripts. Dashes and dots indicate absence and identical amino acid sequences compared with nurse shark UrIg, respectively. CDR, glycosylation sites, and Cys-113 are underlined. Species abbreviations and accession numbers (when available) are: Caca, great white shark, Carcharodon carcharias (NC_054485.1)*; Chpl, whitespotted bamboo shark, Chiloscyllium plagiosum (GCC18650.1); Chpu, brownbanded bamboo shark, Chiloscyllium punctatum (BEZZ01003013); Gici, nurse shark, Ginglymostoma cirratum (GIWU01140354.1, GIWU01140356.1); Heze, zebra bullhead shark, Heterodontus zebra (GGGL01499922.1); Leer, little skate, Leucoraja erinacea (JAMYXN010000071.1)*; Leuoc, winter skate, Leucoraja ocellata (GEZH01007499+GEZH01041997); Rhty, whale shark, Rhincodon typus (XP_048472529.1); Scca, small-spotted catshark, Scyliorhinus canicula (NC_052158+CACTIS020018579)*; Sqac, spiny dogfish, Squalus acanthias (HAGU01063221.1); Teca, Pacific electric ray, Tetronarce californica (GFBV01010368). An asterisk (*) indicates that sequences were manually curated from the genome assembly and/or transcripts databases.

Close modal

Igs are expressed at high levels in the spleen and epigonal of cartilaginous fish, and at lower levels in the gut, liver, and kidney (49, 50). TCRs are also expressed highly in the spleen and thymus, as in all other vertebrates. In contrast, UrIg is expressed at high levels in the liver and somewhat lower levels in the kidney, with no or little mRNA expression in other tissues, including immune organs (Fig. 4). It is not known whether the gene is expressed in hepatocytes or in infiltrating liver leukocytes.

FIGURE 4.

UrIg is expressed in the liver and kidney. cDNA generated from nurse shark “NJ” total RNA was used for PCR amplification of UrIg. Primers were cast at the 5′ of the C1 domain and 3′ of the C3 domain to amplify the 792-bp C regions (arrow marked with C1–C3). These primers also amplify the C2-less short, spliced form (C2-less) expressed at very low levels. The uniform band at the bottom of the gel is presumably primer dimers, which also appear in the control (NTC, nontemplate control) lane. NDPK was used as a PCR control because it is nearly ubiquitously expressed. Size marker: phiX174 HaeIII.

FIGURE 4.

UrIg is expressed in the liver and kidney. cDNA generated from nurse shark “NJ” total RNA was used for PCR amplification of UrIg. Primers were cast at the 5′ of the C1 domain and 3′ of the C3 domain to amplify the 792-bp C regions (arrow marked with C1–C3). These primers also amplify the C2-less short, spliced form (C2-less) expressed at very low levels. The uniform band at the bottom of the gel is presumably primer dimers, which also appear in the control (NTC, nontemplate control) lane. NDPK was used as a PCR control because it is nearly ubiquitously expressed. Size marker: phiX174 HaeIII.

Close modal

Bona fide Igs are dimers of IgH/IgL dimers. The H chains are disulfide bonded to each other and to L chains. There is, however, only one free cysteine in UrIg, in the G strand of the VJ domain (Fig. 1A, Cys-113), suggesting that H chain dimers form but there is no disulfide-bonded L chain. AlphaFold (relaxed) predicted the structure of UrIg with highest confidence placed the V-V domains so that a disulfide bond would be formed between the Cys residues 113 and 113′ (marked red in Fig. 5A). In contrast, if the V-V dimer domains were modeled as a standard Fab or L chain dimer of V domains, the free Cys would be on the distal sides of that dimer, so it is unlikely that the V-V will pair like a typical Fab. The AlphaFold model of the C2–C3 region predicts an Fc-like (CH2–CH3) arrangement of the domains, and the N-linked glycan site at position 291 in UrIg (penultimate sugar in Fig. 1C) is very close to that of position 297 in a human IgG Fc, where the equivalent glycan is attached. In mammals this glycan is crucial for IgG effector functions (51). Thus, we predict that the UrIg C2–C3 domains will pair similar to mammalian IgG C2–C3. The other glycans (underlined in Fig. 1A, 1B) are modeled to be exposed to the solvent, most conspicuously at the unique hinge region between the VJ and C1 domain; this sugar likely protects UrIg from protease attack (note that UrIg has a hinge-like region at the N terminus of the C1 domain [APSVSPPL]).

FIGURE 5.

UrIg AlphaFold protein models. (A) Complete dimeric molecule. Glycans were added to the model with GLYCAM-Web followed by manual adjustments to avoid steric clashes. The predicted disulfide bond at position Cys-113 in the V domain is shown with a yellow schematic. (B) V domain with CDR side chains indicated. The model of the UrIg V domain is shown with CDR-like loops colored yellow (CDR1), blue (CDR2). and magenta (CDR3). Side chains are shown for the unpaired Cys in the G strand and putative N-linked Asn in the AB loop.

FIGURE 5.

UrIg AlphaFold protein models. (A) Complete dimeric molecule. Glycans were added to the model with GLYCAM-Web followed by manual adjustments to avoid steric clashes. The predicted disulfide bond at position Cys-113 in the V domain is shown with a yellow schematic. (B) V domain with CDR side chains indicated. The model of the UrIg V domain is shown with CDR-like loops colored yellow (CDR1), blue (CDR2). and magenta (CDR3). Side chains are shown for the unpaired Cys in the G strand and putative N-linked Asn in the AB loop.

Close modal

The V domain was modeled as well, with amino acid side chains on the CDR (Fig. 5B). Note the large number of aromatic residues in the loops that are modeled to interact (two Phes in CDR3, Trp in CDR2, three Tyrs). We emphasize that this is a model and we are not certain whether the CDR3 points up or folds over and perhaps buries those aromatic residues.

The UrIg four-domain H chain, with a C-terminal His-tag, was produced in mammalian expi293F cells. That UrIg was secreted readily into the supernatant suggested that it folded correctly and likely does not have to associate with another partner, for example, Ig L chains. The calculated protein molecular mass for this construct was 50,730 g/mol. There are six potential N-linked glycosylation sites per monomer (one in V, three in C1, two in C2, Fig. 1A–C), suggesting an approximate molecular mass increase between 9,000 and 12,500, depending on the occupancy and type of carbohydrate at each site. Thus, we expected a monomer to have a molecular mass of ∼60,000–63,500 and a dimer of ∼120,000–127,000. As mentioned, there is one Cys in the V domain (residue 113, sequential numbering) that modeling predicts to be surface exposed, and thus the possibility of dimerization via this residue. SDS-PAGE of purified UrIg showed a band slightly larger than 100,000 for the nonreduced sample and between 50,000 and 75,000 for the reduced sample (Fig. 6A). The nonreduced sample eluted in size-exclusion chromatography was close to the molecular mass standard, γ-globulin (∼158,000) (Fig. 6B). Thus, the secreted, recombinant UrIg forms a disulfide-linked dimer.

FIGURE 6.

Recombinant UrIg forms a secreted dimer. (A) Coomassie-stained gel with molecular mass markers in Kd (Bio-Rad, Precision Plus dual color), followed by: lane 1, media after depletion by Ni Excel resin (nonreducing); lane 2, high-salt wash of Ni Excel resin before elution (nonreducing); lane 3, eluted UrIg (nonreducing); lane 4, eluted UrIg (reducing). (B) Size-exclusion chromatography (Superdex 200, 16/60) of UrIg after affinity purification. UrIg is shown in blue, molecular mass standards are shown in red, with milliliters on the x-axis and milli-absorbance units (mAu) on the y-axis. Elution positions (milliliters) are shown at the top of each standard peak. UrIg (blue) runs at the approximate weight expected for a dimer. Molecular mass standards (red) (Bio-Rad) are as follows: vitamin B12, 100.9 ml, 1,350 Da; myoglobin, 83.7 ml, 17,000 Da; OVA, 71.9 ml, 44,000 Da; γ-globulin, 56.8 ml, 158,000 Da; thyroglobulin, 42.8 ml, 330,000 Da.

FIGURE 6.

Recombinant UrIg forms a secreted dimer. (A) Coomassie-stained gel with molecular mass markers in Kd (Bio-Rad, Precision Plus dual color), followed by: lane 1, media after depletion by Ni Excel resin (nonreducing); lane 2, high-salt wash of Ni Excel resin before elution (nonreducing); lane 3, eluted UrIg (nonreducing); lane 4, eluted UrIg (reducing). (B) Size-exclusion chromatography (Superdex 200, 16/60) of UrIg after affinity purification. UrIg is shown in blue, molecular mass standards are shown in red, with milliliters on the x-axis and milli-absorbance units (mAu) on the y-axis. Elution positions (milliliters) are shown at the top of each standard peak. UrIg (blue) runs at the approximate weight expected for a dimer. Molecular mass standards (red) (Bio-Rad) are as follows: vitamin B12, 100.9 ml, 1,350 Da; myoglobin, 83.7 ml, 17,000 Da; OVA, 71.9 ml, 44,000 Da; γ-globulin, 56.8 ml, 158,000 Da; thyroglobulin, 42.8 ml, 330,000 Da.

Close modal

The purified recombinant UrIg was used to immunize mice to produce antisera and mAbs. A panel of mAbs was generated and tested on the recombinant protein by ELISA. The antisera and several of the mAbs recognized the recombinant UrIg molecule at ∼120 kDa under nonreducing conditions by Western blotting, and the antisera, but not the mAbs, also recognized the reduced UrIg at 60 kDa (Fig. 7).

FIGURE 7.

UrIg mAb specificity. (A and B) Western blotting test of UrIg-specific mAbs and antisera tested on recombinant UrIg protein under reducing (A) and nonreducing (B) conditions. The last lane on each blot is the UrIg protein stained by Coomassie Blue. All of the mAbs react with UrIg by ELISA (native), but only a subset (O-1, O-2, O-3, O-8, O-9, O-11, O-13) reacts with denatured and native UrIg. Antisera (from two mice) reacted with monomeric and dimeric UrIg, whereas the mAbs reacted only with the dimer.

FIGURE 7.

UrIg mAb specificity. (A and B) Western blotting test of UrIg-specific mAbs and antisera tested on recombinant UrIg protein under reducing (A) and nonreducing (B) conditions. The last lane on each blot is the UrIg protein stained by Coomassie Blue. All of the mAbs react with UrIg by ELISA (native), but only a subset (O-1, O-2, O-3, O-8, O-9, O-11, O-13) reacts with denatured and native UrIg. Antisera (from two mice) reacted with monomeric and dimeric UrIg, whereas the mAbs reacted only with the dimer.

Close modal

Tissue lysates from all nurse shark tissues were prepared and tested for the presence of UrIg. Although mRNA expression is mostly limited to the liver (Fig. 4), we were able to detect UrIg protein in many tissues under nonreducing conditions at 120 kDa (Fig. 8). Interestingly, in the five brain tissues, that is, telencephalon (cerebrum), diencephalon (optic lobes), mesencephalon (midbrain), metencephalon (cerebellum), and myelencephalon (medulla oblongata), UrIg was found predominantly as a monomer.

FIGURE 8.

UrIg is present in many nurse shark tissues. Western blotting analysis with mouse antisera on 14 shark tissues demonstrated that UrIg protein is found as a dimer in most tissues. Gel of denatured shark lysates was run under nonreducing conditions. Note that UrIg is predominantly a monomer in the five regions of shark brain: cerebrum, optic lobe, midbrain, cerebellum, medulla oblongata. WBCs were from the buffy coat of centrifuged blood.

FIGURE 8.

UrIg is present in many nurse shark tissues. Western blotting analysis with mouse antisera on 14 shark tissues demonstrated that UrIg protein is found as a dimer in most tissues. Gel of denatured shark lysates was run under nonreducing conditions. Note that UrIg is predominantly a monomer in the five regions of shark brain: cerebrum, optic lobe, midbrain, cerebellum, medulla oblongata. WBCs were from the buffy coat of centrifuged blood.

Close modal

It was a puzzle that UrIg is expressed as mRNA by the liver, but the protein can easily be detected in many tissues by Western blotting (Fig. 8). We tested the exact tissue localization by immunohistochemistry of the spleen and found that the mAbs and antisera stained in a “stick-like” pattern around the splenic white pulp and around the vasculature, as expected of staining of the extracellular matrix (ECM) (Fig. 9A). A mAb specific for IgNAR was used to show the staining of lymphocytes in the white pulp (Fig. 9B). Staining of other tissues on the Western blot such as gill (Fig. 9D) and others (not shown) also showed stick-like staining of defined areas. To test whether the mAbs truly stained ECM, we costained kidney sections with an UrIg-specific mAb O-1 (in red, see Western blot for mAb O-1 reactivity in Fig. 7) and a reagent that detects collagen (CNA in green).

FIGURE 9.

UrIg is found in the extracellular matrix of nurse shark tissues. (AC) Staining of spleen. (A) Immunofluorescence staining of nurse shark spleen with the UrIg-specific mAb O-1 staining (in green) of extracellular matrix surrounding the white pulp in a “stick-like” fashion. Central arteriole of the white pulp is shown with an arrow. Merged images of mAb staining (green) and DAPI (blue). (B) IgNAR-specific mAb showing staining of lymphocytes in the splenic white pulp and scattered plasma cells in the red pulp. (C) Spleen with no primary Ab. Merged images of mAb staining (green) and DAPI (blue) are shown. Arrows point to central arteries. (D and E) Staining of gill with UrIg-specific mAb O-1 (D) and no primary Ab (E). UrIg staining is mainly at the insertion sites of the gill filaments and weak staining of the filaments themselves. Merged images of mAb staining (green) and DAPI (blue) are shown. (FH) Immunofluorescence staining of renal sections. (F) mAb UrIg staining alone shows extracellular localization of UrIg outside of tubules and blood vessels. Blue DAPI staining shows tubular nuclei. (G and H) UrIg/collagen double stains. To visualize collagen, matrix (18TD-CNA35)3 conjugated with Alexa Fluor 488 that was used, which binds to any triple collagen helix. Double immunofluorescence staining for UrIg (red) and collagen (green). UrIg rod-like parallel structures are cross-linking collagen layers. Original magnifications, ×600 (F and G) and ×800 (H). Scale bar in (G), 50 mm.

FIGURE 9.

UrIg is found in the extracellular matrix of nurse shark tissues. (AC) Staining of spleen. (A) Immunofluorescence staining of nurse shark spleen with the UrIg-specific mAb O-1 staining (in green) of extracellular matrix surrounding the white pulp in a “stick-like” fashion. Central arteriole of the white pulp is shown with an arrow. Merged images of mAb staining (green) and DAPI (blue). (B) IgNAR-specific mAb showing staining of lymphocytes in the splenic white pulp and scattered plasma cells in the red pulp. (C) Spleen with no primary Ab. Merged images of mAb staining (green) and DAPI (blue) are shown. Arrows point to central arteries. (D and E) Staining of gill with UrIg-specific mAb O-1 (D) and no primary Ab (E). UrIg staining is mainly at the insertion sites of the gill filaments and weak staining of the filaments themselves. Merged images of mAb staining (green) and DAPI (blue) are shown. (FH) Immunofluorescence staining of renal sections. (F) mAb UrIg staining alone shows extracellular localization of UrIg outside of tubules and blood vessels. Blue DAPI staining shows tubular nuclei. (G and H) UrIg/collagen double stains. To visualize collagen, matrix (18TD-CNA35)3 conjugated with Alexa Fluor 488 that was used, which binds to any triple collagen helix. Double immunofluorescence staining for UrIg (red) and collagen (green). UrIg rod-like parallel structures are cross-linking collagen layers. Original magnifications, ×600 (F and G) and ×800 (H). Scale bar in (G), 50 mm.

Close modal

Most UrIg staining was observed in the anterior and middle portions of kidneys surrounding tubules (Fig. 9F). No staining was detected in the glomeruli (not shown). Staining was extracellular surrounding tubules and apparently forming rod-like structures with an orientation perpendicular to the tubular cell membrane. To visualize collagen matrix, we developed a trimeric form of a natural collagen binding protein (CNA35) conjugated to a fluorescent dye (see Materials and Methods). CNA35 monomer itself was demonstrated to have much better specific binding to collagen than existing fluorescent techniques currently used for collagen visualization (52, 53). Generation of multivalent CNA35 dendrimers (2–4 mers) by native chemical ligation remarkably enhanced the affinity and attenuated the dissociation kinetics (54). Inspired by these multivalent CNA35 probes, we designed our own system for recombinant production of trimeric CNA35 in bacteria. The CNA35 sequence was genetically fused to a trimerization domain of collagen XVIII (18TD), which has a picomolar trimerization potential (55). The resulting hybrid protein molecule (18TD-CNA35)3 was conjugated to Alexa Fluor 488 and used for costaining of tissues. The UrIg (red) is seen “stitching” layered collagen structures in shark kidneys (Fig. 9F–H). Note that in all tissues the UrIg is found associated with collagen, but in defined areas that require further study.

The data, taken together, suggest that the UrIg expressed by the liver is transported by the blood or by other cells (note that PBLs [WBCs] were Western blot positive; Fig. 8) throughout the body and deposited in the ECM.

The UrIg sequence suggested that the molecule is an IgH chain that made a disulfide-linked dimer via a unique S-S bond and is not associated with Ig L chains. This prediction was borne out in the production of the recombinant molecule and by Western blotting of shark tissues. An AlphaFold model predicts that the three C1 domains form a structure highly reminiscent of IgG, even at the level of the canonical glycan in the UrIg (and IgGH) C2 domain that provides flexibility to the Fc region and is vital for effector function (51).

As mentioned in the Introduction, cartilaginous fish IgH genes are in the so-called “cluster organization” with multiple V-D-D-J-C genes that undergo rearrangement within a cluster during B cell ontogeny (11, 56). The phenomenon of germline joining is the apparent consequence of RAG activity in germ cells that can initiate rearrangement within these clusters (14–17). Most of the clusters having such germline joining that have been detected result in pseudogenes, but there are two examples of functional IgH and IgL genes generated by this mechanism in the nurse shark, and one example in Hydrolagus. In all cases, it is clear that the process was “recent,” as the joined genes are closely related to non–germline-joined clusters; furthermore, in the two nurse shark joins there are few bases in the CDR3-encoding regions suggestive of little TdT activity in the germline-joined events in germ cells. UrIg, in contrast, is highly conserved in all elasmobranchs, going back 300 million years (Fig. 3). The CDR3 of UrIg is quite long and relatively hydrophobic. Furthermore, the C domains are not closely related to any of the shark Ig C regions of IgM, IgNAR, or IgW, unlike the three germline-joined cartilaginous fish Ig genes (Fig. 2). Thus, although it is possible that there was a germline-joining event prior to the emergence of cartilaginous fish (or elasmobranchs), it is more likely that UrIg emergence preceded the development of rearranging Ig/TCR; this hypothesis is also suggested by the trees, in which the UrIg C domains seem ancestral. Using the same logic, it is unlikely that UrIg resulted from a reverse-transcribed IgH gene (retroposon) that inserted into the genome (also the transcript would have had to retain all of the introns).

While the modeling of the C domains was relatively straightforward, this was not true of the VJ domains. We think it most likely that the Cys-113 in the center of the Gly-X-Gly motif of the G strand, almost unique among VJ domains (27), provides the disulfide bond covalently linking the UrIg chains. However, do the UrIg V domains join together similar to VH/VL to make a conventional binding site or do the two domains bind Ag independently, similar to IgNAR (43)? The conserved amino acid residues known to interact between VH/VL and TCRα/β to facilitate dimer formation are not well conserved in the UrIg VJ, but the AlphaFold model does suggest a V-V association. Further studies of the natural UrIg will allow discrimination of these two possibilities.

UrIg is transcribed in the liver yet is found as protein in ECM throughout the shark body (Figs. 8, 9). First, is UrIg produced by hepatocytes or infiltrating hematopoietic cells? We should be able to address this question with in situ hybridization studies in combination with immunohistochemistry using existing mAbs. So far, in situ hybridization has not been sensitive enough to detect the UrIg-expressing cells in the liver, perhaps because of overall low expression by hepatocytes. Second, it will be a challenge to understand how UrIg is transported to the ECM throughout the body, yet with existing reagents we should be able to address this question in the future as well. Lastly, and most importantly, what is the function of UrIg in the ECM? Does it perform an innate immune function such as acting as a secreted pattern recognition receptor, or could it have an ECM supporting function? Based on its MHC linkage in the class III region, we propose that it does have an innate immune function; such a function would also make sense if UrIg (or an UrIg-like molecule) were co-opted to function in the adaptive immune system after the RAG transposon invasion.

Based on the presence of VJ and C1 domains in the MHC and MHC paralogous regions, Du Pasquier et al. (26) suggested that Ig/TCR emerged from the MHC or MHC precursor. Subsequently, other groups, including our own, also provided evidence for such a scenario (27, 30–32). UrIg is encoded in the class III region of the MHC, near to the TNF gene cluster (57) and close to other VJ single-exon genes such as NCR3 (Fig. 1E). The MHC class III region is well known for the presence of genes involved in innate immunity and inflammatory responses, and immune genes in the class III region clearly were present in the proto-MHC prior to the emergence of adaptive immunity (28, 58, 59). Thus, we think it plausible that UrIg is related to the VJ gene that was invaded by the RAG transposon, perhaps most related. Although UrIg is clearly Ig-like in overall structure, the VJ domain is not more related to VH, VL, TCRα, or TCRβ, suggesting that it is either highly derived, or it emerged before the split of Ig and TCR. To date, we have not detected UrIg in agnathans or deuterostomes that arose from ancestors prior to the genome-wide duplications that occurred in vertebrates (30). There are, however, VJ domains present in agnathans and invertebrates, yet little is known about their genetic history or functions (26, 60–62). Studies of these molecules, and further studies of the structure and ligand identification of UrIg, will be crucial in piecing together the origins of adaptive immunity.

Note added in proof

While our article was being revised, another article reported the same shark sequence, Kondo et al. (64).

The authors have no financial conflicts of interest.

We thank Erik Cruz for technical support and Marc Elslinger for help with the AlphaFold modeling.

This work was supported by National Institute of Allergy and Infectious Diseases Grants R01AI140326 and R01AI170844.

The online version of this article contains supplemental material.

AID

activation-induced cytidine deaminase

CNA

collagen adhesin

ECM

extracellular matrix

IgSF

Ig superfamily

SPBS

shark-modified PBS

1
Flajnik
,
M. F.
2018
.
A cold-blooded view of adaptive immunity
.
Nat. Rev. Immunol.
18
:
438
453
.
2
Pishesha
,
N.
,
T. J.
Harmand
,
H. L.
Ploegh
.
2022
.
A guide to antigen processing and presentation
.
Nat. Rev. Immunol.
22
:
751
764
.
3
Davis
,
M. M.
,
P. J.
Bjorkman
.
1988
.
T-cell antigen receptor genes and T-cell recognition. [Published erratum appears in 1988 Nature 335: 744.]
Nature
334
:
395
402
.
4
Kaufman
,
J.
2018
.
Unfinished business: evolution of the MHC and the adaptive immune system of jawed vertebrates
.
Annu. Rev. Immunol.
36
:
383
409
.
5
Litman
,
G. W.
,
J. P.
Rast
,
S. D.
Fugmann
.
2010
.
The origins of vertebrate adaptive immunity
.
Nat. Rev. Immunol.
10
:
543
553
.
6
Flajnik
,
M. F.
2014
.
Re-evaluation of the immunological Big Bang
.
Curr. Biol.
24
:
R1060
R1065
.
7
Good
,
R. A.
,
B. W.
Papermaster
.
1964
.
Ontogeny and phylogeny of adaptive immunity
.
Adv. Immunol.
27
:
1
115
.
8
Zhu
,
C.
,
V.
Lee
,
A.
Finn
,
K.
Senger
,
A. A.
Zarrin
,
L.
Du Pasquier
,
E.
Hsu
.
2012
.
Origin of immunoglobulin isotype switching
.
Curr. Biol.
22
:
872
880
.
9
Flajnik
,
M. F.
2002
.
Comparative analyses of immunoglobulin genes: surprises and portents
.
Nat. Rev. Immunol.
2
:
688
698
.
10
Neely
,
H. R.
,
M. F.
Flajnik
.
2016
.
Emergence and evolution of secondary lymphoid organs
.
Annu. Rev. Cell Dev. Biol.
32
:
693
711
.
11
Hinds
,
K. R.
,
G. W.
Litman
.
1986
.
Major reorganization of immunoglobulin VH segmental elements during vertebrate evolution
.
Nature
320
:
546
549
.
12
Ohta
,
Y.
,
T.
Shiina
,
R. L.
Lohr
,
K.
Hosomichi
,
T. I.
Pollin
,
E. J.
Heist
,
S.
Suzuki
,
H.
Inoko
,
M. F.
Flajnik
.
2011
.
Primordial linkage of β2-microglobulin to the MHC
.
J. Immunol.
186
:
3563
3571
.
13
Ott
,
J. A.
,
C. D.
Castro
,
T. C.
Deiss
,
Y.
Ohta
,
M. F.
Flajnik
,
M. F.
Criscitiello
.
2018
.
Somatic hypermutation of T cell receptor α chain contributes to selection in nurse shark thymus
.
eLife
7
:
e28477
.
14
Kokubu
,
F.
,
K.
Hinds
,
R.
Litman
,
M. J.
Shamblott
,
G. W.
Litman
.
1988
.
Complete structure and organization of immunoglobulin heavy chain constant region genes in a phylogenetically primitive vertebrate
.
EMBO J.
7
:
1979
1988
.
15
Rast
,
J. P.
,
C. T.
Amemiya
,
R. T.
Litman
,
S. J.
Strong
,
G. W.
Litman
.
1998
.
Distinct patterns of IgH structure and organization in a divergent lineage of chrondrichthyan fishes
.
Immunogenetics
47
:
234
245
.
16
Lee
,
S. S.
,
D.
Fitch
,
M. F.
Flajnik
,
E.
Hsu
.
2000
.
Rearrangement of immunoglobulin genes in shark germ cells
.
J. Exp. Med.
191
:
1637
1648
.
17
Rumfelt
,
L. L.
,
D.
Avila
,
M.
Diaz
,
S.
Bartl
,
E. C.
McKinney
,
M. F.
Flajnik
.
2001
.
A shark antibody heavy chain encoded by a nonsomatically rearranged VDJ is preferentially expressed in early development and is convergent with mammalian IgG
.
Proc. Natl. Acad. Sci. USA
98
:
1775
1780
.
18
Kokubu
,
F.
,
R.
Litman
,
M. J.
Shamblott
,
K.
Hinds
,
G. W.
Litman
.
1988
.
Diverse organization of immunoglobulin VH gene loci in a primitive vertebrate
.
EMBO J.
7
:
3413
3422
.
19
Greenberg
,
A. S.
,
D.
Avila
,
M.
Hughes
,
A.
Hughes
,
E. C.
McKinney
,
M. F.
Flajnik
.
1995
.
A new antigen receptor gene family that undergoes rearrangement and extensive somatic diversification in sharks
.
Nature
374
:
168
173
.
20
Anumukonda
,
K.
,
M.
Francis
,
P.
Currie
,
F.
Tulenko
,
E.
Hsu
.
2022
.
Heavy chain-only antibody genes in fish evolved to generate unique CDR3 repertoire
.
Eur. J. Immunol.
52
:
247
260
.
21
Criscitiello
,
M. F.
,
M.
Saltis
,
M. F.
Flajnik
.
2006
.
An evolutionarily mobile antigen receptor variable region gene: doubly rearranging NAR-TcR genes in sharks
.
Proc. Natl. Acad. Sci. USA
103
:
5036
5041
.
22
Ott
,
J. A.
,
Y.
Ohta
,
M. F.
Flajnik
,
M. F.
Criscitiello
.
2021
.
Lost structural and functional inter-relationships between Ig and TCR loci in mammals revealed in sharks
.
Immunogenetics
73
:
17
33
.
23
Sakano
,
H.
,
K.
Hüppi
,
G.
Heinrich
,
S.
Tonegawa
.
1979
.
Sequences at the somatic recombination sites of immunoglobulin light-chain genes
.
Nature
280
:
288
294
.
24
Huang
,
S.
,
X.
Tao
,
S.
Yuan
,
Y.
Zhang
,
P.
Li
,
H. A.
Beilinson
,
Y.
Zhang
,
W.
Yu
,
P.
Pontarotti
,
H.
Escriva
, et al
.
2016
.
Discovery of an active RAG transposon illuminates the origins of V(D)J recombination
.
Cell
166
:
102
114
.
25
Flajnik
,
M. F.
2016
.
Evidence of G.O.D.’s miracle: unearthing a RAG transposon
.
Cell
166
:
11
12
.
26
Du Pasquier
,
L.
,
I.
Zucchetti
,
R.
De Santis
.
2004
.
Immunoglobulin superfamily receptors in protochordates: before RAG time
.
Immunol. Rev.
198
:
233
248
.
27
Dornburg
,
A.
,
J. A.
Yoder
.
2022
.
On the relationship between extant innate immune receptors and the evolutionary origins of jawed vertebrate adaptive immunity
.
Immunogenetics
74
:
111
128
.
28
Zucchetti
,
I.
,
R.
De Santis
,
S.
Grusea
,
P.
Pontarotti
,
L.
Du Pasquier
.
2009
.
Origin and evolution of the vertebrate leukocyte receptors: the lesson from tunicates
.
Immunogenetics
61
:
463
481
.
29
Williams
,
A. F.
,
A. N.
Barclay
.
1988
.
The immunoglobulin superfamily—domains for cell surface recognition
.
Annu. Rev. Immunol.
6
:
381
405
.
30
Flajnik
,
M. F.
,
M.
Kasahara
.
2010
.
Origin and evolution of the adaptive immune system: genetic events and selective pressures
.
Nat. Rev. Genet.
11
:
47
59
.
31
Fu
,
Y.
,
Z.
Yang
,
J.
Huang
,
X.
Cheng
,
X.
Wang
,
S.
Yang
,
L.
Ren
,
Z.
Lian
,
H.
Han
,
Y.
Zhao
.
2019
.
Identification of two nonrearranging IgSF genes in chicken reveals a novel family of putative remnants of an antigen receptor precursor
.
J. Immunol.
202
:
1992
2004
.
32
Ohta
,
Y.
,
M.
Kasahara
,
T. D.
O’Connor
,
M. F.
Flajnik
.
2019
.
Inferring the “primordial immune complex”: origins of MHC class I and antigen receptors revealed by comparative genomics
.
J. Immunol.
203
:
1882
1896
.
33
Kinlein
,
A.
,
M. E.
Janes
,
J.
Kincer
,
T.
Almeida
,
H.
Matz
,
J.
Sui
,
M. F.
Criscitiello
,
M. F.
Flajnik
,
Y.
Ohta
.
2021
.
Analysis of shark NCR3 family genes reveals primordial features of vertebrate NKp30
.
Immunogenetics
73
:
333
348
.
34
Kumar
,
S.
,
G.
Stecher
,
M.
Li
,
C.
Knyaz
,
K.
Tamura
.
2018
.
MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms
.
Mol. Biol. Evol.
35
:
1547
1549
.
35
Flajnik
,
M. F.
,
S.
Ferrone
,
N.
Cohen
,
L.
Du Pasquier
.
1990
.
Evolution of the MHC: antigenicity and unusual tissue distribution of Xenopus (frog) class II molecules
.
Mol. Immunol.
27
:
451
462
.
36
Boudko
,
S. P.
,
T.
Sasaki
,
J.
Engel
,
T. F.
Lerch
,
J.
Nix
,
M. S.
Chapman
,
H. P.
Bächinger
.
2009
.
Crystal structure of human collagen XVIII trimerization domain: a novel collagen trimerization Fold
.
J. Mol. Biol.
392
:
787
802
.
37
Boudko
,
S. P.
,
K. D.
Zientek
,
J.
Vance
,
J. L.
Hacker
,
J.
Engel
,
H. P.
Bächinger
.
2010
.
The NC2 domain of collagen IX provides chain selection and heterotrimerization
.
J. Biol. Chem.
285
:
23721
23731
.
38
Flajnik
,
M. F.
,
E.
Taylor
,
C.
Canel
,
D.
Grossberger
,
L.
Du Pasquier
.
2015
.
Reagents specific for MHC class I antigens of Xenopus
.
Am. Zool.
31
:
580
591
.
39
Evans
,
R.
,
M.
O’Neill
,
A.
Pritzel
,
N.
Antropova
,
A.
Senior
,
T.
Green
,
A.
Žídek
,
R.
Bates
,
S.
Blackwell
,
J.
Yim
, et al
.
2022
.
Protein complex prediction with AlphaFold-Multimer
.
bioRxiv
:
2021.2010.2004.463034
.
40
Kirschner
,
K. N.
,
A. B.
Yongye
,
S. M.
Tschampel
,
J.
González-Outeiriño
,
C. R.
Daniels
,
B. L.
Foley
,
R. J.
Woods
.
2008
.
GLYCAM06: a generalizable biomolecular force field. Carbohydrates
.
J. Comput. Chem.
29
:
622
655
.
41
Emsley
,
P.
,
B.
Lohkamp
,
W. G.
Scott
,
K.
Cowtan
.
2010
.
Features and development of Coot
.
Acta Crystallogr. D Biol. Crystallogr.
66
:
486
501
.
42
Early
,
P.
,
J.
Rogers
,
M.
Davis
,
K.
Calame
,
M.
Bond
,
R.
Wall
,
L.
Hood
.
1980
.
Two mRNAs can be produced from a single immunoglobulin μ gene by alternative RNA processing pathways
.
Cell
20
:
313
319
.
43
Roux
,
K. H.
,
A. S.
Greenberg
,
L.
Greene
,
L.
Strelets
,
D.
Avila
,
E. C.
McKinney
,
M. F.
Flajnik
.
1998
.
Structural analysis of the nurse shark (new) antigen receptor (NAR): molecular convergence of NAR and unusual mammalian immunoglobulins
.
Proc. Natl. Acad. Sci. USA
95
:
11804
11809
.
44
Flajnik
,
M. F.
,
N.
Deschacht
,
S.
Muyldermans
.
2011
.
A case of convergence: why did a simple alternative to canonical antibodies arise in sharks and camels?
PLoS Biol.
9
:
e1001120
.
45
Greenberg
,
A. S.
,
A. L.
Hughes
,
J.
Guo
,
D.
Avila
,
E. C.
McKinney
,
M. F.
Flajnik
.
1996
.
A novel “chimeric” antibody class in cartilaginous fish: IgM may not be the primordial immunoglobulin
.
Eur. J. Immunol.
26
:
1123
1129
.
46
Berstein
,
R. M.
,
S. F.
Schluter
,
S.
Shen
,
J. J.
Marchalonis
.
1996
.
A new high molecular weight immunoglobulin class from the carcharhine shark: implications for the properties of the primordial immunoglobulin
.
Proc. Natl. Acad. Sci. USA
93
:
3289
3293
.
47
Anderson
,
M. K.
,
S. J.
Strong
,
R. T.
Litman
,
C. A.
Luer
,
C. T.
Amemiya
,
J. P.
Rast
,
G. W.
Litman
.
1999
.
A long form of the skate IgX gene exhibits a striking resemblance to the new shark IgW and IgNARC genes
.
Immunogenetics
49
:
56
67
.
48
Flajnik
,
M. F.
,
T.
Tlapakova
,
M. F.
Criscitiello
,
V.
Krylov
,
Y.
Ohta
.
2012
.
Evolution of the B7 family: co-evolution of B7H6 and NKp30, identification of a new B7 family member, B7H7, and of B7’s historical relationship with the MHC
.
Immunogenetics
64
:
571
590
.
49
Rumfelt
,
L. L.
,
M.
Diaz
,
R. L.
Lohr
,
E.
Mochon
,
M. F.
Flajnik
.
2004
.
Unprecedented multiplicity of Ig transmembrane and secretory mRNA forms in the cartilaginous fish
.
J. Immunol.
173
:
1129
1139
.
50
Miracle
,
A. L.
,
M. K.
Anderson
,
R. T.
Litman
,
C. J.
Walsh
,
C. A.
Luer
,
E. V.
Rothenberg
,
G. W.
Litman
.
2001
.
Complex expression patterns of lymphocyte-specific genes during the development of cartilaginous fish implicate unique lymphoid tissues in generating an immune repertoire
.
Int. Immunol.
13
:
567
580
.
51
Bournazos
,
S.
,
J. V.
Ravetch
.
2017
.
Fcγ receptor function and the design of vaccination strategies
.
Immunity
47
:
224
233
.
52
Krahn
,
K. N.
,
C. V.
Bouten
,
S.
van Tuijl
,
M. A.
van Zandvoort
,
M.
Merkx
.
2006
.
Fluorescently labeled collagen binding proteins allow specific visualization of collagen in tissues and live cell culture
.
Anal. Biochem.
350
:
177
185
.
53
Boerboom
,
R. A.
,
K. N.
Krahn
,
R. T.
Megens
,
M. A.
van Zandvoort
,
M.
Merkx
,
C. V.
Bouten
.
2007
.
High resolution imaging of collagen organisation and synthesis using a versatile collagen specific probe
.
J. Struct. Biol.
159
:
392
399
.
54
Breurken
,
M.
,
E. H.
Lempens
,
E. W.
Meijer
,
M.
Merkx
.
2011
.
Semi-synthesis of a protease-activatable collagen targeting probe
.
Chem. Commun. (Camb.)
47
:
7998
8000
.
55
Annunen
,
S.
,
J.
Körkkö
,
M.
Czarny
,
M. L.
Warman
,
H. G.
Brunner
,
H.
Kääriäinen
,
J. B.
Mulliken
,
L.
Tranebjaerg
,
D. G.
Brooks
,
G. F.
Cox
, et al
.
1999
.
Splicing mutations of 54-bp exons in the COL11A1 gene cause Marshall syndrome, but other mutations cause overlapping Marshall/Stickler phenotypes
.
Am. J. Hum. Genet.
65
:
974
983
.
56
Malecek
,
K.
,
V.
Lee
,
W.
Feng
,
J. L.
Huang
,
M. F.
Flajnik
,
Y.
Ohta
,
E.
Hsu
.
2008
.
Immunoglobulin heavy chain exclusion in the shark
.
PLoS Biol.
6
:
e157
.
57
Redmond
,
A. K.
,
R.
Pettinello
,
F. K.
Bakke
,
H.
Dooley
.
2022
.
Sharks provide evidence for a highly complex TNFSF repertoire in the jawed vertebrate ancestor
.
J. Immunol.
209
:
1713
1723
.
58
Abi-Rached
,
L.
,
A.
Gilles
,
T.
Shiina
,
P.
Pontarotti
,
H.
Inoko
.
2002
.
Evidence of en bloc duplication in vertebrate genomes
.
Nat. Genet.
31
:
100
105
.
59
Suurväli
,
J.
,
L.
Jouneau
,
D.
Thépot
,
S.
Grusea
,
P.
Pontarotti
,
L.
Du Pasquier
,
S.
Rüütel Boudinot
,
P.
Boudinot
.
2014
.
The proto-MHC of placozoans, a region specialized in cellular stress and ubiquitination/proteasome pathways
.
J. Immunol.
193
:
2891
2901
.
60
Chen
,
R.
,
L.
Zhang
,
J.
Qi
,
N.
Zhang
,
L.
Zhang
,
S.
Yao
,
Y.
Wu
,
B.
Jiang
,
Z.
Wang
,
H.
Yuan
, et al
.
2018
.
Discovery and analysis of invertebrate IgVJ-C2 structure from amphioxus provides insight into the evolution of the Ig superfamily
.
J. Immunol.
200
:
2869
2881
.
61
Pancer
,
Z.
,
W. E.
Mayer
,
J.
Klein
,
M. D.
Cooper
.
2004
.
Prototypic T cell receptor and CD4-like coreceptor are expressed by lymphocytes in the agnathan sea lamprey
.
Proc. Natl. Acad. Sci. USA
101
:
13273
13278
.
62
Haruta
,
C.
,
T.
Suzuki
,
M.
Kasahara
.
2006
.
Variable domains in hagfish: NICIR is a polymorphic multigene family expressed preferentially in leukocytes and is related to lamprey TCR-like
.
Immunogenetics
58
:
216
225
.
63
Lee
,
S. S.
,
D.
Tranchina
,
Y.
Ohta
,
M. F.
Flajnik
,
E.
Hsu
.
2002
.
Hypermutation in shark immunoglobulin light chain genes results in contiguous substitutions
.
Immunity
16
:
571
582
.
64
Kondo
,
H.
,
H.
Kamei
,
F.
Murotani
,
R.
Nozaki
,
R.
Yazawa
,
R.
Tani
,
K.
Koiwai
, and
I.
Hirono
.
2023
.
A novel liver-specific immunoglobulin heavy chain-like gene in a cartilaginous fish
.
Fish Shellfish Immunol. Rep.
17
:
100102
.

Supplementary data