Note to users. If you're seeing this message, it means that your browser cannot find this page's style/presentation instructions -- or possibly that you are using a browser that does not support current Web standards. Find out more about why this message is appearing, and what you can do to make your experience of our site the best it can be.
Diversity and Function of Adaptive Immune Receptors in a Jawless Vertebrate
Matthew N. Alder,1Igor B. Rogozin,2Lakshminarayan M. Iyer,2Galina V. Glazko,3Max D. Cooper,1Zeev Pancer4*
Instead of the immunoglobulin-type antigen receptors of jawedvertebrates, jawless fish have variable lymphocyte receptors(VLRs), which consist of leucine-rich repeat (LRR) modules.Somatic diversification of the VLR gene is shown here to occurthrough a multistep assembly of LRR modules randomly selectedfrom a large bank of flanking cassettes. The predicted concavesurface of the VLR is lined with hypervariable positively selectedresidues, and computational analysis suggests a repertoire ofabout 1014 unique receptors. Lamprey immunized with anthraxspores responded with the production of soluble antigen-specificVLRs. These findings reveal that two strikingly different modesof antigen recognition through rearranged lymphocyte receptorshave evolved in the jawless and jawed vertebrates.
1 Howard Hughes Medical Institute, Departments of Medicine, Microbiology, Pediatrics, and Pathology, University of Alabama at Birmingham, Birmingham, AL 35294, USA. 2 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA. 3 Stowers Institute for Medical Research, 1000 East 50th Street, Kansas City, MO 64110, USA. 4 Center of Marine Biotechnology, University of Maryland Biotechnology Institute, Baltimore, MD 21202, USA.
* To whom correspondence should be addressed. E-mail: pancer{at}comb.umbi.umd.edu.
An adaptive immune system based on lymphocytes bearing clonallydiverse antigen-specific receptors first appeared at the dawnof vertebrate evolution 500 million years ago. Within less than40 million years in the Cambrian, both jawless and jawed vertebratesevolved mechanisms of lymphocyte receptor diversification thatwere radically different. Thus, jawed vertebrates rearrangeimmunoglobulin and T cell receptor (TCR) variable, diverse,and joining gene segments (VDJs) to generate highly diverserepertoires of T and B lymphocyte antigen receptors (1, 2).In contrast, lamprey and hagfish, jawless fish representativesof the oldest vertebrate taxon, assemble their VLRs from modularLRR units (3, 4). In the lamprey, a single incomplete germlineVLR gene generates a diverse repertoire of cell surface receptorsthrough somatic rearrangement of LRR cassettes that flank thegene. Each lymphocyte thus assembles a VLR gene of unique sequence.Hagfish have two germline VLR genes, called VLR-A and VLR-B,that can generate equivalently diverse receptor repertoires(4). On the basis of the existence of a sizable repertoire ofdiverse lymphocyte receptors, we hypothesized that VLRs mayserve as jawless fish equivalents of the anticipatory antigenreceptors of jawed vertebrates.
The potential diversity of lamprey VLRs was estimated by analysisof 517 unique VLR sequences, including 129 previously reportedsequences (3) and 388 new sequences derived mostly from animalsimmunized with the Bacillus anthracis spore coat (5). Analysisof the aligned VLR diversity regions revealed mixed clustersof sequences, with no exclusive clustering of VLRs from animalsimmunostimulated with particular antigens. The alignment wasthen converted into a matrix consisting of the individual typesof constituent LRR modules (Fig. 1A). This included the 30 to38 residue N-terminal LRR (LRRNT), 18-residue first LRR (LRR1),24-residue LRRs (LRRVs), 13-residue connecting peptide (CP),and 48- to 65-residue C-terminal LRR (LRRCT). Noting that theterminal 24-residue LRR module adjacent to the CP had a distinctsequence signature in 98% of the cases (fig. S1) (5), we designatedthis as the LRRV-end (LRRVe).
Fig. 1. Lamprey VLR diversity and gene rearrangement intermediates. (A) VLR scheme: signal peptide (SP), LRRNT, first LRR1, variable LRRV, end LRRVe, CP, and LRRCT (see text). Germline VLR-encoded portions of LRRNT and LRRCT are hatched. (B) Germline VLR gene rearrangement intermediates. Examples of LRR modules inserted from flanking cassettes into the germline gene: extensions of the VLR gene 5' LRRNT (F.1 + R.1 amplicons); replacements and extensions of the VLR gene 5' LRRCT (F.1 + R.2); and extensions of the VLR gene 3 ' LRRCT (F.2 + R.3). Most insertions terminate with an incomplete LRR. Position of forward (F) and reverse (R) primers indicated; black, cDNA clones; red, genomic clones; red line in Int.36 indicates a 78-nucleotide noncoding DNA flanking the LRRVe. (C) 3D model of VLR diversity region. Positively selected solvent-exposed residues on the concave surface are represented by colored spheres: red, LRRNT; yellow, LRR1; blue, LRRV; white, LRRVe; green, ß strands; magenta, helices.
[View Larger Version of this Image (66K GIF file)]
The data set was screened for repetitive occurrence of eachtype of LRR module, singly or as recurring pairs (Tables 1 and2). Most pairs of adjoining LRRVs or LRRVe's were only observedonce, but in some cases, repetitious pairs of LRRNT-LRR1 andCP-LRRCT were identified. These may represent VLRs that wereassembled from multimodule genomic cassettes, such as one LRR1-LRRV-LRRVtriplet previously identified in the VLR locus (3), or VLRsselected for certain structural conformations. However, 94%of the LRRNT-LRR1 and CP-LRRCT pairs are either unique or consistof the same pair of adjoining modules occurring three timesor less in the VLR data set, and the pairing occurrence followsa random Poisson distribution (6). Most hagfish VLR-A moduleswere also found in random combinations (n = 139; tables S1 andS2), whereas the VLR-B sample (n = 70) was too small for reliableanalysis. The potential diversity of the VLR repertoire wastherefore calculated by considering individual LRR modules asindependent recombination units. For the lamprey, we predicta potential repertoire of up to 1014 unique VLRs and up to 1017for the hagfish VLR-A (5).
Table 1. Distribution of unique and repeated adjoining pairs of LRR modules among 517 unique VLR sequences (the modules are shown in Fig. 1A).
Number of pairs
Adjoining pairs of LRR modules
LRRNT LRR1
LRR1 LRRV
LRRV LRRV
LRRV LRRVe
LRRVe CP
CP LRRCT
1
287
390
270
388
449
308
2
22
5
3
16
29
3
7
2
3
8
4
4
1
4
5
5
8
6
1
7
2
1
8
2
2
9
2
1
10
2
11
1
1
13
1
20
1
21
1
22
1
1
Table 2. Different LRR modules and those found only once in adjoining pairs among 517 unique VLR sequences. The distribution of LRRV modules per transcript is shown separately.
Different LRR modules and uniquely paired combinations
The number of LRR cassettes flanking the germline VLR gene isunknown. Thus far, 32 unique germline LRR modules have beenidentified in the partially sequenced lamprey VLR locus (3),and only 15 of these were identical to one of the 1568 modulesfrom the VLR data set. To estimate the number of LRRV modulesin flanking cassettes at the VLR locus, we used Monte Carlosimulations to predict at 95% confidence level an upper boundestimate of 1500 lamprey LRRVs and 2400 LRRVs for the hagfishVLR-A (5). These data suggest that the rearrangement processthat yields mature VLR genes occurs by random selection of eachmodule type from a large pool of genomic LRR modules.
The lamprey germline VLR gene of 13 kb consists of three codingregions separated by two intervening sequences: (i) the signalpeptide and 5' portion of LRRNT, (ii) the 5' portion of LRRCT,and (iii) the 3' portion of LRRCT and the stalk region (Fig. 1B)(3). Previously, we identified germline VLR transcriptsfrom lamprey and hagfish lymphocytes, which indicated VLR genetranscription before or during the rearrangement process (4).We therefore preferentially cloned those rare cDNA ampliconsthat retained portions of the intervening sequences. For thiswe used polymerase chain reaction (PCR) primer combinations,wherein one annealed within an intervening region and the otherin a coding portion, followed by selection for amplicons shorterthan the length expected for germline VLR transcripts. Ampliconsgenerated from genomic DNA (gDNA) were also analyzed. Among37 unique rearrangement intermediates, we identified cloneswith large DNA deletions at different locations in the interveningregions (nine cDNA clones, two gDNA). In addition to deletions,some clones revealed modular LRR insertions within the germlineVLR (24 cDNA clones, 2 gDNA). Deletion of coding portions ofthe germline gene were observed in five cDNA clones, as in no.36 where the germline 5' LRRCT is missing. In other cases, thegerm lineencoded 5' LRRCT was replaced with unique 5'LRRCT from the flanking cassettes (four cDNA clones, two gDNA).The insertions in two clones included noncoding DNA, as shownfor the 78-nucleotide insertion flanking the terminal LRR modulein no. 36. All of the LRR modules were inserted in-frame withgerm lineencoded elements, but in most cases, the insertionsterminated with an incomplete LRR module (92%). Insertion ofthe LRRV modules into the germline VLR occurs through multipleindependent events as indicated by (i) the variable numbersof LRRVs in the rearrangement intermediates and as many as eightin some of the VLR transcripts, whereas only singlet or doubletLRRV cassettes have been identified in the VLR locus; (ii) therarity of repetitive adjoining LRRV modules (Tables 1 and 2);and (iii) the random Poisson distribution of the number of LRRVmodules per transcript (table S5). Among the 32 LRR modulesidentified in the intermediate clones, only 4 matched any ofthe 32 known germline modules in the VLR locus. Consensus sequencesthat could mediate rearrangement of the LRR cassettes via recombinaseactivity were not found. The mechanism for the stepwise VLRrearrangement process remains unknown, but the final maturationstep into functional VLR genes may involve recombination betweenthe ends of the partially rearranged germline gene, therebyeliminating the remaining intervening sequences and any incompletemodules.
A hallmark of genes undergoing positive Darwinian selectionis the prevalence of codons with nonsynonymous nucleotide substitutions(Ka), which alter the encoded residue, over codons with synonymoussubstitutions (Ks). For instance, multiple alleles of the polymorphicmajor histocompatability complex antigen-presenting moleculesdiffer by only a few positively selected residues located inthe diverse antigen-presentation sites (7). In B lymphocytes,however, somatic hypermutation of immunoglobulin genes followedby a selection stage can also result in prevalence of nonsynonymousmutations. We therefore analyzed the distribution of nucleotidesubstitutions in all the related VLR sequences of identicallength that differ by 1 to 21 nucleotides (n = 20; two tripletsand seven pairs). In most cases, the substitutions clustereddiscretely in one or more of the LRR modules in a nonrandomdistribution (P < 0.01) (8). Only in one case were "mutations"randomly scattered throughout the VLR diversity region (P =0.37). Hence, the presence of one or more unique LRR modulesdistinguishes most of the VLR sequences, indicating that somatichypermutation is not a significant contributing factor in VLRdiversification. This conclusion is supported by the findingof recurring identical LRR modules among VLRs collected fromdifferent animals (Table 2) and by the observation that scaffoldresidues in the LRR modules are highly conserved, for example,10 out of 24 residues are invariant in 90 to 100% of the LRRVemodules (fig. S1).
To identify regions in the VLR that may be undergoing positiveselection, we used a three-dimensional (3D) model of the lampreyVLR (Fig. 1C) to predict the position of solvent-exposed andburied residues in the VLR. The residues in each VLR were thendivided into three categories: (i) solvent-exposed residueson the concave VLR surface; (ii) solvent-exposed residues elsewhere;and (iii) buried residues. Analysis of nucleotide substitutionrevealed a rate significantly higher for nonsynonymous substitutionsonly in the concave VLR surface. A concentration of nonsynonymoussubstitutions was also found on the concave surface of hagfishVLR-A and VLR-B (Table 3; fig. S2). The invariant scaffold residueswithin each LRR module are interspersed with hypervariable sites(fig. S1), which indicates that some of these sites may be underpositive selection (7, 9, 10). Positive selection can be distinguishedby the ratio of Ka to Ks substitutions: a ratio >1 indicatespositive selection, a ratio <1 indicates purifying selection,and a near 1 ratio indicates neutral evolution (9).
Table 3. Average Ks and Ka among solvent-exposed and buried residues of the lamprey VLR (n = 517), hagfish VLR-A (n = 139), and hagfish VLR-B (n = 70). A ratio of Ka/Ks >1 indicates positive selection; Ka/Ks < 1 indicates purifying selection; and Ka/Ks 1 indicates neutral evolution. For Ks and Ka, standard error in parentheses.
Site class
Ks
Ka
Mode of selection
Lamprey VLR
Exposed residues on concave VLR surface
0.28 (0.03)
0.44 (0.05)
Positive selection (Z = 2.61, p = 0.004)
Exposed residues elsewhere on VLR surface
0.25 (0.02)
0.21 (0.03)
Neutral evolution (Z = 1.55, p = 0.12)
Buried residues
0.21 (0.02)
0.12 (0.02)
Purifying selection (Z = 3.43, p = 0.001)
Hagfish VLR-A
Exposed residues on concave VLR surface
0.37 (0.05)
0.53 (0.05)
Positive selection (Z = 3.63, p < 0.001)
Exposed residues elsewhere on VLR surface
0.26 (0.03)
0.29 (0.03)
Neutral evolution (Z = 0.37, p = 0.90)
Buried residues
0.25 (0.03)
0.10 (0.02)
Purifying selection (Z = 4.77, p < 0.001)
Hagfish VLR-B
Exposed residues on concave VLR surface
0.35 (0.04)
0.65 (0.02)
Positive selection (Z = 8.37, p < 0.001)
Exposed residues elsewhere on VLR surface
0.30 (0.03)
0.17 (0.03)
Purifying selection (Z = 3.75, p < 0.001)
Buried residues
0.32 (0.03)
0.09 (0.02)
Purifying selection (Z = 8.72, p < 0.001)
Using both maximum parsimony (11) and maximum likelihood (12,13) for independent calculations, we identified one to six sitesthat could be confidently considered as having been under positiveselection in all six module types, with the exception of thehagfish VLR-A LRRCT and VLR-B CP (tables S3 and S4). The positivelyselected sites predicted by both methods were mapped onto lampreyand hagfish VLR models (Fig. 1C; fig. S2). In each LRR moduletype, except for the CP, one to three of the positively selectedresidues are solvent exposed on strands of the central ßsheet that forms the concave surface of the VLR model, for example,codons 7 and 9 in lamprey LRRV (table S4). Another set of positivelyselected sites localize at one or both ends of the LRRNT andLRRCT. A conservative estimate of the combinatorial diversitythat can be generated by the positively selected solvent-exposedresidues on the concave VLR surface is 5 x 107 for the lamprey,7.1 x 1013 for the hagfish VLR-A, and 1.5 x 106 for VLR-B. Notably,in many LRR-containing proteins, the concave surface is theligand-binding interface (1419).
The remarkable diversity of the VLR repertoire suggested thatthese may serve as lymphocyte antigen receptors in lamprey immunity.To assess the VLR's role in antigen recognition, we injectedanimals with anthrax spore coat (exosporium) as a particulateimmunogen bearing an immunodominant antigen for mice, the collagen-likeBclA glycoprotein (20). We then examined cellular and humoralresponses after exosporia injections at weekly intervals. Flowcytometric analysis, using a VLR-specific antibody against theconserved stalk, indicated a dramatic increase in large lymphocytesamong the VLR-positive cells. Compared with unstimulated animals,the fraction of large VLR-positive lymphocytes increased duringthe 8-week stimulation period from 4 to 93% in the blood, from11 to 90% in the kidney, and from 7 to 76% in the typhlosole,the major hematopoietic tissue in larvae. Mitogenic activityof the exosporium may have induced the dramatic activation ofVLR-bearing lymphocytes, as in lamprey stimulated with a mixtureof antigens and mitogens (3). Plasma VLR concentrations in 8-weekimmunized animals were increased by 8- to 10-fold over preimmunizationlevels (5). An ELISA assay, used to measure levels of solubleanthrax-reactive VLR, revealed a progressive increase in sporerecognition over the immunization period (Fig. 2A). VLR specificitywas indicated by selective reactivity with B. anthracis versusB. subtilis spores, a related bacterium used as a control. BclAantigenspecific VLRs also increased in plasma samplesfrom immunized animals (Fig. 2B), and longer immunization periodsled to progressively higher levels of BclA-specific VLRs. Thesedata indicate that lampreys are capable of humoral responsesto anthrax exosporium by producing increasing levels of solubleBclA-specific VLRs.
Fig. 2. Antigen recognition by lamprey VLR. Immune responses after weekly injections of anthrax spore coats at 4, 6, and 8 weeks. (A) Plasma VLR reactivity with B. anthracis spores compared with B. subtilis (control); plasma dilution 1:200. (B) Plasma VLR recognition of the spore coat protein BclA; two individuals per time point; control, plasma from 8-week stimulated larva reacted with unrelated protein.
[View Larger Version of this Image (15K GIF file)]
In summary, our data indicate that jawless fish generate a verylarge repertoire of VLRs, comparable to the predicted diversityof 1014 mammalian antibody repertoire (21, 22). These repertoireswould clearly be sufficient to recognize a wide range of antigenicdeterminants, yet this remarkable extent of receptor diversityin both jawless and jawed vertebrates is intriguing given thatthe available repertoire is limited by the presence of lessthan 10 million lymphocytes in lamprey larvae and in jawed vertebraterepresentatives like the zebrafish (23). Apart from antibodies,TCRs, and VLRs, such a spectacularly complex repertoire hasonly been reported for the 1013 C-type lectin fold variantsin the receptor of the Bordetella bacteriophage (24).
Analysis of intermediates in the VLR gene assembly process indicatesa multistep mechanism for insertion of various LRR modules fromflanking cassettes into the framework germline gene. These areincorporated precisely in-frame with the coding regions in theincomplete VLR and in tandem with previously inserted LRR modules.The molecular machinery used in assembly of mature VLR genesis clearly an interesting arena for future investigation, andour prediction that an array of 1500 to 2400 diverse LRR modulesin agnathan genomes provides the primary source of VLR diversitywill be tested when the sea lamprey genome sequencing projectis completed.
Most important, the present studies indicate that lamprey canuse their VLRs for specific recognition of particulate and solubleprotein antigens in a humoral response. Within 4 weeks of anthraximmunization, soluble anthrax-specific VLRs were abundant inthe circulation, and these included VLRs that recognize theexosporium BclA protein. Our data thus strongly suggest convergentevolution of remarkably different strategies for generatinganticipatory lymphocyte receptors in jawless and jawed vertebrates.
20. C. Steichen, P. Chen, J. F. Kearney, C. L. Turnbough Jr., J. Bacteriol.185, 1903 (2003).[Abstract/Free Full Text]
21. T. Gojobori, M. Nei, Mol. Biol. Evol.3, 156 (1986).[Abstract]
22. C. A. Janeway Jr., P. Traver, M. Walport, M. J. Shlomchik, Immunobiology: The Immune System in Health and Disease (Garland Publishing, New York, 2001).
25. We thank C. L. Turnbough and J. F. Kearney labs for advice and anthrax reagents; L. Gartland for the VLR-specific antibody, G. L. Gartland for help with the FACS; L. Aravind, M. F. Flajnik, A. N. Haines, and M. Criscitiello for critical comments. I.B.R. and L.M.I. were supported by the NLM/NIH/DHHS Intramural Research Program; M.D.C. is a HHMI Investigator; Z.P. was funded by NSF-MCB-0317460; contribution 05-122 from COMB. Reported sequences were deposited in GenBank: DQ150997 to DQ151421.
Characterization of arrangement and expression of the T cell receptor {gamma} locus in the sandbar shark.
H. Chen, S. Kshirsagar, I. Jensen, K. Lau, R. Covarrubias, S. F. Schluter, and J. J. Marchalonis (2009)
PNAS
106, 8591-8596
|Abstract »|Full Text »|PDF »
The Sea Lamprey Petromyzon marinus: A Model for Evolutionary and Developmental Biology.
N. Nikitina, M. Bronner-Fraser, and T. Sauka-Spengler (2009)
CSH Protocols
2009, pdb.emo113
|Abstract »|Full Text »
Antigen Recognition by Variable Lymphocyte Receptors.
B. W. Han, B. R. Herrin, M. D. Cooper, and I. A. Wilson (2008)
Science
321, 1834-1837
|Abstract »|Full Text »|PDF »
Marine Invertebrate Genome Sequences and Our Evolving Understanding of Animal Immunity.
Structure and specificity of lamprey monoclonal antibodies.
B. R. Herrin, M. N. Alder, K. H. Roux, C. Sina, G. R. A. Ehrhardt, J. A. Boydston, C. L. Turnbough Jr, and M. D. Cooper (2008)
PNAS
105, 2040-2045
|Abstract »|Full Text »|PDF »
Structural Diversity of the Hagfish Variable Lymphocyte Receptors.
H. M. Kim, S. C. Oh, K. J. Lim, J. Kasamatsu, J. Y. Heo, B. S. Park, H. Lee, O. J. Yoo, M. Kasahara, and J.-O. Lee (2007)
J. Biol. Chem.
282, 6726-6732
|Abstract »|Full Text »|PDF »
Lamprey TLRs with Properties Distinct from Those of the Variable Lymphocyte Receptors.
A. Ishii, A. Matsuo, H. Sawa, T. Tsujita, K. Shida, M. Matsumoto, and T. Seya (2007)
J. Immunol.
178, 397-406
|Abstract »|Full Text »|PDF »
Fc Receptor Homolog 3 Is a Novel Immunoregulatory Marker of Marginal Zone and B1 B Cells.
W.-J. Won, J. B. Foote, M. R. Odom, J. Pan, J. F. Kearney, and R. S. Davis (2006)
J. Immunol.
177, 6815-6823
|Abstract »|Full Text »|PDF »
Genomic insights into the immune system of the sea urchin..
J. P. Rast, L. C. Smith, M. Loza-Coll, T. Hibino, and G. W. Litman (2006)
Science
314, 952-956
|Abstract »|Full Text »|PDF »
Unexpected diversity displayed in cDNAs expressed by the immune cells of the purple sea urchin, Strongylocentrotus purpuratus.
D. P. Terwilliger, K. M. Buckley, D. Mehta, P. G. Moorjani, and L.C. Smith (2006)
Physiol Genomics
26, 134-144
|Abstract »|Full Text »|PDF »