Abstract
Ab structure prediction has made great strides, but accurately modeling CDR H3 loops remains elusive. Unlike the other five CDR loops, CDR H3 does not adopt canonical conformations and must be modeled de novo. During Antibody Modeling Assessment II, we found that biasing simulations toward kinked conformations enables generating low–root mean square deviation models (Weitzner et al. 2014. Proteins 82: 1611–1623), and since then, we have presented new geometric parameters defining the kink conformation (Weitzner et al. 2015. Structure 23: 302–311). In this study, we use these parameters to develop a new biasing constraint. When applied to a benchmark set of high-quality CDR H3 loops, the average minimum root mean square deviation sampled is 0.93 Å, compared with 1.34 Å without the constraint. We then test the performance of the constrained de novo method for homology modeling and rigid-body docking and present the results for 1) the Antibody Modeling Assessment II targets, 2) the 2009 RosettaAntibody benchmark set, and 3) the high-quality set.
Introduction
The adaptive immune system in vertebrates is capable of raising Abs against a countless number of Ags. Additionally, engineered Abs form various sources are used as therapeutic molecules (1, 2) and biosensors (3–5). To optimize specific modes of interactions, rational engineering techniques must be developed. Rational engineering of Abs requires accurate structural models, but crystallization is not always practical or even possible. Additionally, expressing a large library of mutants to assess the energetic implications of specific mutations is time consuming, resource intensive, and, in some cases, technically challenging. Computational methods, namely Ab homology modeling, are poised to enable the realization of rational design.
In addition to biotechnology applications, advances in next-generation sequencing techniques (6–8) have made it possible to routinely elucidate the sequences of 104–105 Abs per individual. Such a vast and complete snapshot of an individual’s Ab repertoire is ripe for extracting an unprecedented amount of immunological information (9). However, sequence analysis and structural analysis will need to be used in concert to produce a fully formed view. The sheer number of sequences necessitates the use of computational structure prediction methods.
Our goal is to develop methods to improve Ab structure prediction. Generating accurate structural models is critical for producing inputs for additional structure-based simulations such as docking (10), as well as for computational prediction of binding affinities and dissociation free energies (11, 12). The modeling tool we use, develop, and study in this paper is RosettaAntibody.
RosettaAntibody’s approach to modeling (13) is to break the structure into eight distinct structural components: the H and L chain frameworks, CDR loops L1–3, and CDR loops H1–3. Because the non-H3 CDR loops adopt canonical conformations (14, 15), accurate backbone conformations for them can usually be found in known structures. RosettaAntibody exploits this by selecting templates from curated structural databases using BLAST (16) bit score for CDRs L1–3, H1, and H2 and the framework regions. Each structural component is defined such that they have overlapping residues that can then be superposed to create a grafted model. An initial VL–VH orientation is also selected from databases, and the grafted H and L chains are each superposed to the corresponding chain in the orientation template. After this, the CDR H3 loop is modeled de novo while sampling the VL–VH orientation.
We have presented the performance of RosettaAntibody in Antibody Modeling Assessment II (AMA-II) (17, 18). With few exceptions, RosettaAntibody selects templates for the framework regions and the non-H3 CDR loops that have backbone atomic coordinates with subangstrom optimal root mean square deviations (RMSDs) from the native structure. Similarly, the other participants in the assessment were generally able to construct models with low RMSDs in the framework and non-H3 CDR loops (19–23). The most difficult aspect of Ab homology remains accurately predicting the VL–VH orientation and the CDR H3 conformation.
Because the CDR H3 loop lies at the interface between the H and L chains, incorrect VL–VH orientations can frustrate identifying correct CDR H3 conformations. In the time that has elapsed since AMA-II was conducted, progress has been made in predicting VL–VH orientation (24) from sequence by training a random forest model on a set of fingerprint residues at the VL–VH interface using ABangle’s six degree-of-freedom description of orientation (25). Further improvement has been achieved with an ensemble approach to modeling VL–VH orientation called multiple-template grafting (26). Similarly, effort has been made to develop a CDR H3-specific loop modeling routine (21, 27, 28). Successful predictions require extremely accurate atomic coordinates for the rest of the FV (22, 27), which may make these tools better suited for refining crystal structures with poor electron density around the CDR H3 loop than for homology modeling. Another method relies on restraining the dihedral angles of the three N-terminal and four C-terminal residues of the H3 loop based on distributions observed in crystal structures (28).
A large majority, >80% of known structures, of CDR H3 loops have a C-terminal kink (15, 29, 30–34), a structure that resembles a β bulge in that it disrupts the hydrogen bonding pattern along a β strand and causes the backbone to twist. In AMA-II we found that producing low-RMSD models required filtering out nonkinked H3 conformations. However, the scores of the kinked structures, which can be thought of as their free energies of folding, are higher (worse) than some of the extended structures that Rosetta produced. Shirai et al. (22) also used a filter to ensure the predicted H3 loops had appropriate base geometries. In response to these findings, we developed new geometric parameters that describe the kink, in particular that the C-terminal loop residues form a pseudo dihedral angle, α101 (100X–103 using Chothia numbering), of 39° and a pseudo bond angle, τ101 (100X–102 using Chothia numbering), of 101° (2). Along with this enhanced description of the kink, we hypothesized that the kink serves to drive CDR H3 diversity and is stabilized by tertiary interactions and not by the sequence of the loop itself. This hypothesis is supported by the recent work by Teplyakov et al. (35) in which four H chains and four L chains were combined pairwise to create 16 Abs. All 16 Abs had the same CDR H3 loop, but the structure of the loop varied considerably.
De novo loop modeling has endured as a challenging problem in part because of the large number of df that need to be sampled, the closure requirement, and the challenges associated with accurately ranking different structures. Additionally, side-chain interactions often play key roles in stabilizing observed loop conformations, potentially complicating low-resolution searches.
CDR H3 loops, in addition to having a variety of lengths, are anchored on two adjacent β strands that are disrupted by the C-terminal kink. In previous work, we found that the simulation tends to continue the β strands well into the loop region, possibly due to the formation of favorable backbone–backbone hydrogen bonds in the low-resolution stage of the search (1).
Complicating the task even further is the most common source of the reference coordinates: crystal structures. Crystals are crowded environments in which each protein molecule is surrounded by several others; crystal contacts may influence the observed conformation within the asymmetric unit. Without the existence of a crystal structure of the same protein in more than one distinct crystal form, it cannot be determined whether crystal contacts perturb the conformation of any region of the protein.
Finally, it is not justified to always assume that loop modeling must search for a single set of coordinates. Proteins in physiological conditions are not completely rigid, and estimating the conformational entropy of a loop requires supplying a model to describe the modes of flexibility accessible to the loop (36). Nevertheless, the possible existence of multiple degenerate-energy conformations cannot be dismissed.
In this study, we use the parameters defined in our previous work (2) to constrain the kink during the course of a simulation. The constraint is tested by predicting H3 conformations on the crystal framework structure across the set of benchmark Ab structures. To limit the uncertainty in the crystallographic coordinates, we have constructed a set of high-resolution H3 loops (described in 2Materials and Methods). Because our long-term goal is to predict entire FVs from sequence, we also assess CDR H3 modeling on homology modeled frameworks. Finally, we test the ability to dock an Ab with a modeled H3 loop. The results enable us to address long-standing questions about CDR H3 modeling, including the limitations of sampling and ranking candidate conformations, and whether an improved representation of the kink can enable accurate CDR H3 loop structure prediction. Finally, to show the improvements we have made since the initial development of RosettaAntibody, we report the performance of the updated version for all 52 of the structures included in the 2009 study. To place our work into the context of other Ab structure prediction tools, we model the targets from AMA-II.
Materials and Methods
Data set construction
A set of FVs with accurate CDR H3 coordinates was constructed by querying the backend databases of PyIgClassify (37) for structures with a resolution of 2.5 Å or better, a maximum R value of 0.2, a B factor of ≤80.0 Å2 for every atom in the structure, only one copy of the FV in the asymmetric unit, and CDR H3 loop lengths ranging from 9 to 20 residues using the Honegger–Plückthun-based definition (38). To ensure the set has diverse chemical environments, no two H chain CDR loops are permitted to be identical in sequence. The structures were further filtered to remove Abs from species other than humans and mice as well as modified residues (namely pyroglutamic acid, a cyclized form of glutamine or glutamic acid). The resulting set of structures contains 49 FVs and is summarized in Table I.
PDB Code . | R Value . | Resolution (Å) . | Max. B Factor (Å2) . | Max. H3 B Factor (Å2) . | Species . | CDR H3 Length . | L Chain Isotype . | Fragment . | pH . |
---|---|---|---|---|---|---|---|---|---|
1x9qa | 0.193 | 1.5 | 32.57 | 21.22 | Human | 9 | κ | scFV–Ag | 4.6 |
2d7t | 0.191 | 1.7 | 49.16 | 47.73 | Human | 9 | κ | FV | 9.3 |
3hc4 | 0.162 | 1.62 | 41.11 | 33.5 | Human | 9 | κ | Fab | 8 |
1mlb | 0.181 | 2.1 | 49.63 | 34.78 | Mouse | 9 | κ | Fab | — |
2e27a | 0.198 | 1.7 | 51.38 | 31.14 | Mouse | 9 | κ | FV–Ag | 5.7 |
3g5y | 0.199 | 1.59 | 40.08 | 35.9 | Mouse | 9 | κ | Fab–Ag | — |
3m8oa | 0.154 | 1.55 | 55.22 | 24.48 | Human | 10 | κ | Fab | 7 |
1jpt | 0.182 | 1.85 | 49.59 | 29.78 | Humanized mouse | 10 | κ | Fab | 4.6 |
3e8u | 0.188 | 2.1 | 32.5 | 27.11 | Mouse | 10 | κ | Fab–Ag | — |
1mqk | 0.136 | 1.28 | 51.66 | 37.15 | Mouse | 11 | κ | FV | 6 |
1nlb | 0.197 | 1.6 | 44.78 | 41.47 | Mouse | 11 | κ | Fab | 9 |
2adf | 0.192 | 1.9 | 35.22 | 20.68 | Mouse | 11 | κ | Fab–Ag | 4.6 |
2fbj | 0.194 | 1.95 | 60.06 | 41.76 | Mouse | 11 | κ | Fab–Ag | — |
2w60 | 0.171 | 1.5 | 54.6 | 36.38 | Mouse | 11 | κ | Fab | 8 |
3gnm | 0.189 | 2.1 | 46.42 | 43.99 | Mouse | 11 | κ | Fab | 4 |
3hnt | 0.199 | 1.8 | 54.27 | 28.18 | Mouse | 11 | κ | Fab–Ag | 7.1 |
3v0w | 0.184 | 1.73 | 60.85 | 60.81 | Mouse | 11 | κ | Fab–Ag | 4.6 |
1mfa | 0.166 | 1.7 | 69.57 | 41.64 | Mouse | 11 | λ | FV–Ag | — |
3mxw | 0.181 | 1.83 | 72.85 | 31.98 | Mouse | 12 | κ | Fab–Ag | — |
2xwt | 0.179 | 1.9 | 44.78 | 29.3 | Human | 12 | λ | Fab–Ag | 5 |
1dlf | 0.183 | 1.45 | 42.75 | 24.85 | Mouse | 12 | κ | FV | 5.25 |
2ypv | 0.183 | 1.8 | 75.36 | 37.38 | Mouse | 12 | κ | Fab–Ag | 8.5 |
3ifl | 0.18 | 1.5 | 34.75 | 22.57 | Mouse | 12 | κ | Fab–Ag | 9 |
3liza | 0.178 | 1.8 | 52.33 | 43.85 | Mouse | 12 | κ | Fab–Ag | 7.2 |
3oz9 | 0.192 | 1.6 | 55.29 | 31.28 | Mouse | 12 | κ | Fab | 8.5 |
3umt | 0.177 | 1.8 | 56.76 | 39.55 | Mouse | 12 | κ | scFV | 9.5 |
4h0h | 0.197 | 2 | 65.19 | 37.96 | Mouse | 12 | κ | scFV | 6.5 |
4h20 | 0.197 | 1.9 | 45.61 | 20.26 | Mouse | 12 | κ | Fab | 7.4 |
4hpy | 0.171 | 1.5 | 55.43 | 42.96 | Human | 13 | λ | Fab–Ag | 6.5 |
2v17 | 0.16 | 1.65 | 37.77 | 23.54 | Mouse | 13 | κ | Fab–Ag | 7.5 |
3t65 | 0.194 | 1.45 | 63.85 | 34.13 | Mouse | 13 | κ | Fab–Ag | 8 |
1oaq | 0.16 | 1.5 | 44.45 | 43.75 | Mouse | 13 | λ | FV | 5 |
2vxv | 0.155 | 1.49 | 37.34 | 29.51 | Human | 14 | κ | Fab | 10.5 |
3eo9 | 0.191 | 1.8 | 45.44 | 27.45 | Humanized mouse | 14 | κ | Fab | 5 |
3p0y | 0.182 | 1.8 | 76.35 | 25.77 | Human | 14 | κ | Fab–Ag | 8 |
1jfq | 0.196 | 1.9 | 57.19 | 40.56 | Mouse | 14 | κ | Fab | 7.5 |
2r8s | 0.196 | 1.95 | 63.07 | 34.21 | Human (library) | 14 | κ | Fab–Ag | 5.9 |
3i9g | 0.192 | 1.9 | 53.87 | 29.27 | Humanized mouse | 14 | κ | Fab–Ag | 6 |
3giz | 0.198 | 2.2 | 52.12 | 52.12 | Human | 15 | κ | Fab | 6.3 |
3go1 | 0.192 | 1.89 | 37.78 | 31.35 | Human | 16 | λ | Fab–Ag | 6.5 |
1fns | 0.172 | 2 | 49.64 | 20.8 | Mouse | 16 | κ | Fab–Ag | — |
1seqb | 0.194 | 1.78 | 68.08 | 68.08 | Mouse | 16 | κ | Fab | 7.5 |
1gig | 0.195 | 2.3 | 53.69 | 34.01 | Mouse | 16 | λ | Fab | — |
3mlr | 0.181 | 1.8 | 42.1 | 37.7 | Human | 17 | λ | Fab–Ag | 5.5 |
4nzu | 1.37 | 1.2 | 69.1 | 32.8 | Human | 18 | κ | Fab | 4 |
3lmj | 0.194 | 2.2 | 58.27 | 58.27 | Human | 18 | λ | Fab | 7.5 |
4f57 | 0.188 | 1.7 | 64.7 | 40.56 | Human | 18 | λ | Fab | 6.5 |
2fb4 | 0.189 | 1.9 | 39.92 | 39.2 | Human | 19 | λ | Fab | — |
3nps | 0.19 | 1.5 | 49.76 | 31.95 | Human | 19 | λ | Fab–Ag | — |
PDB Code . | R Value . | Resolution (Å) . | Max. B Factor (Å2) . | Max. H3 B Factor (Å2) . | Species . | CDR H3 Length . | L Chain Isotype . | Fragment . | pH . |
---|---|---|---|---|---|---|---|---|---|
1x9qa | 0.193 | 1.5 | 32.57 | 21.22 | Human | 9 | κ | scFV–Ag | 4.6 |
2d7t | 0.191 | 1.7 | 49.16 | 47.73 | Human | 9 | κ | FV | 9.3 |
3hc4 | 0.162 | 1.62 | 41.11 | 33.5 | Human | 9 | κ | Fab | 8 |
1mlb | 0.181 | 2.1 | 49.63 | 34.78 | Mouse | 9 | κ | Fab | — |
2e27a | 0.198 | 1.7 | 51.38 | 31.14 | Mouse | 9 | κ | FV–Ag | 5.7 |
3g5y | 0.199 | 1.59 | 40.08 | 35.9 | Mouse | 9 | κ | Fab–Ag | — |
3m8oa | 0.154 | 1.55 | 55.22 | 24.48 | Human | 10 | κ | Fab | 7 |
1jpt | 0.182 | 1.85 | 49.59 | 29.78 | Humanized mouse | 10 | κ | Fab | 4.6 |
3e8u | 0.188 | 2.1 | 32.5 | 27.11 | Mouse | 10 | κ | Fab–Ag | — |
1mqk | 0.136 | 1.28 | 51.66 | 37.15 | Mouse | 11 | κ | FV | 6 |
1nlb | 0.197 | 1.6 | 44.78 | 41.47 | Mouse | 11 | κ | Fab | 9 |
2adf | 0.192 | 1.9 | 35.22 | 20.68 | Mouse | 11 | κ | Fab–Ag | 4.6 |
2fbj | 0.194 | 1.95 | 60.06 | 41.76 | Mouse | 11 | κ | Fab–Ag | — |
2w60 | 0.171 | 1.5 | 54.6 | 36.38 | Mouse | 11 | κ | Fab | 8 |
3gnm | 0.189 | 2.1 | 46.42 | 43.99 | Mouse | 11 | κ | Fab | 4 |
3hnt | 0.199 | 1.8 | 54.27 | 28.18 | Mouse | 11 | κ | Fab–Ag | 7.1 |
3v0w | 0.184 | 1.73 | 60.85 | 60.81 | Mouse | 11 | κ | Fab–Ag | 4.6 |
1mfa | 0.166 | 1.7 | 69.57 | 41.64 | Mouse | 11 | λ | FV–Ag | — |
3mxw | 0.181 | 1.83 | 72.85 | 31.98 | Mouse | 12 | κ | Fab–Ag | — |
2xwt | 0.179 | 1.9 | 44.78 | 29.3 | Human | 12 | λ | Fab–Ag | 5 |
1dlf | 0.183 | 1.45 | 42.75 | 24.85 | Mouse | 12 | κ | FV | 5.25 |
2ypv | 0.183 | 1.8 | 75.36 | 37.38 | Mouse | 12 | κ | Fab–Ag | 8.5 |
3ifl | 0.18 | 1.5 | 34.75 | 22.57 | Mouse | 12 | κ | Fab–Ag | 9 |
3liza | 0.178 | 1.8 | 52.33 | 43.85 | Mouse | 12 | κ | Fab–Ag | 7.2 |
3oz9 | 0.192 | 1.6 | 55.29 | 31.28 | Mouse | 12 | κ | Fab | 8.5 |
3umt | 0.177 | 1.8 | 56.76 | 39.55 | Mouse | 12 | κ | scFV | 9.5 |
4h0h | 0.197 | 2 | 65.19 | 37.96 | Mouse | 12 | κ | scFV | 6.5 |
4h20 | 0.197 | 1.9 | 45.61 | 20.26 | Mouse | 12 | κ | Fab | 7.4 |
4hpy | 0.171 | 1.5 | 55.43 | 42.96 | Human | 13 | λ | Fab–Ag | 6.5 |
2v17 | 0.16 | 1.65 | 37.77 | 23.54 | Mouse | 13 | κ | Fab–Ag | 7.5 |
3t65 | 0.194 | 1.45 | 63.85 | 34.13 | Mouse | 13 | κ | Fab–Ag | 8 |
1oaq | 0.16 | 1.5 | 44.45 | 43.75 | Mouse | 13 | λ | FV | 5 |
2vxv | 0.155 | 1.49 | 37.34 | 29.51 | Human | 14 | κ | Fab | 10.5 |
3eo9 | 0.191 | 1.8 | 45.44 | 27.45 | Humanized mouse | 14 | κ | Fab | 5 |
3p0y | 0.182 | 1.8 | 76.35 | 25.77 | Human | 14 | κ | Fab–Ag | 8 |
1jfq | 0.196 | 1.9 | 57.19 | 40.56 | Mouse | 14 | κ | Fab | 7.5 |
2r8s | 0.196 | 1.95 | 63.07 | 34.21 | Human (library) | 14 | κ | Fab–Ag | 5.9 |
3i9g | 0.192 | 1.9 | 53.87 | 29.27 | Humanized mouse | 14 | κ | Fab–Ag | 6 |
3giz | 0.198 | 2.2 | 52.12 | 52.12 | Human | 15 | κ | Fab | 6.3 |
3go1 | 0.192 | 1.89 | 37.78 | 31.35 | Human | 16 | λ | Fab–Ag | 6.5 |
1fns | 0.172 | 2 | 49.64 | 20.8 | Mouse | 16 | κ | Fab–Ag | — |
1seqb | 0.194 | 1.78 | 68.08 | 68.08 | Mouse | 16 | κ | Fab | 7.5 |
1gig | 0.195 | 2.3 | 53.69 | 34.01 | Mouse | 16 | λ | Fab | — |
3mlr | 0.181 | 1.8 | 42.1 | 37.7 | Human | 17 | λ | Fab–Ag | 5.5 |
4nzu | 1.37 | 1.2 | 69.1 | 32.8 | Human | 18 | κ | Fab | 4 |
3lmj | 0.194 | 2.2 | 58.27 | 58.27 | Human | 18 | λ | Fab | 7.5 |
4f57 | 0.188 | 1.7 | 64.7 | 40.56 | Human | 18 | λ | Fab | 6.5 |
2fb4 | 0.189 | 1.9 | 39.92 | 39.2 | Human | 19 | λ | Fab | — |
3nps | 0.19 | 1.5 | 49.76 | 31.95 | Human | 19 | λ | Fab–Ag | — |
All structures have R values < 0.2, resolution > 2.5 Å, and maximum B factors < 80.0 Å2. To limit variables being considered, the CDR H3 loops are restricted to a range from 9 to 20 residues in length and are only derived from human and mouse Abs. Abs crystallized with a bound Ag have an annotation ending with “–Ag” in the Fragment column. A dash (—) in the pH column indicates that no pH information was included with the coordinates.
Extended base geometry.
Unclear base geometry.
Max., maximum.
Kink constraint
In de novo loop modeling simulations, it is impossible to exhaustively sample all of the structural df. To increase the likelihood of generating CDR H3 models with near-native structures, we constrain (39) two parameters: 1) τ101, the Cα–Cα–Cα pseudo bond angle for the three C-terminal residues; and 2) α101, the Cα–Cα–Cα–Cα pseudo dihedral angle for the three C-terminal residues in the CDR H3 loop and one adjacent residue in the H chain framework.
Because an objective of this study was to determine whether Rosetta can correctly identify native H3 conformations, it is important not to overconstrain any of the simulations. With this in mind, a FLAT_HARMONIC potential, which has a region wherein no penalty is applied, is a natural choice (Fig. 1A). The FLAT_HARMONIC potential is of the form:
where μ is the mean, t (tolerance) is the distance from μ with no penalty, and ξ is the scaling factor that controls the penalty that is applied.
For the kink parameters α101 and τ101, we designed a penalty schedule with no penalty when the value is within 1.0 σ of the mean and a penalty of 1.0 at 3.0 σ, yielding ξ = 2t. This schedule encourages Rosetta to generate models with kinked H3 loops without forcing the geometry toward the mean values of both parameters. Using the values determined previously (2), the kink constraint for AHo-numbered Abs is written for input into Rosetta as:
# alpha: pseudo dihedral - last 3 residues in H3 and the following W
# mean: 38.85 degrees; SD: 11.75 degrees (in radians)
Dihedral CA 136H CA 137H CA 138H CA 139H FLAT_HARMONIC 0.678 0.41 0.205
# tau: pseudo bond angle of the last 3 residues in H3
# mean: 100.9 degrees; SD: 5.57 degrees (in radians)
Angle CA 136H CA 137H CA 138H FLAT_HARMONIC 1.761 0.194 0.0972
Fig. 1B shows a contour plot of the combined value of the τ101 and α101 constraints, with each line representing an increase in score of 2.0 Rosetta energy units. Fig. 1B also shows the regions of τ101 and α101 that define kinked (orange; ±3.0 σ of the mean of both parameters), unclear (gray; ±3.0 σ of mean of one of the parameters), and extended (white; beyond 3.0 σ of both parameters) conformations. The output score is always derived from the unbiased score function, even when the constraint is employed in the simulation. These definitions are used throughout this study.
De novo loop structure prediction
Next-generation kinematic closure (KIC) (40), or NGK, has been developed to further improve the performance of loop modeling in Rosetta. Similar to KIC, NGK generates diverse loop conformations by drawing φ and ψ torsion angle pairs from Ramachandran distributions for all but three residues in the loop; the torsion angles of the remaining residues are determined analytically by polynomial resultants (41). The approach employed by NGK enhances KIC by using neighbor-dependent Ramachandran maps (42), explicitly sampling ω backbone dihedral angles as well as using a simulated annealing strategy for repulsive and Ramachandran score terms in the all-atom stage. On the same set of loops used to benchmark KIC, NGK generates substantially more near-native models (43), which is why it is used as the starting point in this study. During loop modeling, the backbone coordinates for the non-loop regions of the protein are held fixed.
The flags to run a standard NGK simulation are:
./loopmodel.macosclangrelease
-native input_file.pdb
-s input_file.pdb
-nstruct 500
-loops:loop_file h3.loops
-loops:remodel perturb_kic
-loops:refine refine_kic
-loops:outer_cycles 5
-kic_bump_overlap_factor 0.36
-legacy_kic false
-kic_min_after_repack true
-corrections:score:use_bicubic_interpolation false
-loops:kic_omega_sampling
-loops:kic_rama2b
-allow_omega_move
-loops:ramp_fa_rep
-loops:ramp_rama
-ex1
-ex2
-extrachi_cutoff 0
where h3.loops contains
# FORMAT JSON
{”LoopSet”: [{
“start”: { “resSeq”: 107, “iCode”: ” “, “chainID”: “H” },
“stop”: { “resSeq”: 138, “iCode”: ” “, “chainID”: “H” },
“extras”: { “extend”: true },
}]
}
We refer readers to the NGK publication (43) for a discussion of the various flags associated with it. NGK simulations with constraints use the above command line with the addition of the following flags to specifcy constraints in both the low-resolution and full atom (fa) stages:
-constraints:cst_file kink.constraint
-constraints:cst_weight 1.0
-constraints:cst_fa_file kink.constraint
-constraints:cst_fa_weight 1.0
where the file kink.constraint contains the constraints as shown above.
Scaled scores
To compare the results of simulations of different targets, the scores of a set of candidate models are scaled such that a value of 1.0 corresponds to the 95th percentile of scores and a value of 0.0 corresponds to the 5th percentile. After this normalization is completed, the score of the refined native structure is computed on the same scale.
Discrimination score
The discrimination score is used to measure how “funnel-like” a score versus the RMSD plot is, with a lower value being indicative of a more successful simulation. In this study, the reference for RMSD calculations is the set of atomic coordinates of the crystal structure subjected to refinement as described below. As defined by Conway et al. (44), the discrimination score is calculated as:
where r is the RMSD cutoff in Å, Si are the dimensionless scaled scores calculated as described above, and the discrimination score, D, is the sum of the score differences of the best scoring models above and below the seven RMSD cutoffs.
Preparation of input structures
The raw crystallographic coordinates of protein structures often do not score favorably within Rosetta, typically because the packed environment of a crystal leads to side-chain conformations that would be undesirable in solution, and some contacts are classified as steric clashes by Rosetta. To compensate, crystal structures must be relaxed, that is, optimized with respect to the Rosetta scoring function. Relaxation will result in small changes to the atomic coordinates with significant improvements in the score; however, it is important that the backbone coordinates do not vary much, especially in the case of loop modeling. To ensure that only small changes to the backbone are allowed, all coordinates are constrained to their starting positions with a spring potential. The command line used for constrained relax is:
./relax.macosclangrelease
-s input.pdb
-nstruct 500
-relax:constrain_relax_to_start_coords
-relax:coord_constrain_sidechains
-relax:ramp_constraints false
-ex1
-ex2
-use_input_sc
Once the crystallographic coordinates have been optimized, the entire structure can be subjected to fixed-backbone side-chain optimization to further lower the score of the reference structure for loop modeling and to better approximate the side-chain conformations in the free, unbound conformation. The command line used for fixed-backbone side-chain optimization is:
./relax.macosclangrelease
-s lowest_scoring_model_from_previous_simulation.pdb
-nstruct 100
-relax:bb_move false
-ex1
-ex2
-extrachi_cutoff 0
The low-scoring model from this calculation is used as the input structure in the subsequent calculations.
Critical assessment of predicted interactions criteria
For a model to be considered a high-quality prediction by critical assessment of predicted interactions (CAPRI) metrics (45), the fraction of native residue–residue contacts recovered (fnat) must be ≥0.5 and the interface RMSD (I_RMSD) or ligand RMSD (L_RMSD) must be ≤1.0 Å. Medium-quality predictions must have fnat of ≥0.3 and L_RMSD of ≤5.0 Å or I_RMSD of ≤2.0 Å, whereas acceptable predictions have fnat of ≥0.1 and L_RMSD of ≤10.0 Å or I_RMSD of ≤4.0 Å. Models that have fnat of ≤0.1 or L_RMSD of ≥10.0 Å and I_RMSD of ≥4.0 Å are considered incorrect.
Results
Kink constraint
Fig. 1 shows the functional form of the constraint used to bias de novo CDR H3 loop modeling simulations toward generating kinked conformations. See the 2Materials and Methods section.
High-quality CDR H3 loop benchmark set
Fig. 2 shows an example CDR H3 loop with the electron density map for the H3 residues shown in a gray mesh over the residues represented in sticks. At this level of detail, all of the side-chain coordinates are well defined, and the map even shows a hole in aromatic residues. The level of agreement between the electron density map and the coordinates and the lack of ambiguity in the atomic coordinates suggest that this loop is in a stable conformation in the crystal, making it a prime candidate for loop modeling experiments. We constructed a set of 49 high-quality CDR H3 structures as described in 2Materials and Methods. The other loops in the set have similarly well-defined electron density. A limitation of this set is that several structures are in their bound form (annotations ending with “–Ag” in the Fragment column of Table I), and the unbound forms may differ or even be flexible. This danger is mitigated by the fact that usually these loops deviate <1.0 Å RMSD between the bound and unbound forms (46).
Table I lists all of the loops in the set and includes information on the quality and content of the crystal structure, the species from which the Ab was derived, the length of the loop, and the L chain isotype. In the set, 24 of the 49 structures are crystallized in the bound conformation with their Ag, 40 of the 49 structures are Fabs, 6 are FVs, and the remaining 3 are scFVs. Eighteen of the structures are of human Abs, and 11 have λ L chains, making this a diverse set of structures.
The definition of the bounds of the CDR H3 loop differs from the Chothia-based definition (47) used within RosettaAntibody (15), and instead it is based on the structure-based Honegger–Plückthun definition (38) used by North et al. (17) and in our previous work on the kink geometry (2). Both definitions end on Chothia residue number 102, but the Chothia-based definition begins at residue 95 whereas the Honegger–Plückthun-based definition begins at residue 93, making the Honegger–Plückthun CDR H3 loops two residues longer than Chothia loops. In this set, the median and mode of the loop lengths are both 12 residues.
Unconstrained de novo modeling of CDR H3 loops
To establish a baseline, we first assessed whether Rosetta methods create native-like H3 kink structures using established methods. In AMA-II (1) NGK was used to model CDR H3 loops on a crystallographic framework. In these cases, a filter was employed to favor kinked structures, using the θbase (α101 in this work) definition developed by Shirai et al. (30, 31) and refined by Kuroda et al. (32). Without this penalty, very few kinked structures were produced. From these results it remained unclear whether the primary limitation for producing accurate CDR H3 models lies in generating low-RMSD conformations (sampling) or in ranking the candidate models effectively (scoring). Since then, we established the geometric parameters α101 and τ101 to describe the C-terminal kink in CDR H3 loops (2). We now use these parameters to probe model sets to determine whether Rosetta modeling failures are in sampling, scoring, or both.
Fig. 3 shows the results of a de novo CDR H3 modeling simulation on an anti-citrullinated collagen type II Ab (Protein Data Bank [PDB] accession code 2w60) (48). In Fig. 3A, a score versus RMSD plot (funnel plot) shows the models ranked by the scaled score and colored by their base geometry. The kinked models (orange points) make up a small fraction of the structures produced; however, they have lower scores than extended structures at the same RMSD value. The top-ranked models have very low RMSDs, but only three such models were produced. Nonetheless, because the score function successfully separates the near-native and nonnative conformations, the discrimination score is −0.5710 (negative discrimination scores indicate success). The scaled score of the refined, native (x-ray) structure is −1.4061, meaning that none of the predicted structures approaches near-native scores.
Fig. 3B shows the τ101 and α101 values for the models (black) and the crystal structure (red). The gray bars demarcate ±3.0 σ from the mean of the distribution of each parameter in kinked Abs as found in our previous work (2). This plot shows a clear preference for NGK to produce H3 loops in the extended conformation, likely because it can form backbone–backbone hydrogen bonds by extending β strands of the framework in the low-resolution stage of modeling.
Supplemental Figs. 1 and 2 show funnel plots and τ101 versus α101 plots for the rest of the structures in the data set. Across the whole set, kinked models represent a small fraction of the models that are produced, but those models tend to have lower RMSDs.
Across the set of Abs, the native conformation has a significantly better score than the best decoys that are being produced by NGK (Supplemental Table I): the average scaled native score for kinked targets is −0.9480 with a SD of 0.5033 (Table II).
Simulation . | Min. RMSD (Å) . | Scaled Native Score . | Top 10 RMSDs (Å) . | RMSD of Top 10 Scored (Å) . | RMSD of Top 1 Scored (Å) . |
---|---|---|---|---|---|
Unconstrained | 1.336 | −0.948 | 2.018 | 3.6433 | 3.2174 |
Constrained | 0.9332 | −0.5264 | 1.2473 | 2.2179 | 2 |
Combined | 1.3789 | −0.2396 | 1.8398 | 3.4148 | 2.7564 |
Simulation . | Min. RMSD (Å) . | Scaled Native Score . | Top 10 RMSDs (Å) . | RMSD of Top 10 Scored (Å) . | RMSD of Top 1 Scored (Å) . |
---|---|---|---|---|---|
Unconstrained | 1.336 | −0.948 | 2.018 | 3.6433 | 3.2174 |
Constrained | 0.9332 | −0.5264 | 1.2473 | 2.2179 | 2 |
Combined | 1.3789 | −0.2396 | 1.8398 | 3.4148 | 2.7564 |
The minimum (Min.) RMSD, scaled native score, average of the 10 lowest RMSDs, average RMSD of the 10 top-scoring models, and the RMSD of the top-ranked model are shown for unconstrained NGK, constrained NGK, and the combined NGK plus CCD methods (see Supplemental Fig. 6 for an example). Each value is the average of the values of the 44 kinked targets in the benchmark set. NGK with constraints performs best over the whole set. Only the scaled native score in the combined simulations have superior values; however, this affects the ability of the score function to discriminate between near-native and nonnative conformations. Per-target values for unconstrained NGK can be found in Supplemental Table I, and per-target values for constrained NGK can be found in Supplemental Table II.
The negative discrimination scores indicate that lower scoring conformations are correlated with lower RMSDs, whereas large, negative scaled native scores indicate that the native conformations score substantially more favorably than do the predicted conformations. In this case, the negative discrimination score tells us that if we were to sort our models by score and pick the best scoring models, we would indeed select the lowest RMSD models. The large, negative scaled native tells us, however, that the score gap between the refined native and the 5th percentile is, on average, equivalent to the difference between the 5th and 95th percentile of the models. Taken together, these pieces of information suggest that conformational sampling must be improved or directed.
Constrained de novo modeling of CDR H3 loops
Because the predicted kinked structures have low RMSDs and relaxation of native CDR H3 structures can find substantially lower scores (Supplemental Fig. 3), we tested whether biasing the simulation toward kinked conformations would increase the number of low-scoring and low-RMSD models produced in the course of the simulation. As described in 2Materials and Methods, we use the parameters of the kink described in our previous work to develop a kink constraint that can be employed during a simulation (Fig. 1). Because the constraint potential is smooth and continuous, the conformation of a structure can be minimized with the constraint enabled.
Fig. 4 shows the results of the constrained NGK simulation for anti-citrullinated collagen type II Ab (2w60) (48). The τ101 versus α101 plot (Fig. 4B) shows that the constraint successfully biases the simulation to mostly produce kinked structures. At the same time, many models are not kinked, which indicates that the simulations are not being overconstrained.
Fig. 4A shows a funnel plot for the constrained NGK simulation of 2w60. The fraction of near-native structures has increased dramatically, demonstrating that generating more kinked structures is critical for successful CDR H3 predictions. The dashed horizontal line indicates the scaled score of the native structure, which was below the plotted bounds on the unconstrained plot. That is, the models now have scores near that of the refined, native structure. Therefore, the geometry of the models generated with constraints is more favorable than the geometry of the models generated by the unconstrained method.
Because many more models between 1.0 and 3.0 Å RMSD are generated and those models score more favorably, the discrimination score for the constrained simulation, −0.1721, is worse than the unconstrained case. Fig. 4A shows that for many of the models with RMSDs of <2.0 Å, there is a model with RMSD 3.5 Å that scores as well. This result underscores the importance of producing many models even when using constrained NGK.
Supplemental Figs. 4 and 5 show funnel plots and τ101 versus α101 plots for the rest of the structures in the high-resolution data set. Across the data set, with the exception of five targets (one of which has an unclear base geometry), the scaled native scores appear within the plot bounds, indicating that the models achieve scores close to those of the native structures. Supplemental Fig. 4 shows that the four extended loops in the benchmark (1x9q, 2e27, 3liz, 3m8o) have sets of models that are predominantly kinked. Although this is problematic, it appears that, with the exception of 1x9q, these particular targets were also not modeled successfully by unconstrained NGK, which confounds any analysis to determine whether the constraint penalty could be overcome when appropriate. Supplemental Table II shows numerical results for each target.
The average RMSD of the 10 top-scoring models is lower with constraints in 40 of the 49 targets, and 30 targets have a lower RMSD of the top-scoring model. Without constraints, the average RMSD of the 10 top-scoring models is <1.0 Å for three targets; this number increases to nine targets when using constraints. When considering only the top-scoring model, 7 targets without constraints have RMSDs of <1.0 Å, whereas 13 targets with constraints have subangstrom RMSDs. Table III shows the lowest RMSD CDR H3 loop produced with and without the constraint for each target in the benchmark set.
Target . | CDR H3 Length . | Min. RMSD Unconstrained (Å) . | Min. RMSD Constrained (Å) . |
---|---|---|---|
1mlb | 9 | 1.0512 | 0.8055 |
2d7t | 9 | 0.9517 | 0.8841 |
3g5y | 9 | 0.7744 | 0.4713 |
3hc4 | 9 | 0.7802 | 0.7545 |
1x9qa | 9 | 0.1933 | 2.0763 |
2e27a | 9 | 1.3989 | 2.5348 |
1jpt | 10 | 0.9972 | 0.7319 |
3e8u | 10 | 0.6829 | 0.263 |
3m8oa | 10 | 0.8086 | 1.1371 |
1mfa | 11 | 1.204 | 0.3159 |
1mqk | 11 | 0.8169 | 0.7269 |
1nlb | 11 | 0.4001 | 0.3155 |
2adf | 11 | 1.4002 | 0.5459 |
2fbj | 11 | 0.97 | 0.8883 |
2w60 | 11 | 0.445 | 0.3128 |
3gnm | 11 | 0.7956 | 0.5754 |
3hnt | 11 | 1.2357 | 0.9713 |
3v0w | 11 | 0.5647 | 0.4142 |
1dlf | 12 | 1.0057 | 0.8847 |
2xwt | 12 | 0.3709 | 0.3651 |
2ypv | 12 | 1.1427 | 0.5321 |
3ifl | 12 | 0.7096 | 0.5588 |
3mxw | 12 | 1.1074 | 0.9767 |
3oz9 | 12 | 0.573 | 0.5176 |
3umt | 12 | 1.2153 | 1.0396 |
4h0h | 12 | 1.1919 | 0.6141 |
4h20 | 12 | 0.8145 | 0.8209 |
3liza | 12 | 1.3761 | 2.007 |
1oaq | 13 | 1.0488 | 0.8637 |
2v17 | 13 | 0.7926 | 0.8039 |
3t65 | 13 | 2.7751 | 0.9007 |
4hpy | 13 | 0.497 | 0.5774 |
1jfq | 14 | 1.8429 | 0.563 |
2r8s | 14 | 0.6297 | 0.6658 |
2vxv | 14 | 1.1711 | 1.1523 |
3eo9 | 14 | 3.6195 | 1.2092 |
3i9g | 14 | 0.7581 | 0.6414 |
3p0y | 14 | 0.4399 | 0.3675 |
3giz | 15 | 1.7031 | 0.862 |
1fns | 16 | 1.9961 | 1.2479 |
1gig | 16 | 2.5965 | 1.9806 |
3go1 | 16 | 1.495 | 1.6585 |
1seqb | 16 | 3.1272 | 2.8814 |
3mlr | 17 | 2.4555 | 2.2999 |
3lmj | 18 | 1.8205 | 2.1115 |
4f57 | 18 | 1.7381 | 1.3136 |
4nzu | 18 | 3.297 | 2.0598 |
2fb4 | 19 | 3.0953 | 1.8331 |
3nps | 19 | 3.8113 | 2.6624 |
Target . | CDR H3 Length . | Min. RMSD Unconstrained (Å) . | Min. RMSD Constrained (Å) . |
---|---|---|---|
1mlb | 9 | 1.0512 | 0.8055 |
2d7t | 9 | 0.9517 | 0.8841 |
3g5y | 9 | 0.7744 | 0.4713 |
3hc4 | 9 | 0.7802 | 0.7545 |
1x9qa | 9 | 0.1933 | 2.0763 |
2e27a | 9 | 1.3989 | 2.5348 |
1jpt | 10 | 0.9972 | 0.7319 |
3e8u | 10 | 0.6829 | 0.263 |
3m8oa | 10 | 0.8086 | 1.1371 |
1mfa | 11 | 1.204 | 0.3159 |
1mqk | 11 | 0.8169 | 0.7269 |
1nlb | 11 | 0.4001 | 0.3155 |
2adf | 11 | 1.4002 | 0.5459 |
2fbj | 11 | 0.97 | 0.8883 |
2w60 | 11 | 0.445 | 0.3128 |
3gnm | 11 | 0.7956 | 0.5754 |
3hnt | 11 | 1.2357 | 0.9713 |
3v0w | 11 | 0.5647 | 0.4142 |
1dlf | 12 | 1.0057 | 0.8847 |
2xwt | 12 | 0.3709 | 0.3651 |
2ypv | 12 | 1.1427 | 0.5321 |
3ifl | 12 | 0.7096 | 0.5588 |
3mxw | 12 | 1.1074 | 0.9767 |
3oz9 | 12 | 0.573 | 0.5176 |
3umt | 12 | 1.2153 | 1.0396 |
4h0h | 12 | 1.1919 | 0.6141 |
4h20 | 12 | 0.8145 | 0.8209 |
3liza | 12 | 1.3761 | 2.007 |
1oaq | 13 | 1.0488 | 0.8637 |
2v17 | 13 | 0.7926 | 0.8039 |
3t65 | 13 | 2.7751 | 0.9007 |
4hpy | 13 | 0.497 | 0.5774 |
1jfq | 14 | 1.8429 | 0.563 |
2r8s | 14 | 0.6297 | 0.6658 |
2vxv | 14 | 1.1711 | 1.1523 |
3eo9 | 14 | 3.6195 | 1.2092 |
3i9g | 14 | 0.7581 | 0.6414 |
3p0y | 14 | 0.4399 | 0.3675 |
3giz | 15 | 1.7031 | 0.862 |
1fns | 16 | 1.9961 | 1.2479 |
1gig | 16 | 2.5965 | 1.9806 |
3go1 | 16 | 1.495 | 1.6585 |
1seqb | 16 | 3.1272 | 2.8814 |
3mlr | 17 | 2.4555 | 2.2999 |
3lmj | 18 | 1.8205 | 2.1115 |
4f57 | 18 | 1.7381 | 1.3136 |
4nzu | 18 | 3.297 | 2.0598 |
2fb4 | 19 | 3.0953 | 1.8331 |
3nps | 19 | 3.8113 | 2.6624 |
Extended base geometry.
Unclear base geometry.
Min., minimum.
Homology modeling with constraints
We tested the constrained method on crystallographic frameworks to isolate the loop modeling problem, but our ultimate objective remains predicting Ab structures from sequence. To assess the effect of the constraint in the context of homology modeled frameworks, we modified RosettaAntibody to apply constraints to the de novo loop modeling phase and enabled the neighbor-dependent Ramachandran map sampling from NGK.
Fig. 5 shows cumulative density estimates for CDR H3 loop RMSDs modeled using RosettaAntibody with a kink filter (gray curve) and with the new kink constraint (orange curve) for 2w60. Both methods can generate low-RMSD models of the H3 loop, but with the kink constraint, 1106 of the 2000 models have H3 RMSD of <2.0 Å as opposed to only 796 with the filter.
The enrichment of low-RMSD models shows that the kink constraint leads to sampling improvements even in cases where RosettaAntibody is already successful. Comparable results can be achieved while generating fewer models, and additional simulation time can be spent performing other stages of modeling, that is, VL–VH optimization.
With this result, we produced homology models for the entire benchmark set, including using the new multitemplate VL–VH orientation method (27). Supplemental Table III shows a summary of the H3 accuracy on a homology-modeled framework for each target in the set. RosettaAntibody generates subangstrom predictions for 26 targets across a variety of loop lengths. The top-scoring model has an H3 RMSD of <1.0 Å for five targets. If an ensemble of the top 10 models is considered, as it would be in the case of EnsembleDock (49), 12 targets have subangstrom H3 predictions. Notably, all of the loops that were predicted within 1.0 Å are ≤14 residues.
Docking with modeled CDR H3 loops
Successful docking is highly dependent on having accurate models of the bound conformation of each binding partner. To test whether the H3 loop conformations predicted using constrained NGK are accurate enough for binding, we focused on 2adf, which is crystallized with its Ag. The CDR H3 loop in 2adf is 11 residues and the constrained NGK simulation has a discrimination score of −0.1758, indicating successful CDR H3 prediction (Supplemental Table II). We selected the 10 top-scoring models as an ensemble to dock to the bound form of the Ag using EnsembleDock (49). EnsembleDock functions by cycling through a set of distinct backbone conformations after each rigid-body move during the low-resolution stage of docking. Each member of the ensemble is scored, and the best scoring conformation observed in the low-resolution stage is the starting point for all-atom refinement. EnsembleDock does not resample any of the loop conformations during docking, resulting in all non-H3 RMSDs being 0.0 Å. The kink constraint is not used in this simulation.
The EnsembleDock results are shown in Fig. 6. Points are colored to indicate the CAPRI quality rating (50) of each model, with gray points corresponding to incorrect structures, orange to acceptable quality, red to medium-quality, and blue to high-quality models (see 2Materials and Methods). The fnat metric that is used to compute the CAPRI quality is not restricted to the H3 loop, and thus the values we report also consider the contributions of non-H3 CDR loops to the interface. The 10 top models by interface score encompass one incorrect model, four acceptable models, one medium-quality model, and four high-quality models.
The 10 models used in the ensemble have scores ranging from −594.19 to −586.90, and the average H3 RMSD is 1.48 Å (Table IV). The top-ranked model has a loop RMSD of 1.53 Å, and the eighth structure in the set has an H3 RMSD of 0.75 Å. As shown in Table IV, the 10 top models do not converge on a single member of the ensemble, showing that considering several models simultaneously is a path forward.
Rank . | Interface Score . | Interface RMSD (Å) . | Ligand RMSD (Å) . | fnat . | H3 RMSD (Å) . | CAPRI Rating . |
---|---|---|---|---|---|---|
1 | −8.79 | 0.43 | 4 | 0.91 | 1.48 | High |
2 | −8.51 | 0.54 | 1.78 | 0.86 | 0.75 | High |
3 | −8.22 | 0.74 | 2.57 | 0.89 | 1.59 | High |
4 | −7.67 | 0.43 | 1.6 | 0.91 | 1.53 | High |
5 | −5.54 | 2.72 | 9.25 | 0.26 | 1.69 | Acceptable |
6 | −5.55 | 3.5 | 9.49 | 0.29 | 0.75 | Acceptable |
7 | −5.53 | 2.28 | 7.56 | 0.57 | 1.54 | Medium |
8 | −5.27 | 13.75 | 25.24 | 0.11 | 1.54 | Incorrect |
9 | −5.22 | 5.27 | 7.33 | 0.2 | 1.69 | Acceptable |
10 | −5.18 | 2.76 | 9.38 | 0.34 | 1.53 | Acceptable |
Rank . | Interface Score . | Interface RMSD (Å) . | Ligand RMSD (Å) . | fnat . | H3 RMSD (Å) . | CAPRI Rating . |
---|---|---|---|---|---|---|
1 | −8.79 | 0.43 | 4 | 0.91 | 1.48 | High |
2 | −8.51 | 0.54 | 1.78 | 0.86 | 0.75 | High |
3 | −8.22 | 0.74 | 2.57 | 0.89 | 1.59 | High |
4 | −7.67 | 0.43 | 1.6 | 0.91 | 1.53 | High |
5 | −5.54 | 2.72 | 9.25 | 0.26 | 1.69 | Acceptable |
6 | −5.55 | 3.5 | 9.49 | 0.29 | 0.75 | Acceptable |
7 | −5.53 | 2.28 | 7.56 | 0.57 | 1.54 | Medium |
8 | −5.27 | 13.75 | 25.24 | 0.11 | 1.54 | Incorrect |
9 | −5.22 | 5.27 | 7.33 | 0.2 | 1.69 | Acceptable |
10 | −5.18 | 2.76 | 9.38 | 0.34 | 1.53 | Acceptable |
Interface score is calculated as the score of the unbound partners subtracted from the score of the complex. Interface RMSD is the RMSD of the backbone atoms of the residues within 8.0 Å of a residue on the other docking partner. Ligand RMSD is the backbone atom RMSD of the Ag after superposing the Ab to the native structure. fnat is the fraction of native residue–residue contacts recovered, where contacting residues are defined as residues on opposite binding partners within 5.0 Å of each other. The H3 RMSD column shows the RMSD of the CDR H3 loop of the model that was ultimately selected by EnsembleDock to generate the docked model. Interestingly, a different member of the ensemble was used in each of the high-quality docked models.
Big data (set)
To assess the progress we have made in the development of RosettaAntibody, we benchmarked the performance for two additional sets of structures: 1) the 10 non-rabbit AMA-II targets; and 2) the 54 targets in the initial RosettaAntibody study. Supplemental Table IV shows the summary of the H3 predictions in the context of a homology-modeled framework for the AMA-II targets. In AMA-II, RosettaAntibody produced subangstrom models for two targets (1). The average minimum RMSD of the 10 targets is 1.28 Å. The new constrained method successfully produces subangstrom predictions for 2 of the 10 targets; however, these models are not ranked in a set of 10 top-scoring structures. When considering the 10 top-scoring models, 6 of 10 targets are predicted within 2.0 Å RMSD. In AMA-II we were able to rank subangstrom predictions in the top-scoring models for two targets. The apparent degradation is due to our use of homology filters in this study to prevent RosettaAntibody from selecting the now-available crystal structures as templates. Additionally, the kink geometry can deviate from native values of τ101 and α101 with little change in overall RMSD, but this deficiency would expose itself during downstream simulations.
The 2009 RosettaAntibody benchmark set provides a more comprehensive view of improvements to de novo CDR H3 structure prediction on homology-modeled frameworks. Supplemental Table V shows the per-target summary of modeling and includes the lowest RMSD achieved for the target in the 2009 study (15). There is a sampling improvement in 49 targets, with average (including the targets with larger RMSDs) improved RMSDs of 0.1 Å for very short loops (4–6 residues), 0.6 Å for short loops (7–9 residues), 0.7 Å for medium loops (10–11 residues), 1.3 Å for long loops (12–14 residues), and 1.1 Å for very long loops (17–22 residues). Three of the five targets with degraded performance had RMSDs of 0.2 Å higher (1dqq: length 5, 0.2 Å → 0.4 Å; 1z3g: length 6, 1.8 Å → 2.0 Å; 1bql: length 7, 1.6 Å → 1.8 Å), one increased by 0.4 Å (2h2h: length 12, 1.6 Å → 2.0 Å), and one increased by 0.6 Å (1bj1: length 14, 1.6 Å → 2.2 Å). Interestingly, 24 of the targets have a lower RMSD model in the 10 top-scoring models than was even sampled in 2009. The smaller change in performance for shorter loops is due to the fact that they were predicted to high accuracy in the initial study, and can only improve marginally, whereas longer loops improve more substantially.
Discussion
Ab structure prediction has made great strides, but accurately modeling CDR H3 loops remains elusive due to its variable length, sequence, and structure across Abs. We found that state-of-the-art de novo loop prediction methods fail to generate near-native conformations based on both RMSD and the kink geometry parameters, τ101 and α101. In our previous work (2), we hypothesized that the kink creates the observed structural diversity of CDR H3 loops. In this study, we developed a constraint based on these parameters and tested it on a set of high-quality CDR H3 structures. The effectiveness of this constraint for enabling high-resolution H3 predictions suggests that creating models with native-like values for α101 and τ101 drives better results for the whole loop. We found that this constraint is also effective in improving CDR H3 predictions on homology-modeled frameworks and producing H3 models of sufficient quality to successfully dock to Ag.
Part of this study required constructing a set of high-resolution CDR H3 loops from crystal structures. Not all CDR H3 loops meet the strict quality cutoffs that were used in this study. It is possible that some of the loops that meet these criteria are simply more stable or rigid than some other H3 loops. If that is the case, these loops may be easier modeling targets. Regardless, because the atomic coordinates of these loops are well defined, structural comparisons between models and the loops have clear meaning.
Although the atomic coordinates may be well defined, the static conformation found in crystal structures may not tell a complete story, and this limitation may be particularly pronounced in loop regions. However, we do not think that all loop regions are extremely flexible; indeed, such a view could lead one to conclude that the entire objective of loop structure prediction is poorly defined at best and a misguided exercise in futility at worst. In the cases of Abs that have been crystallized in both the bound and free forms (predominantly Abs with peptide Ags), CDR loop conformations do not change substantially (typically <1.0 Å RMSD; 37% of CDR H3 loops deviate >1.0 Å) (46) and, thus, can be treated as rigid. Additional evidence of CDR loop rigidity comes from studies that compare flexibility estimates of naive and mature Abs and find that one of the effects of somatic hypermutation is a decrease in loop flexibility (51–53). The degree to which these trends hold for Abs with protein Ags, or vary as a function of the length of the CDR H3 loop, remains to be established.
Although there are not enough loops at each length to draw conclusions about prediction performance as a function of length, the longest loop where the average RMSD of the 10 top-scoring models is subangstrom without constraints is 13 residues, and the longest with constraints is 14. This difference seems small, but the RMSD of the top-scoring model for some longer loops reveals the extent to which the kink constraint improves the performance of de novo loop modeling. For example, for the 19-residue CDR H3 loop in 2fb4, the RMSD of 14.67 Å without constraints is reduced to 3.63 Å (Table III). Although this is a substantial improvement, these models are unlikely to successfully dock. Further improvements might arise from using more cycles of NGK for longer loops, generating more models, or incorporating knowledge of additional local structures (e.g., β turns or additional constraints to the terminal residues of the loop) (29) in addition to the kink into the simulation.
A challenge remains in addressing nonkinked CDR H3 loops. In this study, we applied the kink constraint to all CDR H3 loops without considering whether the native loop has a kink. We made this decision for a few reasons. Most importantly, the vast majority of CDR H3 loops are kinked, and we remain predominantly interested in developing a method to predict those conformations reliably. Second, all attempts to classify the base geometry of the H3 loop by sequence alone have not held up as more structures have been solved, leaving us with no way to determine when we should not apply the constraint. While carrying out this work, we developed several alternate hypotheses to account for nonkinked H3 loops, including using the constraint stochastically with a probability in accordance with the observed populations of kinked loops (that is, each model would have an 85% chance of being constrained). Unfortunately, Rosetta was unable to generate low-RMSD conformations of the nonkinked H3 loops in our data set even in the absence of the constraint. Because of this, there is no clear way to test alternatives to the approach described in this study at this time.
The sampling performed in the low-resolution stage may lose information of some important interactions that are mediated by side chains. Some modeling methods have been developed that operate in all-atom mode throughout the entire simulation. One such method available in Rosetta is stepwise assembly (54), which builds the loop one residue at a time. Although this method has shown promise, it is extremely computationally expensive and is therefore not yet well suited to Ab homology modeling tasks that rebuild the H3 loop while simultaneously sampling VL–VH orientation. Nevertheless, an all-atom loop-modeling routine may enable Rosetta to capture critical side-chain interactions.
In fact, an anti-dansyl Ab, which has been crystallized at two different pH values (pH 5.25, PDB accession code 1dlf; pH 6.75, 2dlf), has a histidine in its CDR H3 loop (55). Nakasako et al. (55) found that the structure of the Ab remained the same except for the CDR H3 loop, which undergoes a pH-dependent conformational change, presumably controlled by the protonation state of the histidine within the loop. Preliminary tests using pH-aware loop prediction and side-chain packing failed to capture the effect of the protonation state (56), again highlighting the need for side-chain accuracy and increased score function detail.
Although this study mostly focused on modeling CDR H3 loops on the crystallographic framework, the ultimate test of the utility of a new loop modeling method in the context of Ab modeling is predicting CDR H3 conformations on a homology framework. We tested the new method on a homology modeled framework by comparing the distribution of CDR H3 RMSDs from the standard method, which uses a filter based on α101, and the new constrained method, which evaluates a potential based on both τ101 and α101. The new constrained method produces a substantially larger fraction of low-RMSD models, which should enable the development of new protocols that focus more time on other aspects of Ab modeling, for example, VL–VH orientation optimization (27).
The new method can generate subangstrom loop predictions for 20 of the 40 very short, short, and medium-length loops in the 2009 benchmark set. Although progress is being made, long loops still pose a challenge; on a homology framework, the best sampled model for all 10 of the long loops in the 2009 benchmark set lies within 1.0–2.0 Å RMSD.
Another goal for Ab structure prediction is to generate models of sufficient quality to be used in downstream applications, namely Ab–Ag docking. To assess the quality of the predicted H3 conformations, we used EnsembleDock with a set of the 10 top-scoring models and the bound conformation of the Ag. The simulation correctly predicts the conformation of the complex. This case is idealized in the sense that the CDR H3 loop was modeled on the crystal framework and the bound form of the Ag was used, but the successful simulation marks a necessary milestone in obtaining reliable blind docked complex predictions from sequence.
The success of both the homology modeling and docking simulations is encouraging, and these simulations should serve as the starting point for future development of improved Ab–Ag docking methods. In summary, a structure-based kink constraint has enabled accurate de novo structure prediction of the most diverse region of Abs. We found that performance degrades on homology modeled frameworks, which reinforces the importance of the environment of the loop. Coupled with the chain pairings from Teplyakov et al. (35), there is support for the hypothesis we introduced previously: the C-terminal kink is the key to CDR H3’s diversity and is stabilized by the environment of the loop (2).
Appendix
Determining the parameters for the flat harmonic potential
Because the penalty should begin after 1.0 σ, t can be set to σ. Now we solve for ξ to produce the desired penalty schedule. First, we solve for ξ at 3.0 σ as follows:
and then plug in ξ = 2t and evaluate the penalty at 2.0 σ to check the intermediate value
and we find that setting ξ to 2 σ will exactly produce a reasonable penalty schedule with the useful feature of being a factor of four larger at 3.0 σ than at 2.0 σ.
Mixing cyclic coordinate descent and NGK
In addition to KIC and NGK, Rosetta also has implementations of loop modeling methods that use cyclic coordinate descent (CCD) (57) to ensure the continuity of the loop. CCD closure calculates the dihedral angles required to minimize the gap between the loop end points for a single position in the loop, and iterates over each residue and until the loop is closed. In contrast to NGK, CCD-based methods can close loops while maintaining the overall conformation of the starting, open loop. Generally, the CCD-based methods use fragments from known structures for coarse-grained sampling and small perturbations of the conformation of each residue for refinement (58). Because of their object-oriented design, any of the loop modeling methods in Rosetta can be mixed and matched within a single simulation. To couple the more conservative CCD refinement with the aggressive NGK sampling, a combined NGK–CCD simulation can be run. The combined simulation uses the same command line as the constrained NGK simulation, but with
-loops:refine refine_kic changed to -loops:refine refine_ccd.
./loopmodel.macosclangrelease
-native input_file.pdb
-s input_file.pdb
-nstruct 500
-loops:loop_file h3.loops
-loops:remodel perturb_kic
-loops:refine refine_ccd
-loops:outer_cycles 5
-kic_bump_overlap_factor 0.36
-legacy_kic false
-kic_min_after_repack true
-corrections:score:use_bicubic_interpolation false
-loops:kic_omega_sampling
-loops:kic_rama2b
-allow_omega_move
-loops:ramp_fa_rep
-loops:ramp_rama
-ex1
-ex2
-extrachi_cutoff 0
Acknowledgements
We thank Andrew P. Leaver-Fay for cutting the two-body Ramachandran sampling method memory requirements from 5 GB to 160 MB, Nick Marze for running the new method on the 2009 RosettaAntibody benchmark set, as well as the Rosetta developers for the continued development of Rosetta. This work used the Extreme Science and Engineering Discovery Environment, as well as the Maryland Advanced Research Computing Center.
Footnotes
This work was supported by National Institutes of Health Grant R01 GM078221 (to J.J.G.). Andrew P. Leaver-Fay is supported by National Institutes of Health Grant R01 GM73151, the Extreme Science and Engineering Discovery Environment is supported by National Science Foundation Grant ACI-1053575, and the Maryland Advanced Research Computing Center is supported by the State of Maryland.
The online version of this article contains supplemental material.
Abbreviations used in this article:
- AMA-II
Antibody Modeling Assessment II
- CAPRI
critical assessment of predicted interactions
- CCD
cyclic coordinate descent
- fa
full atom
- I_RMSD
interface root mean square deviation
- KIC
kinematic closure
- L_RMSD
ligand root mean square deviation
- NGK
next-generation KIC
- PDB
Protein Data Bank
- RMSD
root mean square deviation.
References
Disclosures
The authors have no financial conflicts of interest.