In Silico Modeling and Functional Interpretations of Cry1Ab15 Toxin from Bacillus thuringiensis BtB-Hm-16

The theoretical homology based structural model of Cry1Ab15 δ-endotoxin produced by Bacillus thuringiensis BtB-Hm-16 was predicted using the Cry1Aa template (resolution 2.25 Å). The Cry1Ab15 resembles the template structure by sharing a common three-domain extending conformation structure responsible for pore-forming and specificity determination. The novel structural differences found are the presence of β0 and α3, and the absence of α7b, β1a, α10a, α10b, β12, and α11a while α9 is located spatially downstream. Validation by SUPERPOSE and with the use of PROCHECK program showed folding of 98% of modeled residues in a favourable and stable orientation with a total energy Z-score of −6.56; the constructed model has an RMSD of only 1.15 Å. These increments of 3D structure information will be helpful in the design of domain swapping experiments aimed at improving toxicity and will help in elucidating the common mechanism of toxin action.


Introduction
Bacillus thuringiensis (Bt) a soil bacterium produces pertinacious toxin generally referred to as insecticidal crystal protein. This toxin belongs to a large family with target spectrum spanning insects, nematodes, flatworm, and protozoa [1]. In nature, Cry toxins are produced as crystalline protoxin (hence named Cry protein) within Bt sporangia, and after ingestion by a susceptible insect larva, these protoxins are solubilized and proteolytically cleaved into an active toxin fragment that binds to at least one of the four different types of high affinity receptors and later get inserted into the brush border epithelium. The insertion of toxin creates pores in the cell membrane that causes the leaching of the cellular electrolytes. This disruption causes cell lyses and finally larval death [2]. So far, Cry1 toxins have extensively been used in studies of insect control either as transgenic spores or as spray formulations. Where three-dimensional crystal structure of Cry1 family of protein is concerned, few of the toxins in solutions have been analysed by X-ray diffraction crystallography [3][4][5][6][7][8][9], and a few of them have been predicted using the homology modelling method [10][11][12]. All these toxins have a different toxicity spectrum; in spite of this, these proteins show a similar tertiary organization. This property impels for elucidation of three-dimensional structures of the rest of the reported Cry1 family members for possible setting down of unifying mechanisms underlying the toxicity. There are currently many templates for protein structures prediction available from the Protein Data Bank (PDB) [13], and all such templates are constantly increasing in number. For three-dimensional structure prediction, modeled structure, by applying template-based modeling, has become so accurate that they can be applied for molecular replacement information in many cases. Therefore, in this paper, as an increment in a structure elucidation, the model of the Cry1Ab15 toxin is reported based on the hypothesis of structural similarity with Cry1Aa toxin [7]. This model supports existing hypotheses of receptor insertion and will further provide an initiation point for the domain-swapping and mutagenesis experiments among different Cry toxins.
database. The sequence accession number was AAO13302. It was ascertained that the three-dimensional structure of the protein was not available in the Protein Data Bank; hence, the present exercise of developing the three-dimensional model was undertaken.

Template Selection and Structure Prediction.
Homology method-dependent modeling is an effective approach for a three-dimensional structure of protein provided by an experimentally obtained three-dimensional structure of homologous protein. All experimentally determined homologous protein can serve as a template for modeling. Since template selection is an important factor that affects quality, therefore, an attempt was made for a suitable template searching using mGenTHREADER [14], which is an online tool for searching similar sequences, based on sequence and structure-wise similarity. The target protein was 577 amino acid long stretches. From the homologous searching, Cry1Aa (PDB: 1CIY, resolution 2.25Å) was selected as a template protein. Finally, amino acid sequence alignment between the target (Cry1Ab15) and template protein was derived using the MEGA4 software [15]. The three-dimensional structure of target protein was predicted by using the alignment file in MODELLER software [16] whereby predicted structure was returned.

Homology Modeling of Cry1Ab15.
The possible outliers and side chains static constrain refinement of the developed model was performed on Summa Lab server [17] after the selected theoretical model were further subjected to a series of tests for evaluating its consistency and reliability. Backbone confirmation was evaluated by the inspection of the Psi/Phi Ramachandran plot from RAMPAGE web server [18]. The energy criterion was evaluated by ProSA web server [19], which compares the potential of mean forces derived from a large set of NMR and X-ray crystallographically derived protein structures of similar sizes. Potential deviations were calculated by SUPERPOSE web server [20] for root mean square deviations (RMSD) between target and template protein structure. The comparative analysis of generated model showed it to be superimposable. The secondary structure visualization was made using PDBsum [21], and amino acid sequence alignments are generated with SAS software [22] ( Figure 1). The visualization of models was performed on UCF Chimera software [23] and PyMOL [24] loaded on a personal computer machine that has an Intel Quad core processor and four gigabytes of random accessed memory. Figures and electrostatic potential calculations were generated with PyMOL0.99rc6. The final model was submitted to the PMDB database [25] to obtain protein model databank (PMDB) identifier PM0076556.

Results and Discussion
Sequence alignment showed 88.3% identity (Smith Waterman Score-3356; Z-Score-3981.3; E Value-6.4e-215) between the Cry1Ab15 and Cry1Aa. It is observed that a model tends to be reliable if identity percentage between the template and target protein is above 40%. Low degree of reliability arises when identity decreases below 20% [26]. Identity difference in the present case is sufficiently high to carry out the theoretical modeling for the Cry1Ab15 toxin stretch of 84-661 residues ( Figure 1). Sequence alignment of domain I, domain II, and domain III was straightforward within the possible limits of flanking domains. Domain III is quite well conserved both on the N-terminal and C-terminal sides. Domain I is composed of residues 86-341 and consists of 9 -helices and too small -strands. All the helices in the Cry1Ab15 model were slightly longer than those in Cry1Aa (Table 1).  Figure 1: Amino acid sequence alignment of the Cry1Ab15 with Cry1Aa (1ciy: A). The residues highlighted in red color represent helix; those in blue represent strand; in green represent turn; and those in black represent coil, and alignment is generated using SAS software.
The amphiphilicity (Hoops and Woods) values indicated an exposed nature of a few of the helices of domain I ( 1, 2a, 2b, 3, and 6). These values correspond well with the accessibility calculated with Swiss PDB, except for 1, which is packed against domain II (Figure 2). It is possible that this helix will have some mobility, with an emphasis that one of the cutting sites by gut proteases is located close to the middle of this helix [27]. On the other hand, membrane insertion and pore formation are thought to occur through elements of domain I, composed of a bundle of six amphipathichelices surrounding the highly hydrophobic helix 5 [7]. Spectroscopic studies with synthetic peptides corresponding  B   33  40  45  50  55  60  65  70  75  80  85  90   93  100  105  110  115  120  125  130  135  140  145  150   153  160  165  170  175  180  185  190  195  200  205  210   213  220  225  230  235  240  245  250  255  260  265  270   273  280  285  290  295  300  305  310  315  320  325  330   333  340  345  350  355  360  365  370  375  380  385  390   393  400  405  410  415  420  425  430  435  440  445  450   453  460  465  470  475  480  485  490  495  500  505  510   513  520  525  530  535  540  545  550  555  560   573  580  585  590  595  600  605  610  615  620 565 570 H20 C D Figure 2: The two-dimensional structure annotation showing sequential arrangements of helices and sheets in Cry1Ab15 toxin molecule using the PDB Sum (http://www.ebi.ac.uk/pdbsum/). The structure is as the spiral shape are helix labeled as H1 and H2; and the arrows as strands are labeled by their sheets A and B while motifs are beta turn and are gamma turn while the bend tube shape is a beta hairpin. to domain I helices revealed that 4 and 5 have the greatest propensity for insertion into artificial membranes, although insertion and pore formation were more efficient when 4 and 5 were connected by a segment analogous to the 4-5 loop of the toxin [28,29]. A particularly large number of single-site mutations with altered amino acids from these helices, which lead to a strong reduction in the toxicity and pore-forming ability of the toxin, have been characterized [30][31][32][33]. Also, a site-directed chemical modification study has provided strong evidence that 4 lines the lumens of the pores formed by the toxin [34]. Recent studies have established that toxin activity is especially sensitive to modifications not only in the charged residues of 4 [33] but also in most of its hydrophilic residue [30]. Furthermore, the loss of activity of most of these mutants did not result from an altered selectivity or the size of the pores, but from a reduced poreforming capacity of the toxin [34]. The charge distribution pattern in the Cry1Ab15 theoretical model corresponds to a negatively charged patch along 4 and 13 (Figures 3 and  4) of domains II and III, respectively. The Cry1Ab15 domain I model relates well with the data from Gerber and Shai [29] who have suggested that 4 and 5 insert into the membrane in an antiparallel manner as a helical hairpin. It is possible that according to the surface electrostatic potential of helices 4 and 5 there was a neutral region in the middle of the helices which probably indicates, if we follow the umbrella model and consider it to be correct, that both helices cross the membrane with their polar sides exposed to the solvent as it has been suggested by the results of mutagenesis experiments done by Girard et al. [31] with the Cry1Ac toxin. This region is also the most conserved among the Cry toxins. Girard et al. [31] demonstrated that mutations in the base of helix 3 and the loop between 3 and 4 that cause alterations in the balance of negative charged residues may cause loss of toxicity. Mutations in helices 2 and 6 and the surface residues of 3 have no important effect on toxicity; meanwhile, helices 4 and 5 seem to be very sensitive to mutations. Helix 1 probably does not play an important part in toxin activity after the cleavage of the protoxin. It is possible that the mutations aimed to an increasing the amphiphilicity in these helices will improve the pore-forming activity of the Cry1Ab15 type toxins. The structure of domain I of the toxin, the effect of site-directed mutagenesis in this domain on toxin activity, and the studies with hybrid toxins [35][36][37] all suggest that domain I, or parts of it, inserts 125 into the membrane and forms a pore. This idea is further supported by studies that show that truncated proteins corresponding to domain I of CryIA(c) [38] -endotoxin form ion channels in model lipid membranes similar to those formed by the intact toxins. After receptor binding, the network of contacts between 7, the helix in the interface between the poreforming domain and the receptor-binding domain, and 5,

Front view Back view
Side-back view Side view Figure 5: Superimposed backbone 3D structure between Cry1Aa1 (green) and Cry1Ab15 (red) coordinates. The RMSD for backbone and alpha carbons is 1.15. The image was generated using the SUPERPOSE software (http://wishart.biology.ualberta.ca/SuperPose/). 6, and, presumably, 4 helices may assist at the insertion of the 4-5 hairpin into the membrane by the unpacking of the helical bundle that exists in the nonmembrane-bound form of the toxin. This hypothesis might account for the observation that 7 mutants are susceptible to proteolysis by either trypsin or midgut juice [39]. Our model also supports the notion that the 4-5 hairpin is the major structural component in the lining of the pores formed by -endotoxin. Therefore, it is possible to create toxin variants with better membrane permeability potential by stabilizing the hairpin antiparallel structure by cross-linking 4 with 5. This postulation is important because mutations within transmembrane segments of proteins usually decrease or have no effect on the biological activities of these proteins. Thus, it is conceivable that the introduction of several salt bridges or other bonds between 4-5 helices or the stabilization of the 4-5 hairpin by the creation of bridging interactions between the 3-4 and 5-6 loops may result in a significantly enhanced toxic activity. Other studies also support the umbrella-like model for domain I insertion into membranes [34,40,41]. As for other Cry toxins, domain II of the Cry1Ab15 toxin consists of three Greek key beta sheets arranged in a beta prism topology. It is comprised of residues 350-508, one helix ( 8), and 11 -strands (Table 1). In the case of the three domain Cry toxins, specificity is mostly attributed to their capacity to bind to certain proteins located on the surface of the intestinal membrane through specific segments of domains II and III, composed mainly of sheets [42,43]. Loop 4-5 is mostly hydrophilic, and the charged residues located at the tip of the loop are probably important determinants of insect specificity. As in loop 2-3, few glycine residues are also present before a negatively charged residue supporting the hypothesis that correct orientation of charged residues in the specificity loops could be important in receptor recognition. Mutations in defined regions of the Cry1Aa toxin have identified residues 365-371 (equivalent to residues in the Cry1Ab15 6-7 loop) as essential for binding to the membrane of midgut cells of Bombyx mori [35,44]. In the Cry1Ab15 model, this region is shorter than their counterparts in Cry1Aa. Loop 2-3 seems also to be able to modulate the toxicity and specificity of Cry1C [45]. The dual specificity of Cry2Aa for Lepidoptera and Diptera has been mapped to residues 307-382 that corresponds in the Cry1Ab15 theoretical model to sheet 1, strand 6, and loop 6-7. Domain III comprised residues 471-608 and showed high conservation of residues and the only important modification is a 3-residue deletion between 16 and 17. Several studies indicate that site mutations in conserve blocks reduce toxicity and alter channel properties at least in Cry1Ac [7] and Cry1Aa [42,46], and divergence in block 5 element [8,41] postulates an alternative mechanism of membrane permeabilization. Finally, the recognition of artefacts and errors in experimental and theoretical structures remains a problem in the field of structure modeling. A structural comparison of Cry1Aa toxin with the theoretical model of the Cry1Ab15 protein indicates correspondence with the general model for a Cry protein the superimposed backbone traces showed low RMS deviations ( Figure 5). The comparison between the overall energy of developed structure and those of experimentally determined structures in PROSA database validated the developed model as folded near to experimentally determined, natural structures ( Figure 6) while the Ramachandran plot analysis (Figure 7) supported the above conclusions by showing that most of the residue (98%) has and angles in the core-and allowed-regions, except for nine residues which qualified for outlier region. Most bond lengths, bond angles, and torsion angles were in the range of values expected for a naturally folded protein (Figure 7).

Conclusions
In conclusion, evidence presented here, based on the identification of structural equivalent residues of Cry1Aa in Cry1Ab15 toxin through homology modeling, indicates that due to the high amino acid homology between these two toxins, they do share a common three-dimensional structure. Cry1Aa and Cry1Ab15 contain the most variable regions in the loops of domain II, which is responsible for the specificity of these toxins. Structural comparison indicates a correspondence to the general model for a Cry protein (an + structure with three domains) and few of the differences present are the presence of 175 0 and 3 the absence of 7b, 1a, 10a, 10b, 12, and 11a while 9 is located spatially downstream. This is the first model of a Cry1Ab15 protein and its importance can be perceived since the members of this group of toxins are potentially important entomopathogenic candidates.

Conflict of Interests
The author declares that he has no direct financial relation with the commercial identities mentioned in the paper that might lead to a conflict of interests including the necessary citation.