CHAPTER 30
Protein Structure Prediction Using Threading Method

CS Mukhopadhyay and HK Manku

School of Animal Biotechnology, GADVASU, Ludhiana

30.1 INTRODUCTION

Threading or fold recognition is the method for protein tertiary structure prediction, and is implemented when no significant homologs are searched. The underlying theory assumes the existence of a limited number of distinct protein folds. The threading method thus searches the structural analogs of the query sequence in a library of representative structures through comparative fold recognition. The threading approach relies on fitting the target peptide sequence to one or more entries of a library of protein structures (e.g., protein domains, peptide chains or conserved protein cores). The best fit is evaluated on the basis of an appropriate energy function (determined by a biocomputational algorithm) to determine the best possible templates. In this way, folds are recognized simultaneously, and the whole structure can be predicted.

30.2 OBJECTIVE

To predict the tertiary structure of a protein sequence using the fold recognition method.

30.3 PROCEDURE

3D structure prediction using the threading method is explained by using the RaptorX server:

  1. Query submission: Submit a query sequence for protein structure prediction via the URL http://raptorx.uchicago.edu/StructurePrediction/predict/.
  2. Click on “Submit a new job”.
  3. Create a job identification title so that you can access your results page in the future.
  4. Enter a valid email ID.
  5. Paste your query sequence or upload the sequence in FASTA format, and submit your query.
Home window of RaptorX for job submission, with job name, email Id, amino acid sequence in FASTA format encircled, with arrows pointing the Choose File and Submit buttons at the bottom.

FIGURE 30.1 Home window of RaptorX for job submission.

30.4 RESULTS AND INTERPRETATION

The Results window displays the results as follows:

Section 1 shows the amino acid residues of the query sequence which are modeled and non‐modeled. Modeled residues are labeled as 1, whereas non‐modeled residues are labeled as 0 (shown in (A) of Figure 30.2). Modeled residues are considered as a functional part of the protein, and non‐modeled residues are considered as side loops. Moreover, the result shows the best template selection by GDT (Global Distance Test) score. For a protein with > 100 residues, GDT > 50 is a good indicator (shown in (B) of Figure 30.2). In this example protein sequence, accession no. P79362 in SwissProt has been taken for tertiary structure prediction.

Image described by caption and surrounding text.

FIGURE 30.2 (A) The modeled and non‐modeled residues; (B) 3D cartoon view of selected template; (C) the target‐template alignment view.

30.4.1 The target‐template alignment view

This alignment (shown in (C) of Figure 30.2) is used for constructing the 3D model. The chemical nature of the residues in the alignment dictates the colors of each position:

  1. Blue = Acidic
  2. Magenta = Basic
  3. Red = Hydrophobic
  4. Green = Hydroxyl + Amine.

30.4.2 Tertiary structure prediction

The predicted results indicated that one domain that had the best matching template; 1n5hA had the P‐value of 6.78e–09. The P‐value is an indicator of the goodness of the best template representing the predicted secondary structure. Overall, 64% of the residues were modeled.

Image described by caption.

FIGURE 30.3 3D cartoon view of tertiary structure predicted by RaptorX server.

30.4.3 Secondary structure prediction

The user can switch between the pairs of models (three state and eight state models), or use both of them for secondary structure prediction. Hovering over a residue will display the exact distribution of secondary structure classes in a pop‐up box appearing next to the residue. The three‐state secondary structure types are also abbreviated as “H”, “E”, and “C”, which represent helix, beta sheet, and loop, respectively.

This results window shows the three‐class secondary structure result. A single block contains ten residues, which together form a histogram‐like shape in a block, where each bar indicates the percentage of helix, coil, and beta strand. For example, P at the 135th position contributes 15.5% helix, 18.8% beta strand, and 67.8% coil (as shown in (A) in Figure 30.4). Similarly, an eight‐class secondary structure result has been demonstrated, showing the percentage of alpha helix, isolated beta bridge, extended strand in the beta ladder, 3‐helix, 5‐helix, hydrogen bonded turn, bend, and coil (as shown in (B) in Figure 30.4).

Image described by caption and surrounding text.

FIGURE 30.4 3 Class SS3 and 8 Class SS8 secondary structural element contribution to the 3D structure.

In addition to the structural element prediction, RaptorX also predicts the percentages of the residues which contribute to the disordered conformational randomness of the secondary structure. Moreover, it gives the percentages of the residues which are involved in the formation of the solvent‐accessible surface. Maximum exposure of the residue means a greater contribution towards the solvent‐accessible surface. For example, I at the 122nd position is 81.8% buried, 15.4% in the medium region, and 2.7% exposed. This means that this residue makes the smallest contribution towards the formation of the solvent‐accessible surface.

3-Class SS3 (A) and 8-Class SS8 (B) secondary structure with percentage of helix, beta, α-helix, isolated beta bridge extended strand in beta ladder, 3‐helix, 5‐helix, hydrogen bonded turn, bend, and coil.

FIGURE 30.5 Conformationally ordered and disordered contribution of the residues in the 2D and 3D structure (C). Contribution of each residue in solvent accessibility (D).

The threading method is beneficial for the identification of the conserved domains. The predicted tertiary structure can be further analyzed for model quality validation using Procheck.

30.5 QUESTIONS

  1. 1. Why choose a threading method for tertiary structure prediction when the homology modeling method is already available?
  2. 2. Which template will be the best among the top five templates?
  3. 3. What is solvent accessibility?
  4. 4. Predict the tertiary structure prediction for AB973433 protein using the fold recognition method.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.138.204.195