CS Mukhopadhyay and RK Choudhary
School of Animal Biotechnology, GADVASU, Ludhiana
Prediction of the tertiary structure of a peptide sequence is done by either comparative modeling (when the experimentally determined structure of a homologous protein is available) or ab initio approaches (when the structure of the homologous protein is not available).
To predict the tertiary structure of a peptide using the homology modeling approach with the online tool SWISS‐MODEL.
Let us say we are interested in determining (predicting) the structure of pancreatic ribonuclease of buffalo (Bubalus bubalis), which has not been experimentally determined and reported. Hence, we need to search for the reported structure from a related species (e.g., cattle), so that this can be used as a template for homology modeling.
The sequence of the 124‐residue‐long bubaline pancreatic ribonuclease (obtained from PDB) is:
>tr|Q95NE6|Q95NE6_BUBBU Pancreatic ribonuclease (Fragment) OS = Bubalusbubalis PE = 3 SV = 1
K E T A A A K F Q R Q H M D S S T S S A S S S N Y C N Q M M K S R S M T S D R C K P V N T F V H E S L A D V Q A V C S Q K N V A C K N G Q T N C Y Q S Y S T M S I T D C R E T G S S K Y P N C A Y K T T Q A N K H I I V A C E G N P Y V P V H F D A S V
Download a reported structure of a related peptide (in a closely related species, with at least 70% sequence similarity) from a protein data bank (PDB). In the present example, we will download the .pdb file of the experimentally determined structure (X‐ray diffraction, resolution of 1.95 Å) of bovine pancreatic ribonuclease (PDB ID: 4RTE).
Determining the percentage similarity between the sequences (template vs. query sequences of pancreatic ribonucleases):
The alignment result (percent identity matrix) clearly shows that there is a 95.97% match between the two sequences. Hence, the template of the target protein from bovine origin is fit to be used for homology modeling of the structure of bubaline pancreatic RNase.
The URL is http://swissmodel.expasy.org/workspace/index.php?func=show_workspace.
The first‐time user needs to create a new workspace and then start using the tool. A returning user needs to open the existing “workspace”, and then enter the email ID and project title.
Click on “Modelling” at the top of the page, then select “Automated Mode” from the drop‐down menu.
Paste the amino acid sequence (up to 1000 residues) into the sequence box, or enter the PDB ID of the peptide in the specified box. Here we will upload the template file 4RTE.pdb.
Click on “Submit Modeling Request”. You will need to wait until the work is done. The result will be emailed to the specified email address. Download the predicted structure in *.pdb format.
The output page is self‐explanatory. The predicted model can be viewed on a Java‐enabled system by clicking on the image. The output page also mentions the following outputs:
This is the length of the input sequence that has been used for predicting its structure. Sequence identity (95.97%) and E‐value (4.47e‐48) are also given to indicate the degree of homology between the query and template sequences.
QMEAN values (QMEAN4 and QMEAN6) are determined by the program to assess the overall quality of the model and the legitimacy of the bonds between amino acids in the predicted structure. The quality scores are also determined using methods like atomic empirical mean force potential (ANOLEA) and GROMOS. The detail on these terms can be obtained at the SWISS‐MODEL help page: http://swissmodel.expasy.org/workspace/index.php?func=special_help&=#A
The pairwise alignment between the query and template sequences is displayed, along with the secondary structure information in the last row in each alignment block.
It is useful if the ligand is included in the model during prediction. Otherwise, the program predicts the possible ligand interacting sites on the predicted structure.
This is a relevant section to get information about the steps during homology modeling. If the template model is not supporting enough, the program uses an ab initio approach to model the query sequence. This log also indicates which portions have been built ab initio. In our example, the “final total energy” of the predicted model is –7759.335 kJ/mol.
Information about template search and use are given here.
The results obtained should be judged for the quality of the predicted structure using the following main parameters:
Finally, after obtaining the model, the predicted structure must be validated using suitable tools (see Chapter 32) such as ProCheck, WhatCheck, What If, and so on. The correctness of the dihedral angle of rotation (peptide torsion angles Phi(φ) and Psi(ψ)) of the amino acid residues can be checked using Ramachandran’s Plot (RaptorX or any suitable tool).
>Query
M A L K S L V L L S L L V L V L L L V R V Q P S L G K E T A A A K F E R Q H M D S S T S A A S S S N Y C N Q M M K S R N L T K D R C K P V N T F V H E S L A D V Q A V C S Q K N V A C K N G Q T N C Y Q S Y S T M S I T D C R E T G S S K Y P N C A Y K T T Q A N K H I I V A C E G N P Y V P V H F D A S V
3.149.242.9