CHAPTER 36
Genome Annotation in Prokaryotes

CS Mukhopadhyay and RK Choudhary

School of Animal Biotechnology, GADVASU, Ludhiana

36.1 INTRODUCTION

The genomic machinery harbors several signals, elements and conserved regions, namely, promoter signals, transcription start and termination signals (start codon “ATG” and termination codons: Ochre: “TAA”, Amber: “TAG”, Opal: “TGA”), codons, exons, intervening introns (delimited by exon–intron boundaries), etc. Use of suitable software enables the researchers to identify and annotate the various regions of the genome. In this chapter, we will use the GeneMark program to learn gene finding and genome annotation in prokaryotes.

36.2 OBJECTIVE

To annotate the partial genome of a prokaryotic organism, using the GeneMark.hmm (Lukashin and Borodovsky, 1998) online tool.

36.3 PROCEDURE

  1. Download the nucleotide sequence from Nucleotide database (NCBI) and save it in Notepad in FASTA format:
    1. The genomic sequence of Yersinia pestis (AL590842.1: Yersinia pestis CO92 complete genome) has been downloaded from NCBI (http://www.ncbi.nlm.nih.gov/nuccore/AL590842.1).
    2. Now the sequence is to be saved (in FASTA format) in a .txt file (Notepad), using the “send to” option available on the right‐hand side of the top of the page.
  2. Open GeneMark.hmm for Prokaryotes (Version 2.8) (Figure 36.1), using the URL: http://exon.gatech.edu/gmhmmp.cgi (optionally, you can go directly to the homepage of GeneMark: http://opal.biology.gatech.edu/GeneMark/).
  3. Click “GeneMark hmm” on the left‐hand side of the shortcut menu (arrow) and select “prokaryotes”.
  4. Browse the sequence file by clicking on the “Browse…” button, or paste the sequence into the box provided.
  5. Select the species from the drop‐down option. If the exact species is not there, select a similar type of species.
  6. Check the boxes located at the bottom, as per the output requirements:
    1. to obtain the in silico translations of the predicted genes;
    2. to get the nucleotide sequences of the predicted genes;
    3. to produce on‐screen PDF graphics;
    4. to generate PostScript graphics (via email).
  7. Click “Start GeneMark.hmm” to start gene searching under the “Action” tab (See Figure 36.2).
Homepage of the GeneMark online tool displaying the gene predictions in bacteria, archaea, metagenomes, and metatranscriptomes, in eukaryotes, in transcripts, and in viruses, phages, and plasmids.

FIGURE 36.1 Homepage of the GeneMark online tool.

Browse GeneMark.hmm prokaryotic manual with sections for Input sequence and Select species, Action, Options, and Advanced options.

FIGURE 36.2 Specifying the parameters in GeneMark.hmm for prokaryotes for gene finding and annotation.

36.4 INTERPRETATION OF GENEMARK OUTPUT

  1. The program gives the results in tabular format: Gene (as serial number), Strand (positive or negative strand), Left End and Right End (start and end nucleotide number of the gene), Gene length, and Class.
  2. This is followed by the predicted sequence of the translated amino acids and the nucleotide sequence of the gene (if the options “Translate predicted genes into proteins” and “Sequences of predicted genes” have been checked) (see Figure 36.3).
Output pages of the GeneMark online tool for prokaryotic gene prediction, displaying the coordinates of predicted genes and the nucleotide sequence.

FIGURE 36.3 The output pages of the GeneMark online tool for prokaryotic gene prediction.

36.5 QUESTIONS

  1. 1. Download the given sequence form NCBI, predict the possible genes and annotate them:
    1. BX248333.1
    2. NZ_KK354537.1
  2. 2. Compare the genome annotation results of RAST and GeneMark using the sequence from NCBI BX248333.1 (accession number).
  3. 3. What are the salient points to be considered while annotating a given DNA sequence of a prokaryote?
  4. 4. How can we identify the novel genes which are missed by a genome annotation/prediction tool?
  5. 5. Please annotate the cloning vector pRB223, using suitable tools.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.118.166.94