This is the Title of the Book, eMatter Edition
Copyright © 2012 O’Reilly & Associates, Inc. All rights reserved.
Chapter 5
Previous chapters explored what biological sequences are, how they are aligned, and
how similarity is measured. This chapter discusses BLAST itself. What is BLAST?
The simple answer is that it is a set of programs that search sequence databases for
statistically significant similarities. The details of how BLAST searches for similari-
ties aren’t so easily answered. Searching requires multiple steps and many control-
ling parameters. Understanding the theoretical framework will help you design and
interpret BLAST experiments, and give you a foundation for troubleshooting when
your search produces unexpected results.
The Five BLAST Programs
The five traditional BLAST programs are: BLASTN, BLASTP, BLASTX, TBLASTN,
and TBLASTX. BLASTN compares nucleotide sequences to one another (hence the
N). All other programs compare protein sequences (see Table 5-1).
Table 5-1. Traditional BLAST programs
Program Database Query Typical uses
BLASTN Nucleotide Nucleotide Mapping oligonucleotides, cDNAs, and PCR products to a
genome; screening repetitive elements; cross-species
sequence exploration; annotating genomic DNA; clustering
sequencing reads; vector clipping
BLASTP Protein Protein Identifying common regions between proteins; collecting
related proteins for phylogenetic analyses
BLASTX Protein Nucleotide trans-
lated into protein
Finding protein-coding genesin genomicDNA; determining
if a cDNA corresponds to a known protein
TBLASTN Nucleotide trans-
lated into protein
Protein Identifying transcripts, potentially from multiple organ-
isms, similar to a given protein; mapping a protein to
genomic DNA
TBLASTX Nucleotide trans-
lated into protein
Nucleotide trans-
lated into protein
Cross-species gene prediction at the genome or transcript
level; searching for genes missed by traditional methods or
not yet in protein databases
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.