This is the Title of the Book, eMatter Edition
Copyright © 2012 O’Reilly & Associates, Inc. All rights reserved.
168
|
Chapter 10: Installation and Command-Line Tutorial
Executables
Let’s assume the tarball has been downloaded to /usr/pkg/wu-blast, and you nor-
mally keep your executables in /usr/local/bin. Issue the following commands to put
the executables in your path.
ln -s /usr/pkg/wu-blast/blasta /usr/local/bin/blastn
ln -s /usr/pkg/wu-blast/blasta /usr/local/bin/blastp
ln -s /usr/pkg/wu-blast/blasta /usr/local/bin/blastx
ln -s /usr/pkg/wu-blast/blasta /usr/local/bin/tblastn
ln -s /usr/pkg/wu-blast/blasta /usr/local/bin/tblastx
ln -s /usr/pkg/wu-blast/xdformat /usr/local/bin
ln -s /usr/pkg/wu-blast/xdget /usr/local/bin
Note, unlike the NCBI program blastall, blasta can not be executed by its own name,
but only through aliases.
Table 10-2. WU-BLAST files and directories
Name Description
blasta The WU-BLAST executable. Unlike the free version, which comes with five different BLAST
executables, the licensed version has only one.
blastn, blastp, blastx, tblastn,
tblastx
Symbolic links (aliases) to blasta. blasta figures out what kind of program to run based on
the name of the symbolic link.
xdformat Executable for formatting both nucleotide and protein databases.
xdget Executable that allows you to retrieve sequences by accession number from a WU-BLAST
database.
nrdb, patdb Programs used to create nonredundant databases. nrdb keeps only unique sequences and
concatenates the descriptions of identical sequences. patdb goes a little further and
removes sequences that are perfect substrings of other sequences.
gb2fasta, gt2fasta, pir2fasta,
sp2fasta
Programs to convert GenBank, SwissProt, and PIR files to FASTA files. gb2fasta extracts the
nucleotides, and gt2fasta extracts the proteins.
filter Directory containing the complexity filtering programs used by WU-BLAST (seg, dust, and
xnu).
matrix Directory containing two subdirectories, aa and nt, which contain, respectively, the amino
acid and nucleotide scoring matrices. The amino acid matrices like BLOSUM 62 are singular
files, but the nucleotide matrices exist in two forms, with the extension 4.2 or 4.4 that cor-
responds to 4- and 16-symbol matrices.
setdb, pressdb Executable used to format protein and nucleotide databases. The xdformat executable
replaces these programs, but they are included for those who prefer the old interface or
require compatibility with older executables.
wu-blastall, wu-formatdb Perl scripts that mimic the NCBI-BLAST command-line interface while executing the WU-
BLAST counterparts.
sysblast Configuration file that allows administrators to enforce system-level resource limitations
on BLAST jobs.