Technical requirements

The sample data you'll need is available from this book's GitHub repository at https://github.com/PacktPublishing/R-Bioinformatics-Cookbook. If you want to use the code examples as they are written, then you will need to make sure that this data is in a sub-directory of whatever your working directory is.

Here are the R packages that you'll need. In general, you can install these with install.packages("package_name"). The packages listed under Bioconductor need to be installed with the dedicated installer. If you need to do anything further, installation will be described in the recipes in which the packages are used:

  •  Bioconductor
    • Biobase 
  • caret
  • class
  • dplyr
  • e1071
  • factoextra
  • fakeR
  • magrittR
  • randomForest
  • RColorBrewer

Bioconductor is huge and has its own installation manager. You can install the manager with the following code:

if (!requireNamespace("BiocManager"))
    install.packages("BiocManager")

Then, you can install the packages with this code:

BiocManager::install("package_name") 
 Further information is available at https://www.bioconductor.org/install/.

Normally, in R, a user will load a library and use the functions directly by name. This is great in interactive sessions but it can cause confusion when many packages are loaded. To clarify which package and function I'm using at a given moment, I will occasionally use the packageName::functionName() convention. 

Sometimes, in the middle of a recipe, I'll interrupt the code so you can see some intermediate output or the structure of an object that's important to understand. Whenever that happens, you'll see a code block where each line begins with ## (double hash) symbols. Consider the following command:

letters[1:5]

This will give us the following output:

## a b c d e

Note that the output lines are prefixed with ##.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.144.237.77