Data pre-processing, as the name implies, involves curating the data to make it suitable for machine learning exercises. There are various methods for pre-processing and a few of the more common ones have been illustrated here.
Note that data pre-processing should be performed as part of the cross-validation step, that is, pre-processing should not be done before the fact, but rather during the model-building process. This will be explained in more detail afterward.