Identifying the most important variables in data with random forests

We've already seen the random forests algorithm in use in this chapter, in the Predicting classes with random forests recipe, where we used it for class prediction and regression. Here, we're going to use it for a different purposeā€”to try and work out which of the variables in a dataset contribute most to the classification or regression accuracy of the trained model. This requires only a simple change to the code we already have and a new function or two.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.147.55.42