Summary

In this chapter, we discussed machine learning and why it is everywhere. We also looked at supervised and unsupervised machine learning. We discussed the reason that machine learning creates its own vocabulary and how it relates to the statistical vocabulary.

Many other methods were discussed:

  • tree models, their strengths and weakness
  • essemble methods such as random forests
  • hierarchical and k-means clustering

Finally, the chapter introduced feedforward neural networks with R using the h2o package. In the next chapter, we will cover ways in which we can evaluate the quality and characteristics of various datasets in an effort to find insights from the data.

Get Hands-On Data Science with R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.