Chapter 3. An Introduction to Hadoop's Architecture and Ecosystem

From this chapter onwards, we start with the implementation aspects of Machine learning. Let's start learning the platform of choice—a platform that can scale to Advanced Enterprise Data needs (big data needs of Machine learning in specific)—Hadoop.

In this chapter, we cover Hadoop platform and its capabilities in addressing large-scale loading, storage, and processing challenges for Machine learning. In addition to an overview of Hadoop Architecture, its core frameworks, and the other supporting ecosystem components, also included here is a detailed installation process with an example deployment approach. Though there are many commercial distributions of Hadoop, our focus in this ...

Get Practical Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.