Chapter 3. HDInsight Service on Azure

Microsoft Azure is a flexible cloud platform that enables enterprises of any size to quickly rent resources on demand to build, deploy, and manage a wide range of IT applications. HDInsight is a 100 percent Apache Hadoop-based cloud service that leverages the Azure platform.

In this chapter, we will discuss how to build your Data Lake using the HDInsight service on the Azure cloud, which can scale to petabytes on demand. We will cover the following topics:

  • Registering for an Azure account
  • Provisioning an HDInsight cluster
  • HDInsight Management dashboard
  • Exploring a cluster using the remote desktop
  • Deleting the HDInsight cluster
  • HDInsight Emulator for development

Registering for an Azure account

The first step to access ...

Get HDInsight Essentials - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.