Azure HDInsight

Azure HDInsight is a service which deploys Hadoop components in the form of clusters in the cloud. Apache Hadoop is a very popular open source framework for processing and analyzing big data. The Hadoop components on Azure make it very easy to process data because Azure handles the creation of the clusters.

HDInsight offers the following cluster types:

  • Apache Hadoop: Hadoop can process data in parallel using MapReduce, a programming language that can process data efficiently.
  • Apache Spark: Spark can process data in parallel as well by using in-memory processing for better performance. You can use it with SQL, streaming data, and machine learning solutions.
  • Apache HBase: This offers a NoSQL database on Hadoop which can be ...

Get Architecting Microsoft Azure Solutions - Exam Guide 70-535 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.