Delve into the key concepts of Hadoop and get a thorough understanding of the Hadoop ecosystem
This book jumps into the world of Hadoop ecosystem components and its tools in a simplified manner, and provides you with the skills to utilize them effectively for faster and effective development of Hadoop projects.
Starting with the concepts of Hadoop YARN, MapReduce, HDFS, and other Hadoop ecosystem components, you will soon learn many exciting topics such as MapReduce patterns, data management, and real-time data analysis using Hadoop. You will also get acquainted with many Hadoop ecosystem components tools such as Hive, HBase, Pig, Sqoop, Flume, Storm, and Spark.
By the end of the book, you will be confident to begin working with Hadoop straightaway and implement the knowledge gained in all your real-world scenarios.
What You Will Learn
Get introduced to Hadoop, big data, and the pillars of Hadoop such as HDFS, MapReduce, and YARN
Understand different use cases of Hadoop along with big data analytics and real-time analysis in Hadoop
Explore the Hadoop ecosystem tools and effectively use them for faster development and maintenance of a Hadoop project
Demonstrate YARN's capacity for database processing
Work with Hive, HBase, and Pig with Hadoop to easily figure out your big data problems
Gain insights into widely used tools such as Sqoop, Flume, Storm, and Spark using practical examples
Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.