Hadoop Blueprints

by Anurag Shrivastava, Tanmay Deshpande

Released September 2016

Publisher(s): Packt Publishing

ISBN: 9781783980307

Start your free trial

Book description

Use Hadoop to solve business problems by learning from a rich set of real-life case studies

About This Book

Solve real-world business problems using Hadoop and other Big Data technologies
Build efficient data lakes in Hadoop, and develop systems for various business cases like improving marketing campaigns, fraud detection, and more
Power packed with six case studies to get you going with Hadoop for Business Intelligence

Who This Book Is For

If you are interested in building efficient business solutions using Hadoop, this is the book for you This book assumes that you have basic knowledge of Hadoop, Java, and any scripting language.

What You Will Learn

Learn about the evolution of Hadoop as the big data platform
Understand the basics of Hadoop architecture
Build a 360 degree view of your customer using Sqoop and Hive
Build and run classification models on Hadoop using BigML
Use Spark and Hadoop to build a fraud detection system
Develop a churn detection system using Java and MapReduce
Build an IoT-based data collection and visualization system
Get to grips with building a Hadoop-based Data Lake for large enterprises
Learn about the coexistence of NoSQL and In-Memory databases in the Hadoop ecosystem

In Detail

If you have a basic understanding of Hadoop and want to put your knowledge to use to build fantastic Big Data solutions for business, then this book is for you. Build six real-life, end-to-end solutions using the tools in the Hadoop ecosystem, and take your knowledge of Hadoop to the next level.

Start off by understanding various business problems which can be solved using Hadoop. You will also get acquainted with the common architectural patterns which are used to build Hadoop-based solutions. Build a 360-degree view of the customer by working with different types of data, and build an efficient fraud detection system for a financial institution. You will also develop a system in Hadoop to improve the effectiveness of marketing campaigns. Build a churn detection system for a telecom company, develop an Internet of Things (IoT) system to monitor the environment in a factory, and build a data lake – all making use of the concepts and techniques mentioned in this book.

The book covers other technologies and frameworks like Apache Spark, Hive, Sqoop, and more, and how they can be used in conjunction with Hadoop. You will be able to try out the solutions explained in the book and use the knowledge gained to extend them further in your own problem space.

Style and approach

This is an example-driven book where each chapter covers a single business problem and describes its solution by explaining the structure of a dataset and tools required to process it. Every project is demonstrated with a step-by-step approach, and explained in a very easy-to-understand manner.

Hadoop Blueprints

Product information

Title: Hadoop Blueprints
Author(s): Anurag Shrivastava, Tanmay Deshpande
Release date: September 2016
Publisher(s): Packt Publishing
ISBN: 9781783980307

book

Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem

by Douglas Eadline

Get Started Fast with Apache Hadoop ® 2, YARN, and Today’s Hadoop Ecosystem With Hadoop 2.x …

book

Hadoop: Data Processing and Modelling

by Garry Turkington, Tanmay Deshpande, Sandeep Karanth

Unlock the power of your data with Hadoop 2.X ecosystem and its data warehousing techniques across …

video

Creating an extensible 100+ PB real-time big data platform by unifying storage and serving

by Reza Shiftehfar

Uber relies heavily on making data-driven decisions in every product area and needs to store and …

book

Pro Apache Hadoop, Second Edition

by Sameer Wadkar, Madhu Siddalingaiah

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop the framework of big …

Hadoop Blueprints

Book description

Table of contents

Product information

You might also like

Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem

Hadoop: Data Processing and Modelling

Creating an extensible 100+ PB real-time big data platform by unifying storage and serving

Pro Apache Hadoop, Second Edition

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly