Introduction to Apache NiFi (Hortonworks DataFlow - HDF 2.0)

Video description

Apache NiFi was initially used by the NSA so they could move data at scale and was then open sourced. Being such a hot technology, Onyara (the company behind it) was then acquired by Hortonworks, one of the main backers of the big data project Hadoop, and then Hadoop Data Platform. Apache NiFi is now used in many top organisations that want to harness the power of their fast data by sourcing and transferring information from and to their database and big data lakes. It is a key tool to learn for the analyst and data scientists alike. Its simplicity and drag and drop interface make it a breeze to use! You can start building flows between Kafka and ElasticSearch, an FT,P and MongoDB, and so much more! Your imagination is the limit This course will take you through the Apache NiFi technology. It will help you understand its fundamental concepts, with theory lessons that walk you through the core concepts of Apache NiFi. You will also have hands-on labs to get started and build your first data flows. You will learn how to set up your connectors, processors, and how to read your FlowFiles to make the most of what NiFi has to offer. The most important configuration options will be demonstrated so you will be able to get started in no time. We will also analyse a template picked from the web and understand how to debug your flows as well as route your data to different processors based on outcomes through relationships. We will finally learn about the integrations between NiFi and Apache Kafka or MongoDB. Lots of learning ahead!

What You Will Learn

  • Install and configure Apache NiFi.
  • Design Apache NiFi architecture.
  • Master core functionalities like FlowFile, FlowFile processor, connection, flow controller, process groups, and so on.
  • Use NiFi to stream data between different systems at scale.
  • Monitor Apache NiFi.
  • Integrate NiFi with Apache Kafka

Audience

Beginners who want to get started on learning Apache NiFi. Architects who want to get an overview of Apache NiFi.

About The Author

Stéphane Maarek: Stéphane Maarek is a solutions architect, consultant, and software developer who has a particular interest in all things related to big data and analytics. He is also a bestseller instructor on Udemy for his courses on Apache Kafka, Apache NiFi, and AWS Lambda. He loves Apache Kafka and regularly contributes to the Apache Kafka project.

Stéphane has also written a guest blog post that was featured on the Confluent website, the company behind Apache Kafka. He is also an AWS Certified Solutions Architect and has many years of experience with technologies such as Apache Kafka, Apache NiFi, Apache Spark, Hadoop, PostgreSQL, Tableau, Spotfire, Docker, Ansible, and more.

Product information

  • Title: Introduction to Apache NiFi (Hortonworks DataFlow - HDF 2.0)
  • Author(s): Stéphane Maarek
  • Release date: May 2018
  • Publisher(s): Packt Publishing
  • ISBN: 9781789346084