Chapter 3. Working with Big Data and Cloud Sources

In this chapter, we will cover:

  • Loading data into Salesforce.com
  • Getting data from Salesforce.com
  • Loading data into Hadoop
  • Getting data from Hadoop
  • Loading data into HBase
  • Getting data from HBase
  • Loading data into MongoDB
  • Getting data from MongoDB

Introduction

While flat files and databases are the most common type of source that developers using Kettle interact with, there are many other types of data sources that are capable of being used. Data warehouses are now starting to leverage the capabilities of tools such as Hadoop, NoSQL databases, and cloud services such as Amazon Web Services and SalesForce.

In this chapter, you will learn to interact with these Big Data sources in Kettle. The recipes in this ...

Get Pentaho Data Integration Cookbook Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.