Getting data from MongoDB

Moving data out of MongoDB is a tad trickier than putting data into the NoSQL database. Fortunately, we are able to filter out data to produce a smaller subset of a source document store.

Getting ready

We will be pulling a subset of data from the batting dataset loaded from the Lahman's Baseball Database in the recipe, Loading data into MongoDB. It will also be beneficial to read more on MongoDB's data model. There is a good overview provided by the MongoDB website at http://docs.mongodb.org/manual/core/data-modeling/.

How to do it...

  1. Open a new transformation.
  2. Under the Big Data category, select the MongoDb input step and bring it over to the canvas.
  3. Open the step and add the MongoDB instance connection information to the ...

Get Pentaho Data Integration Cookbook Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.