Summary

In this chapter, we explored the basic operations and functions of MongoDB directly from the Mongo shell and from a graphic interface (Robomongo). We also performed a data preparation of a CSV dataset with OpenRefine, and turned it into a well-formatted JSON dataset. Finally, we presented an introduction to data processing with the aggregation framework, which is a faster alternative to MapReduce for common aggregations. We introduced the basic operators used in the pipelines and the expressions supported by the aggregation framework.

In the next chapter, we will explore the MapReduce functionality of MongoDB, and we will create a word cloud in D3 with the most frequent words in positive tweets.

Get Practical Data Analysis - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.