Summary

In this chapter, we explored the basic concepts of the MapReduce programming model and how to implement common activities such as grouping, aggregation, counting, and summing in MongoDB.

MapReduce is a powerful tool for log analysis and data processing. In this chapter, we learned how to implement easy but powerful aggregation capabilities into Python using PyMongo and Jupyter.

In the next chapter, we will explore an online Python tool for data analysis and development called Wakari, and a data analysis library named Pandas.

Get Practical Data Analysis - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.