O'Reilly logo
  • Sagar Mainkar thinks this is interesting:

The output from the map function is processed by the MapReduce framework before being sent to the reduce function. This processing sorts and groups the key-value pairs by key.


Cover of Hadoop: The Definitive Guide, 4th Edition


What is shuffling?