Summary

In this chapter, you learned how to set up a Hadoop environment. This required you to install Linux and Docker to download an image from Hortonworks, and to learn the ropes of that environment. Much of this chapter was spent on the environment and how to perform a spatial query using the GUI tools provided. This is because the Hadoop environment is complex and without a proper understanding, it would be hard to fully understand how to use it with Python. Lastly, you learned how to use HDFS and Hive in Python. The Python libraries for working with Hadoop, Hive, and HDFS are still developing. This chapter provided you with a foundation so that when these libraries improve, you will have enough knowledge of Hadoop and the accompanying ...

Get Mastering Geospatial Analysis with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.