Installing rhdfs
The rhdfs
package is the interface between R and HDFS, which allows users to access HDFS from an R console. Similar to rmr2
, one should install rhdfs
on every task node, so that one can access HDFS resources through R. In this recipe, we will introduce how to install rhdfs
on the Cloudera QuickStart VM.
Getting ready
Ensure that you have completed the previous recipe by starting the Cloudera QuickStart VM and connecting the VM to the Internet, so that you can proceed with downloading and installing the rhdfs
package.
How to do it...
Perform the following steps to install rhdfs
:
- First, you can download
rhdfs 1.0.8
from GitHub. You may need to update the link if Revolution Analytics upgrades the version ofrhdfs
:$wget --no-check-certificate ...
Get R: Recipes for Analysis, Visualization and Machine Learning now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.