How to do it...

You will need one of the following command-line tools curl or wget to retrieve specified data:

  1. You can start by downloading the dataset using either of the following three commands. The first one is as follows:
wget http://archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/winequality-white.csv

You can also use the following command:

curl http://archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/winequality-white.csv -o winequality-white.csv

This command is the third way to do the same:

http://archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/winequality-white.csv
  1. Now we begin our first steps of data exploration by seeing how the data in winequality-white.csv is formatted:
head -5 winequality-white.csv ...

Get Apache Spark 2.x Machine Learning Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.