- You can start with downloading the dataset using either of the following commands:
wget http://files.grouplens.org/datasets/movielens/ml-1m.zip
You can also use the following command:
curl http://files.grouplens.org/datasets/movielens/ml-1m.zip -o ml-1m.zip
- Now you need to decompress the ZIP:
unzip ml-1m.zipcreating: ml-1m/inflating: ml-1m/movies.datinflating: ml-1m/ratings.datinflating: ml-1m/READMEinflating: ml-1m/users.dat
The command will create a directory named ml-1m with data files decompressed inside.
- Change into the directory m1-1m:
cd m1-1m
- Now we begin our first steps of data exploration by verifying how the data in movies.dat is formatted:
head -5 movies.dat1::Toy Story (1995)::Animation|Children's|Comedy ...