Data preparation

This section will show you how to prepare the data to be used in recommender models. Follow these steps:

  1. Select the relevant data.
  2. Normalize the data.

Selecting the most relevant data

When we explored the data, we noticed that the table contains:

  • Movies that have been viewed only a few times. Their ratings might be biased because of lack of data.
  • Users who rated only a few movies. Their ratings might be biased.

We need to determine the minimum number of users per movie and vice versa. The correct solution comes from an iteration of the entire process of preparing the data, building a recommendation model, and validating it. Since we are implementing the model for the first time, we can use a rule of thumb. After having built the models, ...

Get Building a Recommendation System with R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.