O'Reilly logo
  • Sanghyun Lee thinks this is interesting:

If you were training on all the data and making predictions on some of that training data, you’d be cheating, as the model is more likely to make good predictions if it’s seen the data while training.

From

Cover of Real-World Machine Learning

Note

what does that mean? got it! i need to remove testing data for training