O'Reilly logo
  • Sanghyun Lee thinks this is interesting:

If you were training on all the data and making predictions on some of that training data, you’d be cheating, as the model is more likely to make good predictions if it’s seen the data while training.


Cover of Real-World Machine Learning


what does that mean? got it! i need to remove testing data for training