As we can see from the plot, using only a single neighbor, each point in the training set has an obvious influence on the predictions, and the predicted values go through all of the data points. This leads to a very unsteady prediction. Considering more neighbors leads to smoother predictions, but these do not fit the training data as well.
Cons of decision trees
Share this highlighthttp://www.safaribooksonline.com/a/introduction-to-machine/20171323/