After each training step, we can see that the value from the loss function decreases


...only if the back propagation algorithm works properly - or we do not get caught in a local minimum.