Other sources for machine learning data:
- SMS spam data: http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/
- Financial dataset from Lending Club https://www.lendingclub.com/info/download-data.action
- Research data from Yahoo http://webscope.sandbox.yahoo.com/index.php
- Amazon AWS public dataset http://aws.amazon.com/public-data-sets/
- Labeled visual data from Image Net http://www.image-net.org
- Census datasets http://www.census.gov
- Compiled YouTube dataset http://netsg.cs.sfu.ca/youtubedata/
- Collected rating data from the MovieLens site http://grouplens.org/datasets/movielens/
- Enron dataset available to the public http://www.cs.cmu.edu/~enron/
- Dataset for the classic book elements of statistical learning http://statweb.stanford.edu/~tibs/ElemStatLearn/data.html ...