Review questions and exercises

  1. What does unsupervised learning mean?
  2. What is the major difference between unsupervised learning and supervised learning?
  3. How do we install the Python package sklearn?
  4. Discuss the relationship between distance and clustering classification.
  5. How do we define the distance between two objects?
  6. For non-numeric values, how do we define a distance between two members?
  7. For R, we could find a set of related packages related to unsupervised learning called cluster. Is there any task view, or similar super package, for Python?
  8. First, generate the following set of random numbers:
>set.seed(12345) 
>n=30 
>nGroup=4 
>x <- matrix(rnorm(n*nGroup),nrow =nGroup) 

Then, based on the various definitions of distance, estimate the ...

Get Hands-On Data Science with Anaconda now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.