DBSCAN

DBSCAN inherits the idea that data can be represented as multidimensional points. Again, sticking with a two-dimensional example, this is in rough steps how DBSCAN works:

  1. Pick a point that has not been visited before.
  2. Draw a circle with the point as the center. The radius of the circle is epsilon.
  3. Count how many other points fall into the circle. If there are more than a specified threshold, we mark all the points as being part of the same cluster.
  4. Recursively do the same for each point in this cluster. Doing so expands the cluster.
  5. Repeat these steps.

I highly encourage you to do this on dotted paper and try to draw this out yourself. Start by plotting random points, and use pencils to draw circles on paper. This will give you an ...

Get Go Machine Learning Projects now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.