Clustering is the process of grouping the data into classes or clusters so that objects within a cluster have high similarity in comparison to one another, but are very dissimilar to objects in other clusters. Dissimilarities are assessed based on the attribute values describing the objects.
There are a large number of clustering algorithms. The major methods can be classified into the following categories:
- Partitioning methods: A partitioning method constructs K partitions of the data, which satisfy both of the following requirements:
- Each group must contain at least one object.
- Each object must belong to exactly one group. Given the initial K number of partitions to construct, the method creates initial ...