Skip to main content

Get full access to R: Recipes for Analysis, Visualization and Machine Learning and 60K+ other titles, with a free 10-day trial of O'Reilly.

There are also live events, courses curated by job role, and more.

Start your free trial

Performing cluster analysis using hierarchical clustering

The hclust function in the package stats helps us perform hierarchical clustering.

Getting ready

If you have not already downloaded the data files for this chapter, do it now and ensure that the auto-mpg.csv file is in R's working directory.

We will hierarchically cluster the data based on the variables mpg, cylinders, displacement, horsepower, weight, and acceleration.

How to do it...

To perform cluster analysis using hierarchical clustering, follow these steps:

Read the data:
```
> auto <- read.csv("auto-mpg.csv")
```
Define a convenience function to standardize the relevant variables and append the resulting variables to the original data:
```
rdacb.scale.many <- function (dat, column_nos) { nms <- names(dat) ...
```

Get R: Recipes for Analysis, Visualization and Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Don’t leave empty-handed

Get Mark Richards’s Software Architecture Patterns ebook to better understand how to design components—and how they should interact.

It’s yours, free.

Get it now

Cover of Software Architecture Patterns

Check it out now on O’Reilly

Dive in for free with a 10-day trial of the O’Reilly learning platform—then explore all the other resources our members count on to build skills and solve problems every day.

Start your free trial Become a member now