Part II. Wrangle

In this part of the book, youâll learn about data wrangling, the art of getting your data into R in a useful form for visualization and modeling. Data wrangling is very important: without it you canât work with your own data! There are three main parts to data wrangling:

This part of the book proceeds as follows:

In ChapterÂ 7, youâll learn about the variant of the data frame that we use in this book: the tibble. Youâll learn what makes them different from regular data frames, and how you can construct them âby hand.â
In ChapterÂ 8, youâll learn how to get your data from disk and into R. Weâll focus on plain-text rectangular formats, but will give you pointers to packages that help with other types of data.
In ChapterÂ 9, youâll learn about tidy data, a consistent way of storing your data that makes transformation, visualization, and modeling easier. Youâll learn the underlying principles, and how to get your data into a tidy form.

Data wrangling also encompasses data transformation, which youâve already learned a little about. Now weâll focus on new skills for three specific types of data you will frequently encounter in practice:

ChapterÂ 10 will give you tools for working with multiple interrelated datasets.
ChapterÂ 11 will introduce regular expressions, a powerful tool for manipulating strings.
ChapterÂ 12 will show ...

Get R for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

R for Data Science by Hadley Wickham, Garrett Grolemund

Part II. Wrangle

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly