Summary

In this chapter, you learned what the terms KDD and data mining could mean. You have also learned about diverse ways of retrieving text from the web and even how to get a dwarf name for yourself. Otherwise, you may have learned how to run a term frequency and a clustering analysis. To wrap it up, here are the things that we did with Twitter data:

  • Cleaned and transformed data
  • Ran a term frequency analysis
  • Drew lollipop and word cloud charts to aid interpreting
  • Made hierarchical clustering from the term frequency

There is much more we could do with data retrieved from Twitter, such as the following:

  • Topic modeling
  • Sentiment analysis
  • Follower analysis
  • Retweet analysis—this might be useful for you to get more retweets
  • Favorite analysis ...

Get Hands-On Data Science with R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.