O'Reilly logo

Learning Hadoop 2 by Garry Turkington, Gabriele Modena

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 8. Data Lifecycle Management

Our previous chapters were quite technology focused, describing particular tools or techniques and how they can be used. In this and the next chapter, we are going to take a more top-down approach whereby we will describe a problem space you are likely to encounter and then explore how to address it. In particular, we'll cover the following topics:

  • What we mean by the term data life cycle management
  • Why data life cycle management is something to think about
  • The categories of tools that can be used to address the problem
  • How to use these tools to build the first half of a Twitter sentiment analysis pipeline

What data lifecycle management is

Data doesn't exist only at a point in time. Particularly for long-running ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required