O'Reilly logo

Agile Data Science by Russell Jurney

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 8. Exploring Data with Reports

Figure 8-1. Level 3: Exploring with Reports

Introduction

In the next step, our third agile sprint, we’ll extend our chart pages into full-blown reports. In this step: charts become interactive, static pages become dynamic, and our data becomes explorable through networks of linked, related entities with charts. These are the characteristics of the ‘reports’ stage of the data value pyramid.

Example Code

Code examples for this chapter are available at https://github.com/rjurney/Agile_Data_Code/tree/master/ch09. Clone the repository and follow along!

git clone https://github.com/rjurney/Agile_Data_Code.git

Building Reports with Multiple Charts

To build a report we need to compose multiple views on the same entity. The charts we made last chapter will serve us well as we increase interactivity to create reports. Lets create an email address entity page and add a tag cloud for related emails to give us something more of a report.

Lets start by creating a relation that shows the most related email addresses. Check out ch09/pig/related_email_addresses.pig.

/* Set Home Directory - where we install software */ %default HOME `echo \$HOME/Software/` /* Avro uses json-simple, and is in piggybank until Pig 0.12, where AvroStorage and TrevniStorage are Pig builtins */ REGISTER $HOME/pig/build/ivy/lib/Pig/avro-1.5.3.jar REGISTER $HOME/pig/build/ivy/lib/Pig/json-simple-1.1.jar ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required