The path from journeyman to master is long. In the case of data visualization, the path has been well marked by many accomplished designers and cognitive scientists who have been doing great work for decades. We gladly follow in their footsteps, and we hope you will, too.
In these pages, however, our goal is not so much to take you to the summit as to start you down the pathâand that the path is quite rewarding to travel. Our goal is to give you confidence as you begin your journey.
Many statisticians and practitioners with excellent coding and data munging skills are nevertheless stuck in a rut of common formats and default settings, which lead to mundane, suboptimal visualizations. But the domain of hand-crafted, fine-tuned, noteworthy visualizations is not limited to âcreativeâ types; it is accessible with a bit of guidance.
The truth is, there is plenty of room for artistry and creativity in data visualization. But success is built upon a linear process that encodes information for visual transmission and subsequent decoding by wetwareâthe readerâs brain. One aim in writing this book is to introduce you to this process, including some basic concepts and best practices, so that your message may be transmitted with minimal interference.
It is a process. And design is something youâre probably already doing, whether youâre designing applications, frameworks, graphics, or something else. Design is simply a process of organized thinking, planning, and executing. You are making design choices, intentional or not. Of course, intentional choices have a better chance of being useful than arbitrary or accidental choices.[1] This book is a road map to those choices: it is meant to make you aware of the choices you get to make, and to help you make useful, intentional design decisions at every turn.
This high-level road map is one we havenât seen presented anywhere else. It will give you the general lay of the land. It is a set of steps and rules to follow that will get you 80% of the way to turning out great work. Weâll introduce many questions youâll need to ask yourself, and point you in the direction of some answers. The nuanced details of those answers have already been addressed by others, and we hope you will continue down the path with further guidance from our colleagues and mentors (see the Reading List in Appendix A).
Note that Appendix A also covers some of the many tools available for creating data visualizations, and we hope you will peruse them. But youâll find a discussion of tools intentionally missing from the rest of the book, because the topic at hand is, âWhat problem are you solving?â (and the questions youâre answering), rather than, âWhat tools are you using?â Design and implementation are two separate things.
As in any creative discipline, the best data visualizations are forged by breaking some of the rules. But rules must be broken with intention. One must learn the rules (well, more like guidelines) before one is entitled to break them. With that in mind, we present for your consideration our process for the visual encoding of information.
This book is organized into two major parts, which can loosely be thought of as practical theoretical foundations and applied suggestions, respectively.
In Part I, we discuss different kinds of visualization (including infographics and visual art) and explore the influences at work in each one. The goal is to help you become a more savvy consumer of visualizations, as well as a more organized thinker when creating your own visual work.
In Chapter 1, we introduce some ways of classifying and describing different styles of visualization, so that you can begin to think about and describe what youâre designing.
In Chapter 2, we introduce the three fundamental influences to the visualization productâthe designer, the reader, and the dataâand describe how each should shape what is eventually created.
In Part II, we apply these concepts to the design process. The goal is to help you think in a linear way about how to select and apply appropriate encodings for your data.
In Chapter 3, we focus on getting to a clear understanding of your goalsâand defining the requisite supporting dataâso that you can implement them most effectively.
In Chapter 4, we lay out heuristics for understanding the shape of your data and choosing compatible visual properties and structures with which to encode it.
In Chapter 5, we dive deep into the property of spatial positionâaxes and placementâone of the most important properties youâll need to select. We also discuss using different visualization structures.
In Chapter 6, we look at best practices and offer specific suggestions for encoding many specific different data types with visual properties. We also present warnings against common pitfalls and dark patterns.
Finally, the Appendices are full of resources and references meant to help you put your skills into practice and expand your knowledge beyond this volume.
Appendix A contains a list of tools to help you get started, as well as a suggested reading list to expand your knowledge and understanding of design concepts.
Appendix B is a list of the questions and decisions youâll confront as part of the design process. We hope youâll read the entire book, and then use this section as a refresher whenever you design a new visualization.
In this book, weâll use some specific terms to describe your data and visual encodings. Here is a handy glossary for quick reference.
Chart: Something that shows qualitative information (e.g., flow charts). |
Data dimensions: One single channel of data. A stock graph may comprise four properties: date, price, company, and market cap. Each is a unique dimension of the data, which can be encoded separately, with a different visual property. |
Data visualization: Visualizations that are algorithmically generated and can be easily regenerated with different data, are usually data-rich, and are often aesthetically shallow. |
Designer: The creator of a visualization; any reader of this book. |
Encoding: The visual property (noun) applied to a dimension of data that encodes (verb) the information into a visual medium for decoding by the readerâs brain. |
Explanatory visualization: Data visualizations that are used to transmit information or a point of view from the designer to the reader. Explanatory visualizations typically have a specific âstoryâ or information that they are intended to transmit. |
Exploratory visualization: Data visualizations that are used by the designer for self-informative purposes to discover patterns, trends, or sub-problems in a dataset. Exploratory visualizations typically donât have an already-known story. |
Graph: Something that shows quantitative information (e.g., pie graphs and bar graphs). |
Infographic: Visualizations that are manually generated around specific data, tend to be data-shallow, and are often aesthetically rich. |
Reader: The consumer of a visualization, often someone other than the designer. The reader has information needs that are meant to be satisfied by the visualization. |
Visual property: A characteristic that you can see. Color, size, location, thickness, and line weight are all visual properties. |
Variability of a property or data dimension: Within a visual property or single data dimension, what values are present or allowed, and how they change. Integers vary discretely; position can vary continuously. Categories are finite (and discrete, though maybe hierarchical); numbers are infinite. |
The following figures are reprinted by kind permission:
Figure 1-3. Flint Hahn (2010). Copyright © 2010, Flint Hahn. Permission to reproduce the likeness of Burning Man and the mark âBurning Manâ granted by Burning Man.
Figure 1-5. Nora Ligorano and Marshall Reese (2011). Copyright © 2011, Ligorano/Reese. http://ligoranoreese.net/fiber-optic-tapestry
Figure 4-1. European Soil Bureau. Copyright © 1995â2011, European Union. Used with stated authorization to reproduce, with acknowledgment. http://eusoils.jrc.ec.europa.eu/
Figure 4-2. Center for International Earth Science Information Network (CIESIN) (2007). Copyright © 2007, The Trustees of Columbia University in the City of New York. Columbia University. Population, Landscape, and Climate Estimates (PLACE). Used under the Creative Commons Attribution License. http://sedac.ciesin.columbia.edu/place/
Figure 4-5. Tableau Software Public Gallery. Copyright © 2003â2011 Tableau Software. http://www.tableausoftware.com/learn/gallery/company-performance
Figure 4-6. Christian Caron (2011). Copyright © 2011, Christian Caron.
Figure 4-10. Michael Dayah (1997). Copyright © 1997 Michael Dayah. http://www.ptable.com
Figure 4-15. Robert Palmer (2010). Copyright © 2010, Robert Palmer. http://rp-network.com/
Figure 5-1 and Figure 5-2. Photo credits to: Annette Crimmins, Sias van Schalkwyk, Janni Due, Dimitri Castrique, and Grethe Boe.
Figure 5-6. Nelson Minar (2011). Copyright © 2011 Daedalus Bits, LLC. http://windhistory.com/
Figure 5-7. Craig Robinson (2011). Copyright © 2011, Craig Robinson. http://www.flipflopflyin.com/flipflopflyball/info-majorleagueparks.html
Figure 5-8. Tableau Software Public Gallery. Copyright © 2003â2011 Tableau Software. http://www.tableausoftware.com/learn/gallery/federal-stimulus-cost
Figure 6-3. Spective® Colour System is the evolved color selection method created by Tony Scauzillo-Golden in 2010 while improving upon existing design industry standard color UIs. Please visit TSGâs Spective Productions website for further details. Spective® is registered under United States Patent Reg. No. 3,896,334.
Figure 6-11. Jess Bachman (2011). Copyright © 2011, Jess Bachman. http://www.smarter.org/research/apples-to-oranges/
The images in this book were designed to be viewed in color and at full size. If youâre reading the print edition or on a mobile device, please download the color figures from the website for this book to see them in their full glory.
This book is here to help you get your job done. If you reference limited parts of it in your work or writings, we appreciate, but do not require, attribution. An attribution usually includes the title, author, publisher, and ISBN. For example: âDesigning Data Visualizations by Noah Iliinsky and Julie Steele (OâReilly). Copyright 2011 Julie Steele and Noah Iliinsky, 978-1-449-31228-2.â
If you feel your use of examples or quotations from this book falls outside fair use or the permission given above, feel free to contact us at permissions@oreilly.com.
Note
Safari Books Online is an on-demand digital library that lets you easily search over 7,500 technology and creative reference books and videos to find the answers you need quickly.
With a subscription, you can read any page and watch any video from our library online. Read books on your cell phone and mobile devices. Access new titles before they are available for print, and get exclusive access to manuscripts in development and post feedback for the authors. Copy and paste code samples, organize your favorites, download chapters, bookmark key sections, create notes, print out pages, and benefit from tons of other time-saving features.
OâReilly Media has uploaded this book to the Safari Books Online service. To have full digital access to this book and others on similar topics from OâReilly and other publishers, sign up for free at http://my.safaribooksonline.com.
Please address comments and questions concerning this book to the publisher:
OâReilly Media, Inc. |
1005 Gravenstein Highway North |
Sebastopol, CA 95472 |
800-998-9938 (in the United States or Canada) |
707-829-0515 (international or local) |
707-829-0104 (fax) |
We have a web page for this book, where we list errata, examples, and any additional information. You can access this page at:
http://www.oreilly.com/catalog/9781449312282 |
To comment or ask technical questions about this book, send email to:
bookquestions@oreilly.com |
For more information about our books, courses, conferences, and news, see our website at http://www.oreilly.com.
Find us on Facebook: http://facebook.com/oreilly
Follow us on Twitter: http://twitter.com/oreillymedia
Watch us on YouTube: http://www.youtube.com/oreillymedia
We wish to thank all the amazing visual thinkers and designers who came before us, who work with us, who inspire us. Many of them are cited in the Reading List in Appendix A, and many others have kindly given us permission to reprint their work here. We also wish to thank the good folks at OâReilly who helped this project come to life.
In particular, Noah would like to thank professor David Farkas, as one of the many origins of this book can be traced to Daveâs suggestion to âtake a quarter and look at some diagrams, and see what happens,â many years ago.
Finally, big thanks and hugs to our friends, and especially our family members, who have loved, supported, fed, and encouraged us over the hectic writing period. Martin, Miller, Amy, Juliebot WCE, Nathan, Noël, Tanya, Edd, and Anna: thanks for keeping us going and for cheering us on.
[1] Suh: The Principles of Design (Oxford University Press); Schon: The Reflective Practitioner (Basic Books).
Get Designing Data Visualizations now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.