Data Visualization: Design Considerations



- Use color meaningfully, e.g., only use color when needed to communicate something about the data.

- Choose the right color scheme for your data - categorical, diverging, sequential

- For categorical data, avoid using too many different colors - no more than 6 colors is best; 12 colors max.

- For sequential data, don't use rainbows, use white to highly saturated.

- Consider the format of your visualization - will it be displayed on a projector, in print, copied in grey scale, etc.

- Be mindful of the potential color-deficiencies of your audience - there are tools to help choose or test color schemes that are accessible for color deficient vision. You may also want to consider the cultural connotations of particular colors.

Color Schemes


Labels, Legends, and Other Chart Elements


- Use sans-serif fonts; avoid all caps; make sure font size large enough to be read in intended format (print, screen, etc.)

Simplicity and Clarity

- Use clear language and avoid acronyms in your title, legend, and labels.

- Can you omit the legend and instead label bars or lines directly? Note that if you only have one data category, there is no need for a legend. 

- Can you omit gridlines and/or the box around the chart area? Or at least lighten their color so they don't detract from the data?

Shape and Size

- Think about the aspect ratio and what is most apporpriate for your data, not just what fits on the page.

- "Banking to 45 degrees" - a theory that line charts may be more readable if their average slope is 45 degrees. This theory has been debated; however, it is likely still a good idea to aim for 45 degrees, unless there is good reason not to.

Data Visualization Dos and Don'ts


It is important to use consistent scale divisions when graphing data that involve continuous series.

Example: If your data are grouped into specific spans of time, the spans should be equal. The histogram on the left has unequal divisions, while the histogram on the right has equal divisions.

histograms showing unequal and equal horizontal scale divisions


Vertical axes should generally begin at the origin (zero). Visualizations in which the vertical axis does not begin at the origin can give a misleading picture of the meaning of the data.

