DSCI 531: Data Visualization I
The design and implementation of static figures across all phases of data analysis, from ingest and cleaning to description and inference. How to make principled and effective choices with respect to marks, spatial arrangement, and colour.
Time: 11:10am-12:30pm Wed/Mon, Nov 16-Dec 12 2016
Labs: Thursday afternoons 2-4pm, ESB 1042
Quizzes: Thu Dec 1 2-2:30pm, Thu Dec 15 2-2:30pm
Location: SPPH 143
Slack channel: https://ubc-mds.slack.com/messages/531_viz-1
Instructor: Tamara Munzner @munzner
Teaching fellow: Tiffany Timbers @tiffany
TA: Bo Chang @bchang
Schedule
Lecture | Date | Day | Topic |
---|---|---|---|
1 | 2016-11-16 | Wed | Introduction to visualization, Data and tasks, Marks and channels |
2 | 2016-11-21 | Mon | Visualization design exercise |
3 | 2016-11-23 | Wed | Table data part 1 |
4 | 2016-11-28 | Mon | Table data part 2 |
5 | 2016-11-30 | Wed | Spatial data |
6 | 2016-12-05 | Mon | Colours |
7 | 2016-12-07 | Wed | Network data |
8 | 2016-12-12 | Mon | Rules of thumb |
Time: 11:00am-12:20pm
Labs
Assignment | Lab topic | Due Date | Material |
---|---|---|---|
1 html, 1 md | User testing for Visualizations | 2016-11-23 09:00 | tutorial html, tutorial md |
2 html, 2 md | Visualizing tabular data using Python's matplotlib and pandas
|
2016-12-01 09:00 | tutorial html, tutorial Rmd |
3 html, 3 md | Exploring colour and visualizing spatial data in R with ggplot2 's geom_polygon and ggmap
|
2016-12-08 09:00 | |
4 html, 4 md | Visualizing network data in R | 2016-12-13* 09:00 | |
* note that this is a Tuesday! |
Quizzes
Time | Date | Location | |
---|---|---|---|
1 html, 1 ipynb | 14:00 - 14:30 | 2016-12-01 | ESB 1042 |
2 html, 2 ipynb | 15:00 - 15:30 | 2016-12-14 | ESB 1042 |
Solutions
- Lab 1 html | Lab 1 Rmd
- Lab 2 html | Lab 2 ipynb
- Lab 3 html | Lab 3 Rmd
- Lab 4 html | Lab 4 Rmd
- Quiz 1 html | Quiz 1 ipynb
- Quiz 2 html | Quiz 2 ipynb
Prerequisites
- DSCI 511 (Programming for Data Science)
- DSCI 521 (Computing Platforms for Data Science)
Reference Material
-
Overall
- Munzner, Tamara. Visualization Analysis and Design, CRC Press, 2014.]ebook link, Tamara's book page
- Introduction to Scientific Python (Lecture 5: Data Visualization)
-
Week 1: Lecture 1, Wed Nov 16. Intro, Data and tasks, Marks and channels
- Slides: Lecture 1 1up, 16up, keynote
- Further reading Lecture 1 Visualization Analysis and Design (VAD) Chapter 1, VAD Chapter 2, VAD Chapter 3, VAD Chapter 4, VAD Chapter 5
-
Week 1: Lecture 2, Mon Nov 21. Visualization design exercise
-
Week 2: Lectures 3/4, Wed Nov 23 & Mon Nov 28. Table Data
- Slides: Lectures 3/4 1up PDF, 16up PDF, keynote
- Further reading: VAD Chapter 7
- Pointers from discussion
- Stacked Graphs: NYTimes Interactive Ebb and Flow of the movies, Stacked Graphs -- Geometry and Aesthetics paper, Byron paper page including links to Processing + D3 software
- Connected Scatterplots paper, Haroz paper page including demo and data
- Matrix reordering: Simple Algorithms for Network Visualization: A Tutorial, McGuffin paper page with links to errata, Java demos, Javascript code
- Matrix reordering survey: Seriation and matrix reordering methods: An historical overview. Liiv
-
Week 3: Lectures 5/6, Wed Nov 30 & Mon Dec 5. Spatial Data, Color
- Slides: Lectures 5/6 1up PDF, 16up PDF, keynote
- Further reading: VAD Chapter 8, VAD Chapter 10
- Surprise maps slides, blog post, paper
- Pointers from discussion
- Are Maps of Financial Variables just Population Maps? Ben Jones Tableau Public demo
- Cartogram examples from Worldmapper, 538 hexbinned
- Thematic maps with glyphs example from Cleveland & McGill, Graphical Perception paper
- Marching cubes isosurfaces explanation
- ColorBrewer demo
- alternatives to color coding: Taxonomy-Based Glyph Design paper
- alternatives to color coding: MizBee paper, paper page with videos and source
- Design Study Methodology paper page
- Perceptual Guidelines for Creating Rectangular Treemaps paper page
- Sizing the Horizon paper page
- D3 Gallery from Viau
- Bostock's D3 bl.ocks
- Data vis jobs list
-
Week 4: Lectures 7/8, Wed Dec 7 & Mon Dec 12. Network data, Rules of thumb.
- Slides: Lectures 7/8 1up PDF, 16up PDF, keynote
- Further reading: VAD Chapter 9, VAD Chapter 6
- Suggested reading: The Non-Designer’s Design Book, 4th ed. Robin Williams, Peachpit Press, 2015.
- Artery Viz slides, paper
- Pointers from discussion
- NYT 3D Economic Future Chart
- Lasseter Siggraph 87: Principles of Traditional Animation Applied to 3D Computer Animation paper
- complex shape transformation example: Outside In video
- blink comparator showing Pluto data
- change blindness demo videos
Learning Outcomes
By the end of the course, students are expected to be able to:
- Analyze existing static visual encodings in terms of marks and channels, spatial arrangement, and color broken down into luminance, saturation, and hue.
- Design new static visual encodings that use space and color channels appropriately according to principles of perceptual effectiveness and by matching channel type to attribute type for quantitative versus categorical attributes.
- Implement static visual encodings using existing toolkits and libraries.
- Describe and manipulate table, network, and spatial data; transform data into a form suitable for the intended abstract task of the visualization user.
- Explain whether a visual encoding is perceptually appropriate for a specific combination of task and data.
Lecture Learning Objectives
-
Introduction to visualization, Data and task abstractions, Marks and channels
By the end of the lecture, students are expected to be able to:
- Discuss situations where visualization is appropriate versus inappropriate.
- Choose appropriate methods of validation for each of the four levels of visualization in the nested model.
- Analyze visual encodings by breaking them down according marks and channels used, including whether the combination of channels is integral or separable.
- Apply the principles of expressiveness and effectiveness to choose appropriate visual channels for specific data types.
- Distinguish between contexts where visual popout will occur versus those where serial search must be used to find specific items.
-
Visualization design exercise
By the end of the lecture, students are expected to be able to:
- Generate multiple design proposals for a given data-task abstraction combination.
- Analyze the suitability of a given design in terms of both data and task abstractions.
- Table data part 1
By the end of the lecture, students are expected to be able to:
- Analyze spatial arrangements of table data according to key and value variables.
- Analyze the scalability of simple visual encodings of table data.
-
Table data part 2
By the end of the lecture, students are expected to be able to:
- Analyze visual encodings in terms of rectilinear, radial, and parallel axes and discuss their strengths and weaknesses.
- Analyze the scalability of complex visual encodings of table data.
-
Spatial data
By the end of the lecture, students are expected to be able to:
- Discuss the similarities and differences between choropleth maps and heatmaps.
- Choose appropriate normalization for population maps by deriving new attributes.
- Explain the differences between isosurfaces and direct volume rendering.
- Present three strategies for visually encoding vector field and discuss their similarities and differences.
-
Colour
By the end of the lecture, students are expected to be able to:
- Explain the discriminability limits on the use of categorical colors and discuss appropriate alternative designs.
- Explain the appropriate use of the color channels of hue, saturation, and luminance according to attribute types.
- Explain the drawbacks of rainbow colormaps for ordered data and discuss appropriate alternative designs.
-
Network data
By the end of the lecture, students are expected to be able to:
- Explain which combinations of data and tasks are appropriate for node-link versus adjacency matrix representations of networks.
- Explain the strengths and weaknesses of containment versus connection for showing network structure.
- Analyze the scalability of force-directed placement and discuss appropriate alternatives.
-
Rules of Thumb
By the end of the lecture, students are expected to be able to:
- Discuss the costs and benefits of 3D visual representations for both spatial and nonspatial data
- Discuss the costs and benefits of 2D visual representations of network data.
- Apply basic graphic design principles about spatial layout.
- Discuss the tradeoffs between resolution and immersion.