Jeanette Bautista, Micheline Manske
Department of Computer Science
University of British Columbia
2366 Main Mall
Vancouver, BC, V6T 1Z4, Canada
{bautista, manske}@cs.ubc.ca
Selecting a university is a difficult decision with many variables to consider such as cost of living, quality of the school and the requirements to get in. Humans are not very effective at reasoning about many variables at one time. We would like to design an interface that supports the task of selecting a college or university from among many choices. Our system is targeted at high school students, their parents, high school guidance councilors, and anyone else for whom ranking schools is an important task.
We have chosen as our test data the USNEWS data set taken from the 1995 U.S. News & World Report’s Guide to America’s Best Colleges. The data set is available from the StatLib website at http://lib.stat.cmu.edu/datasets/colleges.
The data set consists of information about 1300 colleges and universities in the United States for the 1993-94 school year. The variables for each school are listed in Table 1.
Variables for the USNEWS data set | |
College name | % new students from top 25% of HS class |
State | Number of full-time undergraduates |
Public/private indicator | Number of part-time undergraduates |
Average Math SAT score | In-state tuition |
Average Verbal SAT score | Out-of-state tuition |
Average Combined SAT score | Room and board costs |
Average ACT score (qualifying test) | Room costs |
First quartile Math SAT | Board costs |
Third quartile Math SAT | Additional fees |
First quartile Verbal SAT | Estimated book costs |
Third quartile Verbal SAT | Estimated personal spending |
First quartile ACT | % of faculty with Ph.D.’s |
Third quartile ACT | % of faculty with terminal degrees |
Number of applications received | Student/faculty ratio |
Number of applications accepted | Instructional expenditure per student |
Number of new students enrolled | Graduation rate |
% new students from top 10% of HS class |   |
Examination of Table 1 reveals that there is an inherent hierarchy in the variables. One interpretation of this hierarchy is shown as a tree in Figure 1 (note that some nodes have been added which do not directly correspond to variables in the original data set). We wish to exploit the hierarchical nature of the data to reduce the perception of an overwhelming number of variables by selectively hiding lower level variables during exploration by the user.
Figure 1: USNEWS data set arranged hierarchically
A potential screen layout of the interface is shown in Figure 2. The bar on the right contains the sliders for filtering data points. The filtered schools will be displayed in the box on the upper left. When a data point in the display is clicked on, detailed information for that particular school is displayed in the box below.
Figure 2: Potential screen layout of interface
We are considering two options for the hierarchical sliders. The first are the version that are shown in Figure 2; using the standard windows treeview implementation. They are shown in more detail in Figure 3(a). The second option is based on the Treemaps approach [3], where the tree of variables (with a slider for each variable) is mapped to a box which uses all of the available screen space. The Treemap implementation is shown in Figure 3(b).
Figure 3: The two potential approaches for displaying sliders hierarchically. 3(a) The standard windows treeview 3(b) The Treemap approach.
The filtered data will initially be displayed as a scatterplot, with the variables on the axes determined by the user. One extension of our work may be to display the filtered data using ValueCharts[4], an interactive visualization for displaying and comparing options based on up to five weighted variables.
Another further extension may be to extend the interface to enable it to work with any data set that can be arranged as hierarchical trees with depth less than five.
The interface will be written in Java using the InfoVis Tookit[5]. The InfoVis Toolkit is an interactive graphics tookit which supports scatterplots, treemaps, and interactive sliders. Neither author has used the toolkit before.
As a possible extension mentioned in the Proposed Solution section, we may incorporate ValueCharts[4]. The code for ValueCharts is being provided by one of the authors (Giuseppe Carenini).
As well, we have not decided on the extension that we will apply to the initial design interface. We feel that the most relevant problems will become more obvious as we get into the project. Some possible ideas for extensions are to analyze the InfoVis Toolkit, to incorporate ValueCharts into the visualization, to carry out a small study to comparing our UFinder to similar decision-support visualizations, to extend the interface to allow users to compare two data points directly, to extend the interface to work with missing data fields, or to extend the design so that it can accept any generic data set with a hierarchical nature.
Date | Milestone |
March 7 |   •    Familiar with the InfoVis Toolkit.
  •    Meet with Giuseppe Carenini to discuss extensions of ValueCharts |
March 15 |   •    Preliminary interface done (without functionality)
  •    Class presentation |
March 29 |   •    Scatterplot working
  •    Extension chosen |
April 5 |   •    Implementation complete |
April 19 |   •    Extension complete |
[1] | Christopher Williamson, and Ben Shneiderman. The Dynamic HomeFinder: Evaluating dynamic queries in a real-estate information exploration system, In Proc. ACM SIGIR'92, pp338-346.   |
[2] | Christopher Ahlberg and Ben Shneiderman. Visual Information Seeking: Tight Coupling of Dynamic Query Filters with Starfield Displays, In Proc. ACM CHI 1994, pp 313-317.   |
[3] | Brian Johnson and Ben Shneiderman. Treemaps: a space-filling approach to the visualization of hierarchical in-formation structures. In Proc. of the 2nd International IEEE Visualization Conference, pages 284–291, October 1991.   |
[4] | Giuseppe Carenini and John Lloyd. ValueCharts: Analyzing Linear Models Expressing Preferences and Evaluation. In publication.   |
[5] | Jean-Daniel Fekete. The InfoVis Toolkit. Version 0.6alpha2, 2004. http://www.lri.fr/~fekete/InfovisToolkit/   |