Final Project Proposal - ShadyStats533C: Information Visualization |
Mike Cora November 4, 2005 {mcora} @ cs ubc ca |
|||||||||||||||
Domain |
In this time of blazing fast graphics hardware, surround sound processors, and oversaturated gaming industry, a video game’s AI and gameplay are playing larger and larger roles setting it apart from the rest. Statistical analysis is almost a given when designing a tuneable AI system. Stats should be gathered throughout the development cycle: both during experimentation with different algorithms and gameplay mechanisms, and during feature development and final bug fixing. An info visualization tool to facilitate the interpretation of these statistics can only have a positive effect on the final quality of the game. |
|||||||||||||||
Dataset |
The dataset used was generated from Nintendo’s Mario Super Strikers, a 5-on-5 over-the-top arcade soccer game to be released on December 5th - go play it, it's fun! :) It includes statistics from about 2400 AI vs. AI games played at six different dates over the span of the last 2 months of development, consisting of 15 metrics (dimensions), times 2 for Home/Away. That would make 2400 games * 15 * 2 = 72,000 numbers, a pretty modest dataset, but still large enough to overwhelm an Excel spreadsheet: I have designed and implemented the AI system which generated the dataset during this past year, so I have a fair bit of expertise with the domain, however, I have never done any sort of statistical analysis (and even less multidimensional analysis). |
|||||||||||||||
Tasks |
The tool proposed should aid in the statistical analysis of multi-dimensional data gathered by the AI, with the following goals in mind:
|
|||||||||||||||
The Tool: ShadyStats |
The tool would load the dataset and make use of a variety of multidimensional visualization methods. Some specific implementation goals are:
ShadyStats shall be written in Java because of the availability of information visualization libraries. The ones I will most likely use or at least try out are: InfoVis Toolkit, Prefuse, and ILOG, using the Eclipse IDE (which I have never used before). Following is a super rough layout of the proposed tool. The sample parallel coordinate image was taken from Hierarchical Parallel Coordinates for Visualizing Large Multivariate Data Sets Ying-Huey Fua, Matthew O. Ward, and Elke A. Rundensteiner, IEEE Visualization '99. I will attempt to implement some of the techniques described in this paper, to reduce the clutter associated with parallel graphs. |
|||||||||||||||
Usage Scenario |
The AI programmer has made some recent changes to formation positioning and passing code. A good baseline tuning has been established a week prior and stats were gathered. He wants to see if his recent changes could have introduced any imbalances when compared to the baseline. The ShadyStats maintains a history of previous datasets, so the baseline is already in the list. The new dataset is loaded, and selected from the Datasets checklist along with the baseline dataset. Filtering and highliting options are set for emphasizing differences between datasets (rather than comparing between different dimensions). The baseline is displayed as green, and the new dataset as red (actual colours to be determined). Things look alright at the lower difficulty levels, however the new passing code caused them to pass more times at the higher difficulty levels when compared to the baseline, apparent when aggregating the parallel coordinates based on difficulty level. Another apparent change is that the other team's pass intercepts shot up also, reducing the effectiveness of the offensive gameplay. The user selects the # of passes, and pass intercepts dimensions and creates a scatterplot to get a better view of the relationship. |
|||||||||||||||
Milestones |
|