Difference: DuckyThesisProposalNotes (16 vs. 17)

Revision 172006-11-21 - DuckySherwood

Line: 1 to 1
 
META TOPICPARENT name="DuckyHomework"

Ducky Thesis Proposal Notes

Line: 13 to 13
 
    • P1.1: We do not have a shared vocabulary for discussing different techniques that developers use when developing code with IDEs. (?)
  • P2: We do not know which techniques are the most productive.
  • P3: We do not know how to teach/train developers how to be more productive.
Added:
>
>
  • P4: Coding video is expensive, annoying, and it is difficult to be consistent in coding (especially when one doesn't know what one is looking for)
 

Givens (right word?):

Robillard et al. showed:
Line: 26 to 27
 
  • H3: Data mining software can discover interesting interaction patterns in navigation tasks.
  • H4: Data mining software can discover interesting interaction patterns in more general code-development tasks.
  • H5: Success in coding tasks correlates with which interaction patterns the developer uses.
Added:
>
>
  • H6: Patterns are similar across tools.
  • H7: Patterns are similar across languages.
 
Added:
>
>
I plan to do H1, H2, and H3. I hope to also do H4.
 

Literature Review

Deleted:
<
<
@@@ A presentation of the relevant literature and the theoretical framework.
 
  • Robillard et al
  • Murphy/Kersten/Findlater
  • BSD et al (unpublished)
Changed:
<
<
  • ?
>
>
  • Jonathan Sillito's questions
  • Andrew Ko ISCE05
 

Proposed data-gathering methods

I will use data collected by BSD which contains a replication of the first part of Robillard et al's study, where professional programmers search for specific interesting methods in the code.

For further work, I have access to

Changed:
<
<
  • many individual logs of traces of a small number of developers either fixing one well-described bug or adding a well-defined feature, with the code available
  • many individual logs of traces of a large number of developers working on unknown material, without the code available
>
>
  • from the Mylar bugzilla: many individual compressed logs of traces of a small number of developers either fixing one well-described bug or adding a well-defined feature, with the context
  • from the glob: many individual logs of traces of a large number of developers working on unknown material, without the code available
 
Changed:
<
<
We do not have data corresponding to the second part of Robillard et al's study, where professional programmers attempt to add a feature. I might need to run a user study replicating that part.
>
>
We do not have data corresponding to the second part of Robillard et al's study, where a small number of professional programmers all do the same task. I might need to run a user study replicating that part.
 

Proposed analysis methods

Line: 53 to 57
 
  1. eyeballs (better word?) -- I will examine the data visually, with filters as appropriate to change how the data is visualized
  2. protein-motif finding algorithm -- I will use a modified protein motif-finding algorithm to search for common patterns, and judgement to select interesting ones.
  3. data-visualization and mining tools, e.g. YALE -- I will use data mining and visualization tools to search through the patterns.
Added:
>
>
  1. (potentially, tho hopefully not) write my own algorithm and/or modify an existing algorithm
 
Changed:
<
<
Having found patterns, I will write code to recognize those patterns.

@@@ not sure what to put for what statistical tests I will use

>
>
Having found patterns, I will write code to
  • recognize those patterns
  • (optionally?) to generate reports with
    • what patterns were found
    • description of the pattern distribution compared with the broader population's distribution
    • ?proscriptive advice on more effective techniques for coding?
 
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2025 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback