Frequency Distribution of Error Messages
Which programming error messages are the most common? We investigate this question, motivated by writing error explanations better suited to novices. We consider two very large data sets, one in Python and the other in Java, both combining syntax and run-time errors. We group essentially identical messages and then determine the most common ones. In both data sets, we find that the error message frequencies empirically resemble Zipf-Mandelbrot distributions. We use a maximum-likelihood approach to select the distribution parameters. This gives one possible way to contrast languages or compilers quantitatively.
Frequency Distribution of Error Messages (plateau2015-pritchard.pdf) | 960KiB |
David Pritchard studied computer science and mathematics at MIT and the University of Waterloo, obtaining his PhD in 2010. He taught at Waterloo, EPFL (Switzerland), Princeton University, and the University of Southern California, while developing free software for students to practice and learn introductory programming online. He is currently employed at Google Los Angeles, and continues to volunteer for the Computer Science Circles project, which is hosted by the Center for Education in Mathematics and Computer Science in Waterloo.
Mon 26 Oct Times are displayed in time zone: Eastern Time (US & Canada) change
10:30 - 12:00 | |||
10:30 20mTalk | Frequency Distribution of Error Messages PLATEAU David PritchardUniversity of Waterloo, Canada File Attached | ||
10:50 20mTalk | An Evaluation of the DiaSuite Toolset by Professional Developers PLATEAU Milan KabáčUniversity of Bordeaux / Inria Bordeaux / LaBRI, Nic VolanschiInria Bordeaux, Charles ConselUniversity of Bordeaux File Attached | ||
11:10 20mTalk | Aiding Programmers using Lightweight Integrated Code Visualization PLATEAU File Attached | ||
11:30 10mTalk | Towards moldable development tools PLATEAU Andrei ChişUniversity of Bern, Switzerland, Tudor Gîrbatudorgirba.com, Switzerland, Oscar NierstraszUniversity of Bern, Switzerland Pre-print Media Attached File Attached | ||
11:40 13mTalk | Understanding the Effects of Code Presentation PLATEAU File Attached |