@sjbutler

On the Relationship between the Vocabulary of Bug Reports and Source Code

, , , and . 29th IEEE International Conference on Software Maintenance, page 452--455. (2013)

Abstract

Text retrieval (TR) techniques have been widely used to support concept and bug location. When locating bugs, developers often formulate queries based on the bug descriptions. More than that, a large body of research uses bug descriptions to evaluate bug location techniques using TR. The implicit assumption is that the bug descriptions and the relevant source code files share important words. In this paper, we present an empirical study that explores this conjecture. We found that bug reports share more terms with the patched classes than with the other classes in the system. Furthermore, we found that the class names are more likely to share terms with the bug descriptions than other code locations, while more verbose parts of the code (e.g., comments) will share more words. We also found that the shared terms may be better predictors for bug location than some TR techniques.

Links and resources

Tags

community

  • @sjbutler
  • @dblp
@sjbutler's tags highlighted