copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Data mining source code for locating software bugs: A case study in telecommunication industry

B. Turhan, G. Kocak, and A. Bener. Expert Systems with Applications, 36 (6): 9986 - 9990 (2009)
DOI: http://dx.doi.org/10.1016/j.eswa.2008.12.028

Abstract

In a large software system knowing which files are most likely to be fault-prone is valuable information for project managers. They can use such information in prioritizing software testing and allocating resources accordingly. However, our experience shows that it is difficult to collect and analyze fine-grained test defects in a large and complex software system. On the other hand, previous research has shown that companies can safely use cross-company data with nearest neighbor sampling to predict their defects in case they are unable to collect local data. In this study we analyzed 25 projects of a large telecommunication system. To predict defect proneness of modules we trained models on publicly available Nasa \MDP\ data. In our experiments we used static call graph based ranking (CGBR) as well as nearest neighbor sampling for constructing method level defect predictors. Our results suggest that, for the analyzed projects, at least 70% of the defects can be detected by inspecting only (i) 6% of the code using a Naïve Bayes model, (ii) 3% of the code using \CGBR\ framework.

Description

Data mining source code for locating software bugs: A case study in telecommunication industry

Links and resources

BibTeX key: Turhan20099986
entry type: article
year: 2009
journal: Expert Systems with Applications
number: 6
pages: 9986 - 9990
volume: 36
issn: 0957-4174
DOI: http://dx.doi.org/10.1016/j.eswa.2008.12.028
url: http://www.sciencedirect.com/science/article/pii/S0957417408009275

Cite this publication

@article{Turhan20099986, abstract = {In a large software system knowing which files are most likely to be fault-prone is valuable information for project managers. They can use such information in prioritizing software testing and allocating resources accordingly. However, our experience shows that it is difficult to collect and analyze fine-grained test defects in a large and complex software system. On the other hand, previous research has shown that companies can safely use cross-company data with nearest neighbor sampling to predict their defects in case they are unable to collect local data. In this study we analyzed 25 projects of a large telecommunication system. To predict defect proneness of modules we trained models on publicly available Nasa \{MDP\} data. In our experiments we used static call graph based ranking (CGBR) as well as nearest neighbor sampling for constructing method level defect predictors. Our results suggest that, for the analyzed projects, at least 70% of the defects can be detected by inspecting only (i) 6% of the code using a Naïve Bayes model, (ii) 3% of the code using \{CGBR\} framework. }, added-at = {2015-09-17T19:32:54.000+0200}, author = {Turhan, Burak and Kocak, Gozde and Bener, Ayse}, biburl = {https://www.bibsonomy.org/bibtex/221827d34b38c4e4669b7542153b9f6bc/burak.turhan}, description = {Data mining source code for locating software bugs: A case study in telecommunication industry}, doi = {http://dx.doi.org/10.1016/j.eswa.2008.12.028}, interhash = {67ef04d8e77ecc845e3bb4736b22e54e}, intrahash = {21827d34b38c4e4669b7542153b9f6bc}, issn = {0957-4174}, journal = {Expert Systems with Applications }, keywords = {myown}, number = 6, pages = {9986 - 9990}, timestamp = {2015-09-17T19:32:54.000+0200}, title = {Data mining source code for locating software bugs: A case study in telecommunication industry }, url = {http://www.sciencedirect.com/science/article/pii/S0957417408009275}, volume = 36, year = 2009 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Data mining source code for locating software bugs: A case study in telecommunication industry

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Data mining source code for locating software bugs: A case study in telecommunication industry

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Data mining source code for locating software bugs: A case study in telecommunication industry

Comments and Reviews
(0)