Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2020390.2020402acmotherconferencesArticle/Chapter ViewAbstractPublication PagespromiseConference Proceedingsconference-collections
research-article

Selecting discriminating terms for bug assignment: a formal analysis

Published: 20 September 2011 Publication History

Abstract

Background. The bug assignment problem is the problem of triaging new bug reports to the most qualified developer. The qualified developer is the one who has enough knowledge in a specific area that is relevant to the reported bug. In recent years, bug triaging has received a considerable amount of attention from researchers. In previous work, bugs were represented as vectors of terms extracted from the bug reports' description. Once the bugs are represented as vectors in the terms space, traditional machine learning techniques are employed for the bug assignment. Most of the previous algorithms are marred by low accuracy values. Aims. This paper formulates the bug assignment problem as a classification task, and then examines the impact of several term selection approaches on the classification effectiveness. Method. Three variants selection methods that are based on the Log Odds Ratio (LOR) score are compared against methods that are based on the Information Gain (IG) score and Latent Semantic Analysis (LSA). The main difference in the methods that are based on the LOR score is in the process of selecting the terms. Results. Term selection techniques that are based on the Log Odds Ratio achieved up to 30% improvement in the precision and up to 5% higher in recall compared to other term selection methods such as Latent Semantic Analysis and Information Gain. Conclusions. Experimental results showed that the effectiveness of bug assignment methods is directly affected by the selected terms that are used in the classification methods.

References

[1]
J. Anvik. Automating bug report assignment. In Proceedings of the 28th international conference on Software engineering, ICSE '06, pages 937--940, New York, NY, USA, 2006. ACM.
[2]
J. Anvik, L. Hiew, and G. C. Murphy. Coping with an open bug repository. In ETX, pages 35--39, 2005.
[3]
J. Anvik, L. Hiew, and G. C. Murphy. Who should fix this bug? In ICSE, pages 361--370, 2006.
[4]
R. Bekkerman and J. Allan. Using bigrams in text categorization. Department of Computer Science University of Massachusetts Amherst, 1003:1--10, 2003.
[5]
B. W. Boehm and V. R. Basili. Software defect reduction top 10 list. IEEE Computer, 34(1):135--137, 2001.
[6]
D. Cubranic and G. C. Murphy. Automatic bug triage using text categorization. In SEKE, pages 92--97, 2004.
[7]
S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman. Indexing by latent semantic analysis. Journal of The American Society For Information Science, 41(6):391--407, 1990.
[8]
N. Friedman, D. Geiger, and M. Goldszmidt. Bayesian network classifiers. Mach. Learn., 29(2--3):131--163, 1997.
[9]
G. Jeong, S. Kim, and T. Zimmermann. Improving bug triage with bug tossing graphs. In Proceedings of the 7th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering. ACM, August 2009.
[10]
R. M. Mainur, R. Guenther, and Z. Thomas. Optimized assignment of developers for fixing bugs an initial evaluation for eclipse projects. In ESEM '09: Proceedings of the 2009 3rd International Symposium on Empirical Software Engineering and Measurement, pages 439--442, Washington, DC, USA, 2009. IEEE Computer Society.
[11]
C. D. Manning, P. Raghavan, and H. Schtze, Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008.
[12]
A. K. McCallum. Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering, 1996. http://www.cs.cmu.edu/mccallum/bow.
[13]
C. V. Ramamoorthy and W. T. Tsai. Advances in software engineering. IEEE Computer, 29(10):47--58, 1996.
[14]
I. Witten, E. Frank, L. Trigg, M. Hall, G. Holmes, and S. Cunningham. Weka: Practical machine learning tools and techniques with java implementations. ICONIP/ANZIIS/ANNES, 99:192--196, 1999.

Cited By

View all
  • (2022)Taxonomy of bug tracking process smellsInformation and Software Technology10.1016/j.infsof.2022.106972150:COnline publication date: 1-Oct-2022
  • (2021)Analytical Study on Bug Triaging PracticesResearch Anthology on Recent Trends, Tools, and Implications of Computer Programming10.4018/978-1-7998-3016-0.ch049(1068-1094)Online publication date: 2021
  • (2021)Issue Auto-Assignment in Software Projects with Machine Learning Techniques2021 IEEE/ACM 8th International Workshop on Software Engineering Research and Industrial Practice (SER&IP)10.1109/SER-IP52554.2021.00018(65-72)Online publication date: Jun-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
Promise '11: Proceedings of the 7th International Conference on Predictive Models in Software Engineering
September 2011
145 pages
ISBN:9781450307093
DOI:10.1145/2020390
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 September 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bug assignment
  2. bug reports
  3. classification
  4. machine learning

Qualifiers

  • Research-article

Funding Sources

Conference

Promise '11

Acceptance Rates

Promise '11 Paper Acceptance Rate 15 of 35 submissions, 43%;
Overall Acceptance Rate 98 of 213 submissions, 46%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Taxonomy of bug tracking process smellsInformation and Software Technology10.1016/j.infsof.2022.106972150:COnline publication date: 1-Oct-2022
  • (2021)Analytical Study on Bug Triaging PracticesResearch Anthology on Recent Trends, Tools, and Implications of Computer Programming10.4018/978-1-7998-3016-0.ch049(1068-1094)Online publication date: 2021
  • (2021)Issue Auto-Assignment in Software Projects with Machine Learning Techniques2021 IEEE/ACM 8th International Workshop on Software Engineering Research and Industrial Practice (SER&IP)10.1109/SER-IP52554.2021.00018(65-72)Online publication date: Jun-2021
  • (2021)Towards a Taxonomy of Bug Tracking Process Smells: A Quantitative Analysis2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA53835.2021.00026(138-147)Online publication date: Sep-2021
  • (2020)Analytical Study on Bug Triaging PracticesCognitive Analytics10.4018/978-1-7998-2460-2.ch088(1698-1725)Online publication date: 2020
  • (2020)Software Defect Prediction Using Heterogeneous Ensemble Classification Based on Segmented PatternsApplied Sciences10.3390/app1005174510:5(1745)Online publication date: 3-Mar-2020
  • (2020)Vocabulary and time based bug‐assignment: A recommender system for open‐source projectsSoftware: Practice and Experience10.1002/spe.283050:8(1539-1564)Online publication date: 27-Apr-2020
  • (2020)Guidelines for evaluating bug‐assignment researchJournal of Software: Evolution and Process10.1002/smr.225032:9Online publication date: 3-Sep-2020
  • (2018)A Conceptual Tool to Improve the Management of Software DefectsBusiness Modeling and Software Design10.1007/978-3-319-94214-8_35(443-451)Online publication date: 30-Jun-2018
  • (2018)The Application of Change Indicators in Mining Software RepositoriesTrends and Advances in Information Systems and Technologies10.1007/978-3-319-77712-2_40(418-428)Online publication date: 17-May-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media