Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2020390.2020401acmotherconferencesArticle/Chapter ViewAbstractPublication PagespromiseConference Proceedingsconference-collections
research-article

Studying the fix-time for bugs in large open source projects

Published: 20 September 2011 Publication History
  • Get Citation Alerts
  • Abstract

    Background: Bug fixing lies at the core of most software maintenance efforts. Most prior studies examine the effort needed to fix a bug (fix-effort). However, the effort needed to fix a bug may not correlate with the calendar time needed to fix it (fix-time). For example, the fix-time for bugs with low fix-effort may be long if they are considered to be of low priority.
    Aims: We study the fix-time for bugs in large open source projects.
    Method: We study the fix-time along three dimensions: (1) the location of the bug (e.g., which component), (2) the reporter of the bug, and (3) the description of the bug. Using these three dimensions and their associated attributes, we examine the fix-time for bugs in two large open source projects: Eclipse and Mozilla, using a random forest classifier.
    Results: We show that we can correctly classify ~65% of the time the fix-time for bugs in these projects. We perform a sensitivity analysis to identify the most important attributes in each dimension. We find that the time of the filing of a bug and its location are the most important attributes in the Mozilla project for determining the fix-time of a bug. On the other hand, the fix-time in the Eclipse project is highly dependant on the severity of the bug. Surprisingly, the priority of the bug is not an important attribute when determining the fix-time for a bug in both projects.
    Conclusion: Attributes affecting the fix-time vary between projects and vary over time within the same project.

    References

    [1]
    J. Anvik, L. Hiew, and G. C. Murphy. Who should fix this bug? In Proceedings of the 28th International Conference on Software Engineering, pages 361--370, Shanghai, China, May 2006.
    [2]
    N. Bettenburg, S. Just, A. Schröter, C. Weiß, R. Premraj, and T. Zimmermann. What Makes a Good Bug Report. Technical report, Saarland University, 2008.
    [3]
    I. T. Bowman, R. C. Holt, and N. V. Brewster. Reconstructing Ownership Architectures To Help Understand Software Systems. In Proceedings of the 7th International Workshop on Program Comprehension, Pittsburgh, USA, May 1999.
    [4]
    L. Breiman. Bagging Predictors. Machine Learning, 26(1):123--140, 1996.
    [5]
    L. Breiman. Random Forests. Machine Learning, 45(1):5--32, 2001.
    [6]
    M. E. Conway. How do comittees invent? 14(4):28--31, 1968.
    [7]
    D. Cubranic and G. C. Murphy. Automatic bug triage using text categorization. In Proceedings of the Sixteenth International Conference on Software Engineering and Knowledge Engineering, pages 92--97, Banff, Canada, 2004.
    [8]
    J. Frederick P. Brooks. The Mythical Man-Month. Addison Wesley Professional, 1974.
    [9]
    T. L. Graves, A. F. Karr, J. S. Marron, and H. P. Siy. Predicting fault incidence using software change history. IEEE Transactions on Software Engineering, 26(7):653--661, 2000.
    [10]
    A. E. Hassan and K. Zhang. Using decision trees to predict the certification result of a build. In ASE '06: Proceedings of the 21st IEEE/ACM International Conference on Automated Software Engineering, pages 189--198, Washington, DC, USA, 2006. IEEE Computer Society.
    [11]
    P. Hooimeijer and W. Weimer. Modeling bug report quality. In Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering, pages 34--43, New York, NY, USA, 2007. ACM.
    [12]
    J. Quinlan. Programs for Machine Learning. Morgan Kaufmann, 1993.
    [13]
    J. P. Kincaid, R. P. Fishburne, R. L. Rogers, and B. S. Chissom. Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel. Research Branch Report 8-75, Saarland University, 1975.
    [14]
    M. Leszak, D. E. Perry, and D. Stoll. Classification and evaluation of defects in a project retrospective. The Journal of Systems and Software, 61(3):173--187, 2002.
    [15]
    J. Munson and T. Khoshgoftaar. The Detection of Fault-Prone Programs. IEEE Transactions on Software Engineering, 18(5):423--433, 1992.
    [16]
    N. Ohlsson and H. Alberg. Predicting Fault-Prone Software Modules in Telephone Switches. IEEE Transactions on Software Engineering, 22(12):886--894, dec 1996.
    [17]
    L. D. Panjer. Predicting eclipse bug lifetimes. In Proceedings of the Fourth International Workshop on Mining Software Repositories, May 2007.
    [18]
    D. E. Perry and C. S. Steig. Software Faults in Evolving a Large, Real-Time System: a Case Study. In Proceedings of the 4th European Software Engineering Conference, Garmisch, Germany, Sept. 1993.
    [19]
    J. Rice. Mathematical Statisitcs and Data Analysis. Duxbury press, 1995.
    [20]
    J. S. Shirabad. Supporting Software Maintenance by Mining Software Update Records. PhD thesis, University of Ottawa, 2003.
    [21]
    Q. Song, M. Shepperd, M. Cartwright, and C. Mair. Software defect association mining and defect correction effort prediction. IEEE Transactions on Software Engineering, 32(2):69--82, feb 2006.
    [22]
    C. Weiß, R. Premraj, T. Zimmermann, and A. Zeller. How long will it take to fix this bug? In Proceedings of the Fourth International Workshop on Mining Software Repositories, May 2007.
    [23]
    H. Zeng and D. Rine. Estimation of software defects fix effort using neural networks. In Annual International Computer Software And Applications Conference, Sept 2004.

    Cited By

    View all
    • (2024)Graph Neural Network vs. Large Language Model: A Comparative Analysis for Bug Report Priority and Severity PredictionProceedings of the 20th International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3663533.3664042(2-11)Online publication date: 10-Jul-2024
    • (2024)Smarter Project Selection for Software Engineering ResearchProceedings of the 20th International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3663533.3664037(12-21)Online publication date: 10-Jul-2024
    • (2024)CEDAR: Continuous Testing of Deep Learning Libraries2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)10.1109/SANER60148.2024.00044(371-382)Online publication date: 12-Mar-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    Promise '11: Proceedings of the 7th International Conference on Predictive Models in Software Engineering
    September 2011
    145 pages
    ISBN:9781450307093
    DOI:10.1145/2020390
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 September 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bug fix-time
    2. empirical software engineering
    3. mining software repositories

    Qualifiers

    • Research-article

    Conference

    Promise '11

    Acceptance Rates

    Promise '11 Paper Acceptance Rate 15 of 35 submissions, 43%;
    Overall Acceptance Rate 98 of 213 submissions, 46%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)54
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Graph Neural Network vs. Large Language Model: A Comparative Analysis for Bug Report Priority and Severity PredictionProceedings of the 20th International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3663533.3664042(2-11)Online publication date: 10-Jul-2024
    • (2024)Smarter Project Selection for Software Engineering ResearchProceedings of the 20th International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3663533.3664037(12-21)Online publication date: 10-Jul-2024
    • (2024)CEDAR: Continuous Testing of Deep Learning Libraries2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)10.1109/SANER60148.2024.00044(371-382)Online publication date: 12-Mar-2024
    • (2024)An empirical study on the potential of word embedding techniques in bug report management tasksEmpirical Software Engineering10.1007/s10664-024-10510-329:5Online publication date: 25-Jul-2024
    • (2024)On the value of instance selection for bug resolution prediction performanceJournal of Software: Evolution and Process10.1002/smr.2710Online publication date: 2-Jul-2024
    • (2023)Large-Scale Identification and Analysis of Factors Impacting Simple Bug Resolution Times in Open Source Software RepositoriesApplied Sciences10.3390/app1305315013:5(3150)Online publication date: 28-Feb-2023
    • (2023)There’s no Such Thing as a Free Lunch: Lessons Learned from Exploring the Overhead Introduced by the Greenkeeper Dependency Bot in NpmACM Transactions on Software Engineering and Methodology10.1145/352258732:1(1-40)Online publication date: 13-Feb-2023
    • (2023)Investigating the Impact of Bug Dependencies on Bug-Fixing Time Prediction2023 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)10.1109/ESEM56168.2023.10304804(1-12)Online publication date: 26-Oct-2023
    • (2023)Understanding and predicting incident mitigation timeInformation and Software Technology10.1016/j.infsof.2022.107119155:COnline publication date: 1-Mar-2023
    • (2023)A multi-model framework for semantically enhancing detection of quality-related bug report descriptionsEmpirical Software Engineering10.1007/s10664-022-10280-w28:2Online publication date: 11-Feb-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media