Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1985441.1985475acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
short-paper

An empirical analysis of the FixCache algorithm

Published: 21 May 2011 Publication History

Abstract

The FixCache algorithm, introduced in 2007, effectively identifies files or methods which are likely to contain bugs by analyzing source control repository history. However, many open questions remain about the behaviour of this algorithm. What is the variation in the hit rate over time? How long do files stay in the cache? Do buggy files tend to stay buggy, or can they be redeemed? This paper analyzes the behaviour of the FixCache algorithm on four open source projects. FixCache hit rate is found to generally increase over time for three of the four projects; file duration in cache follows a Zipf distribution; and topmost bug-fixed files go through periods of greater and lesser stability over a project's history.

References

[1]
Apache httpd. https://github.com/apache/httpd.
[2]
A. Bachmann, C. Bird, F. Rahman, P. Devanbu, and A. Bernstein. The missing links: Bugs and bug-fix commits. FSE 2010.
[3]
V. Basili, L. Briand, W. Melo. A validation of object-oriented design metrics as quality indicators. IEEE TSE, 22(10):751--761, 2002.
[4]
C. Bird, A. Bachmann, E. Aune, J. Duffy, A. Bernstein, V. Filkov, and P. Devanbu. Fair and balanced?: Bias in bug-fix datasets. FSE 2009.
[5]
C. Bird, P. Rigby, E. Barr, D. Hamilton, D. German, and P. Devanbu. The promises and perils of mining Git. MSR 2009.
[6]
E. Engstrm, P. Runeson, and G. Wikstrand. An empirical evaluation of regression testing based on fix-cache recommendations. Int'l Conf. Software Testing, Verification & Validation (ICST) 2010.
[7]
A. Hassan R. Holt. The top ten list: Dynamic fault prediction. ICSM 2005.
[8]
S. Kim, T. Zimmermann, E. Whitehead Jr, and A. Zeller. Predicting faults from cached history. ICSE 2007.
[9]
N. Nagappan, T. Ball, and A. Zeller. Mining metrics to predict component failures. ICSE 2006.
[10]
T. Ostrand, E. Weyuker, and R. Bell. Predicting the location and number of faults in large software systems. IEEE TSE, 31(4):340--355, 2005.
[11]
Postgres. http://www.github.com/postgres/postgres.
[12]
G. Robles, S. Koch, J. M. Gonzlez-Barahona, and J. Carlos. Remote analysis and measurement of libre software systems by means of the CVSAnalY tool. RAMSS 2004.
[13]
J. Zliwerski, T. Zimmermann, and A. Zeller. When do changes induce fixes? MSR 2005.
[14]
Voldemort. https://github.com/voldemort/voldemort
[15]
Z. Wang. Fix Cache Based Regression Test Selection. Master's thesis, Chalmers Univ. of Technology, Univ. of Gothenburg, 2010.
[16]
G. Wikstrand, R. Feldt, J. Gorantla, W. Zhe, and C. White. Dynamic regression test selection based on a file cache an industrial evaluation. ICST 2009.

Cited By

View all
  • (2022)Pitfalls and guidelines for using time-based Git dataEmpirical Software Engineering10.1007/s10664-022-10200-y27:7Online publication date: 6-Oct-2022
  • (2021)Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)10.1109/MSR52588.2021.00022(85-96)Online publication date: May-2021
  • (2021)Onboarding vs. Diversity, Productivity, and QualityProceedings of the 43rd International Conference on Software Engineering10.1109/ICSE43902.2021.00097(1033-1045)Online publication date: 22-May-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MSR '11: Proceedings of the 8th Working Conference on Mining Software Repositories
May 2011
260 pages
ISBN:9781450305747
DOI:10.1145/1985441
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 May 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. fixcache
  2. software bug prediction

Qualifiers

  • Short-paper

Conference

ICSE11
Sponsor:
ICSE11: International Conference on Software Engineering
May 21 - 22, 2011
HI, Waikiki, Honolulu, USA

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Pitfalls and guidelines for using time-based Git dataEmpirical Software Engineering10.1007/s10664-022-10200-y27:7Online publication date: 6-Oct-2022
  • (2021)Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)10.1109/MSR52588.2021.00022(85-96)Online publication date: May-2021
  • (2021)Onboarding vs. Diversity, Productivity, and QualityProceedings of the 43rd International Conference on Software Engineering10.1109/ICSE43902.2021.00097(1033-1045)Online publication date: 22-May-2021
  • (2018)An empirical study of software change classification with imbalance data‐handling methodsSoftware: Practice and Experience10.1002/spe.260648:11(1968-1999)Online publication date: 29-Jun-2018
  • (2016)Potential-fault cache-based regression test selection2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)10.1109/AICCSA.2016.7945658(1-8)Online publication date: Nov-2016
  • (2015)Towards a Model-based Software Mining InfrastructureACM SIGSOFT Software Engineering Notes10.1145/2693208.269322440:1(1-8)Online publication date: 6-Feb-2015
  • (2012)An Empirical Analysis of Software Changes on Statement Entity in Java Open Source ProjectsInternational Journal of Open Source Software and Processes10.4018/jossp.20120401024:2(16-31)Online publication date: 1-Apr-2012
  • (2011)BugCache for inspectionsProceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering10.1145/2025113.2025157(322-331)Online publication date: 5-Sep-2011

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media