Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering

Published: 01 May 2014 Publication History

Abstract

Randomized algorithms are widely used to address many types of software engineering problems, especially in the area of software verification and validation with a strong emphasis on test automation. However, randomized algorithms are affected by chance and so require the use of appropriate statistical tests to be properly analysed in a sound manner. This paper features a systematic review regarding recent publications in 2009 and 2010 showing that, overall, empirical analyses involving randomized algorithms in software engineering tend to not properly account for the random nature of these algorithms. Many of the novel techniques presented clearly appear promising, but the lack of soundness in their empirical evaluations casts unfortunate doubts on their actual usefulness. In software engineering, although there are guidelines on how to carry out empirical analyses involving human subjects, those guidelines are not directly and fully applicable to randomized algorithms. Furthermore, many of the textbooks on statistical analysis are written from the viewpoints of social and natural sciences, which present different challenges from randomized algorithms. To address the questionable overall quality of the empirical analyses reported in the systematic review, this paper provides guidelines on how to carry out and properly analyse randomized algorithms applied to solve software engineering tasks, with a particular focus on software testing, which is by far the most frequent application area of randomized algorithms within software engineering. Copyright © 2012 John Wiley & Sons, Ltd.

References

[1]
Harman M, McMinn P. A theoretical and empirical study of search based testing: local, global and hybrid search. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 2: pp.226-247.
[2]
Godefroid P, Klarlund N, Sen K. DART: directed automated random testing. In ACM Conference on Programming Language Design and Implementation PLDI, Chicago, Illinois, USA, 2005; pp.213-223.
[3]
Motwani M, Raghavan P. Randomized Algorithms. Cambridge University Press: Cambridge, 1995.
[4]
Arcuri A, Iqbal MZ, Briand L. Random testing: theoretical results and practical implications. IEEE Transactions on Software Engineering TSE 2012; Volume 38 Issue 2: pp.258-277.
[5]
Duran JW, Ntafos SC. An evaluation of random testing. IEEE Transactions on Software Engineering TSE 1984; Volume 10 Issue 4: pp.438-444.
[6]
Arcuri A, Briand L. A practical guide for using statistical tests to assess randomized algorithms in software engineering. In ACM/IEEE International Conference on Software Engineering ICSE, Honolulu, Hawaii, USA, 2011; pp.1-10.
[7]
Harman M, Mansouri SA, Zhang Y. 2009 .Search based software engineering: a comprehensive analysis and review of trends techniques and applications. Technical Report TR-09-03, King's College.
[8]
McMinn P. Search-based software test data generation: a survey. Software Testing, Verification and Reliability 2004; Volume 14 Issue 2: pp.105-156.
[9]
Bagnall AJ, Rayward-Smith VJ, Whittley IM. The next release problem. Information and Software Technology 2001; Volume 43 Issue 14: pp.883-890.
[10]
Aguilar-Ruiz J, Ramos I, Riquelme JC, Toro M. An evolutionary approach to estimating software development projects. Information and Software Technology 2001; Volume 43: pp.875-882.
[11]
Arcuri A, Yao X. A novel co-evolutionary approach to automatic software bug fixing. In IEEE Congress on Evolutionary Computation CEC, Hong Kong, China, 2008; pp.162-168.
[12]
Mitchell BS, Mancoridis S. On the automatic modularization of software systems using the bunch tool. IEEE Transactions on Software Engineering TSE 2006; Volume 32 Issue 3: pp.193-208.
[13]
Canfora G, Penta MD, Esposito R, Villani ML. An approach for QoS-aware service composition based on genetic algorithms. In Genetic and Evolutionary Computation Conference GECCO, Washington, USA, 2005; pp.1069-1075.
[14]
Cooper KD, Schielke PJ, Subramanian D. Optimizing for reduced code space using genetic algorithms. In Proceedings of the ACM SIGPLAN Workshop on Languages, Compilers, and Tools for Embedded Systems, Atlanta, Georgia, USA, 1999; pp.1-9.
[15]
Khoshgoftaar T, Yi L, Seliya N. A multiobjective module-order model for software quality enhancement. IEEE Transactions on Evolutionary Computation TEC 2004; Volume 8 Issue 6: pp.593-608.
[16]
Cohen J. Statistical power analysis for the behavioral sciences, 1988.
[17]
Grissom R, Kim J. Effect Sizes for Research: A Broad Practical Approach. Lawrence Erlbaum: London, 2005.
[18]
Klein J, Moeschberger M. Survival Analysis: Techniques for Censored and Truncated Data. Springer Verlag: Berlin, 2003.
[19]
Rice JA. Mathematical Statistics and Data Analysis, 2nded.Duxbury Press: Forest Lodge Road Pacific Grove, CA, 1994.
[20]
Wilcox R. Fundamentals of Modern Statistical Methods: Substantially Improving Power and Accuracy. Springer Verlag: Berlin, 2001.
[21]
Dybå T, Kampenes V, Sjøberg D. A systematic review of statistical power in software engineering experiments. Information and Software Technology IST 2006; Volume 48 Issue 8: pp.745-755.
[22]
Kampenes V, Dybå T, Hannay J, Sjøberg D. A systematic review of effect size in software engineering experiments. Information and Software Technology IST 2007; Volume 49 Issue 11-12: pp.1073-1086.
[23]
Nakagawa S, Cuthill I. Effect size, confidence interval and statistical significance: a practical guide for biologists. Biological Reviews 2007; Volume 82 Issue 4: pp.591-605.
[24]
Katz M. Multivariable Analysis: A Practical Guide for Clinicians. Cambridge University Press: Cambridge, 2006.
[25]
Wohlin C. Experimentation in Software Engineering: An Introduction. Vol.Volume 6, Springer Netherlands: Berlin, 2000.
[26]
Poulding S, Clark J. Efficient software verification: statistical testing using automated search. IEEE Transactions on Software Engineering TSE Volume 36 Issue 6: pp.763-777.
[27]
Ali S, Briand L, Hemmati H, Panesar-Walawege R. A systematic review of the application and empirical investigation of search-based test-case generation. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 6: pp.742-762.
[28]
Feller W. An Introduction to Probability Theory and Its Applications. 3rded. Vol.Volume 1, Wiley: Hoboken, 1968.
[29]
Khan K, Kunz R, Kleijnen J, Antes G. Systematic Reviews to Support Evidence-Based Medicine: How to Review and Apply Findings of Healthcare Research. RSM Press: London, 2004.
[30]
Kitchenham B, Pearl Brereton O, Budgen D, Turner M, Bailey J, Linkman S. Systematic literature reviews in software engineering-a systematic literature review. Information and Software Technology IST 2009; Volume 51 Issue 1: pp.7-15.
[31]
Hsu H, Orso A. MINTS: a general framework and tool for supporting test-suite minimization. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.419-429.
[32]
Thum T, Batory D, Kastner C. Reasoning about edits to feature models. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.254-264.
[33]
Mitchell T. Machine Learning. McGraw Hill: New York City, 1997.
[34]
Ganesh V, Leek T, Rinard M. Taint-based directed whitebox fuzzing. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.474-484.
[35]
Abraham R, Erwig M. Mutation operators for spreadsheets. IEEE Transactions on Software Engineering TSE 2009; Volume 35 Issue 1: pp.94-108.
[36]
Masood A, Bhatti R, Ghafoor A, Mathur A. Scalable and effective test generation for role-based access control systems. IEEE Transactions on Software Engineering TSE 2009; Volume 35 Issue 5: pp.654-668.
[37]
Ngo-The A, Ruhe G. Optimized resource allocation for software release planning. IEEE Transactions on Software Engineering TSE 2009; Volume 35 Issue 1: pp.109-123.
[38]
Menzies T, Williams S, Boehm B, Hihn J. How to avoid drastic software process change using stochastic stability. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.540-550.
[39]
Weimer W, Nguyen T, Goues CL, Forrest S. Automatically finding patches using genetic programming. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.364-374.
[40]
Kieyzun A, Guo P, Jayaraman K, Ernst M. Automatic creation of SQL injection and cross-site scripting attacks. In ACM/IEEE International Conference on Software Engineering ICSE, Vancouver, Canada, 2009; pp.199-209.
[41]
Arcuri A. Full theoretical runtime analysis of alternating variable method on the triangle classification problem. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.113-121.
[42]
Ghani K, Clark J, Heslington Y. Widening the goal posts: program stretching to aid search based software testing. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.122-131.
[43]
Durillo J, Zhang Y, Alba E, Nebro A. A study of the multi-objective next release problem. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.49-58.
[44]
Garvin B, Cohen M, Dwyer M. An improved meta-heuristic search for constrained interaction testing. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.13-22.
[45]
Kpodjedo S, Ricca F, Antoniol G, Galinier P. Evolution and search based metrics to improve defects prediction. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.23-32.
[46]
Khan U, Bate I. WCET analysis of modern processors using multi-criteria optimisation. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.103-112.
[47]
Marchetto A, Tonella P. Search-based testing of Ajax web applications. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.3-12.
[48]
Kim D, Park S. Dynamic architectural selection: a genetic algorithm based approach. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.59-68.
[49]
Shevertalov M, Kothari J, Stehle E, Mancoridis S. On the use of discretized source code metrics for author identification. In International Symposium on Search Based Software Engineering SSBSE, Windsor, UK, 2009; pp.69-78.
[50]
Bryce R, Colbourn C. A density-based greedy algorithm for higher strength covering arrays. Software Testing, Verification and Reliability STVR 2009; Volume 19 Issue 1: pp.37-53.
[51]
Polo M, Piattini M, García-Rodríguez I. Decreasing the cost of mutation testing with second-order mutants. Software Testing, Verification and Reliability STVR 2009; Volume 19 Issue 2: pp.111-131.
[52]
Schneidewind N. Integrating testing with reliability. Software Testing, Verification and Reliability STVR 2009; Volume 19 Issue 3: pp.175-198.
[53]
Huo J, Petrenko A. Transition covering tests for systems with queues. Software Testing, Verification and Reliability STVR 2009; Volume 19 Issue 1: pp.55-83.
[54]
White J, Doughtery B, Schmidt D. ASCENT: an algorithmic technique for designing hardware and software in tandem. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 6.
[55]
Garousi V. A genetic algorithm-based stress test requirements generator tool and its empirical evaluation. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 6: pp.778-797.
[56]
Yuan X, Memon AM. Generating event sequence-based test cases using GUI runtime state feedback. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 1: pp.81-95.
[57]
Do H, Mirarab S, Tahvildari L, Rothermel G. The effects of time constraints on test case prioritization: a series of controlled experiments. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 5: pp.593-617.
[58]
Simons CL, Parmee IC, Gwynllyw R. Interactive, evolutionary search in upstream object-oriented class design. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 6: pp.798-816.
[59]
Bowman M, Briand LC, Labiche Y. Solving the class responsibility assignment problem in object-oriented analysis with multi-objective genetic algorithms. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 6: pp.817-837.
[60]
Emberson P, Bate I. Stressing search with scenarios for flexible solutions to real-time task allocation problems. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 5: pp.704-718.
[61]
Antunes J, Neves N, Correia M, Verissimo P, Neves R. Vulnerability discovery with attack injection. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 3: pp.357-370.
[62]
Artzi S, Kiezun A, Dolby J, Tip F, Dig D, Paradkar A, Ernst MD. Finding bugs in web applications using dynamic test generation and explicit-state model checking. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 4: pp.474-494.
[63]
Beckman NE, Nori AV, Rajamani SK, Simmons RJ, Tetali SD, Thakur AV. Proofs from tests. IEEE Transactions on Software Engineering TSE 2010; Volume 36 Issue 4: pp.495-508.
[64]
Lai Z, Cheung S, Chan W. Detecting atomic-set serializability violations in multithreaded programs through active randomized testing. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.235-244.
[65]
Zhang L, Hou S, Hu J, Xie T, Mei H. Is operator-based mutant selection superior to random mutant selection? In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.435-444.
[66]
Gligoric M, Gvero T, Jagannath V, Khurshid S, Kuncak V, Marinov D. Test generation through programming in UDITA. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.225-234.
[67]
Nainar PA, Liblit B. Adaptive bug isolation. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.255-264.
[68]
Gabel M, Su Z. Online inference and enforcement of temporal properties. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.15-24.
[69]
Gu Z, Barr ET, Hamilton DJ, Su Z. Has the bug really been fixed? In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.55-64.
[70]
Jha S, Gulwani S, Seshia SA, Tiwari A. Oracle-guided component-based program synthesis. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.215-224.
[71]
Yang Q, Li M. A cut-off approach for bounded verification of parameterized systems. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.345-354.
[72]
Nori A, Rajamani SK. An empirical study of optimizations in yogi. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.355-364.
[73]
Schaefer CA, Pankratius V, Tichy WF. Engineering parallel applications with tunable architectures. In ACM/IEEE International Conference on Software Engineering ICSE, Cape Town, South Africa, 2010; pp.405-414.
[74]
<familyNamePrefix>de</familyNamePrefix>Souza JT, Maia CL, <familyNamePrefix>de</familyNamePrefix>Freitas FG, Coutinho DP. The human competitiveness of search based software engineering. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.143-152.
[75]
<familyNamePrefix>del</familyNamePrefix>Sagrado J, <familyNamePrefix>del</familyNamePrefix>Aguila IM, Orellana FJ. Ant colony optimization for the next release problem. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.67-76.
[76]
Lu G, Bahsoon R, Yao X. Applying elementary landscape analysis to search-based software engineering. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.3-8.
[77]
McMinn P. How does program structure impact the effectiveness of the crossover operator in evolutionary testing? In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.9-18.
[78]
Xiao J, Afzal W. Search-based resource scheduling for bug fixing tasks. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.133-142.
[79]
Yoo S. A novel mask-coding representation for set cover problems with applications in test suite minimisation. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.19-28.
[80]
Lakhotia K, Harman M, Gross H. AUSTIN: a tool for search based software testing for the C language and its evaluation on deployed automotive systems. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.101-110.
[81]
Tonella P, Susi A, Palma F. Using interactive GA for requirements prioritization. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.57-66.
[82]
Asadi F, Antoniol G, Gueheneuc Y. Concept location with genetic algorithms: a comparison of four distributed architectures. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.153-162.
[83]
Lindlar F, Windisch A. A search-based approach to functional hardware-in-the-loop testing. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.111-119.
[84]
Zhang Y, Harman M. Search based optimization of requirements interaction management. In International Symposium on Search Based Software Engineering SSBSE, Benevento, Italy, 2010; pp.47-56.
[85]
Zhao R, Lyu M, Min Y. Automatic string test data generation for detecting domain errors. Software Testing, Verification and Reliability STVR 2010; Volume 20 Issue 3: pp.209-236.
[86]
Griesmayer A, Bloem RP, Byron C. Repair of Boolean programs with an application to C. Computer Aided Verification 2006; pp.358-371.
[87]
Staber S, Jobstmann B, Bloem R. Finding and fixing faults. In Conference on Correct Hardware Design and Verification Methods CHARME, Saarbrücken, Germany, 2005; pp.35-49.
[88]
Stumptner M, Wotawa F. A model based approach to software debugging. In International Workshop on Principles of Diagnosis, Val Morin, Canada, 1996.
[89]
Cowles M, Davis C. On the origins of the .05 level of statistical significance. American Psychologist 1982; Volume 37 Issue 5: pp.553-558.
[90]
Goodman S. P values, hypothesis tests, and likelihood: implications for epidemiology of a neglected historical debate. American Journal of Epidemiology 1993; Volume 137 Issue 5: pp.485-496.
[91]
Goodman S. Toward evidence-based medical statistics. 1: the P value fallacy. Annals of Internal Medicine 1999; Volume 130 Issue 12: pp.995-1004.
[92]
Arcuri A, Briand L. Formal analysis of the probability of interaction fault detection using random testing. IEEE Transactions on Software Engineering TSE 2012; Volume 38 Issue 5: pp.1088-1099.
[93]
Sharma R, Gligoric M, Arcuri A, Fraser G, Marinov D. Testing container classes: random or systematic? In Fundamental Approaches to Software Engineering FASE, Saarbrücken, Germany, 2011; pp.262-277.
[94]
Fraser G, Arcuri A. Evolutionary generation of whole test suites. In International Conference on Quality Software QSIC, Madrid, Spain, 2011; pp.31-40.
[95]
Siegmund D. Sequential Analysis: Tests and Confidence Intervals. Springer: Berlin, 1985.
[96]
Fay M, Proschan M. Wilcoxon-Mann-Whitney or t-test? On assumptions for hypothesis tests and multiple interpretations of decision rules. Statistics Surveys 2010; Volume 4: pp.1-39.
[97]
Sawilowsky S, Blair R. A more realistic look at the robustness and type II error properties of the t test to departures from population normality. Psychological Bulletin 1992; Volume 111 Issue 2: pp.352-360.
[98]
Glass G, Peckham P, Sanders J. Consequences of failure to meet assumptions underlying the fixed effects analyses of variance and covariance. Review of Educational Research 1972; Volume 42 Issue 3: pp.237-288.
[99]
Nijssen S, Back T. An analysis of the behavior of simplified evolutionary algorithms on trap functions. IEEE Transactions on Evolutionary Computation TEC 2003; Volume 7 Issue 1: pp.11-22.
[100]
Rudolph G. Convergence analysis of canonical genetic algorithms. IEEE Transactions on Neural Networks 1994; Volume 5 Issue 1: pp.96-101.
[101]
Tonella P. Evolutionary testing of classes. In ACM International Symposium on Software Testing and Analysis ISSTA, Boston, Massachusetts USA, 2004; pp.119-128.
[102]
Fraser G, Arcuri A. Whole test suite generation. IEEE Transactions on Software Engineering TSE 2012.
[103]
Leech N, Onwuegbuzie A. 2002 .A call for greater use of nonparametric statistics. Technical Report, US Dept. Education.
[104]
Ruxton G. The unequal variance t-test is an underused alternative to student's t-test and the Mann-Whitney U test. Behavioral Ecology 2006; Volume 17 Issue 4: pp.688-690.
[105]
Freitag G, Lange S, Munk A. Non-parametric assessment of non-inferiority with censored data. Statistics in Medicine 2006; Volume 25 Issue 7: pp.1201-1217.
[106]
Arcuri A. Theoretical analysis of local search in software testing. In Symposium on Stochastic Algorithms, Foundations and Applications SAGA, Sapporo, Japan, 2009; pp.156-168.
[107]
Vargha A, Delaney HD. A critique and improvement of the CL common language effect size statistics of McGraw and Wong. Journal of Educational and Behavioral Statistics 2000; Volume 25 Issue 2: pp.101-132.
[108]
Chernick M. Bootstrap Methods: A Practitioner's Guide Wiley Series in Probability and Statistics, 1999.
[109]
Arcuri A, Iqbal MZ, Briand L. Black-box system testing of real-time embedded systems using random and search-based testing. In IFIP International Conference on Testing Software and Systems ICTSS, Natal, Brazil, 2010; pp.95-110.
[110]
Arcuri A, Fraser G. On parameter tuning in search based software engineering. In SSBSE, Szeged, Hungary, 2011; pp.33-47.
[111]
Kruskal W, Wallis W. Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association 1952; Volume 47 Issue 260: pp.583-621.
[112]
Nakagawa S. A farewell to Bonferroni: the problems of low statistical power and publication bias. Behavioral Ecology 2004; Volume 15 Issue 6: pp.1044-1045.
[113]
Perneger T. What's wrong with Bonferroni adjustments. British Medical Journal 1998; Volume 316: pp.1236-1238.
[114]
García L. Escaping the Bonferroni iron claw in ecological studies. Oikos 2004; Volume 105 Issue 3: pp.657-663.
[115]
Carrano EG, Wanner EF, Takahashi RHC. A multicriteria statistical based comparison methodology for evaluating evolutionary algorithms. IEEE Transactions on Evolutionary Computation TEC 2011; Volume 15 Issue 6: pp.848-870.
[116]
Fraser G, Arcuri A. It is not the length that matters, it is how you control it. In IEEE International Conference on Software Testing, Verification and Validation ICST, Berlin, Germany, 2011; pp.150-159.
[117]
Tillmann N, <familyNamePrefix>de</familyNamePrefix>Halleux NJ. Pex-white box test generation for .NET. In International Conference on Tests And Proofs TAP, Prato, Italy, 2008; pp.134-253.
[118]
Pacheco C, Lahiri SK, Ernst MD, Ball T. Feedback-directed random test generation. In ACM/IEEE International Conference on Software Engineering ICSE, Minneapolis, USA, 2007; pp.75-84.
[119]
Andrews JH, Menzies T, Li FC. Genetic algorithms for randomized unit testing. IEEE Transactions on Software Engineering TSE 2011; Volume 37 Issue 1: pp.80-94.
[120]
Arcuri A, Yao X. Search based software testing of object-oriented containers. Information Sciences 2008; Volume 178 Issue 15: pp.3075-3095.
[121]
Ribeiro JCB, Zenha-Rela MA, <familyNamePrefix>de</familyNamePrefix>Vega FF. Test case evaluation and input domain reduction strategies for the evolutionary testing of object-oriented software. Information and Software Technology 2009; Volume 51 Issue 11: pp.1534-1548.
[122]
Shousha M, Briand L, Labiche Y. A UML/MARTE model analysis method for uncovering scenarios leading to starvation and deadlocks in concurrent systems. IEEE Transactions on Software Engineering TSE 2012; Volume 38 Issue 2: pp.354-374.
[123]
Fraser G, Arcuri A. Sound empirical evidence in software testing. In ACM/IEEE International Conference on Software Engineering ICSE, Zurich, Switzerland, 2012; pp.178-188.
[124]
Wolpert DH, Macready WG. No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation 1997; Volume 1 Issue 1: pp.67-82.
[125]
Alshraideh M, Bottaci L. Search-based software test data generation for string data using program-specific search operators. Software Testing, Verification and Reliability STVR 2006; Volume 16 Issue 3: pp.175-203.
[126]
Hemmati H, Arcuri A, Briand L. Empirical investigation of the effects of test suite properties on similarity-based test case selection. In IEEE International Conference on Software Testing, Verification and Validation ICST, Berlin, Germany, 2011; pp.327-336.
[127]
R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing: Vienna, Austria, 2008. ISBN 3-900051-07-0.
[128]
Gelman A, Carlin J, Stern H, Rubin D. Bayesian Data Analysis. Chapman & Hall/CRC: London, 2003.

Cited By

View all
  • (2024)Enhancing GUI Exploration Coverage of Android Apps with Deep Link-Integrated MonkeyACM Transactions on Software Engineering and Methodology10.1145/366481033:6(1-31)Online publication date: 27-Jun-2024
  • (2024)Focused Test Generation for Autonomous Driving SystemsACM Transactions on Software Engineering and Methodology10.1145/366460533:6(1-32)Online publication date: 27-Jun-2024
  • (2024)Graphuzz: Data-driven Seed Scheduling for Coverage-guided Greybox FuzzingACM Transactions on Software Engineering and Methodology10.1145/366460333:7(1-36)Online publication date: 26-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Software Testing, Verification & Reliability
Software Testing, Verification & Reliability  Volume 24, Issue 3
May 2014
70 pages

Publisher

John Wiley and Sons Ltd.

United Kingdom

Publication History

Published: 01 May 2014

Author Tags

  1. Bonferroni adjustment
  2. confidence interval
  3. effect size
  4. nonparametric test
  5. parametric test
  6. statistical difference
  7. survey
  8. systematic review

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Enhancing GUI Exploration Coverage of Android Apps with Deep Link-Integrated MonkeyACM Transactions on Software Engineering and Methodology10.1145/366481033:6(1-31)Online publication date: 27-Jun-2024
  • (2024)Focused Test Generation for Autonomous Driving SystemsACM Transactions on Software Engineering and Methodology10.1145/366460533:6(1-32)Online publication date: 27-Jun-2024
  • (2024)Graphuzz: Data-driven Seed Scheduling for Coverage-guided Greybox FuzzingACM Transactions on Software Engineering and Methodology10.1145/366460333:7(1-36)Online publication date: 26-Aug-2024
  • (2024)Predicting Fairness of ML Software ConfigurationsProceedings of the 20th International Conference on Predictive Models and Data Analytics in Software Engineering10.1145/3663533.3664040(56-65)Online publication date: 10-Jul-2024
  • (2024)Mining Action Rules for Defect Reduction PlanningProceedings of the ACM on Software Engineering10.1145/36608091:FSE(2309-2331)Online publication date: 12-Jul-2024
  • (2024)Advanced White-Box Heuristics for Search-Based Fuzzing of REST APIsACM Transactions on Software Engineering and Methodology10.1145/365215733:6(1-36)Online publication date: 27-Jun-2024
  • (2024)Leveraging Phylogenetics in Software Product Families: The Case of Latent Content Generation in Video GamesProceedings of the 28th ACM International Systems and Software Product Line Conference10.1145/3646548.3672596(113-124)Online publication date: 2-Sep-2024
  • (2024)On the Generalizability of Deep Learning-based Code Completion Across Programming Language VersionsProceedings of the 32nd IEEE/ACM International Conference on Program Comprehension10.1145/3643916.3644411(99-111)Online publication date: 15-Apr-2024
  • (2024)Formal Synthesis of Uncertainty Reduction ControllersProceedings of the 19th International Symposium on Software Engineering for Adaptive and Self-Managing Systems10.1145/3643915.3644095(2-13)Online publication date: 15-Apr-2024
  • (2024)Evaluating Directed Fuzzers: Are We Heading in the Right Direction?Proceedings of the ACM on Software Engineering10.1145/36437411:FSE(316-337)Online publication date: 12-Jul-2024
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media