Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1868328.1868335acmotherconferencesArticle/Chapter ViewAbstractPublication PagespromiseConference Proceedingsconference-collections
research-article

How effective is Tabu search to configure support vector regression for effort estimation?

Published: 12 September 2010 Publication History
  • Get Citation Alerts
  • Abstract

    Background. Recent studies have shown that Support Vector Regression (SVR) has an interesting potential in the field of effort estimation. However applying SVR requires to carefully set some parameters that heavily affect the prediction accuracy. No general guidelines are available to select these parameters, whose choice also depends on the characteristics of the data set used. This motivates the work described in this paper. Aims. We have investigated the use of an optimization technique in combination with SVR to select a suitable subset of parameters to be used for effort estimation. This technique is named Tabu Search (TS), which is a meta-heuristic approach used to address several optimization problems. Method. We employed SVR with linear and RBF kernels, and used variables' preprocessing strategies (i.e., logarithmic). As for the data set, we employed the Tukutuku cross-company database, which is widely adopted in Web effort estimation studies, and performed a hold-out validation using two different splits of the data set. As benchmark, results are compared to those obtained with Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Results. Our results show that TS provides a good choice of parameters, so that the combination of TS and SVR outperforms any other technique applied on this data set. Conclusions. The use of the meta-heuristic Tabu Search allowed us to obtain (I) an automatic choice of the parameters required to run SVR, and (II) a significant improvement on prediction accuracy for SVR. While we are not guaranteed that this is the global optimum, the results we are presenting are the best performance ever obtained on the problem at the hand, up to now. Of course, the experimental results here presented should be assessed on further data. However, they are surely interesting enough to suggest the use of SVR among the techniques that are suitable for effort estimation, especially when using a cross-company database.

    References

    [1]
    }}J. W. Bailey, V. R. Basili "A meta model for software development resource expenditure", Procs. International Conference on Software Engineering, San Diego, California, USA, 1981, pp. 107--116.
    [2]
    }}P. L. Braga, A. L. I. Oliveira, S. R. L. Meira "Software Effort Estimation using Machine Learning Techniques with Robust Confidence Intervals", HIS 2007: 352--357.
    [3]
    }}L. Briand, T. Langley, I. Wiekzorek, "A Replicated Assessment and Comparison of Common Software Cost Modeling Techniques", Procs. International Conference on Software Engineering, IEEE press, 2000, pp. 377--386.
    [4]
    }}K. Chen, C. Wang, "Support vector regression with genetic algorithms in forecasting tourism demand", Tourism Management 28 (2007), 215--226
    [5]
    }}N.-H. Chiu, S-. Huang, "The adjusted analogy-based software effort estimation based on similarity distances", Journal of Systems and Software 80(4) (2007), pp. 628--640.
    [6]
    }}S. Chulani, B. Boehm, B. Steece "Bayesian Analysis of Empirical Software Engineering Cost Models", IEEE TSE 25 (1999) 573--583.
    [7]
    }}S. D. Conte, H. E. Dunsmore, V. Y. Shen, "Software Engineering Metrics and Models", Benjamin-Cummins, 1986.
    [8]
    }}A. Corazza, S. Di Martino, F. Ferrucci, C. Gravino, E. Mendes, "Applying support vector regression for web effort estimation using a cross-company data set", Procs. Empirical Software Engineering and Measurement, IEEE press, 2009, pp: 191--202
    [9]
    }}A. Corazza, S. Di Martino, F. Ferrucci, C. Gravino, E. Mendes, "Investigating the use of Support Vector Regression for Web Effort Estimation", accepted for publication in Empirical Software Engineering Journal.
    [10]
    }}C. Cortes and V. Vapnik, "Support-Vector Networks", Machine Learning, 20, 1995
    [11]
    }}G. Costagliola, S. Di Martino, F. Ferrucci, C. Gravino, G. Tortora, G. Vitiello, "Effort estimation modeling techniques: a case study for web applications", Procs. Intl. Conference on Web Engineering (ICWE'06), 2006, 9--16.
    [12]
    }}J. M. Desharnais, Analyse statistique de la productivitie des projets in 834 formatique a partie de la technique des point des fonction, Ph.D. thesis, 835 Unpublished Masters Thesis, University of Montreal (1989).
    [13]
    }}S. Di Martino, F. Ferrucci, C. Gravino, E. Mendes "Comparing Size Measures for Predicting Web Application Development Effort: A Case Study", Procs. Empirical Software Engineering and Measurement, IEEE press, 2007, pp. 324--333.
    [14]
    }}F. Ferrucci, C. Gravino, R. Oliveto, F. Sarro, "Using Tabu Search to Estimate Software Development Effort". Procs. International Conferences on Software Process and Product Measurement. LNCS 5891. Springer-Verlag, 2009, pp. 307--320.
    [15]
    }}F. Glover, M. Laguna, Tabu Search, Kluwer Academic Publishers, Boston, 1997.
    [16]
    }}T. Hofmann, B. Scholkopf, A. Smola, "Kernel methods in machine learning" Annals of Statistics, 36(3), 2008, 1171--1220.
    [17]
    }}S. Keerthi, "Efficient tuning of SVM hyper-parameters using radius/margin bound and iterative algorithms". IEEE Transaction on Neural Networks, 13(5), (2002), 1225--1229.
    [18]
    }}B. Kitchenham, L. M. Pickard, S. G. MacDonell, M. J. Shepperd "What accuracy statistics really measure", IEE Proceedings Software 148 (3) (2001) 81--85.
    [19]
    }}B. A. Kitchenham, E. Mendes, "A Comparison of Cross-company and Single-company Effort Estimation Models for Web Applications", Procs. EASE 2004, 2004, pp. 47--55.
    [20]
    }}E. Mendes "The Use of Bayesian Networks for Web Effort Estimation: Further Investigation", Procs. International Conference on Web Engineering (2008).
    [21]
    }}E. Mendes, S. Counsell, "Web Development Effort Estimation using Analogy", Procs. Australian Software Engineering Conference, pp. 203--212, 2000.
    [22]
    }}E. Mendes, N. Mosley, S. Counsell, "Investigating Web Size Metrics for Early Web Cost Estimation", Journal of Systems and Software, 77 (2), 157--172, 2005.
    [23]
    }}E. Mendes, N. Mosley "Bayesian Network Models for Web Effort Prediction: A Comparative Study", IEEE TSE 34 (6) (2008) 723--737.
    [24]
    }}E. Mendes, N. Mosley, S. Counsell, "Early Web Size Measures and Effort Prediction for Web Costimation", Procs. IEEE Metrics Symposium, pp. 18--29, 2003.
    [25]
    }}E. Mendes, N. Mosley, S. Counsell, "Comparison of Length, complexity and functionality as size measures for predicting Web design and authoring effort", IEE Procs. Software 149 (3) 86--92, 2002.
    [26]
    }}E. Mendes, S. D. Martino, F. Ferrucci, C. Gravino "Cross-company vs. single-company web effort models using the Tukutuku database: An extended study", Journal of System & Software 81 (5) (2008) 673--690.
    [27]
    }}A. L. I. Oliveira, "Estimation of software project effort with support vector regression", Neurocomputing, 69(13--15):1749--1753, 2006.
    [28]
    }}B. Scholkopf, A. Smola, "Learning with Kernels". 2002, MIT Press
    [29]
    }}K. K. Shukla, "Neuro-genetic prediction of software development effort", Information and Software Technology 42 (10) (2000), pp. 701--713.
    [30]
    }}A. J. Smola, B. Schölkopf, "A tutorial on support vector regression", Statistics and Computing, 14 (3) 2004, 199--222.
    [31]
    }}V. Vapnik, A. Lerner, "Pattern recognition using generalized portrait method", Automation and Remote Control 24, 1963, 774--780.
    [32]
    }}V. Vapnik, A. Chervonenkis, "A note on one class of perceptrons", Automatics and Remote Control 1964, 25.
    [33]
    }}V. Vapnik," The nature of Statistical Learning Theory", Springer-Verlag, 1995

    Cited By

    View all
    • (2024)IBA-VNS: A Logic-Based Machine Learning Algorithm and Its Application in SurgeryMathematics10.3390/math1207095012:7(950)Online publication date: 23-Mar-2024
    • (2023)An Optimization Framework for Crash Count Data Models2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)10.1109/ITSC57777.2023.10422299(2529-2534)Online publication date: 24-Sep-2023
    • (2022)Gradient boosting and bayesian network machine learning models predict aflatoxin and fumonisin contamination of maize in Illinois – First USA case studyFrontiers in Microbiology10.3389/fmicb.2022.103994713Online publication date: 10-Nov-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    PROMISE '10: Proceedings of the 6th International Conference on Predictive Models in Software Engineering
    September 2010
    195 pages
    ISBN:9781450304047
    DOI:10.1145/1868328
    • General Chair:
    • Tim Menzies,
    • Program Chair:
    • Gunes Koru
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 September 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Tabu search
    2. development effort estimation
    3. empirical studies
    4. support vector machines
    5. support vector regression

    Qualifiers

    • Research-article

    Conference

    Promise '10

    Acceptance Rates

    PROMISE '10 Paper Acceptance Rate 19 of 53 submissions, 36%;
    Overall Acceptance Rate 98 of 213 submissions, 46%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)IBA-VNS: A Logic-Based Machine Learning Algorithm and Its Application in SurgeryMathematics10.3390/math1207095012:7(950)Online publication date: 23-Mar-2024
    • (2023)An Optimization Framework for Crash Count Data Models2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)10.1109/ITSC57777.2023.10422299(2529-2534)Online publication date: 24-Sep-2023
    • (2022)Gradient boosting and bayesian network machine learning models predict aflatoxin and fumonisin contamination of maize in Illinois – First USA case studyFrontiers in Microbiology10.3389/fmicb.2022.103994713Online publication date: 10-Nov-2022
    • (2022)Multi-Objective Software Effort Estimation: A Replication StudyIEEE Transactions on Software Engineering10.1109/TSE.2021.308336048:8(3185-3205)Online publication date: 1-Aug-2022
    • (2022)Learning From Mistakes: Machine Learning Enhanced Human Expert Effort EstimatesIEEE Transactions on Software Engineering10.1109/TSE.2020.304079348:6(1868-1882)Online publication date: 1-Jun-2022
    • (2022)The Impact of Parameters Optimization in Software Prediction Models2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA56994.2022.00041(217-224)Online publication date: Aug-2022
    • (2022)Holistic Parameter Optimization for Software Defect PredictionIEEE Access10.1109/ACCESS.2022.321197810(106781-106797)Online publication date: 2022
    • (2022)Comparative Analysis of Ensemble Models for Software Effort EstimationAdvances in Data and Information Sciences10.1007/978-981-19-5292-0_5(47-57)Online publication date: 25-Nov-2022
    • (2021)The nature of build changesEmpirical Software Engineering10.1007/s10664-020-09926-426:3Online publication date: 1-May-2021
    • (2021)Hyper-Parameter Tuning of Classification and Regression Trees for Software Effort EstimationTrends and Applications in Information Systems and Technologies10.1007/978-3-030-72660-7_56(589-598)Online publication date: 29-Mar-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media