Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3320326.3320371acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnissConference Proceedingsconference-collections
research-article

Prediction Demand for Classified Ads Using Machine Learning: an Experiment Study

Published: 27 March 2019 Publication History

Abstract

Classified ads prediction is a very interesting activity for organizations in order to increase the purchase quantity of a product and thereafter the possibility of sale. Used goods predicting can be done by calculating the probability of sale for each selected product. In this paper, we conduct an empirical analysis on classified ads prediction of Avito dataset in order to develop prediction models using three individual machine-learning techniques and five ensemble learners. We compare and evaluate the performance of the proposed models using Root Mean Square Error (RMSE) measure. The stacked generalization method was also used to combine the best-performed models to select the best one. The results show that the Extreme Gradient Boosting Machine algorithm (XGBoost) is the most accurate model with an RMSE value of 0.2253.

References

[1]
Meffert, J. Morawiak, D. Schumacher, T. 2015. Online classified ads: Digital, dynamic, and still evolving, Telecommunications, Media, and Technology.
[2]
Image.{Online}.Available:<https://storage.googleapis.com/ka ggle-media/c>ompetitions/Avito/product_description_qualities_2.png> (accessed April 2018).
[3]
Gauba, H. Kumar, P. Roy, P.P. Singh, P. Dogra, D.P. Raman, B., 2017, Prediction of advertisement preference by fusing eeg response and sentiment analysis, J. Neural Networks, 92, (August 2017), 77--88.
[4]
Islek, I. Oguducu, S.G. 2017, A Decision Support System for Demand Forecasting based on Classifier Ensemble, Federated Conference on Computer Science and Information Systems.
[5]
Boselli, R. Cesarini, M. Mercorio, F. Mezzanzanica, M. 2018. Classifying online job advertisements through machine learning. Future Generation Computer Systems, 86, 319--328.
[6]
Tugay, R. Oguducu, S.G. 2018. Demand Prediction Using Machine Learning Methods and Stacked Generalization, International Conference on Data Science, Technology and Applications.
[7]
Guidolin, M. Pedio, M. 1999. Essentials of Time Series for Financial Applications, Elsevier Inc.
[8]
Haykins, S. A., 1999, Comprehensive Foundation on Neural Networks, Prentice Hall.
[9]
Breiman, L. Friedman, J. Olshen, R. and Stone, R. 1984. Classification and Regression Trees. Chapman and Hall, Wadsworth, New York.
[10]
Breiman, L. 2001. Random Forests. Machine learning, 45(1), 5--32.
[11]
Ho, T.K. 1995. Random Decision Forests, International Conference on Document Analysis and Recognition. pp. 278--282.
[12]
Witten, I.H. Frank, E. Hall, M.A. 2011. Data Mining: Practical Machine Learning Tools and Techniques, ed. Burlington, MA: Morgan Kaufmann.
[13]
Freund, Y. Schapire, R.E. Abe, N. 1999. A short introduction to boosting, Journal of Japanese Society for Artificial Intelligence, 14(5) 771--780.
[14]
Geurts, P., Ernst, D., Wehenkel, L., 2006. Extremely randomized trees, Machine Learning, 63(1), 3--42.
[15]
Chen T., Guestrin C. 2016., XGBoost: a scalable tree boosting system. ACM SIGKDD International Conference On Knowledge Discovery And Data Mining. 785--94.
[16]
Fan, J. Wang, X. Wu, L. Zhou, H. Zhang, F. Yu, X. Xiang, Y. 2018. Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China. Energy Conversion and Management, 164, 102--111.
[17]
Ma, X. Sha, J. Wang, D. Yu, Y. Yang, Q. Niu, X. 2018. Study on a prediction of P2P network loan default based on the machine learning lightGBM and XGboost algorithms according to different high dimensional data cleaning. Electron Commer Res Appl.
[18]
Breiman, L., 1996. Bagging predictors, Machine Learning, 24(2), 123--140.
[19]
Mishra, A., and Vishwakarma, S., 2015. Analysis of TF-IDF Model and its Variant for Document Retrieval. International Conference on Computational Intelligence and Communication Networks (CICN).
[20]
Pedregosa et al., 2011, Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research (JMLR), Vol.12, pp. 2825--2830.
[21]
Tarwani, S. Chug, A. 2016. Predicting Maintainability of Open Source Software using Gene Expression Programming and Bad Smells, In: International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions).
[22]
{Online}. Available: <http://karlrosaen.com/ml/learning-log/2016-06-20/k-fold-diagram.png> (accessed July 2018).
[23]
Wolpert, D. H. 1992. Stacked Generalization, Neural Networks, 5(2), 241--259.
[24]
Barnston, A. 1992. Correspondence among the Correlation RMSE and Heidke Verification Measures; Refinement of the Heidke Score. Weather Forecast. 7:699--709.
[25]
Veerasamy, R. Rajak, H. Jain, A. Sivadasan, S. Varghese, C.O. Agrawal, R.K. 2011, Validation of QSAR Models - Strategies and Importance, International Journal of Drug Design and Discovery 2(3), 511--5.

Cited By

View all
  • (2024)Real-Time Torque-Distribution for Dual-Motor Off-Road Vehicle Using Machine Learning ApproachIEEE Transactions on Vehicular Technology10.1109/TVT.2024.335518673:4(4567-4577)Online publication date: Apr-2024
  • (2023)Efficient Management of Energy Consumption of Electric Vehicles Using Machine Learning—A Systematic and Comprehensive SurveyEnergies10.3390/en1613489716:13(4897)Online publication date: 23-Jun-2023
  • (2023)Torque Distribution Prediction for Dual-Motor Electric Vehicle Using Ensemble Learning Algorithms2023 IEEE Vehicle Power and Propulsion Conference (VPPC)10.1109/VPPC60535.2023.10403179(1-6)Online publication date: 24-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
NISS '19: Proceedings of the 2nd International Conference on Networking, Information Systems & Security
March 2019
512 pages
ISBN:9781450366458
DOI:10.1145/3320326
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 March 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Classified ads
  2. Ensemble techniques
  3. Machine Learning
  4. Prediction
  5. Stacked generalization

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

NISS19

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Real-Time Torque-Distribution for Dual-Motor Off-Road Vehicle Using Machine Learning ApproachIEEE Transactions on Vehicular Technology10.1109/TVT.2024.335518673:4(4567-4577)Online publication date: Apr-2024
  • (2023)Efficient Management of Energy Consumption of Electric Vehicles Using Machine Learning—A Systematic and Comprehensive SurveyEnergies10.3390/en1613489716:13(4897)Online publication date: 23-Jun-2023
  • (2023)Torque Distribution Prediction for Dual-Motor Electric Vehicle Using Ensemble Learning Algorithms2023 IEEE Vehicle Power and Propulsion Conference (VPPC)10.1109/VPPC60535.2023.10403179(1-6)Online publication date: 24-Oct-2023
  • (2022)Detecting fake reviews through topic modellingJournal of Business Research10.1016/j.jbusres.2022.05.081149(884-900)Online publication date: Oct-2022
  • (2022)The derived demand for advertising expenses and implications on sustainability: a comparative study using deep learning and traditional machine learning methodsAnnals of Operations Research10.1007/s10479-021-04429-x339:1-2(131-161)Online publication date: 7-Jan-2022
  • (2021)An effective fake news detection method using WOA-xgbTree algorithm and content-based featuresApplied Soft Computing10.1016/j.asoc.2021.107559109:COnline publication date: 1-Sep-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media