Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Intelligent Data Analysis using Optimized Support Vector Machine Based Data Mining Approach for Tourism Industry

Published: 09 March 2022 Publication History

Abstract

Data analysis involves the deployment of sophisticated approaches from data mining methods, information theory, and artificial intelligence in various fields like tourism, hospitality, and so on for the extraction of knowledge from the gathered and preprocessed data. In tourism, pattern analysis or data analysis using classification is significant for finding the patterns that represent new and potentially useful information or knowledge about the destination and other data. Several data mining techniques are introduced for the classification of data or patterns. However, overfitting, less accuracy, local minima, sensitive to noise are the drawbacks in some existing data mining classification methods. To overcome these challenges, Support vector machine with Red deer optimization (SVM-RDO) based data mining strategy is proposed in this article. Extended Kalman filter (EKF) is utilized in the first phase, i.e., data cleaning to remove the noise and missing values from the input data. Mantaray foraging algorithm (MaFA) is used in the data selection phase, in which the significant data are selected for the further process to reduce the computational complexity. The final phase is the classification, in which SVM-RDO is proposed to access the useful pattern from the selected data. PYTHON is the implementation tool used for the experiment of the proposed model. The experimental analysis is done to show the efficacy of the proposed work. From the experimental results, the proposed SVM-RDO achieved better accuracy, precision, recall, and F1 score than the existing methods for the tourism dataset. Thus, it is showed the effectiveness of the proposed SVM-RDO for pattern analysis.

References

[1]
Ainhoa Serna and Slaven Gasparovic. 2018. Transport analysis approach based on big data and text mining analysis from social media. Transportation Research Procedia 33 (2018), 291–298.
[2]
Adam Weaver. 2021. Tourism, big data, and a crisis of analysis. Annals of Tourism Research 88 (2021), 103158.
[3]
Krittipat Pitchayadejanant and Parinya Nakpathom. 2018. Data mining approach for arranging and clustering the agro-tourism activities in orchard. Kasetsart Journal of Social Sciences 39, 3 (2018), 407–413.
[4]
Hsueh-Yuan Lin and Sheng-Yuan Yang. 2019. A cloud-based energy data mining information agent system based on big data analysis technology. Microelectronics Reliability 97 (2019), 66–78.
[5]
Jingjing Li, Lizhi Xu, Ling Tang, Shouyang Wang, and Ling Li. 2018. Big data in tourism research: A literature review. Tourism Management 68 (2018), 301–323.
[6]
Sunmin Lee, Yunjung Hyun, and Moung-Jin Lee. 2019. Groundwater potential mapping using data mining models of big data analysis in Goyang-si, south korea. Sustainability 11, 6 (2019), 1678.
[7]
Yue Guo, Stuart J. Barnes, and Qiong Jia. 2017. Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent dirichlet allocation. Tourism Management 59 (2017), 467–483.
[8]
Christopher Reichstein and Ralf-Christian Härting. 2018. Potentials of changing customer needs in a digital world–a conceptual model and recommendations for action in tourism. Procedia Computer Science 126 (2018), 1484–1494.
[9]
Sangwon Park, Yang Xu, Liu Jiang, Zhelin Chen, and Shuyi Huang. 2020. Spatial structures of tourism destinations: A trajectory data mining approach leveraging mobile big data. Annals of Tourism Research 84 (2020), 102973.
[10]
Daming Li, Lianbing Deng, and Zhiming Cai. 2020. Statistical analysis of tourist flow in tourist spots based on big data platform and DA-HKRVM algorithms. Personal and Ubiquitous Computing 24, 1 (2020), 87–101.
[11]
Promila Sharma, Uma Meena, and Girish Kumar Sharma. 2021. Application of data mining algorithms for tourism industry. Intelligent Computing and Applications (2021), 481–495.
[12]
Evripides Christodoulou, Andreas Gregoriades, Maria Pampaka, and Herodotos Herodotou. 2020. Combination of topic modelling and decision tree classification for tourist destination marketing. In Proceedings of the Advanced Information Systems Engineering Workshops 382 (2020), 95.
[13]
Meng Na and Li Yan. 2020. Analysis of rural tourism based on C4. 5. In Proceedings of the International Conference on Application of Intelligent Systems in Multi-Modal Information Analytics. Springer, Cham (2020), 783–787.
[14]
Juanjo Montaño, Jaume Rossello, and Andreu Sanso. 2019. A new method for estimating tourists’ length of stay. Tourism Management 75 (2019), 112–120.
[15]
Fei Hu, Zhenlong Li, Chaowei Yang, and Yongyao Jiang. 2019. A graph-based approach to detecting tourist movement patterns using social media data. Cartography and Geographic Information Science 46, 4 (2019), 368–382.
[16]
Tosporn Arreeras, Mikiharu Arimura, Takumi Asada, and Saharat Arreeras. 2019. Association rule mining tourist-attractive destinations for the sustainable development of a large tourism area in Hokkaido using Wi-Fi tracking data. Sustainability 11, 14 (2019), 3967.
[17]
Susan Sixue Jia. 2020. Motivation and satisfaction of chinese and US tourists in restaurants: A cross-cultural text mining of online reviews. Tourism Management 78 (2020), 104071.
[18]
Sandra Maria Correia Loureiro, Joao Guerreiro, and Faizan Ali. 2020. 20 years of research on virtual reality and augmented reality in tourism context: A text-mining approach. Tourism Management 77 (2020), 104028.
[19]
Xiaofeng Shi. 2020. Tourism culture and demand forecasting based on BP neural network mining algorithms. Personal and Ubiquitous Computing 24, 2 (2020), 299--308.
[20]
Qin Li, Shaobo Li, Sen Zhang, Jie Hu, and Jianjun Hu. 2019. A review of text corpus-based tourism big data mining. Applied Sciences 9, 16 (2019), 3300.
[21]
Babak Sohrabi, Iman Raeesi Vanani, Narges Nasiri and Armin Ghassemi Rudd. 2020. A predictive model of tourist destinations based on tourists' comments and interests using text analytics. Tourism Management Perspectives 35 (2020), 100710.
[22]
Ali Jahani, Hamid Goshtasb, and Maryam Saffariha. 2020. Tourism impact assessment modeling of vegetation density for protected areas using data mining techniques. Land Degradation & Development 31, 12 (2020), 1502–1519.
[23]
Jing Yang, Bing Zheng, and Zhenghua Chen. 2020. Optimization of tourism information analysis system based on big data algorithm. Complexity 2020, (2020).
[24]
Valeria Baranova, Olena Orlenko, Alla Vitiuk, Natalia Yakimenko-Tereschenko, and Vyacheslav Lyashenko. 2020. Information system for decision support in the field of tourism based on the use of spatio-temporal data analysis. International Journal of Advanced Trends in Computer Science and Engineering 9, 4 (2020), 6356--6361.
[25]
Mingchen Feng, Jiangbin Zheng, Jinchang Ren, Amir Hussain, Xiuxiu Li, Yue Xi, and Qiaoyuan Liu. 2019. Big data analytics and mining for effective visualization and trends forecasting of crime data. IEEE Access 7 (2019), 106111–106123.
[26]
Aytuğ Onan, Serdar Korukoğlu, and Hasan Bulut. 2016. A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification. Expert Systems with Applications 62 (2016), 1–16.
[27]
Aytuğ Onan and Serdar Korukoğlu. 2017. A feature selection model based on genetic rank aggregation for text sentiment classification. Journal of Information Science 43, 1 (2017), 25–38.
[28]
Aytuğ Onan. 2021. Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks. Concurrency and Computation: Practice and Experience 33, 23 (2021), p.e5909.
[29]
Aytug Onan and Mansur Alp Toçoğlu. 2021. A term weighted neural language model and stacked bidirectional LSTM based framework for sarcasm identification. IEEE Access 9 (2021), 7701–7722.
[30]
Aytuğ Onan. 2019. Topic-enriched word embeddings for sarcasm identification. In Proceedings of the Computer Science On-Line Conference. Springer, Cham (2019), 293–304.
[31]
Aytuğ Onan. 2021. Sentiment analysis on massive open online course evaluations: A text mining and deep learning approach. Computer Applications in Engineering Education 29, 3 (2021), 572–589.
[32]
Aytuğ Onan and Serdar Korukoğlu. 2016. Exploring performance of instance selection methods in text sentiment classification. In Proceedings of the Artificial Intelligence Perspectives in Intelligent Systems. Springer, Cham (2016), 167–179.
[33]
Aytuğ Onan, Serdar Korukoğlu, and Hasan Bulut. 2016. Ensemble of keyword extraction methods and classifiers in text classification. Expert Systems with Applications 57 (2016), 232–247.
[34]
Aytuğ Onan. 2018. An ensemble scheme based on language function analysis and feature engineering for text genre classification. Journal of Information Science 44, 1 (2018), 28–47.
[35]
Aytuğ Onan and Mansur Alp Toçoğlu. 2020. Satire identification in turkish news articles based on ensemble of classifiers. Turkish Journal of Electrical Engineering & Computer Sciences 28, 2 (2020), 1086–1106.
[36]
Aytug Onan. 2017. Hybrid supervised clustering based ensemble scheme for text classification. Kybernetes 46, 2 (2017).
[37]
Aytuğ Onan, Serdar Korukoğlu, and Hasan Bulut. 2017. A hybrid ensemble pruning approach based on consensus clustering and multi-objective evolutionary algorithm for sentiment classification. Information Processing & Management 53, 4 (2017), 814–833.
[38]
Mansur Alp Toçoğlu and Aytuğ Onan. 2020. Sentiment analysis on students’ evaluation of higher educational institutions. In Proceedings of the International Conference on Intelligent and Fuzzy Systems. Springer, Cham, (2020), 1693–1700.
[39]
Yu-Dong Zhang, Zhang-Jing Yang, Hui-Min Lu, Xing-Xing Zhou, Preetha Phillips, Qing-Ming Liu, and Shui-Hua Wang. 2016. Facial emotion recognition based on biorthogonal wavelet entropy, fuzzy support vector machine, and stratified cross validation. IEEE Access 4 (2016), 8375–8385.
[40]
Shuihua Wang, Yang Li, Ying Shao, Carlo Cattani, Yudong Zhang, and Sidan Du. 2017. Detection of dendritic spines using wavelet packet entropy and fuzzy support vector machine. CNS & Neurological Disorders-Drug Targets (Formerly Current Drug Targets-CNS & Neurological Disorders) 16, 2 (2017), 116–121.
[41]
He-Qing Mu, Sin-Chi Kuok, and Ka-Veng Yuen. 2017. Stable robust extended kalman filter. Journal of Aerospace Engineering 30, 2 (2017), B4016010.
[42]
Weiguo Zhao, Zhenxing Zhang, and Liying Wang. 2020. Manta ray foraging optimization: An effective bio-inspired optimizer for engineering applications. Engineering Applications of Artificial Intelligence 87 (2020), 103300.
[43]
Amir Mohammad Fathollahi-Fard, Mostafa Hajiaghaei-Keshteli, and Reza Tavakkoli-Moghaddam. 2020. Red deer algorithm (RDA): A new nature-inspired meta-heuristic. Soft Computing 24, 19 (2020), 14637–14665.
[44]
Md Mehedi Hasnat and Samiul Hasan. 2018. Identifying tourists and analyzing spatial patterns of their destinations from location-based social media data. Transportation Research Part C: Emerging Technologies 96 (2018), 38–54.

Cited By

View all
  • (2023)Framework for Building Smart Tourism Big Data Mining Model for Sustainable DevelopmentSustainability10.3390/su1506516215:6(5162)Online publication date: 14-Mar-2023
  • (2023)Construction of an Intelligent Tourism Public Service Platform Featuring Communication Big data and Blockchain2023 International Conference on Integrated Intelligence and Communication Systems (ICIICS)10.1109/ICIICS59993.2023.10420928(1-7)Online publication date: 24-Nov-2023
  • (2023)The Impact of Artificial Intelligence on Tourism Sustainability: A Systematic Mapping Review2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)10.1109/ICCIKE58312.2023.10131818(119-125)Online publication date: 9-Mar-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 16, Issue 5
October 2022
532 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3514187
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 March 2022
Accepted: 01 October 2021
Revised: 01 October 2021
Received: 01 July 2021
Published in TKDD Volume 16, Issue 5

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Intelligent data
  2. tourism industry

Qualifiers

  • Research-article
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)223
  • Downloads (Last 6 weeks)11
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Framework for Building Smart Tourism Big Data Mining Model for Sustainable DevelopmentSustainability10.3390/su1506516215:6(5162)Online publication date: 14-Mar-2023
  • (2023)Construction of an Intelligent Tourism Public Service Platform Featuring Communication Big data and Blockchain2023 International Conference on Integrated Intelligence and Communication Systems (ICIICS)10.1109/ICIICS59993.2023.10420928(1-7)Online publication date: 24-Nov-2023
  • (2023)The Impact of Artificial Intelligence on Tourism Sustainability: A Systematic Mapping Review2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE)10.1109/ICCIKE58312.2023.10131818(119-125)Online publication date: 9-Mar-2023
  • (2023)Who is a tourist? Classifying international urban tourists using machine learningTourism Management10.1016/j.tourman.2022.10468995(104689)Online publication date: Apr-2023
  • (2023)Early prediction of lithium-ion battery cycle life based on voltage-capacity discharge curvesJournal of Energy Storage10.1016/j.est.2023.10679062(106790)Online publication date: Jun-2023
  • (2022)Intelligent Mining and Visualization of Massive Financial Digital Archives Based on Attribute Grouping2022 International Conference on Knowledge Engineering and Communication Systems (ICKES)10.1109/ICKECS56523.2022.10059688(1-5)Online publication date: 28-Dec-2022

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media