Restaurants Rating Prediction Using Machine Learning Algorithms
Restaurants Rating Prediction Using Machine Learning Algorithms
----------------------------------------------------------------------------------------------------------------------------- --------------------------------
Abstract: Restaurant Rating has become the most commonly used parameter for judging a restaurant for any individual. A lot of research has
been done on different restaurants and the quality of food it serves. Rating of a restaurant depends on factors like reviews, area situated, average
cost for two people, votes, cuisines and the type of restaurant.
The main goal of this is to get insights on restaurants which people like visit and to identify the rating of the restaurant. With this article we
study different predictive models like Support Vector Machine (SVM),Random forest and Linear Regression, XGBoost, Decision Tree and have
achieved a score of 83% with ADA Boost.
Key Words: Pre-processing, EDA, SVM Regressor, Linear Regression, XGBoost Regressor, Boosting.
-------------------------------------------------------------------------------------------------------------------------------------------------------------
1. INTRODUCTION
[4] Rrubaa Panchendrarajan, Nazick Ahamed, Prakhash Sivakumar,
Zomato is the most reputed company in the field of food reviews.
Brunthavan Murugaiah, Surangika Ranathunga and Akila Pemasiri
Founded in 2008, this company started in India and now is in 24
wrote a paper on ‘Eatery, a multi-aspect restaurant rating system’
different countries. Its is so big that the people now use it as a verb.
that identifies rating values for different aspects of a restaurant by
“Did you know about this restaurant? Zomato it”. The rating is the
means of aspect-level sentiment analysis. This research introduced a
most important feature of any restaurant as it is the first parameter
new taxonomy to the restaurant domain that captures the hierarchical
that people look into while searching for a place to eat. It portrays the
relationships among entities and aspects.
quality, hygiene and the environment of the place. Higher ratings
lead to higher profit margins. Notations of the ratings usually are
[5] Neha Joshi wrote a paper in 2012 on A Study on Customer
stars or numbers scaling between 1 and 5.
Preference and Satisfaction towards Restaurant in Dehradun City
Zomato has changed the way people browse through restaurants. It which aims to contribute to the limited research in this area and
has helped customers find good places with respect to their dining provide
budget. insight into the consumer decision making process specifically for
the India foodservice industry. She did hypothesis testing using chi-
Different machine learning algorithms like SVM, Linear regression, square test.
Decision Tree, Random Forest can be used to predict the ratings of
the restaurants. [6] Bidisha Das Baksi, Harrsha P, Medha, Mohinishree Asthana, Dr.
Anitha C wrote a paper that studies various attributes of existing
2. RELATED WORK restaurants and analyses them to predict an appropriate location for
Various researches and students have published related work in higher success rate of the new restaurant. The study of existing
national and international research papers, thesis to understand the restaurants in a particular location and the growth rate of that
objective, types of algorithm they have used and various techniques location is important prior to selection of the optimal location. The
for pre-processing and feature selection. aim is to the create a web application that determines the location
suitable to establish a new restaurant unit, using machine learning
[1] Shina, Sharma S. and Singha A. have used Random forest and and data mining techniques.
decision tree to classifying restaurants into several classes based on
their service parameters. Their results say that the Decision Tree
Classifier is more effective with 63.5% of accuracy than Random
Forest whose accuracy is merely 56%.
3. DATA SET DESCRIPTION
This is a kaggle dataset.
(https://www.kaggle.com/himanshupoddar/zomato-bangalore-
[2] Chirath Kumarasiri’s and Cassim Faroo’s focuses on a Part-of- restaurants).
Speech (POS) Tagger based NLP technique for aspect identification
from reviews. Then a Naïve Bayes (NB) Classifier is used to classify It Represents information of Restaurants in the City of Bangalore.
identified aspects into meaningful categories.
It contains 17Columns and 51,000 Rows
[3] I. K. C. U. Perera and H.A. Caldera have used data mining
techniques like Opinion mining and Sentiment analysis to automate
the analysis and extraction of opinions in restaurant reviews.
375
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 375-378, 2019, ISSN:-2319–8656
Listed_in(city) This box plot helps us look into the outliers. We can also see that
online ordering service also affects the rating. Restaurants with
online ordering service have a rating from 3.5 to 4.
376
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 375-378, 2019, ISSN:-2319–8656
377
International Journal of Computer Applications Technology and Research
Volume 8–Issue 09, 375-378, 2019, ISSN:-2319–8656
5. RESULTS
[5] Neha Joshi. A Study on Customer Preference and Satisfaction
towards Restaurant in Dehradun City.
Algorithms Accuracy Global Journal of Management and Business Research(2012)
Link:
Linear Regression 30% https://pdfs.semanticscholar.org/fef5/88622c39ef76dd773fcad8bb5d
233420a270.pdf
KNN 44%
[6] Bidisha Das Baksi, Harrsha P, Medha, Mohinishree Asthana, Dr.
Support Vector Machine 43% Anitha C.(2018) Restaurant Market Analysis.
International Research Journal of Engineering and Technology
Decision Tree 69% (IRJET)
Link: https://www.irjet.net/archives/V5/i5/IRJET-V5I5489.pdf
Random Forest 81%
XGBoost 72.26%
6. CONCLUSIONS
This paper studies a number of features about existing restaurants of
different areas in a city and analyses them to predict rating of the
restaurant. This makes it an important aspect to be considered, before
making a dining decision. Such analysis is essential part of planning
before establishing a venture like that of a restaurant.
Lot of researches have been made on factors which affect sales and
market in restaurant industry. Various dine-scape factors have been
analysed to improve customer satisfaction levels.
If the data for other citirs is also collected, such predictions could be
made for accurate.
7. REFERENCES
[1] Chirath Kumarasiri, Cassim Faroo,”User Centric Mobile Based
Decision-Making System Using Natural Language Processing (NLP)
and Aspect Based Opinion Mining (ABOM) Techniques for
Restaurant Selection”. Springer 2018. DOI: 10.1007/978-3-030-
01174-1_4
[2] Shina, Sharma, S. & Singha ,A. (2018). A study of tree based
machine learning Machine Learning Techniques for Restaurant
review. 2018 4th International Conference on Computing
Communication and Automation (ICCCA)
DOI:/10.1109/CCAA.2018.8777649