Sample_Report_5_with_Code_Implementation
Sample_Report_5_with_Code_Implementation
Yashvardhan Mahecha
Date: 12-08-2021
Abstract:
In travelling cost of a vehicle is a determinant of the time it takes to reach a certain destination
along with the factor that whether the route is busy or empty which itself is a determinant of the
environmental factors.
Predictive analytics is the use of data, statistical algorithms and machine learning techniques to identify
the likelihood of future outcomes based on historical data. The goal is to go beyond knowing what has
happened to providing a best assessment of what will happen in the future.
Though predictive analytics has been around for decades, it's a technology whose time has come. More
and more organizations are turning to predictive analytics to increase their bottom line and competitive
advantage. Why now?
1. Growing volumes and types of data, and more interest in using data to produce valuable
insights.
2. Faster, cheaper computers.
3. Easier-to-use software.
4. Tougher economic conditions and a need for competitive differentiation.
In this report we have devised an optimal way to predict the price of fare so as to get a glance about the
prices and manage finance accordingly.
1. Problem Statement
Providing a methodological approach to analyze the fare market in depth to get insights
about the determining factors for the increase in fare of a ride along with using the concept of
machine learning to train the model accordingly and predict whenever required.
Getting the insights about price for travelling can help predicting how far an individual can
travel within a certain budget .
7. Applicable Regulations
The patents mentioned above might claim the technology used if the algorithms are not
developed and optimised individually and for our requirements. Using a pre-existing model is off
the table if it incurs a patent claim.
1. Must provide access to the 3rd party websites to audit and monitor the authenticity and
behavior of the service.
2. Enabling open-source, academic and research community to audit the Algorithms and
research on the efficacy of the product.
3. Laws controlling data collection : Some websites might have a policy against collecting
customer data in form of reviews and ratings.
4. Must be responsible with the scraped data : It is quintessential to protect the privacy and
intention with which the data was extracted.
8. Applicable Constraints:
1. The use of cloud platforms to store the data gathered over the net.
2. Using the spark service to clean and transform data .
3. For Evaluation of the model which is done with the help of tableau and PowerBI.
4. For modelling using Timeseries(SARIMAX) and linear regression is applied.
9. Business Opportunity
9.1 Substantial amount of tourism opportunities can be seen if the stats can be used by
local tourism firm and government support (At least in India) presents us with a promising future
in product development and comparison. If Someone doesn’t have a lot of capital to spend on
market research and feedback on prototype products. Our service offers just that. We provide a
one stop price comparison, feedback, and comparative analysis all using Machine learning and
automation. This will complement their market research team gain better insights on product and
pricing design .
9.2 Transportation sector can utilize this segmentation report to charge for a particular
customer service that they desire.
10. Concept Generation
This product requires the tool of machine learning models to be written from scratch in
order to suit our needs. . Tweaking these models for our use is less daunting than coding it up
from scratch. A well trained model can either be repurposed or built. But building a model with
the resources and data we have is dilatory but possible. The customer might want to spend the
least amount of time giving input data. . This accuracy will take a little effort to nail, because it’s
imprudent to rely purely on Classic Machine Learning algorithm .
Front End
1. Different user interface: The user must be given many options to choose form in terms of
parameters. This can only be optimized after a lot of testing and analysis all the edge
cases.
2. Interactive visualization the data extracted from the trained models will return raw and
inscrutable data. This must be present in an aesthetic and an “easy to read” style.
3. Feedback system: A valuable feedback system must be developed to understand the
customer’s needs that have not been met. This will help us train the models constantly.
https://www.kaggle.com/ravi72munde/uber-lyft-cab-prices
https://blog.dataiku.com/predicting-taxi-fares-in-new-york-using-machine-learning-in-real-time
http://ijiird.com/wp-content/uploads/050144.pdf