Java Report 1
Java Report 1
University of Mumbai
Academic Year 2022-2023
Certificate
Date:
Place: Kalyan
Mini Project Approval
This Mini Project entiteled " “Weather forcasting Web”"
by Ajaykumar Prajapati, Mangesh Nemade & Sashikant
Gupta is approved for the degree of Bachelor of
Engineering in Information Technology.
Examiners
1……………………………
(Internal Examiner Name &
Sign)
2……………………………
(External Examiner Name &
Sign)
Date:
Place: Kalyan
ACKNOWLEDGEMENT
I would like to take the opportunity to express my heartfelt gratitude to the
people whose help and co-ordination has made this project a success.I thank
I owe project success to my guide and convey my thanks to her. I would like to
numerous books, magazines made available for handy reference and use of
internet facility.
Lastly, I am also indebted to all those who have indirectly contributed in making
1 Abstract 1
2 Introduction 2
3 Proposed Syestem 3
5 Graphs 5
6 Requirements 6
7 Explanation 7
8 Conclusion 8
9 Reference 9
Abstract
Data Mining
Data mining is looking for hidden, valid, and potentially useful patterns in huge
data sets. Data Mining is all about discovering unsuspected/ previously unknown
relationships amongst the data.It is a multi-disciplinary skill that uses machine
learning, statistics, AI and database technology.
Introduction
Rainfall Prediction is the application of science and technology to predict the
amount of rainfall over a region. It is important to exactly determine the rainfall
for effective use of water resources, crop productivity and pre-planning of water
structures.
There are several reasons why weather forecasts are important. They would
certainly be missed if they were not there.
User will enter current temperature; humidity and wind, System will take this
parameter and will predict weather from previous data in database.
The role of the admin is to add previous weather data in database, so that
system will calculate weather based on these data. Weather forecasting system
takes parameters such as temperature, humidity, and wind and will forecast
weather based on previous record therefore this prediction will prove reliable.
The following is a list of various reasons why weather forecasts are important:
1. Helps people prepare for how to dress (i.e. warm weather, cold weather,
windy weather, rainy weather)
2. Helps businesses and people plan for power production and how much
power to use (i.e. power companies, where to set thermostat) 3. Helps
people prepare if they need to take extra gear to prepare for the weather
(i.e. umbrella, rain coat, sun screen)
4. Helps people plan outdoor activities (i.e. to see if rain/storms/cold
weather will impact outdoor event)
5. Helps curious people to know what sort of weather can be expected
(i.e. a snow on the way, severe storms)
6. Helps businesses plan for transportation hazards that can result
from the weather (i.e. fog, snow, ice, storms, clouds as it relates to driving
and flying for example)
7. Helps people with health related issues to plan the day (i.e.
allergies, asthma, heat stress)
8. Helps businesses and people plan for severe weather and other
weather hazards (lightning, hail, tornadoes, hurricanes, ice storms) 9.
Helps farmers and gardeners plan for crop irrigation and protection
(irrigation scheduling, freeze protection)
Problem And Existing Technology
Algorithm:
Linear regression performs the task to predict a dependent variable value (y)
based on a given independent variable (x). So, this regression technique finds
out a linear relationship between x (input) and y(output). Hence, the name is
Linear Regression.
In the figure above, X (input) is the work experience and Y (output) is the salary
of a person. The regression line is the best fit line for our model.
Where
coefficients.
Data Set
The dataset is a public weather dataset from Austin, Texas available on Kaggle.
austin_weather.csv
Columns:
Date-
The date of the collection (YYYY-MM-DD)
TempHighF-
High temperature, in degrees Fahrenheit
TempAvgF-
Average temperature, in degrees Fahrenheit
TempLowF-
Low temperature, in degrees Fahrenheit
DewPointHighF-
High dew point, in degrees Fahrenheit
DewPointAvgF-
Average dew point, in degrees Fahrenheit
DewPointLowF-
Low dew point, in degrees Fahrenheit
HumidityHighPercent-
High humidity, as a percentage
HumidityAvgPercent-
Average humidity, as a percentage
HumidityLowPercent-
Low humidity, as a percentage
SeaLevelPressureHighInches-
High sea level pressure, in inches of mercury
SeaLevelPressureAvgInches-
Average sea level pressure, in inches of mercury
SeaLevelPressureLowInches-
Low sea level pressure, in inches of mercury
VisibilityHighMiles- High
visibility, in miles
VisibilityAvgMiles-
Average visibility, in miles
VisibilityLowMiles- Low
visibility, in miles
WindHighMPH-
High wind speed, in miles per hour
WindAvgMPH-
Average wind speed, in miles per hour
WindGustMPH-
Highest wind speed gust, in miles per hour
PrecipitationSumInches-
Total precipitation, in inches ('T' if trace)
Events-
Adverse weather events (' ' if None)
1) Histogram for Temp
1) In this module we first gather the data(dataset) for our prediction model.Data
comes in all forms, most of it being very messy and unstructured. They rarely
come ready to use. Datasets, large and small, come with a variety of issues-
invalid fields, missing and additional values, and values that are in forms
different from the one we require. In order to bring it to workable or structured
form, we need to “clean” our data, and make it ready to use. Some common
cleaning includes parsing, converting to one-hot, removing unnecessary data,
etc.
In our case, our data has some days where some factors weren’t recorded. And
the rainfall in cm was marked as T if there was trace precipitation. Our algorithm
requires numbers, so we can’t work with alphabets popping up in our data. so
we need to clean the data before applying it on our model.
2)Once the data is cleaned, In this module that cleaned data can be used as an
input to our Linear regression model. Linear regression is a linear approach to
form a relationship between a dependent variable and many independent
explanatory variables. This is done by plotting a line that fits our scatter plot the
best, ie, with the least errors. This gives value predictions, ie, how much, by
substituting the independent values in the line equation.
We will use Scikit-learn’s linear regression model to train our dataset. Once the
model is trained, we can give our own inputs for the various columns such as
temperature, dew point, pressure, etc. to predict the weather based on these
attributes
Module Outcomes:
1) By the end of the first module the fully cleaned and useful data is available
for the apply the algorithm for the prediction.
1) By the end of the second module the actual prediction will be happen the
outcome is the amount of rainfall in inches based upon the users input.
CONCLUSION
We successfully predicted the rainfall using the linear regression but here this is
not very accurate only some times any way it depends upon the climate changes
to season to season.Here we are taking only summer season weather data set it
only useful to predict rainfall in summer season.
REFERENCES
Textbooks:-