0% found this document useful (0 votes)

602 views

This Study Resource Was

This document contains code to analyze a weather dataset using machine learning techniques. It loads and prepares the data, identifies target and feature variables, performs one-hot encoding on categorical features and imputes missing values. It then standardizes the data and splits it into training and test sets. Two classifiers, SVM and random forest, are trained on the data and evaluated on the test set. Performance scores from each classifier are saved to text files.

Uploaded by

MALLUPEDDI SAI LOHITH MALLUPEDDI SAI LOHITH

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

602 views

This Study Resource Was

Uploaded by

MALLUPEDDI SAI LOHITH MALLUPEDDI SAI LOHITH

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

#!

/usr/bin/env python
# coding: utf-8

# Run the Cell to import the packages

# In[1]:

import pandas as pd
import numpy as np
import dataframe as df

# **Data Loading**
# **Fill in the Command to load your CSV dataset "weather.csv" with pandas**

# In[2]:

weather = pd.read_csv('weather.csv', sep=',')

m
# **Data Analysis**

er as
#

co
# - Get the shape of the dataset and print it.

eH w
#
# - Get the column names in list and print it.

o.
# rs e
# - Describe the dataset to understand the basic statistics of the dataset.
ou urc
#
# - Print the first three rows of the dataset

# In[5]:
o
aC s
v i y re

data_size=weather.size

print(data_size)

weather_col_names =weather.columns
ed d
ar stu

print(weather_col_names)

print( weather.describe() )

print( weather.iloc[:3] )
sh is
Th

# **Target Identification**
#
# Execute the below cell to identify the target variables. If yes it will Rain
Tommorow otherwise it will not Rain.

# In[6]:

weather_target=weather['RainTomorrow']

print(weather_target)

# **Feature Identification**
#
# In our case by analyzing the dataset, we can understand that the columns like

This study source was downloaded by 100000800853935 from CourseHero.com on 06-26-2021 00:09:12 GMT -05:00

https://www.coursehero.com/file/79338366/structured-testpy/
**Date** might be irrelevant as they are not dependent on call usage pattern.
#
# Since **RainTomorrow** is our target variable, we will be removing it from the
feature set.
#
# - Perform appropriate operation to drop the columns **Date** and
**RainTomorrow**

# In[10]:

cols_to_drop = ['Date','RainTomorrow']

weather_feature = weather.drop(columns=cols_to_drop)

print(weather_feature.head(5))

# **Categorical Data**
#
# In order to Identify the categorical variable in a data, use the following
command in the below cell,

m
er as
# In[11]:

co
eH w
weather_categorical = weather.select_dtypes(include=[object])

o.
print(weather_categorical.head(15)) rs e
ou urc
# **Convert to boolean**
#
# Assign the column **RainToday** for the variable **yes_no_cols** and run the
o

below cell to print first 5 rows of weather_feature

aC s

#
v i y re

# In[14]:

yes_no_cols = ["RainToday"]
ed d
ar stu

weather_feature[yes_no_cols] = weather_feature[yes_no_cols] == 'Yes'

print(weather_feature.head(5))
sh is

# One Hot Encoding

#
Th

# Execute the below cells to perform One Hot Encoding

# In[15]:

weather_dumm=pd.get_dummies(weather_feature,
columns=["Location","WindGustDir","WindDir9am","WindDir3pm"],
prefix=["Location","WindGustDir","WindDir9am","WindDir3pm"])

weather_matrix = weather_dumm.values.astype(np.float)

# **Imputing-Missing Values**
#
# Do the Imputing-Missing Values by using the following parameters
#

This study source was downloaded by 100000800853935 from CourseHero.com on 06-26-2021 00:09:12 GMT -05:00

https://www.coursehero.com/file/79338366/structured-testpy/
# - missing_values=np.nan
# - strategy=mean
# - fill_value=None
# - verbose=0
# - copy=True
#

# In[16]:

from sklearn.impute import SimpleImputer

imp=SimpleImputer( missing_values=np.nan, strategy='mean' ,fill_value=None

,verbose=0 ,copy=True )

weather_matrix=imp.fit_transform(weather_matrix)

# **Standardization**
#
# Run the below cell to perform standardization

m
# In[17]:

er as
co
eH w
from sklearn.preprocessing import StandardScaler

o.
#Standardize the data by removing the mean and scaling to unit variance
rs e
ou urc
scaler = StandardScaler()

#Fit to data, then transform it.

weather_matrix = scaler.fit_transform(weather_matrix)
aC s
v i y re

# Train and Test Data

#
# Splitting the data for training and testing(90% train,10% test)
#
ed d

# - Perform train-test split on weather_matrix and weather_target with

ar stu

90% as train data and 10% as test data and set random_state as seed.

# In[20]:
sh is

from sklearn.model_selection import train_test_split

seed=5000
train_data, test_data, train_label, test_label =
train_test_split(weather_matrix, weather_target,train_size=.9, test_size=0.1,
random_state=seed)

# Decision Tree Classification

#
# - Initialize **SVM** classifier with following parameters
# - kernel = linear
# - C= 0.025
# - random_state=seed
#
# - Train the model with train_data and train_label
#
# - Now predict the output with test_data

This study source was downloaded by 100000800853935 from CourseHero.com on 06-26-2021 00:09:12 GMT -05:00

https://www.coursehero.com/file/79338366/structured-testpy/
#
# - Evaluate the classifier with score from test_data and test_label
#
# - Print the predicted score
#
#

# In[24]:

from sklearn.svm import SVC

classifier = SVC( kernel = 'linear', C= 0.025 ,random_state=seed

)

classifier = classifier.fit( train_data, train_label )

churn_predicted_target=classifier.predict( test_data )

score = classifier.score( test_data, test_label )

print('SVM Classifier : ', score )

m
er as
with open('output.txt', 'w') as file:

co
file.write(str(np.mean(score)))

eH w
o.
# **Random Forest Classifier** rs e
#
ou urc
# - Do the **Random Forest** Classifier of the Dataset using the following
parameters.
# - max_depth=5
# - n_estimators=10
o

# - max_features=10
aC s

# - random_state=seed
v i y re

#
# - Train the model with train_data and train_label.
#
# - Now predict the output with test_data.
#
ed d

# - Evaluate the classifier with score from test_data and test_label.

ar stu

# In[26]:
sh is

from sklearn.ensemble import RandomForestClassifier

classifier = RandomForestClassifier( max_depth=5, n_estimators=10,

max_features=10, random_state=seed )

classifier = classifier.fit( train_data , train_label

)

churn_predicted_target=classifier.predict( test_data )

score = classifier.score( test_data ,

test_label )

print('Random Forest Classifier : ', score )

with open('output1.txt', 'w') as file:

file.write(str(np.mean(score)))

This study source was downloaded by 100000800853935 from CourseHero.com on 06-26-2021 00:09:12 GMT -05:00

https://www.coursehero.com/file/79338366/structured-testpy/
# In[ ]:

m
er as
co
eH w
o.
rs e
ou urc
o
aC s
v i y re
ed d
ar stu
sh is
Th

This study source was downloaded by 100000800853935 from CourseHero.com on 06-26-2021 00:09:12 GMT -05:00

https://www.coursehero.com/file/79338366/structured-testpy/
Powered by TCPDF (www.tcpdf.org)

Artillery - API Freighting Dump
No ratings yet
Artillery - API Freighting Dump
8 pages
Stat
No ratings yet
Stat
5 pages
Import As From Import Import: Problem 1
100% (1)
Import As From Import Import: Problem 1
5 pages
Image Processing
No ratings yet
Image Processing
5 pages
Artillery
No ratings yet
Artillery
2 pages
Fresco
100% (2)
Fresco
17 pages
DATAbase Connectivity
100% (2)
DATAbase Connectivity
4 pages
DNN Handson
No ratings yet
DNN Handson
2 pages
Intra Organisational Commerce: Advantages
No ratings yet
Intra Organisational Commerce: Advantages
18 pages
Python3 - Programming-Final Assessment - INCOMPLETO
No ratings yet
Python3 - Programming-Final Assessment - INCOMPLETO
32 pages
Backbone Js
No ratings yet
Backbone Js
7 pages
Python List Handson 1
No ratings yet
Python List Handson 1
2 pages
Machine Learning Scikit Handson
0% (1)
Machine Learning Scikit Handson
4 pages
Scala Constructs
No ratings yet
Scala Constructs
1 page
Scala Constructs: Concepts of Functional Programming
No ratings yet
Scala Constructs: Concepts of Functional Programming
21 pages
SR No Category Sub Category Course Name Enable / Disable D Hands On? Yes/No Handson Detail
No ratings yet
SR No Category Sub Category Course Name Enable / Disable D Hands On? Yes/No Handson Detail
3 pages
Python Hands On
100% (1)
Python Hands On
11 pages
Genki - An Integrated Course in Elementary Japanese Workbook II (Second Edition) (2011), WITH PDF BOOKMARKS!
85% (27)
Genki - An Integrated Course in Elementary Japanese Workbook II (Second Edition) (2011), WITH PDF BOOKMARKS!
130 pages
Image Classification Handson-Image - Test
No ratings yet
Image Classification Handson-Image - Test
5 pages
Unstructured
No ratings yet
Unstructured
37 pages
Basics of Statistics and Probability - FP: Statistical Measures
No ratings yet
Basics of Statistics and Probability - FP: Statistical Measures
12 pages
Python-Module03-Case Study03
100% (1)
Python-Module03-Case Study03
2 pages
Nodejs Mock Test III
No ratings yet
Nodejs Mock Test III
6 pages
Selenium QA
No ratings yet
Selenium QA
2 pages
Stat 2
No ratings yet
Stat 2
3 pages
Numpy - Python Package For Data
No ratings yet
Numpy - Python Package For Data
9 pages
Scala - The Diatonic Syallable
No ratings yet
Scala - The Diatonic Syallable
2 pages
Must Know in D3js
100% (1)
Must Know in D3js
1 page
DC - Os
No ratings yet
DC - Os
3 pages
AngularJS 1.x Routers and Custom Directives Q&A
No ratings yet
AngularJS 1.x Routers and Custom Directives Q&A
4 pages
Tensor Flow
No ratings yet
Tensor Flow
2 pages
Data Handling in R - Introduction To Dplyr
No ratings yet
Data Handling in R - Introduction To Dplyr
2 pages
Unstructured Data Classification
No ratings yet
Unstructured Data Classification
2 pages
Python 3 Functions and OOPs
No ratings yet
Python 3 Functions and OOPs
7 pages
Abstract Class 1
No ratings yet
Abstract Class 1
1 page
Python 3 Programming
No ratings yet
Python 3 Programming
3 pages
New Text Document
No ratings yet
New Text Document
10 pages
Redux Async
No ratings yet
Redux Async
3 pages
Data Visualization New
No ratings yet
Data Visualization New
3 pages
Python Pandas MCQs
No ratings yet
Python Pandas MCQs
7 pages
R Basics
No ratings yet
R Basics
2 pages
Module 3
No ratings yet
Module 3
2 pages
Flask-Python Web Framework Hands-On
No ratings yet
Flask-Python Web Framework Hands-On
12 pages
AngularJS 1.x Internals
No ratings yet
AngularJS 1.x Internals
3 pages
Java8 Innards
No ratings yet
Java8 Innards
4 pages
Rsa
No ratings yet
Rsa
2 pages
Context
No ratings yet
Context
4 pages
ECMAScript6 Handson
100% (1)
ECMAScript6 Handson
2 pages
Context Manager 1
No ratings yet
Context Manager 1
1 page
Java8 Innards FrescoPlay
50% (2)
Java8 Innards FrescoPlay
3 pages
Python Pandas Hands-On CID 55937
No ratings yet
Python Pandas Hands-On CID 55937
10 pages
Python 3 Oops Hands On
No ratings yet
Python 3 Oops Hands On
7 pages
NPM Handson
No ratings yet
NPM Handson
1 page
This Study Resource Was
No ratings yet
This Study Resource Was
3 pages
Handlebars
No ratings yet
Handlebars
5 pages
Python Funstinos and OOPS
No ratings yet
Python Funstinos and OOPS
7 pages
ScalaNew Malay
No ratings yet
ScalaNew Malay
4 pages
Python 3 Programming Q & A
No ratings yet
Python 3 Programming Q & A
4 pages
Regression Analysis - Notes
No ratings yet
Regression Analysis - Notes
3 pages
Cassandra Data Handling Hands On
No ratings yet
Cassandra Data Handling Hands On
3 pages
code
No ratings yet
code
13 pages
If With: February 26, 2024
No ratings yet
If With: February 26, 2024
7 pages
SOA and Web Services - Understanding SOA With Web Services...
No ratings yet
SOA and Web Services - Understanding SOA With Web Services...
69 pages
Unit I SOA Introduction
No ratings yet
Unit I SOA Introduction
12 pages
Introduction To SOA With Web Services - Understanding SOA With Web Services
No ratings yet
Introduction To SOA With Web Services - Understanding SOA With Web Services
50 pages
School of Technology, Hyderabad: Answer All The Questions
No ratings yet
School of Technology, Hyderabad: Answer All The Questions
6 pages
Lab Assignment 2: School of Information Technology and Engineering
No ratings yet
Lab Assignment 2: School of Information Technology and Engineering
17 pages
DX Diag
No ratings yet
DX Diag
15 pages
Airbus Wing Assembly PDF
100% (1)
Airbus Wing Assembly PDF
5 pages
Chapter 1 Fundamentals of JAVA
No ratings yet
Chapter 1 Fundamentals of JAVA
46 pages
Opnet Dir Cache
No ratings yet
Opnet Dir Cache
283 pages
Mum PSP SQL
No ratings yet
Mum PSP SQL
24 pages
2-Oracle Application Framework (OAF) Training Guide - Update, Delete Insert, Validation
No ratings yet
2-Oracle Application Framework (OAF) Training Guide - Update, Delete Insert, Validation
23 pages
AutoDock Tutorial v1.2
No ratings yet
AutoDock Tutorial v1.2
3 pages
Jupiter H110
No ratings yet
Jupiter H110
36 pages
Computing Fundamentals: Engr. Hammad Shahab B.SC Computer Systems Engineering MS Computer Systems Engineering
No ratings yet
Computing Fundamentals: Engr. Hammad Shahab B.SC Computer Systems Engineering MS Computer Systems Engineering
30 pages
Shodan (Website) - Wikipedia
No ratings yet
Shodan (Website) - Wikipedia
7 pages
Artificial Intelligence and Structural Health Monitoring of Bridges A Review of The State-of-the-Art
No ratings yet
Artificial Intelligence and Structural Health Monitoring of Bridges A Review of The State-of-the-Art
21 pages
TED Java Development Estimation v3
No ratings yet
TED Java Development Estimation v3
11 pages
PM2100 Series User Manual en
No ratings yet
PM2100 Series User Manual en
78 pages
October 2009
No ratings yet
October 2009
48 pages
PF 42 - USP 1058 in Process Revision
No ratings yet
PF 42 - USP 1058 in Process Revision
16 pages
Payroll Synopsis
No ratings yet
Payroll Synopsis
14 pages
Antonio Amo Quintanilla CV
No ratings yet
Antonio Amo Quintanilla CV
2 pages
Lab08: Abstract Class and Interface: 1. Abstract Classes (In Java)
No ratings yet
Lab08: Abstract Class and Interface: 1. Abstract Classes (In Java)
8 pages
Minor Project 4th Sem
No ratings yet
Minor Project 4th Sem
38 pages
Best Practices in Predictive Analytics
No ratings yet
Best Practices in Predictive Analytics
17 pages
Connecting Db2 To Vb6.0
No ratings yet
Connecting Db2 To Vb6.0
7 pages
Printer Sharing Unit
No ratings yet
Printer Sharing Unit
11 pages
ALGORITHMIC THINKING WITH PYTHON Mod 3
No ratings yet
ALGORITHMIC THINKING WITH PYTHON Mod 3
78 pages
Software Engineering Lecture2
No ratings yet
Software Engineering Lecture2
64 pages
Module 5 Lab
No ratings yet
Module 5 Lab
6 pages
TV Sanyo Cg14ce1 Chasis Ac6a KNP Susah Dicari?
67% (3)
TV Sanyo Cg14ce1 Chasis Ac6a KNP Susah Dicari?
3 pages
Manual DreamBox BusyBox
No ratings yet
Manual DreamBox BusyBox
77 pages
School Ict Coordinator Designation Order
100% (1)
School Ict Coordinator Designation Order
3 pages