A Nonlinear Regression Application Via Machine Learning Techniques For Geomagnetic Data Reconstruction Processing

The document discusses using machine learning algorithms like support vector regression, random forest regression, gradient boosting regression, and LSTM for geomagnetic data reconstruction to predict missing values. Geomagnetic data is collected from sensors at regular time intervals but sometimes values are missing. The algorithms are trained on existing data and can then predict the target values for missing data points. Based on the RMSE errors, the LSTM algorithm provided the best predictions with the least error compared to the other algorithms.

Uploaded by

Sindhu Pranathi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

A Nonlinear Regression Application Via Machine Learning Techniques For Geomagnetic Data Reconstruction Processing

Uploaded by

Sindhu Pranathi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

A Nonlinear Regression Application via Machine Learning Techniques

for Geomagnetic Data Reconstruction Processing

Geomagnetic Data contains earth magnetic field data and this data will be
recorded by sensors and by using this data scientists can know the status of
the earth such as when explosion will happen in exploded areas such as
volcanoes. Sensors will be configure to read earth magnetic field data based on
time intervals such as every minute, seconds or hours. Sometime sensor will
miss reporting some data and that missing data can cause serious issues such
as missing volcano eruption information. To overcome from such issues various
techniques were introduce but those techniques require heavy man power and
it’s a time consuming task also.

To overcome from this problem author is suggesting to use machine learning

algorithms to get target information by giving missing values. In this paper
author is evaluating performance of various machine learning algorithms such
as Support Vector Regression, Random Forest Regression, Gradient Boosting
Regression and Deep Learning LSTM Algorithm. In all algorithms LSTM is giving
less prediction error compare to other algorithms.

In this project we are using Geomagnetic dataset obtained from sensors and
this dataset downloaded from below website link.

https://www.intermagnet.org/data-donnee/download-eng.php#view

Dataset values

DATE, TIME, sensor id, HYBX, HYBY, HYBZ, HYBF

2020-01-14, 00:00:00.000, 014, 46.13, 4.16, 71.96, 43615.62
2020-01-14, 00:01:00.000, 014, 46.25, 4.13, 71.95, 43615.72
2020-01-14, 00:02:00.000, 014, 46.30, 4.07, 71.95, 43615.78
2020-01-14, 00:03:00.000, 014, 46.38, 3.99, 71.89, 43615.81

Above dataset obtained from HYDERABAD Area Sensor so its ID contains HYB
and HYBX is the latitude and HYBY is the longitude and others values are the
earth magnetic data. Along with this data we can see sense date and time with
sensor id. In above dataset sensor is configure to sense value every 1 minute
and if we want to have value in half minute then that value is missing. For
example in above dataset
First record time = 00:00:00 and it target value = 43615.62
Second record time = 00:01:00 and it target value = 43615.72

missing value at time = 00:00:30 we want to have value at middle time of 0 and
1 minute that is 00:00:30 and this missing value we can obtained by applying
regression algorithms. Regression algorithms will be trained on past values and
then it can predict target value of missing data.

Support Vector Regression Algorithm

Support Vector Regression: Support Vector Regression (SVR) is quite different

than other Regression models. It uses Support Vector Machine (SVM, a
classification algorithm) algorithm to predict a continuous variable. While
other linear regression models try to minimize the error between the predicted
and the actual value, Support Vector Regression tries to fit the best line within
a predefined or threshold error value. What SVR does in this sense, it tries to
classify all the prediction lines in two types, ones that pass through the error
boundary (space separated by two parallel lines) and ones that don’t. Those
lines which do not pass the error boundary are not considered as the
difference between the predicted value and the actual value has exceeded the
error threshold. The lines that pass, are considered for a potential support
vector to predict the value of an unknown or missing values.

Random Forest Regression Algorithm

Random forest is a bagging technique and not a boosting technique. The trees
in random forests are run in parallel. There is no interaction between these
trees while building the trees. It operates by constructing a multitude of
decision trees at training time and outputting the class that is the mode of the
classes (classification) or mean prediction (regression) of the individual trees. A
random forest is a meta-estimator (i.e. it combines the result of multiple
predictions) which aggregates many decision trees, with some helpful
modifications:

The number of features that can be split on at each node is limited to some
percentage of the total (which is known as the hyper parameter). This ensures
that the ensemble model does not rely too heavily on any individual feature,
and makes fair use of all potentially predictive features.

Each tree draws a random sample from the original data set when generating
its splits, adding a further element of randomness that prevents over fitting.
Deep Learning LSTM (Long Short Term Memory)

In this Long Short Term Memory Neural Network (LSTM) algorithm we will
build train model to predict the target geomagnetic data of unseen/missing
values.

Screen shots

To run this project double click on ‘run.bat’ file to get below screen

In above screen click on ‘Upload Geomagnetic Dataset’ button and upload

dataset
In above screen I am uploading ‘dataset.txt’ file, after dataset upload will get
below screen

In above screen we can see dataset loaded, now click on ‘Run Support Vector
Regression’ to train SVR model on loaded dataset
In above screen we can see SVR RMSE error (RMSE means prediction error and
when error is less then algorithm is able to predict missing record target value
with high accuracy). In above screen we can see total dataset size and then we
can see algorithm used how many records for training and testing. As all data
mining algorithms will used 80 % dataset records for training and 20% records
for testing to get accuracy and RMSE error. Now click on ‘Run Random Forest
Algorithm’ button to get its RMSE error

In above screen we can see Random forest RMSE prediction error. Now click
on ‘Run Gradient Boosting Regression’ button to get its RMSE error
In above screen we can see Gradient Boosting RMSE error, now click on ‘Run
LSTM Deep Learning Algorithm’ button to get LSTM prediction RMSE error

In above screen we can see LSTM RMSE error rate, now click on ‘Upload Test
Value & Reconstruct Data For Missing Values’ button and upload ‘test_dataset’
file and this file contains some values whose target value is missing and this
application will predict target value for those missing values. See below
records from test file
HYBX, HYBY, HYBZ, HYBF
80.00, 23.74, 70.33
46.58, 3.92, 71.95

In above test dataset only 3 values are there and fourth target value is missing
as date and time and sensor ID not require so we are omitting it.

In above screen I am uploading ‘test_dataset’ file and below are the result
values

In above screen we got missing fourth value which we called as reconstructed

or predicted value. Similarly u can add more intervals values in test dataset file
and get its missing target value. Now click on ‘RMSE Comparison Graph’ button
to get below graph

In above graph x-axis represents algorithm names and y-axis represents RMSE
error and from above graph we can see LSTM got less RMSE error and has best
prediction rate for missing values.

Jan 2020 C12 QP
No ratings yet
Jan 2020 C12 QP
48 pages
A Practical Guide To Laptop Repair Ebook PDF
100% (1)
A Practical Guide To Laptop Repair Ebook PDF
3 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
IJRPR22505
No ratings yet
IJRPR22505
3 pages
How To Learn Machine Learning Algorithms For Interviews
No ratings yet
How To Learn Machine Learning Algorithms For Interviews
16 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
CAR RESALE VALUE PREDICTION
No ratings yet
CAR RESALE VALUE PREDICTION
23 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
Context: Description
No ratings yet
Context: Description
5 pages
frmCourseSyllabusIPDownload (2)
No ratings yet
frmCourseSyllabusIPDownload (2)
3 pages
1628083312
No ratings yet
1628083312
7 pages
All About ML
No ratings yet
All About ML
18 pages
PA DA2_merged
No ratings yet
PA DA2_merged
29 pages
Lecture_8_Zainab (1)
No ratings yet
Lecture_8_Zainab (1)
21 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
A Study On Regression Algorithm in Machine Learning
No ratings yet
A Study On Regression Algorithm in Machine Learning
3 pages
Machine Learning For Interviews
No ratings yet
Machine Learning For Interviews
12 pages
MLp
No ratings yet
MLp
28 pages
39 +Rifando+Page++281-287
No ratings yet
39 +Rifando+Page++281-287
7 pages
Abhishek SOLAR GRID
No ratings yet
Abhishek SOLAR GRID
15 pages
Evolutionary Algorithms for Food Science and Technology
From Everand
Evolutionary Algorithms for Food Science and Technology
Evelyne Lutton
No ratings yet
A Detailed Analysis of The Supervised Machine Learning Algorithms
No ratings yet
A Detailed Analysis of The Supervised Machine Learning Algorithms
5 pages
PID5108657
No ratings yet
PID5108657
8 pages
Analysis and Comparison of Machine Learning Approaches For Transmission Line Fault Prediction in Power Systems
No ratings yet
Analysis and Comparison of Machine Learning Approaches For Transmission Line Fault Prediction in Power Systems
8 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
Analysis of Anomaly Detection in ECG Data Using Support Vector Machine (SVM) Compared Over Long Short-Term Memory (LSTM) With Improved Accuracy
No ratings yet
Analysis of Anomaly Detection in ECG Data Using Support Vector Machine (SVM) Compared Over Long Short-Term Memory (LSTM) With Improved Accuracy
1 page
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
100% (1)
Implementation of Credit Card Fraud Detection Using Random Forest Algorithm
10 pages
Introduction to Artificial Intelligence
No ratings yet
Introduction to Artificial Intelligence
15 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Equipment Failure
No ratings yet
Equipment Failure
10 pages
Research Paper TARP Final Upload
No ratings yet
Research Paper TARP Final Upload
5 pages
Assignment B 1 LinearRegression
No ratings yet
Assignment B 1 LinearRegression
5 pages
Top 10 Machine Learning Algorithms With Their Use
100% (1)
Top 10 Machine Learning Algorithms With Their Use
12 pages
Predictive Maintenance
No ratings yet
Predictive Maintenance
66 pages
AI Project Report Hannan Noman Zeeshan
No ratings yet
AI Project Report Hannan Noman Zeeshan
10 pages
L03 The Regression Pipeline - 2
No ratings yet
L03 The Regression Pipeline - 2
58 pages
NEW PPT PRESENTATION
No ratings yet
NEW PPT PRESENTATION
28 pages
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
From Everand
Python Machine Learning: Machine Learning Algorithms for Beginners - Data Management and Analytics for Approaching Deep Learning and Neural Networks from Scratch
Ahmed Ph. Abbasi
No ratings yet
Electric Power Scam Prediction Using Machine Learning Techniques
No ratings yet
Electric Power Scam Prediction Using Machine Learning Techniques
8 pages
Power Consumption Forecasting - 191030052
No ratings yet
Power Consumption Forecasting - 191030052
6 pages
project
No ratings yet
project
36 pages
CSL0777 L19
No ratings yet
CSL0777 L19
23 pages
1 s2.0 S235248472201962X Main
No ratings yet
1 s2.0 S235248472201962X Main
9 pages
House Price Prediction
No ratings yet
House Price Prediction
3 pages
DSUP_Exp6[1]
No ratings yet
DSUP_Exp6[1]
5 pages
Loan
No ratings yet
Loan
3 pages
Mathematics 10 02066
No ratings yet
Mathematics 10 02066
13 pages
Regression Models: by Mayuri Bhandari
No ratings yet
Regression Models: by Mayuri Bhandari
64 pages
House Price Prediction Using Regression Techniques: A Comparative Study
No ratings yet
House Price Prediction Using Regression Techniques: A Comparative Study
5 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
B.N.M. Institute of Technology: Prediction of Remaining Useful Life of Aircraft Engine
No ratings yet
B.N.M. Institute of Technology: Prediction of Remaining Useful Life of Aircraft Engine
28 pages
ES334_Report__2021023_ (1)
No ratings yet
ES334_Report__2021023_ (1)
1 page
MC Learning
No ratings yet
MC Learning
4 pages
ML Ex 5
No ratings yet
ML Ex 5
6 pages
Research Paper Modified
No ratings yet
Research Paper Modified
7 pages
UNIT-3
No ratings yet
UNIT-3
12 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
64 pages
sqr da 2
No ratings yet
sqr da 2
11 pages
Assignment 4 Reportdocx
No ratings yet
Assignment 4 Reportdocx
10 pages
Python Machine Learning in 7 Days
No ratings yet
Python Machine Learning in 7 Days
10 pages
A Machine Learning Model For Average FuelConsumption in Heavy Vehicles
No ratings yet
A Machine Learning Model For Average FuelConsumption in Heavy Vehicles
9 pages
Data-Driven Design of Fog Computing Aided Process Monitoring System For Large-Scale Industrial Proce
No ratings yet
Data-Driven Design of Fog Computing Aided Process Monitoring System For Large-Scale Industrial Proce
10 pages
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
No ratings yet
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
8 pages
A Bi-Objective Hyper-Heuristic Support Vector Machines For Big Data Cyber-Security
No ratings yet
A Bi-Objective Hyper-Heuristic Support Vector Machines For Big Data Cyber-Security
6 pages
Big Data Analytics and Mining For Effective Visualization and Trends Forecasting of Crime Data
No ratings yet
Big Data Analytics and Mining For Effective Visualization and Trends Forecasting of Crime Data
8 pages
Android Malware Detection Using Deep Learning
No ratings yet
Android Malware Detection Using Deep Learning
6 pages
A Deep Learning Facial Expression Recognition Based Scoring System For Restaurants
No ratings yet
A Deep Learning Facial Expression Recognition Based Scoring System For Restaurants
7 pages
A Survey On Sensor-Based Threats and Attacks To Smart Devices and Applications
No ratings yet
A Survey On Sensor-Based Threats and Attacks To Smart Devices and Applications
35 pages
A Privacy-Preserving Enforced Bill Collection System
No ratings yet
A Privacy-Preserving Enforced Bill Collection System
10 pages
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
No ratings yet
5G-Smart Diabetes Toward Personalized Diabetes Diagnosis With Healthcare Big Data Clouds
8 pages
GlobalData_TunisiaTelecomOperatorsCountryIntelligenceReport_080425
No ratings yet
GlobalData_TunisiaTelecomOperatorsCountryIntelligenceReport_080425
42 pages
Input and Output Devices
No ratings yet
Input and Output Devices
19 pages
Key Functions of Cortana
No ratings yet
Key Functions of Cortana
2 pages
Whitepaper 3ffbe6644c0188fde69f
No ratings yet
Whitepaper 3ffbe6644c0188fde69f
7 pages
Happy Gold MT5 Manual en
No ratings yet
Happy Gold MT5 Manual en
12 pages
Office Administration SBA
25% (4)
Office Administration SBA
13 pages
Dell 24 Monitor E2424hs Datasheet
No ratings yet
Dell 24 Monitor E2424hs Datasheet
4 pages
Project2 3 1arbor Press
No ratings yet
Project2 3 1arbor Press
9 pages
Derrick/Mast Inspection Certificate: (API Recommended Practice 4G - 3 Edition 2004 - Sect 9.0)
No ratings yet
Derrick/Mast Inspection Certificate: (API Recommended Practice 4G - 3 Edition 2004 - Sect 9.0)
1 page
BÀI TẬP SO SÁNH
No ratings yet
BÀI TẬP SO SÁNH
3 pages
DIG The UK BIZ Links W Robert Hunter Biden
No ratings yet
DIG The UK BIZ Links W Robert Hunter Biden
27 pages
ICT 34 Data Structures and Analysis of Algorithm
100% (1)
ICT 34 Data Structures and Analysis of Algorithm
9 pages
Pscc2016 Template
No ratings yet
Pscc2016 Template
2 pages
Fpe Exam of Practice
No ratings yet
Fpe Exam of Practice
14 pages
PostgreSQL DBA
No ratings yet
PostgreSQL DBA
3 pages
MECH550P: Foundations in Control Engineering
No ratings yet
MECH550P: Foundations in Control Engineering
17 pages
NetScaler SSL Offload - Overview and Sample Configuration
No ratings yet
NetScaler SSL Offload - Overview and Sample Configuration
7 pages
02 Introduction To Numerical Analysis
No ratings yet
02 Introduction To Numerical Analysis
27 pages
Pneumatic Conveying of Bulk Solids PDF
100% (2)
Pneumatic Conveying of Bulk Solids PDF
231 pages
Discrete-Event Simulation of Health
No ratings yet
Discrete-Event Simulation of Health
42 pages
Paper Overview Nonlinear MPC Applications
No ratings yet
Paper Overview Nonlinear MPC Applications
24 pages
d102277 Ac Stag LPG Qmax Basic
No ratings yet
d102277 Ac Stag LPG Qmax Basic
28 pages
Procedure Manual of Amadeus
100% (4)
Procedure Manual of Amadeus
7 pages
Multithreading Interview Questions: Click Here
No ratings yet
Multithreading Interview Questions: Click Here
37 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Assign 02 PDF
No ratings yet
Assign 02 PDF
1 page
Audit Sistem Informasi - Project UAS
No ratings yet
Audit Sistem Informasi - Project UAS
3 pages
Income Tax Synopsis
No ratings yet
Income Tax Synopsis
24 pages