Prediction of Performance Efficiency For Wastewater Treatment Plant's Effluent Biochemical Oxygen Demand Using Artificial Neural Network
Prediction of Performance Efficiency For Wastewater Treatment Plant's Effluent Biochemical Oxygen Demand Using Artificial Neural Network
Prediction of Performance Efficiency For Wastewater Treatment Plant's Effluent Biochemical Oxygen Demand Using Artificial Neural Network
ISSN No:-2456-2165
Abstract:- This study investigated the application of an Wastewater treatment plants (WWTPs) are built to clean
artificial neural network (ANN) to predict the wastewater and convert it into eco-friendlier water which is
performance efficiency of the Abuja-based Wupa WWTP, released into the environment (Varkeshi et al., 2019).
Nigeria using effluent 5-day biochemical oxygen demand However, due to the wide fluctuation in the quality and
(BOD5) as a performance indicator. Daily data of influent quantity of untreated wastewater transported to the treatment
BOD5, pH, total dissolved solids, total suspended solids, plant, the operation of WWTPs can be difficult and
chemical oxygen demand, total coliform, Escherichia challenging (Szeląg et al. 2017).
coliform, and fecal coliform; and effluent BOD5 over a
period of five years (2013 to 2017) for the Wupa WWTP Moreover, many treatment plants are constructed
was utilized for the plant’s performance efficiency. The following the conventional activated sludge system which is
four most reliable multilayer perceptron ANN (MLP- allegedly riddled with inefficiencies associated with pollutant
ANN) algorithms namely, Levenberg-Marquardt (LM) removal (Ogwueleka and Samson, 2020). In addressing the
backpropagation resilient backpropagation, Quasi- challenges of the conventional treatment systems, several
Newton backpropagation, and Fletcher-Reeves conjugate alternative methods have been proposed, notable amongst
gradient backpropagation were adopted; and the most which are the advanced oxidation processes (AOPs) (Deng
appropriate model was selected following training, and Zhao, 2015); nanomaterials (Adeleye et al., 2016);
validation and testing by altering the number of neurons microalgae-activated sludge (MAAS) (Ogwueleka and
and activation functions in both the hidden and output Samson, 2020); microbial electrochemical system (Li et al.,
layers. The model efficiency was determined using mean 2021). However, many of these emerging methods are still
square error (MSE) and correlation coefficient (R2). The limited to pilot or laboratory scales and are yet to gain
ML algorithm with Logsig-Tansig activation pairing and widespread practical applications due to a number of reasons
architecture [8-1270-1] performed the best in terms of such as the initial cost of installation, uncertainties with
convergence time and prediction error, with MSE and R 2 operations, adaptation and installation of new technologies,
values of 1.522 and 0.922, respectively. Also, it revealed etc.; thus, there is still need to seek for means of attaining
that the selected ANN model predicted the effluent BOD 5 efficiency, even in the pre-existing installed treatment
with an overall correlation coefficient of 0.962; thus, systems; which can be achieved by attaining and maintaining
demonstrating the efficacy of ANN models for accurate optimal conditions in WWTPs.
prediction of the Wupa WWTP performance. The novelty
of this research is in evaluating the efficiency of the plant Attaining optimal operational conditions in WWTPs
over the periods and determining the most precise ANN even with conventional systems is possible and can be
model for Wupa WWTP, Abuja, Nigerians a study which achieved with the use of models to predict the WWTP
has never been carried out before now. performance based on previous measurements of major plant
parameters (Jami et al., 2012).
Keywords:- Artificial Neural Network (ANN); Wastewater
Treatment Plant (WWTP); 5-Day Biochemical Oxygen An important parameter commonly utilized to examine
Demand (BOD5); Wupa WWTP; Multilayer Perceptron the performance of WWTPs is the 5-day biochemical oxygen
(MLP) demand (BOD5) (Dogan et al. 2008; Araromi et al., 2018).
BOD5 is an approximation of the quantity of biochemically
I. INTRODUCTION degradable organic matter contained in a water sample,
defined as the amount of oxygen necessary for the aerobic
The treatment and management of wastewater in our bacteria present in a sample to oxidize the organic matter to a
environment have increasingly gained attractive attention in stable organic form (Dogan et al., 2008). It is, however,
the last decade, particularly in the face of the incessantly difficult to measure, and requires five days for its
increasing volume of wastewater owing to population growth; determination (Dogan et al., 2008; Alsulaili and Refaie, 2021).
rapid urbanization; increased agricultural; and industrial Therefore, the determination of the output BOD5 of a WWTP
activities (Abba and Elkiran 2017; Arismendy et al., 2020; as a performance index using predictive tools could achieve
Alsulaili and Refaie, 2021).
III. PREPARE YOUR PAPER BEFORE STYLING Dogan (2008) established an ANN model to predict BOD
in the inlet of WWTPs, and the results demonstrated that the
However, there are basically three phases of developing ANN may be used to accurately estimate daily BOD at the
an ANN model: training, validation, and prediction. During input of wastewater treatment facilities. Hamed et al. (2004)
this process, the data set is divided into three groups, usually developed two ANN models to forecast the effluent
for the purpose of this study and other development studies concentrations of BOD and suspended solids for a major
reported. the data set is divided into a ratio of 70:15:15 WWTP in Cairo; using a 10 months dataset. The study
presents.70% of the data set is usually assigned to the training reported that the prediction error fluctuated minimally and
phase to train the neuron while about 15% is used for gradually throughout the range of data sizes utilized in training
validation and the remaining 15% is used for the prediction of and testing, making ANN a reliable tool for prediction.
the ANN variables. At the training phase the weight attached
to each neuron are updated after each epoch with the help of Despite these studies having established the viability of
the training algorithm until the training is validated with high ANN for the prediction of BOD as a performance indicator in
testing precision. Criterion for stoppage is usually defined as WWTP, it is important to note; no study has been conducted
specified at the beginning of each training by using the to predict the performance of BOD5 as a performance indicator
numbers of iterations and the minimum mean square error as for the Wupa WWTP. Bearing in mind that each WWTP in the
well as validation checks. The normalization checks are world is dynamic and unique with sometimes high variation in
designed in such a way that each impute node contribute their contaminants as rightly noted by (Jami et al. 2012). It
immensely to the prediction of the output to minimize local becomes expedient to apply ANN for the prediction of BOD 5
minimum convergence [29,30] The normalization equation output in the Wupa WWTP. Therefore, this study applies ANN
can be expressed using Eq. (1) below to predict the output BOD5 for the Wupa WWTP using a 5
years’ dataset.
(𝑥 − 𝑥𝑚𝑖𝑛 )(𝑦𝑚𝑎𝑥 − 𝑦𝑚𝑖𝑛 )
𝑦 = 𝑦𝑚𝑖𝑛 + 1 The novelty of this research lies in the application of four
(𝑥𝑚𝑎𝑥 − 𝑥𝑚𝑖𝑛 )
reliable multilayer perceptron ANN algorithms including the
where and 𝑥𝑚𝑖𝑛 and 𝑥𝑚𝑖𝑛 are the maximum and Levenberg-Marquardt (LM) backpropagation, resilient
backpropagation (RP), Quasi-Newton backpropagation (BFG)
minimum value of the data set, 𝑦𝑚𝑎𝑥 and 𝑦𝑚𝑖𝑛 are the range and Fletcher-Reeves conjugate gradient backpropagation
for normalization, and y the normalized value of 𝑥 . basically, (CGF); for performance prediction of the Abuja-based Wupa
the range for normalization is either (0,1) or (-1,1). The output WWTP using effluent 5-day biochemical oxygen demand
of each neutron is function of the training neurons and its (BOD5) as a performance indicator.
eights assigned to it using Eq. (2)
III. MATERIALS AND METHODS
(𝑛(𝑗−1))
𝑎𝑖𝑗 = 𝑓𝑗 (∑𝑘=1 𝑎𝑘(𝑗−1)𝑤𝑘𝑖(𝑗−1)+𝑏 ) 2
𝑖𝑗
A. Study Area
The location for this study is Abuja, the Federal Capital
where 𝑎𝑖𝑗 and 𝑏𝑖𝑗 are the output and bias of the i-th City of Nigeria; using the Wupa WWTP Abuja, Nigeria. Wupa
neutron in the j-th layer, 𝑎𝑘(𝑗−1) and 𝑤𝑘𝑖(𝑗−1) are the output WWTP occupies an area of 297,900 square meters and lies
and the weight of neutron from the previous layer, between UTM coordinate N998183.603, E321889.651,
N998183.603, E322283.340 and 321889.65, E321889.651,
respectively, 𝑛(𝑗−1) is the number of neutrons in the (j-1) and
N997495.399. E322283.340. The Wupa WWTP is an
𝑓𝑗 is the activation by introducing non-linearity to the network. oxidation ditch plant; a type of activated sludge system
The commonly used activation functions are logistic sigmoid developed to a capacity of 131.3 million liters per day with
A. ANN Model Performance for BODeff. Output the best ANN model's regression plot between actual and
Tables 3 to 6 present the best results for each of the predicted data on the BODeff; and Figure 5 shows the BODeff
applied MLP-ANN algorithms (LM, BFG, RP and CGF) using output actual values against prediction values for the best
various combinations of activation pairs; while Figure 4 shows ANN model.
Table 3: MLP-ANN model performance statistics for training; validation; and testing of the LM method for BOD eff.
LM
Training Validation Testing
HLAF OLAF Designation MSE R2 MSE R2 MSE R2 IN Architecture
Logsig Purelin LP 1.512 0.929 1.520 0.904 1.523 0.925 11 [8-1003-1]
Logsig Tansig LT 1.492 0.929 1.576 0.912 1.534 0.918 10 [8-1270-1]
Tansig Purelin TP 1.49 0.929 1.605 0.906 1.527 0.925 16 [8-1200-1]
Tansig Logsig TL 5.307 0.872 5.432 0.861 5.576 0.845 15 [8-1275-1]
Purelin Logsig PL 5.306 0.874 5.483 0.845 5.513 0.857 15 [8-1201-1]
Logsig Logsig LL 5.305 0.872 5.432 0.861 5.571 0.846 2 [8-899-1]
Purelin Purelin PP 1.526 0.927 1.404 0.929 1.617 0.906 4 [8-1098-1]
Tansig Tansig TT 1.504 0.929 1.578 0.912 1.528 0.920 14 [8-1044-1]
Purelin Tansig PT 1.531 0.927 1.480 0.916 1.522 0.922 16 [8-1265-1]
Note: OLAF = output layer activation function; HLAF = hidden layer activation function; IN, iteration number
Table 5: MLP-ANN model performance statistics for training; validation; and testing of the BFG method for BOD eff.
BFG
Training Validation Testing
HLAF OLAF Designation MSE R2 MSE R2 MSE R2 IN Architecture
Logsig Purelin LP 1.506 0.945 1.607 0.908 1.530 0.937 34 [8-1203-1]
Logsig Tansig LT 1.521 0.931 1.566 0.906 1.524 0.939 31 [8-1270-1]
Tansig Purelin TP 1.505 0.929 1.608 0.904 1.538 0.925 31 [8-1001-1]
Tansig Logsig TL 10.347 0.213 10.591 0.191 10.978 0.244 1 [8-1245-1]
Purelin Logsig PL 6.321 0.089 6.179 0.108 6.346 0.084 4 [8-1121-1]
Logsig Logsig LL 6.299 0.000 6.282 0.000 6.346 0.000 1 [8-689-1]
Purelin Purelin PP 1.543 0.925 1.417 0.929 1.645 0.904 17 [8-1048-1]
Tansig Tansig TT 1.570 0.924 1.588 0.910 1.619 0.908 46 [8-1044-1]
Purelin Tansig PT 1.531 0.927 1.480 0.916 1.527 0.920 46 [8-1065-1]
Note: OLAF = output layer activation function; HLAF = hidden layer activation function; IN, iteration number
Table 6: MLP-ANN model performance statistics for training; validation; and testing of the CGF method for BOD eff.
CGF
Training Validation Testing
HLAF OLAF Designation MSE R2 MSE R2 MSE R2 IN Architecture
Logsig Purelin LP 1.523 0.937 1.624 0.814 1.545 0.935 52 [8-1024-1]
Logsig Tansig LT 1.576 0.910 1.649 0.867 1.546 0.927 57 [8-1220-1]
Tansig Purelin TP 1.546 0.924 1.633 0.819 1.583 0.850 37 [8-1230-1]
Tansig Logsig TL 5.306 0.767 5.432 0.714 5.573 0.676 41 [8-1275-1]
Purelin Logsig PL 5.308 0.696 5.484 0.778 5.515 0.677 46 [8-1005-1]
Logsig Logsig LL 5.306 0.794 5.432 0.745 5.573 0.785 27 [8-989-1]
Purelin Purelin PP 1.567 0.887 1.457 0.953 1.678 0.872 16 [8-1031-1]
Tansig Tansig TT 1.530 0.966 1.613 0.947 1.552 0.929 28 [8-1052-1]
Purelin Tansig PT 4.679 0.677 4.617 0.796 4.262 0.750 64 [8-1033-1]
Note: OLAF = output layer activation function; HLAF = hidden layer activation function; IN, iteration number
Fig 5: The best ANN model's regression plot between actual and predicted data on the BOD eff
Fig 6: BODeff output actual values against predicted values for the best ANN model