0709 2071 PDF
0709 2071 PDF
0709 2071 PDF
pollutant Removal
Abstract
In this paper we address a general parameter estimation methodology for an extended biokinetic
this case we focus on the case when only substrate degradation data are available and prove the
structural identifiability of the model. Further we consider the problem of practical identifiability and
propose experimental and related numerical methods for unambiguous parameter estimation based
on multiple substrate degradation curves with different initial concentrations. Finally by means of
simulated pseudo-experiments we have found convincing indications that the proposed algorithm is
Nomenclature
Abbreviation
X biomass (g COD l-1)
S substrate (g COD l-1)
growth rate (d-1)
max maximum growth rate (d-1)
Ks saturation constant (g COD l-1)
Y growth yield
b biomass decay rate (d-1)
t time (d)
Indices
mp micropollutant
0 Initial condition at t=0
1/1
Introduction
Quantitative mathematical modelling of wastewater treatment (as e.g. the well known
system state-variables (such as COD) and do not enter into further details concerning
required.
biokinetic model approaches, as e.g. ASM [2] and aims to describe the degradation
biodegradation alone.
problem is to indirectly estimate its growth and decay rates, as to our best knowledge
there are no easily available experimental methods for direct quantification of the
2/2
State of the art and motivation
reality can consist of many different species. This model contains three types of
parameters:
This theoretical model extension has practical relevance only in case if the additional
experiments. The first type of parameters can easily be identified following standard
Consider a biodegradation system where the full model described above is assumed
considered as closed (see Equation 2 in [1]) and reduces to Equation (1). This
3/3
the model equations we have changed the indices P3 (referring to persistent polar
dXmp
= mp Xmp bmp Xmp
dt (1)
dSmp mp
= Xmp
dt Ymp
Smp
mp = max,mp (2)
K s,mp + Smp
We assume as usual that the kinetic parameters are constant and do not have a
from an abstract point of view parameter estimation for biokinetic systems described
by Equation (1) has long been studied under the assumption that both biomass and
substrate concentrations (or other quantities which are functions of these variables)
propose is based therefore on the assumption that only Smp measurements are
measurements are not feasible due to the extremely low substrate concentrations
methodology proposed here is not restricted to the case of micropollutants and can
always be applied whenever Equation (1) is valid and only substrate concentration
4/4
There are two different questions to be answered [7]:
Structural Identifiability
In this section we omit the subscripts mp. This is done on the one hand for simplicity,
but also because our results have a general mathematical validity (not restricted to
(1, 2)).
From a mathematical point of view we have to solve the following problem: Given the
(closed) system of equations (1, 2) and provided that the function S(t) is exactly
known for all times (t 0), is it in principle possible to uniquely determine the set of
biokinetic parameters max, Ks, b, Y and the unknown initial condition (biomass at t=0)
X(0)=X0?
As a first step we note that by formal integration of the biomass equation one can
transform the system of differential equations (1,2) into the following equivalent
integro-differential equation.
dS X0 S t
S
= max expmax d bt (3)
dt Y Ks + S 0
Ks + S
This equation has one initial condition S(0)=S0 and the other initial condition X(0)=X0
appears as a new parameter. Already at this point it is clear that it is only the ratio
5/5
X0/Y that can be structurally identifiable as any independent values of X0 and Y
yielding the same ratio have an identical impact on equation (3). (This can be
independently seen from equation (23) below.) Therefore the question is whether the
set of parameters max, Ks, b and X0/Y of equation (3) can be uniquely identified
To proceed further we take the logarithm of both sides of equation (3) multiplied by
minus one.
dS X0 S t
S
ln = ln(max ) + ln
+ ln
+ max d bt
Ks + S Ks + S
dt Y (4)
0
Please note that this operation is well defined as both the right and the left hand side
Let us now consider a fixed solution of equation (3) S(t), t 0, S(0)=S0. Then
equation (4) is an algebraic relation between the set of four parameters max, Ks, b
and X0/Y. Let us further assume that there exists a different set of parameters
~ ,K ~ ~ ~ ~
max s , b , X 0 /Y which satisfies the following relation
~
dS X0 ~ t S
= ln(max ) + ln ~
~ + ln ~ S + ~
K + S max K
ln d b t (5)
dt ~
Y s 0 s +S
Subtracting the right hand sides of relations (4) and (5) we obtain the function (t)
which has to be identically equal to zero for all times t 0 due to the fact that the left
X
0 ~
S( ) S( ) K s + S(t)
max Y
( )
t t
~ ~
+ bb (6)
(t) = ln ~ + ln ~ + max K + S() d max ~ d + ln
t 0, t 0
max X 0 0 s 0 K s + S( ) K s + S(t)
~
Y
6/6
From equation (6) it immediately follows that
~ ~
X 0 max X 0 K s + S 0
(0) = 0 ~ =~ (7)
Y max Y K s + S 0
~ ~
Thus we have expressed the parameter X 0 /Y as a function of the remaining ones.
Taking into account that the first derivative of equation (6) has also to be identically
equal to zero for all times t 0 and taking the limit t we find also that
) ( )
d S(t) ~ S(t) X(t) S(t) Ks Ks ~
+ b b 0, t 0
= max
dt K s + S(t)
max ~
K s + S(t)
max
Y K s + S(t) (K s + S(t)) K s + S(t)
~
(
(8)
d ~ ~
limt =bb b = b
dt
From the last result, the first derivative can be rewritten as follows:
d
(max ~max )S(t )3 + (max (K s + K~ s ) 2K s~max )S(t )2 + K s (maxK~ s ~maxK s )S(t ) max (K s K~ s )X(t )S(t ) (t )
= Y = 0
dt (K s + S(t ))2 (K~ s + S(t )) (t )
(9)
where (t ) = (max
~ )S(t )3 + K + K
max max s( (
~ ~ ) 2
) ~ ~
(
s 2K s max S(t ) + K s maxK s maxK s S(t )
max
Y
) ~
(
K s K s X(t )S(t ) )
2 ~
(
and (t ) = (K s + S(t )) K s + S(t ) )
The function (t) is positive at all times and therefore the function (t) has to be
identically equal to zero. However, this function is a polynomial in the variables S(t)
and X(t) (the span of which is a compact segment of the set of real positive numbers)
and it can be identically equal to zero, if and only if the coefficients of the different
powers are also identically equal to zero. This requirement automatically leads to
~ ~
KS = Ks and max = max (10)
These equalities are simultaneously necessary and sufficient conditions for the
function (t) to be identically equal to zero. Further from equations (10) and (7) it
follows that:
~
X0 X0
~ = (11)
Y Y
7/7
Equations (8), (10) and (11) demonstrate the unique structural identifiability of the
Practical Identifiability
data, is however a major challenge. In spite of the fact that it is possible to uniquely
demonstrated above, this can hardly be done from experimental measurement data,
the problems being mathematically ultimately related to the well known high
[10, 11]. Therefore the potentially feasible methods to determine the biokinetic
parameters would have to rely on additional information. One way to this end is to try
to employ the information contained not in one but in series of degradation curves,
for which it is on the one hand guaranteed that all relevant parameters Ks,mp, max,mp,
the ratio X0,mp/Ymp and bmp are identical while the initial values of the substrate
In this approach to solve the problem of practical identifiability there are two implicit
tasks:
To find a modus of application of this method to the case when the available
The first of these two tasks can be solved explicitly. To this end we have developed
an experimental design and protocol which is feasible for the case of micropollutants
aiming at the estimation of the basic biokinetic parameters Ks,mp, max,mp, the ratio
X0,mp/Ymp and the decay rate bmp of the specialised biomass. We have designed two
8/8
basic types of short-term batch experiments to determine these quantities. These
assumed that they deliver sufficient data which are entirely free of noise. This
which the noisy and scarce experimental data are to be used to obtain
The endogenous decay rate bmp is obtained by monitoring the decay of active cell
mass in the absence of growth substrate. The major challenge in this case is finding
a suitable method for evaluating the quantity of active biomass. A method that can be
applied for mixed cultures, based on previous results by [12] can be described as
follows:
substance occurs. Initially a sludge sample is left to decay without substrate. From
this decaying culture sub-samples are periodically withdrawn at time instances ti.
target substrate and the initial slope of each degradation curve is estimated. The
9/9
The initial slope of each of the resulting degradation curves ki (for t=ti) is
dSmp (S )
= mp mp Xmp (ti ) = k i (12)
dt Ymp
The specialised (active) biomass fraction Xmp(ti)/Xmp(t0) remaining at time ti for all i
can be computed by dividing the initial slopes ki for each sub-sample by the initial
b t
ki Xmp (t i ) X 0, mp e mp i b t
= = = e mp i (13)
k 0 Xmp (t 0 ) X 0, mp
(=mp=0).
The semilog plot of the specialised biomass fraction vs. time yields a straight line
Figure 1 shows the resulting idealised curve for the determination of the decay
rate bmp.
Time
10/10
2. Degradation experiments for determination of the biokinetic constants Ks,mp,
target micropollutant substrate is spiked, whereby the concentrations for the separate
samples are different and form a monotonous series. The basic procedure includes
degradation curves are recorded for m+1 different time instances (0, t1, , tm).
For n different initial substrate concentrations S1,mp (0 ),K, S n,mp (0 ) these curves
For t=0 these differential equations can be seen as a set of algebraic equations of
the form:
11/11
Note that due to the experiment design X1,mp(0)=.= Xn,mp(0). In what follows we
denote this parameter with X0,mp. Equations (16) are linear with respect to the two
dS1,mp
(0) S1,mp(0)
dS
- S1,mp(0) 1,mp (0)
dt dt
K K x1 = Ks,mp K
A xv = yr, A
= K v r
(17)
K , x = x = X 0,mp , y = K
Ymp
2 max,
mp
K K K
dSn,mp dSn,mp
(0) Sn,mp(0) - Sn,mp(0) (0)
dt dt
As it is well known [13], provided that the columns of are linearly independent
then exists
~v + r
x = A y, where (
A + = A T A )
1
A T is the pseudoinve rse matrix of A . (18)
equation (17).
Indeed, as
dS1,mp S1,mp(0)
(0) S1,mp(0) S1,mp(0)
dt Ks,mp + S1,mp(0)
K K K K
= K X
A K = max.mp (19)
0,mp
K K
Ymp
K K K K
dSn,mp
(0) Sn,mp(0)
Sn,mp(0)
Sn,mp(0)
dt K + S (0)
s,mp n,mp
it is only in the regime when Ks,mp >> Si,mp(0) (i=1,,n) that has effectively rank
12/12
The determination of max,mp plus independent estimations of max,mp*X0,mp/Ymp and
bmp is based on equation (4) applied to all experimental data for all degradation
zv = vv
B (20)
t1
S1,mp dS1,mp S1,mp (t1 )
1 t1
K d ln
(t1 ) ln
+ S dt K s,mp + S1, mp ( t 1 )
L L
0 s,mp 1,mp
L
L X0,mp
t1
Sn,mp z = ln dS
ln n,mp (t ) ln S ( t ) (21)
1 t1 max,mp Y
0 K s,mp + Sn,mp v
n,mp 1
d 1
s,mp + Sn,mp (t1 )
mp
dt
1 K
B = , vv =
, z = z 2 = bmp
S1,mp dS1,mp S1,mp (t 2 )
t2
1 t
2 0 K s,mp + S1,mp d z3 = max,mp
ln
dt
(t 2 ) ln
K + S (t )
L L s,mp 1,mp 2
L L
Sn,mp dSn,mp Sn,mp (t m )
tm
1 t
0 K s,mp + Sn,mp d ln (t m ) ln
K s,mp + Sn,mp (t m )
m
dt
~v + v 1
z = B v, where B + = B T B B T ( ) is the pseudoinve rse matrix of B . (22)
given, but this time in the regime when Ks,mp << Si,mp(0) (i=1,,n) B has
impossible.
X0,mp/Ymp and bmp from ideal decay and multiple degradation experiments. Two
different ways. It should be stressed however that the second of these quantities, i.e.
experimental data. In contrast the decay rate is determined from two independent
13/13
Parameter estimation from realistic experimental data
To go to realistic (noisy) experiments, one has to be able to obtain good curve fits for
each of the degradation curves and at least the initial slope of the decay curves. A
Unfortunately for few experimental points and some degree of noise the result can be
This allows the initial determination of the biokinetic parameters: first of all bmp,
then Ks,mp and the product max,mp*X0,mp/Ymp and finally max,mp plus an
Now one can attempt a renewed approximation of the experimental data, this time
by numerical solution of the system of differential equations (1) and (2) by using
the biokinetic parameters estimated in the previous step and least square fitting.
dX mp dx mp
= mp X mp b mp X mp = mp x mp b mp x mp
dt dt
dS mp mp dS mp
= X mp = mp x mp (23)
dt Ymp dt
X 0,mp
S mp (0) = S 0,mp , X mp (0) = X 0,mp S mp (0) = S 0,mp , x mp (0) =
Ymp
This allows to find solutions for Smp although we cannot determine Ymp separately.
Finally one can once again determine the biokinetic parameters making use of
these new approximations of the degradation curves and iterate all algorithms
14/14
The above discussion does not yet allow estimating the confidence region of the
points. While a full statistical analysis of the proposed method is still missing, below
Noise simulation
The pseudo-experimental data are obtained by taking calculated ideal values and
analysed samples is usually available. This is due to a number of reasons: For low-
conditions is prerequisite for sound parameter estimation. While this is still possible
for a small number of samples, it becomes increasingly difficult with larger sample
series.
We simulate equation (23) for the following parameter values: max,mp =1 d-1,
Ks,mp= 22 g/l, X0,mp/Ymp=330 g/l, bmp=0,3 d-1 for four fixed initial values of the
0.02, 0.05, 0.1) for each pseudo-experimental value. Adding noise technically
mean value of 1 (the length of the array being equal to the length of the pseudo-
15/15
For each value of standard deviation of the noise we produce the data for a
7 14
Simulated ideal degradation curve Simulated ideal degradation curve
Pseudo-experimental points Pseudo-experimental points
6 12
Degradation curve with added noise stabw=0.1 Degradation curve with added noise stabw=0.1
5 10
Substrate [g COD/l]
Substrate [g COD/l]
4 8
3 6
2 4
1 2
0 0
0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25
Time [days] Time [days]
30 60
Simulated ideal degradation curve Simulated ideal degradation curve
Pseudo-experimental points Pseudo-experimental points
25 Degradation curve with added noise stabw=0.1 50 Degradation curve with added noise stabw=0.1
Substrate [g COD/l]
Substrate [g COD/l]
20 40
15 30
10 20
5 10
0 0
0 0.05 0.1 0.15 0.2 0.25 0 0.05 0.1 0.15 0.2 0.25
Time [days] Time [days]
Figure 2: Plot of a set of simulated ideal degradation curves, these degradation curves with added
One should note that these final pseudo-experimental series are really unfavourable
for parameter estimation. In addition to the mentioned scarcity of data points, the
This needs not actually be the case in a real measurement error as the distribution
experimental value. With the approach we have chosen we particularly increase the
general the error in the initial part of every degradation curve. This affects
particularly these parts of our estimation algorithm which are based on initial slopes
16/16
For each of the pseudo-experimental series we apply the algorithmic approach for
parameter estimation as described above using 100 iteration steps. The results are
example of such type of results for standard deviation 0.02 is presented in Figure 3.
3,5 30
3
25
2,5
20
Ks,mp [g COD/l]
2
max,mp [d ]
-1
15
1,5
10
1
0,5 5
0 0
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 93 97 101 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 93 97 101
Number of iterations Number of iterations
350 1,2
300 1
max,mpX0,mp/Ymp [g l-1 d-1]
250
0,8
bmp [d-1]
200
0,6
150
0,4
100
50 0,2
0 0
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 93 97 101 1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89 93 97 101
Number of iterations Number of iterations
Figure 3: Calculated values for max,mp, Ks,mp, max,mp *X0,mp/Ymp and bmp using noise standard deviation
stabw=0.02 for 22 sets of pseudo-experimental data using 100 iterations in each parameter
estimation
given in Figure 4, whereby the straight line corresponds to the values of the
parameters used in the simulation before adding noise. (Note that these values
coincide with the algorithm output in the case of no noise standard deviation of
noise equal to zero). For each value of the input noise standard deviation a mean
value of all algorithm estimates and the corresponding output standard deviation
are given.
17/17
Calculation of m ax,m p for different standard deviations Calculation of Ks,m p for different standard deviations
(means and standard deviation range) (means and standard deviation range)
3,5
30
3
25
2,5
max,mp [d ]
-1
20
Ks,mp [g/l]
2
1,5 15
1 10
0,5 5
0 0
0 0,02 0,04 0,06 0,08 0,1 0 0,02 0,04 0,06 0,08 0,1
Noise (stabw) Noise (stabw)
Calculation of m ax,m pX0,m p/Ym p for different standard deviations Calculation of bm p for different standard deviations
(means and standard deviation range) (means and standard deviation range)
400 0,8
350 0,7
max,mp X0,mp /Ymp [g l d ]
-1
0,6
300
-1
0,5
250
bmp [d ]
-1
0,4
200
0,3
150
0,2
100
0,1
50
0
0
0 0,02 0,04 0,06 0,08 0,1
0 0,02 0,04 0,06 0,08 0,1
Noise (stabw)
Noise (stabw)
Figure 4: Diagramme for max, mp, Ks,mp, max,mp *X0,mp/Ymp and bmp for different standard deviations
(mean plus confidence range) for 22 sets of pseudo-experimental data using 100 iterations in each
parameter estimation
This approach gives satisfactory indications of the standard deviation range of the
Simultaneously it should be noted that it cannot account for any modelling errors
i.e. possible deviations of the experimental data to the form of the model assumed.
Conclusions
equations (1) and (2) in the case when only substrate measurement data are
identifiability on the basis of multiple substrate degradation curves with different initial
The approach described above is the basis for parameter estimation for a number of
degradation parameters for NDSA and BTSA for which the presented methodology
was additionally tested and verified. The experimental results and the corresponding
Acknowledgement
The presented work was carried out within the framework of the project PTHREE
References
[3] B. Petersen (2000). Calibration, identifiability and optimal experimental design of activated
19/19
[4] B. Petersen, K. Gernaey, M. Henze and P.A. Vanrolleghem (2003). Calibration of
activated sludge models: A critical review of experimental designs. In: Biotechnology for
the Environment: Wastewater Treatment and Modeling, Waste Gas Handling. Eds.
Agathos S.N. and Reineke W., Kluwer Academic Publishers, Dordrecht, The
Netherlands. 101-186.
activated sludge models. In: Proceedings 76th Annual WEF Conference and Exposition.
carbon source degradation processes using titrimetric data and combined respirometric-
[8] Dochain, D., Vanrolleghem, P.A., Van Daele, M. (1995). Structural identifiability of
MathSciNet.
[12] C.S. Criddle (1993). The kinetics of co-metabolism. Biotechnology and Bioengineering