Mathematical_Modelling_COPASI_tutorial
Mathematical_Modelling_COPASI_tutorial
24 November 2016
Building and Simulating Models using COPASI 2
Licence
This tutorial is © 2016, Nicolas Le Novère, Viji Chelliah, Bhupinder Virk.
This manual is distributed under the creative commons Attribution-Share Alike 4.0 licence. This means
that you are free to:
Attribution — You must give appropriate credit, provide a link to the license, and indicate if
changes were made. You may do so in any reasonable manner, but not in any way that
suggests the licensor endorses you or your use.
ShareAlike — If you remix, transform, or build upon the material, you must distribute your
contributions under the same license as the original.
For any reuse or distribution, you must make clear to others the licence terms of this work.
Any of these conditions can be waived if you get permission from the copyright holder.
Nothing in this license impairs or restricts the author's moral rights.
https://creativecommons.org/licenses/by-sa/4.0/legalcode
Building and Simulating Models using COPASI 3
Introduction
This tutorial is not meant to be an advanced course. However, it aims at being practical and after
completing it you should be able to create your own simple models and run a few simulations and
analyses. There are many modelling approaches, and many model analysis procedures. We will only
concentrate on one type of model: Chemical kinetics in well mixed solutions. Such models have been
around since the the mid-XXth and are still the most used type in systems biology.
We will use a mix of models that were actually published as part of research projects, and toy models
specially created for the sake of this tutorial. Their size and complexity have been chosen to fit t he
duration of the tutorial.
Many concepts will be introduced and not developed. For more general background on the modelling
of biochemical networks, see:
Le Novère. Quantitative and logic modelling of gene and molecular networks. Nature Reviews
Genetics (2015) 16: 146–158
Finally, the following book presents different approaches used to model dynamic phenomena in life
sciences, expanding further than the previous ones. It comes with plenty of exercises in MatLab and
R.
Ellner and Guckenheimer. Dynamic Models in Biology. Princeton University Press (2006)
Building and Simulating Models using COPASI 4
Modelling software
This tutorial will use the modelling and simulation tool COPASI (standing for “Complex Pathway
Simulator”). COPASI is an open-source software widely used in computational systems biology
because of its versatility and ease of use. It comes with a command-line version that can be used for
instance on clusters, and a user friendly graphical interface - which we will use today. The tool is used
for teaching but also for research (the main paper, Hoops et al 2006 has been cited > 1400 times).
Plenty of information including manuals, videos and user forums is available at:
http://www.copasi.org
Models
Some of the example models used for this tutorial are taken from the BioModels databas e
(https://www.ebi.ac.uk/biomodels). BioModels contains thousands of mathematical models covering a
large diversity of biological processes. Many of them have been carefully verified and annotated, and
are a great starting point for further modelling endeavours.
BIOMD0000000010 - Kholodenko2000
Kholodenko BN. Negative feedback and ultrasensitivity can bring about oscillations in the
mitogen-activated protein kinase cascades. Eur J Biochem. 2000 Mar;267(6):1583-8.
(http://www.ebi.ac.uk/biomodels-main/BIOMD0000000010)
The Mitogen Activated Protein Kinase cascade is a prototypic example of a behaviour emerging from a
system. As we will see, the response of a MAP kinase to an upstream signal is shaped by the whole
cascade of successive phosphorylation. The prediction that such a cascade would engender
ultrasensitivity is one of the great successes of mathematical modelling in biochemistry, made in 1981
by Albert Goldbeter and Daniel Koshland, a decade before the discovery of MAP kinase cascades.
More can be found at:
Model:
BIOMD0000000009 – Huang 1996
Huang CY, Ferrell JE Jr. Ultrasensitivity in the mitogen-activated protein kinase cascade. Proc Natl
Acad Sci U S A. 1996 Sep 17;93(19):10078 -83.
(http://www.ebi.ac.uk/biomodels-main/BIOMD0000000009)
This was the first detailed model of MAPK signalling showing the effect of cascading phosphorylation
steps on MAPK signalling (although as mentioned above, the theory was there already). All intermediat e
complexes were modelled explicitly, the reactions using Mass Action kinetics (compare this approac h
with the one used in the next exercise). Parameters and concentrations were estimated from
experimental results. The rate equations of the cascade were solved numerically.
Predicted results were similar to experimental results obtained with Xenopus laevis oocytes. In
particular, the dose response curves for MAPK, MAPKK, MAPKKK were predicted to show i ncreasing
ultrasensitivity (or supra-linearity), with the MAPK curve predicted to be the steepest.
In this exercise, we will import a model already written, using the format SBML (Systems Biology Markup
Language, http://sbml.org).
Building and Simulating Models using COPASI 6
A time course simulation shows the evolution of a system over time. It computes the values of the
variables (here the concentrations) after a small amount of time based on their values now. Time
courses are particularly important to understand signalling events and their consequences.
Explore the model and try to understand what each reaction does. Use the map on the previous
page for context. Look at the “Annotation” tab of “Model”, a selected “Species” and a “Reaction”.
Prepare a Time Course (Under Tasks) with duration = 100 seconds, Interval Size = 0.1
seconds, Method = Deterministic (LSODA).
Use the Output Assistant to create a “Concentrations, Volumes and Global Quantity Values”
plot, and then run the time course.
Hide all of the curves except for the normalized active forms of MAPK (“K_PP_norm”), MAPKK
(“KK_PP_norm”) and MAPKKK (“KKK_P_norm”).
Save your model (e.g myHuang1996-TimeCourse.cps)
The plot below shows that MAPKKK (e.g. Raf) is activated first, then MAPKK (e.g. MEK) and finally
MAPK (e.g. ERK). However, MAPK reaches maximal activation in 30 seconds, well before MAPKK (40
secs) and MAPKKK (70 secs). Why does MAPKKK shows a maximum at 40 secs, then decreases to
reach a lower plateau?
Building and Simulating Models using COPASI 7
Dose response studies help us understand any change caused by varying a dosage on the dynamics
of the system. These are used for instance to determine the efficient, “safe” and “hazardous” dosages
of drugs, and also to optimize any perturbation protocol. The numerical procedure is called paramet er
scan, or parameter sweep. As the name indicates, they can also be used to study the effect of varying
any parameter and therefore study the sensitivity of the system towards this parameter (there are more
efficient way to do this, which are beyond the scope of this course).
Here, we will study the stimulus/response curves for three components of the MAPK cascade, the input
stimulus being the MAPKKK_activator (INPUT(E1) on the map above, for instance Ras). We will
compute the steady-state of all concentrations (when their values do not vary anymore) for increasing
values of the MAPKKK_activator.
Select Parameter Scan (Under Tasks), and change the Task type to “Steady State”.
Set new scan item to Scan and click create.
Select the parameter to be scanned as Initial Concentration of the [MAPKKK activator](t=0)
In the scan item you have just created set: Intervals = 100, min = 1e-6, max=0.1, and select
logarithmic
Use the Output Assistant to create a “Scan of Concentrations. Volumes and Global Quantity
Values” plot. Use a log scale for the x-axis (MAPKKK_activator). Run the scan to get the dose
response curves.
What is happening? The very weird curves come the fact that several unstable steady states
have been plotted in addition to the stable steady states. To view only the steady states in the
plot, we need to change the settings for Steady-State (Under Tasks) – Change the Use Newton
parameter from 1 to 0 and run the plot again.
Hide all of the curves except for the normalized active form of MAPK (“K_PP_norm”), MAPKK
(“KK_PP_norm”) and MAPKKK (“KKK_P_norm”).
Save your model (e.g myHuang1996-parameterScan.cps)
Building and Simulating Models using COPASI 8
This plot teaches us two different, very important, characteristic of the MAP kinase cascade.
1. The half activation of ERK happens at lower RAS concentration (4.4 µM) than that of MEK (13
µM) or RAF (260 µM).
2. The slope of the curve for ERK is steeper than for MEK, itself steeper than for RAF
Model:
BIOMD0000000010 – Kholodenko 2000
Kholodenko BN. Negative feedback and ultrasensitivity can bring about oscillations in the mitogen-
activated protein kinase cascades. Eur J Biochem. 2000 Mar;267(6):1583-8.
(http://www.ebi.ac.uk/biomodels-main/BIOMD0000000010)
Inhibitory phosphorylation of SOS by p42/p44 MAPK (ERK) provides a mechanism for switching off Ras
signalling. This inhibition creates a negative-feedback in the MAPK cascade (below). Indeed, while
tyrosine phosphorylated Raf brings ERK activation, ERK mediated inhibition of Raf stimulation by SOS
decreases ERK phosphorylation. The paper by Kholodenko attempted to show that a combination of
negative feedback with ultrasensitivity can lead to sustained biochemical oscillations.
Here, we will see the effect of negative feedback (MAPK on its activator MAPKKK) on the
dynamics of MAPK signalling cascade.
The topology of the reaction network of this model is identical to Huang and Ferrell's model (t he model
we used in the previous example), apart from the negative feedback. However, for this model, all
reactions were modelled as simple Michaelis-Menten kinetics, unlike in Huang and Ferrell model where
the reactions are modelled using mass-action kinetics.
Schematic view of MAPK cascade with negative feedback loop. (Kholodenko 2000)
Open a new COPASI file, and set the name of the model to “Kholodenko2000”
Set the units: Time = s, Volume = l, Quantity = nmol
Create a new compartment (Model>Biochemical>Compartments), name it “cell” and set the
initial volume to 1
Building and Simulating Models using COPASI 10
For the purpose of this tutorial we need the form of the Henri-Michaelis-Menten law that uses k cat and
the concentration of enzyme. COPASI only comes with the classic “Michaelis-Menten” law using Vmax.
Create a time course (as in Exercise 1), with a Duration = 400 secs and Interval Size = 0.1
secs.
Create a “Concentrations, Volumes and Global Quantity Values” plot and run the time course.
Show only the curves for the normalized active form of MAPK (MAPK-PP), MAPKK (MAPKK-
PP) and MAPKKK (MAPKKK-P).
Do we retrieve the results obtained in 1.a? What is different in the curves representing RAF (MAPKKK)?
What causes the difference?
Building and Simulating Models using COPASI 13
Change the rate law of MAPKKK activation (reaction v1) to the new function “HMM with
inhibition”, and set the parameter values as follows (first, set the “mapping” to “--local--”)
Parameter Value
Modifier MAPK-PP
V 2.5
Ki 9
K 10
n 1
What happens? What causes the dynamical behaviour you observe? Would negative feedback always
produce such a result?
Plotting the values of state variables (here concentrations ) versus time is not the only way to depict the
dynamical behaviour of a system. Other graphical representations can reveal regulatory structures. A
plot frequently used to display dynamical behaviours is the phase portrait, where we plot the value of a
variable versus the value of another over time.
Building and Simulating Models using COPASI 14
Create a plot that reports MAPKKK-P as a function of MAPK-PP. Rerun the simulation.
What do you observe? \Can you predict what such a plot would look like for a steady state? A damped
oscillation? A bi-stable switch?
Building and Simulating Models using COPASI 15
Select Parameter Scan, and change the Task type to “Time Course”, then click “Create”.
Vary Ki (Reactions>Reaction Parameters>v1(MAPKKK activation)>Ki) from 1 to 51 in a linear
fashion (start with one interval). This causes a decrease of the inhibitory feedback (Ki indicates
the concentration of the inhibitor needed for half-maximal inhibition).
Run the scan and view the plot (you don’t need to create new plots - they were already
generated in the previous section).
Note the changes in the time course, compared to that of the plot obtained in the previous
section.
If we make 2 intervals, then the point where the oscillation begins to damp can be observed.
Try a few more intervals.
The point at which the behaviour switches from reaching steady -state to stable oscillations is called a
bifurcation. We can try to pinpoint the respective values of Ki.
Create a plot “bifurcation”, that will display MAPK-PP as a function of reaction v1’s paramet er
Ki. Select a log X-axis, “Symbols” for type and data capture during the tasks.
Modify the time course to suppress output before 2000 s (I should still stay on a duration of
9000 s. Interval size of 1 are fine.
Create a time course parameter scan, exploring logarithmically the values of v1’s parameter Ki
from 0.01 to 100 (1000 intervals)
Run it.
Can you pinpoint the value of Ki for which we start to see big stable oscillations? What happens below?
For which value of Ki do the oscillations disappear? What happens above? [look at what happens to
the time course. Art and Science …]
Building and Simulating Models using COPASI 16
Examples of oscillatory behaviours in MAP kinase cascades have since then be shown experimentally .
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2602854/
https://www.ncbi.nlm.nih.gov/pubmed/18384751
Building and Simulating Models using COPASI 17
3) Stochastic simulations
So far we have simulated the models with a “deterministic algorithm”. The numerical solver computed
the changes in the concentrations of molecular species based on ordinary differential equations
reconstructed from all the reactions involving them. This approach presents two problems. Firstly, each
time we simulate the model, we will get the same result. There is no “noise” taken into account.
Secondly, those solvers use continuous variables. So in small volumes, we can get half a molecule
reacting with an eighth of another one to form a third of a complex. A side effect is that if there is a
discrete decision to be made, the solver won’t be able to choose, but will “cut the pear in half”. Therefore,
if small amounts are involved, it is often useful to use a “stochastic algorithm”. There are several classes
of such algorithms. The most used in systems biology is based on the independent simulation of all
reactions, random numbers being used to decide which reaction to fire and when.
To demonstrate the usefulness of such a simulation framework we will use a “toy model”, representing
a system able to exist in two different states. We have two genes, whose activity will be modelled using
the concentrations of their messenger RNAs and the corresponding proteins. Each protein repress es
the expression of the other gene.
Change p1 back to 0. Run a 10000 s simulation with the Gibson-Bruck algorithm? What
happens? Run the simulation several times. Is the result always the same?
Change p1 to 100. Run the simulation. Again. Again. What are the differences with the previous
situation?
So, this is a switch, but it is unstable. If we start with a symmetrical situation, we get oscillations, going
from p1 high to p2 high. One way of getting a more robust decision system is to incorporate cooperativit y
in the repression. The most iconic example of such a system is the lysis-lysogeny mechanism of E coli
in response to lambda phage infection (https://en.wikipedia.org/wiki/Lambda_phage ).
Change p1 back to 0. Change the value of the global variables n1 and n2 from 1 to 1.5.
Run the simulation. Again. Again. What happens? Is it always the same protein that wins? Do
we have oscillations?
Cooperative inhibition
As we saw in exercise 1.b, we can run many simulations at once with COPASI. Let’s do that, but instead
of a parameter scan, where we change the value of a parameter, we will just repeat the same simulation.
Building and Simulating Models using COPASI 19
Go to parameter scan. For “New scan item”, select “repeat”. Set the number of iterations to 20,
and the Task to Time Course.
Launch the scan. Be patient, it will take a while!
How many times does p1 win? To count more easily, select the rising phase with the mouse (this should
be 0-2000 seconds) to zoom in.
A very important feature of the modelling and simulation procedure in systems biology is the estimation
of parameter values based on experimental measurements. Sometimes accurate values can be found
in the literature or in databases. However, most often they are not available or need to be adapted to
the context. Therefore, we need to estimate the parameters, that is to find the values that provide the
best reproduction of observed quantities. (Note that those quantities do not necessarily need to be
explicitly in the model, but they need to be related to variables of the model). To do that, we set
parameter values, we run simulations and we compute the difference between the results and the
observations. Below are two surfaces showing this difference for a 2 parameter system.
On the left, the system presents only one “dip” showing the set of parameters that give the optimal
solution (Notice something odd though?) On the right, the systems present two dips, representing a
local and an absolute minimum.
Explore the model. This is a simple model representing the accumulation of Ulk1 complex during the
formation of an autophagosome and its disappearance after budding of the vesicle. In the initial phase,
the Ulk1 complex aggregates with a constant flux (this assumes an unlimited supply of the complex
from the cytosol). The removal rate is 0. The budding is represented by a discrete event.
After the budding, the creation rate is turned to zero, and the removal rate is set up, so that a
unimolecular removal takes place, that is a reaction in which velocity is proportional to the concentration
of the complex. We have quantitative recordings of live fluorescent imaging of ATG13, a protein part of
the Ulk1 complex. We will use these recordings to determine the values for the creation of Ulk1 complex
before budding and removal after budding.
\\
The scoring function decreases while running a Experimental data, fitting kinetics and errors.
parameter estimation procedure.
Depending on the structure of the multidimensional objective function, some methods will be better than
others. Some are faster, but more sensitive to local minima. Some methods are better at avoiding local
minima but are not very precise etc. In addition, a good parameter estimation protocol woul d run such
parameter estimation many times (hundreds or thousands) and study the distribution of the resulting
fitting scores in function of the estimated parameter values. All that is beyond the scope of the present
course.