Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series

Published: 01 July 2009 Publication History

Abstract

In this paper, the extended Kalman filter (EKF) algorithm is applied to model the gene regulatory network from gene time series data. The gene regulatory network is considered as a nonlinear dynamic stochastic model that consists of the gene measurement equation and the gene regulation equation. After specifying the model structure, we apply the EKF algorithm for identifying both the model parameters and the actual value of gene expression levels. It is shown that the EKF algorithm is an online estimation algorithm that can identify a large number of parameters (including parameters of nonlinear functions) through iterative procedure by using a small number of observations. Four real-world gene expression data sets are employed to demonstrate the effectiveness of the EKF algorithm, and the obtained models are evaluated from the viewpoint of bioinformatics.

References

[1]
T. Akutsu, S. Miyano, and S. Kuhara, "Identification of Genetic Networks from a Small Number of Gene Expression Patterns under The Boolean Network Model," Proc. Pacific Symp. Biocomputing , vol. 4, pp. 17-28, 1999.
[2]
L.R. Baugh, A.A. Hill, J.M. Claggett, K. Hill-Harfe, J.C. Wen, D.K. Slonim, E.L. Brown, and C.P. Hunter, "The Homeodomain Protein PAL-1 Specifies a Lineage-Specific Regulatory Network in the C. Elegans Embryo," Development, vol. 132, pp. 1843-1854, 2005.
[3]
M.J. Beal, F. Falciani, Z. Ghahramani, C. Rangel, and D.L. Wild, "A Bayesian Approach to Reconstructing Genetic Regulatory Networks with Hidden Factors," Bioinformatics, Oct. 2004.
[4]
Z. Bozdech, M. Llinas, B.L. Pulliam, E.D. Wong, and J. Zhu, "The Transcriptome of the Intraerythrocytic Developmental Cycle of Plasmodium Falciparum," PLoS Biology, vol. 1, no. 1, pp. 85-100, 2003.
[5]
T. Chen, H.L. He, and G.M. Church, "Modeling Gene Expression with Differential Equations," Proc. Pacific Symp. Biocomputing, vol. 4, pp. 29-40, 1999.
[6]
L. Chen and K. Aihara, "Chaos and Asymptotical Stability in Discrete-Time Neural Networks," Physica D: Nonlinear Phenomena, vol. 104, pp. 286-325, 1997.
[7]
D.L. Cook, A.N. Gerber, and S.J. Tapscott, "Modeling Stochastic Gene Expression: Implications for Haploinsufficiency," Proc. Nat'l Academy of Science USA, vol. 95, pp. 15641-15646, 1998.
[8]
A. Corigliano and S. Mariani, "Parameter Identification in Explicit Structural Dynamics: Performance of the Extended Kalman Filter," Computer Methods in Applied Mechanics and Eng., vol. 193, pp. 3807-3835, 2004.
[9]
M.J. de Hoon, S. Imoto, K. Kobayashi, N. Ogasawara, and S. Miyano, "Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations," Proc. Pacific Symp. Biocomputing, pp. 17- 28, 2003.
[10]
H. de Jong, "Modeling and Simulation of Genetic Regulatory Systems: A Literature Review," J. Computational Biology, vol. 9, no. 1, pp. 67-103, 2002.
[11]
P. D'haeseleer, X. Wen, S. Fuhrman, and R. Somogyi, "Linear Modeling of mRNA Expression Levels during CNS Development and Injury," Proc. Pacific Symp. Biocomputing, pp. 41-52, 1999.
[12]
M.B. Eisen, P.T. Spellman, P.O. Brown, and D. Botstein, "Cluster Analysis and Display of Genome-Wide Expression Patterns," Proc. Nat'l Academy of Science USA, vol. 95, pp. 14863-14868, 1998.
[13]
H. Gao, X. Meng, and T. Chen, "A Parameter-Dependent Approach to Robust H ¿ Filtering for Time-Delay Systems," IEEE Trans. Automatic Control, vol. 53, no. 10, pp. 2420-2425, 2008.
[14]
H. Gao, X. Meng, and T. Chen, "A New Design of Robust H2 Filters for Uncertain Systems," Systems and Control Letters, vol. 57, no. 7, pp. 585-593, 2008.
[15]
Z. Gao, H. Wang, and T. Chai, "A Robust Fault Detection Filtering for Stochastic Distribution Systems via Descriptor Estimator and Parametric Gain Design," IET Control Theory Applications, vol. 1, no. 5, pp. 1286-1293.
[16]
Z. Gao and S.X. Ding, "State and Disturbance Estimator for Time-Delay Systems with Application to Fault Estimation and Signal Compensation," IEEE Trans. Signal Processing, vol. 55, no. 12, pp. 5541-5551, 2007.
[17]
Z. Gao and D.W.C. Ho, "State/noise Estimator for Descriptor Systems with Application to Sensor Fault Diagnosis," IEEE Trans. Signal Processing, vol. 54, no. 4, pp. 1316-1326, 2006.
[18]
Z. Gao, X. Dai, T. Breikin, and H. Wang, "High-Gain Observer-Based Parameter Identification with Application in a Gas Turbine," Proc. IFAC Control Congress, pp. 1408-1413, July 2008.
[19]
Z. Ghahramani, "Learning Dynamic Bayesian Networks," Adaptive Processing of Sequences and Data Structures, C.L. Giles and M. Gori, eds., pp. 168-197, Springer-Verlag, 1998.
[20]
N.S. Holter, A. Maritan, M. Cieplak, N.V. Fedoroff, and J.R. Banavar, "Dynamic Modeling of Gene Expression Data," Proc. Nat'l Academy of Science USA, vol. 98, pp. 1693-1698, 2001.
[21]
S. Huang, "Gene Expression Profiling, Genetic Networks, and Cellular States: An Integrating Concept for Tumorigenesis and Drug Discovery," J. Molecular Medicine, vol. 77, pp. 469-480, 1999.
[22]
S. Huang and D. Dissanayake, "Convergence and Consistency Analysis for Extended Kalman Filter Based SLAM," IEEE Trans. Robotics, vol. 23, no. 5, pp. 1036-1049, 2007.
[23]
A.K. Jain, M.N. Murty, and P.J. Flynn, "Data Clustering: A Review," ACM Computing Surveys, vol. 31, no. 3, pp. 264-323, 1999.
[24]
P. Kellam, X. Liu, N. Martin, C. Orengo, S. Swift, and A. Tucker, "A Framework for Modeling Virus Gene Expression Data," Intelligent Data Analysis, vol. 6, pp. 265-279, 2002.
[25]
T.B. Kepler and T.C. Elston, "Stochasticity in Transcriptional Regulation: Origins, Consequences, and Mathematical Representations," Biophysics J., vol. 81, no. 6, pp. 3116-3136, 2001.
[26]
A.J. Krener, "The Convergence of the Extended Kalman Filter," Directions in Mathematical Systems Theory and Optimization, pp. 173- 182, 2003.
[27]
S. Liang, S. Fuhrman, and R. Somogyi, "REVEAL: A General Reverse Engineering Algorithm for Inference of Genetic Network Architectures," Proc. Pacific Symp. Biocomputing, vol. 3, pp. 18-29, 1998.
[28]
T. Liu, W. Sung, and A. Mittal, "Model Gene Network by Semi-Fixed Bayesian Network," Expert Systems with Applications, vol. 30, no. 1, pp. 42-49, 2006.
[29]
H.M. McAdams and A. Arkin, "Stochastic Mechanisms in Gene Expression," Proc. Nat'l Academy of Science USA, vol. 94, pp. 814- 819, 1997.
[30]
M.F. Maduro and J.H. Rothman, "Making Worm Guts: The Gene Regulatory Network of the Caenorhabditis Elegans Endoderm," Developmental Biology, vol. 246, pp. 68-85, 2002.
[31]
K. Murphy and S. Mian, "Modeling Gene Expression Data Using Dynamic Bayesian Networks," technical report, Univ. of California, 1999.
[32]
M. Quach, N. Brunel, and F. d'Alché-Buc, "Estimating Parameters and Hidden Variables in Non-Linear State-Space Models Based on ODEs for Biological Networks Inference," Bioinformatics, vol. 23, no. 23, pp. 3209-3216, 2007.
[33]
M.F. Ramoni, P. Sebastiani, and I.S. Kohane, "Cluster Analysis of Gene Expression Dynamics," Proc. Nat'l Academy of Science USA, vol. 99, pp. 9121-9126, 2002.
[34]
C. Rangel, J. Angus, Z. Ghahramani, M. Lioumi, E.A. Sotheran, A. Gaiba, D.L. Wild, and F. Falciani, "Modeling T-Cell Activation Using Gene Expression Profiling and State Space Models," Bioinformatics, vol. 20, no. 9, pp. 1361-1372, 2004.
[35]
P. Smolen, D.A. Baxter, and J.H. Byrne, "Mathematical Modeling of Gene Networks Review," Neuron, vol. 26, no. 3, pp. 567-580, 2000.
[36]
R. Somogyi and C.A. Sniegoski, "Modeling the Complexity of Genetic Networks: Understanding Multigenic and Pleiotropic Regulation," Complexity, vol. 1, no. 6, pp. 45-63, 1996.
[37]
S. Swift and X. Liu, "Predicting Glaucomatous Visual Field Deterioration through Short Multivariate Time Series Modeling," Artificial Intelligence in Medicine, vol. 24, pp. 5-24, 2002.
[38]
P. Tamayo, D. Slonim, J. Mesirov, Q. Zhu, S. Kitareewan, E. Dmitrovsky, E.S. Lander, and T.R. Golub, "Interpreting Patterns of Gene Expression with Self-Organizing Maps: Methods and Application to Hematopoietic Differentiation," Proc. Nat'l Academy of Science USA, vol. 96, pp. 2907-2912, 1999.
[39]
S. Tavazoie, J.D. Hughes, M.J. Campbell, R.J. Cho, and G.M. Church, "Systematic Determination of Genetic Network Architecture," Nature Genetics, vol. 22, no. 3, pp. 281-285, 1999.
[40]
T. Thattai and A. van Oudenaarden, "Stochastic Gene Expression in Fluctuating Environments," Proc. Genetics Soc. Am., pp. 523-530, 2004.
[41]
T. Tian and K. Burrage, "Stochastic Neural Network Models for Gene Regulatory Networks," Proc. 2003 IEEE Congress Evolutionary Computation, pp. 162-169, 2003.
[42]
Z. Wang, F. Yang, D.W.C. Ho, S. Swift, A. Tucker, and X. Liu, "Stochastic Dynamic Modeling of Short Gene Expression Time Series Data," IEEE Trans. NanoBioscience, vol. 7, no. 1, pp. 44-55, 2008.
[43]
Z. Wang, H. Gao, J. Cao, and X. Liu, "On Delayed Genetic Regulatory Networks with Polytopic Uncertainties: Robust Stability Analysis," IEEE Trans. NanoBioscience, vol. 7, no. 2, pp. 154- 163, 2008.
[44]
F. Wu, W. Zhang, and A.J. Kusalik, "Modeling Gene Expression from Microarray Expression Data with State-Space Equations," Proc. Pacific Symp. Biocomputing, pp. 581-592, 2004.
[45]
F. Lewis, L. Xie, and D. Popa, Optimal and Robust Estimation: With an Introduction to Stochastic Control Theory, CRC Press, 2007.
[46]
K.Y. Yeung, C. Fraley, A. Murua, A.E. Raftery, and W.L. Ruzzo, "Model-Based Clustering and Data Transformations for Gene Expression Data," Bioinformatics, vol. 17, no. 10, pp. 977-987, 2001.

Cited By

View all
  • (2022)Experimental Design in Dynamical System Identification: A Bandit-Based Active Learning ApproachMachine Learning and Knowledge Discovery in Databases10.1007/978-3-662-44851-9_20(306-321)Online publication date: 10-Mar-2022
  • (2018)The p53Mdm2 regulation relationship under different radiation doses based on the continuousdiscrete extended Kalman filter algorithmNeurocomputing10.1016/j.neucom.2017.08.016273:C(230-236)Online publication date: 17-Jan-2018
  • (2017)Event-based recursive filtering for time-delayed stochastic nonlinear systems with missing measurementsSignal Processing10.1016/j.sigpro.2016.12.004134:C(158-165)Online publication date: 1-May-2017
  • Show More Cited By

Recommendations

Reviews

Jason H Moore

Systems biology is an emerging discipline aimed at understanding how biomolecules interact to influence biological processes such as development, gene regulation, and metabolism. Other disciplines, such as genomics and bioinformatics, have made it possible to study the relationships between biological components of a system rather than the individual components themselves. A primary focus of systems biology is on the perturbation of a system as a means to understand the interactions that drive the system. A common experimental approach is to treat a culture of cells or an animal model, such as a mouse, with a drug or a chemical, and then measure how the biomolecules of the system respond to the stimulus. The before and after treatment effects can then be modeled using mathematical or computational network analysis methods that, in turn, can be used to infer functional relationships. While these studies have become quite common, they are limited in that many of them do not consider how the system changes as a function of time. The lack of the time component is primarily due to the expense and effort that are required to measure hundreds or thousands of biomolecules on a second-to-second, minute-to-minute, or hour-to-hour timescale. As the cost and inconvenience of measurement approaches decrease, it is important to have available systems analysis methods that are able to directly address the change of a system over time, in addition to its biomolecular interactions. In this paper, Wang et al. present an extended Kalman filter (EKF) approach to modeling the nonlinear dynamics of gene regulatory networks. Kalman filters are used in time-series analysis to estimate the state of discrete systems, using linear stochastic difference equations. Their EKF approach can handle nonlinear time series, which is critical for most complex biological systems. The EKF approach is based on a series of time-varying linearizations that approximate the nonlinear system. The authors apply this approach to four different gene expression time series, ranging from 98 to 530 genes, measured over eight to 123 time steps. They demonstrate that the EKF approach is able to identify the parameters for each of these gene expression systems. The adaptation of time-series modeling approaches such as EKF to the study of complex biological systems will play an important role in advancing the field of systems biology. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Information & Contributors

Information

Published In

cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 6, Issue 3
July 2009
159 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 July 2009
Published in TCBB Volume 6, Issue 3

Author Tags

  1. DNA microarray technology
  2. Modeling
  3. clustering
  4. extended Kalman filtering
  5. gene expression
  6. time series data.

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Experimental Design in Dynamical System Identification: A Bandit-Based Active Learning ApproachMachine Learning and Knowledge Discovery in Databases10.1007/978-3-662-44851-9_20(306-321)Online publication date: 10-Mar-2022
  • (2018)The p53Mdm2 regulation relationship under different radiation doses based on the continuousdiscrete extended Kalman filter algorithmNeurocomputing10.1016/j.neucom.2017.08.016273:C(230-236)Online publication date: 17-Jan-2018
  • (2017)Event-based recursive filtering for time-delayed stochastic nonlinear systems with missing measurementsSignal Processing10.1016/j.sigpro.2016.12.004134:C(158-165)Online publication date: 1-May-2017
  • (2017)Finite-time stability of genetic regulatory networks with impulsive effectsNeurocomputing10.1016/j.neucom.2016.09.017219:C(9-14)Online publication date: 5-Jan-2017
  • (2016)Kriging-Based Parameter Estimation Algorithm for Metabolic Networks Combined with Single-Dimensional Optimization and Dynamic Coordinate PerturbationIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2015.250529113:6(1142-1154)Online publication date: 1-Nov-2016
  • (2016)Linear optimal filtering for time-delay networked systems subject to missing measurements with individual occurrence probabilityNeurocomputing10.1016/j.neucom.2016.07.008214:C(767-774)Online publication date: 19-Nov-2016
  • (2015)A maximum a posteriori probability and time-varying approach for inferring gene regulatory networks from time course gene microarray dataIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2014.234395112:1(123-135)Online publication date: 1-Jan-2015
  • (2015)Mode-mismatched estimator design for Markov jump genetic regulatory networks with random time delaysNeurocomputing10.1016/j.neucom.2015.05.011168:C(1121-1131)Online publication date: 30-Nov-2015
  • (2015)Finite-time stochastic synchronization of genetic regulatory networksNeurocomputing10.1016/j.neucom.2015.04.064167:C(314-321)Online publication date: 1-Nov-2015
  • (2014)Structure identification for gene regulatory networks via linearization and robust state estimationAutomatica (Journal of IFAC)10.1016/j.automatica.2014.08.00350:11(2765-2776)Online publication date: 1-Nov-2014
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media