Host–Pathogen Systems Biology

Anderson Borges Fideles

DDT • Volume 11, Number 5/6 • March 2006 REVIEWS Reviews • INFORMATICS Host–pathogen systems biology Christian V. Forst Bioscience Division, Los Alamos National Laboratory, Mailstop M888, P.O. Box 1663, Los Alamos, NM 87545, USA Unlike traditional biological research that focuses on a small set of components, systems biology studies the complex interactions between a large number of genes, proteins and other elements of biological networks and systems. Host–pathogen systems biology examines the interactions between the components of two distinct organisms, either a microbial or viral pathogen and its animal host or two different microbial species in a community. With the availability of complete genomic sequences of various hosts and pathogens, together with breakthroughs in proteomics, metabolomics and other experimental areas, the investigation of host–pathogen systems on a multitude of levels of detail has come within reach. Systems biology is a novel approach to studying, analyzing and – ultimately – controlling biological systems. Unlike traditional research that typically focuses on single genes, systems biology studies complex interactions of all levels of biological information. In light of emerging biological threats, such as the anthrax scare in Florida, USA, in 2001, the re-engineering of smallpox from old sequence data, the emergence of multi-drug resistant pathogens (e.g. Mycobacterium tuberculosis and strains of Staphylococcus) or the lurking danger of a new influenza pandemic of the avian H5N1 strain, one wants to take a step further and expand biological systems to include two organisms – a pathogen and a host. The research of host–pathogen interactions in its broadest definition is a very mature field. It is closely linked to the understanding of the immune system and immune responses [1] (Box 1). Host–pathogen interactions can be interpreted as the battle of two systems. For example, pathogens can hijack host cells and use host cell capabilities to the pathogens’ advantage [2], or they can evolve so rapidly that their sheer diversity overwhelms the immune system, as in the case of HIV infections [3]. The detailed mechanistic analysis of host–pathogen systems, encompassing all aspects of such a multilevel problem – from molecular interactions to organism responses – is still in its infancy. Components and subproblems have been addressed by theoretical and experimental approaches, often focusing on either host-response Corresponding author: Forst C.V. (chris@lanl.gov). 220 www.drugdiscoverytoday.com or pathogen interference by mimicking the missing ‘partner’. Host–pathogen systems biology is the name given to the paradigm of integrating these different types of models into a host–pathogen system over a range of detail of descriptions. The ultimate goal for host–pathogen systems biology is not only the discovery and comprehension of underlying biology, but also the establishment of a robust framework for more efficient drug development and therapeutic intervention. Examples, approaches and perspectives of host–pathogen systems biology are given in this review. Systems biology in drug discovery As recent reviews indicate [4,5], systems biology approaches offer novel strategies to shorten the cumbersome path from identified target to an approved drug. Systems biology provides in silico models for cost-effective decision making during multimillion-dollar drug development programs [6]. The term ‘systems biology’ encompasses many different techniques and models for probing and understanding biological complexity, spanning multiple levels of spatial and temporal scales (Figure 1). Because biological complexity is an exponential function of the number of systems components and the interactions between them, such efforts are currently limited to simple organisms or to specific pathways in higher organisms. Limiting systems biology studies to specific functional subsystems is even more pronounced in host–pathogen system biology, which focuses on more than one organism. Where systems biology is applied to drug discovery, three 1359-6446/06/$ – see front matter ©2006 Elsevier Ltd. All rights reserved. PII: S1359-6446(05)03735-9 DDT • Volume 11, Number 5/6 • March 2006 An overview of the immune system Immune response is an essential defense mechanism against pathogens that is available to most multicellular organisms. Even unicellular organisms, such as the well-known mould Penicillium chrysogenum, produce chemical components to kill pathogens. Chemical agents are, among other defense mechanisms, part of an innate immune response.The following list details the defense mechanisms repertoire of the innate and adaptive immune system: • Innate immune response (plants, animals); • Barriers to pathogen entry; • Mechanical responses to eliminate antigens; • Chemical agents; • Phagocytes; • Fever – elevated temperature inhibits growth of microbes; • Inflammatory responses to attract white blood cells (leukocytes) to the infection site; • Natural killer (NK) cells to kill pathogen-infected and cancer cells; • Adaptive immune response (higher animals); • Synthesis of antibodies to bind antigens and promote their elimination; • T-cell killing of virus-infected cells; • Activation of macrophages to destroy phagocytosed pathogens. The innate and adaptive immune responses are complementary components of multicellular host defense.The innate immune response provides the initial defense against infections with responses occurring within hours after infection. By contrast, the adaptive immune response requires several days to develop after infection. Innate immunity relies on germline-encoded receptors and is limited to some extent in its diversity, although some diversification is achieved by heterodimerization of TLRs or the semi-invariant NKT cells. NKT cells – T cells with the properties of NK cells – blur the distinction between innate and adaptive immunity by using the complex machinery of somatic recombination to produce receptors recognizing a narrow range of antigenic diversity. Conversely, the receptors of the adaptive response that are also produced by somatic recombination of gene segments experience a tremendous diversity.The adaptive immune system also produces memory cells to store receptor information for particular responses. The innate immune response and toll-like receptor pathways The innate immune system is essential for host defense and is responsible for early detection and containment of pathogens.The inflammatory response to pathogens is activated when the phagocyte recognizes the foreign invader using a battery of pattern recognition receptors (PRR), including toll-like receptors (TLRs) [53], members of the C-type lectin receptor family [54], scavenger receptors [55], complement receptors [56] and integrins. Conserved pathogen-specific chemical motifs recognized by these receptors include carbohydrates, glycolipids, glycoproteins, nucleic acids (DNA and double-stranded RNA), proteolipids and proteins. Stimulation of PRRs results in activation of a broad spectrum of interacting signaling pathways, revealing a system of extraordinary complexity. Additional receptors, such as cytokine, chemokine or growth factor receptors, add to the specificity of the immune response. principal approaches can be identified [6]: (i) bioinformatic integration of ‘omics’ data (a bottom-up approach); (ii) integrative mathematical cell models (an intermediate approach); and (iii) computer models of disease or organ system physiology from cell and organ response information available in the literature (a topdown approach to target selection, clinical indication and clinical trial design). These complementary approaches must ultimately be integrated in the pursuit for a hierarchical molecule-to-systemslevel understanding of host–pathogen interactions. Computational systems biology models, methods and tools Scales and models The goal of host–pathogen systems biology is to understand physiology and infectious disease at the level of molecules, cellular networks (e.g. metabolic, regulatory and signaling networks), cells (host cells as well as various viruses and bacterial pathogens), tissues, organs and ultimately whole organisms. A comprehensive systems model can span approximately ten orders of magnitudes in scale, and even more in time (Figure 1). Two distinct strategies for modeling along many levels of description can be recognized, a bottom-up and a top-down approach which are integrated in a third, hybrid strategy. 1. Bottom-up approach It could be argued that a full understanding of a host–pathogen system requires knowledge of all of its components. A bottom-up approach focuses on the measurement and description of complex systems using the building blocks – their interactions and dynamic properties, such as kinetic parameters. In molecular biology, bottom-up modeling started after the genomic revolution with a plethora of ‘omic’ information available. The bottom-up approach can be used to investigate which genes, proteins or phosphorylation states of proteins are expressed or upregulated in an infection process, leading to testable hypotheses that the regulated species are important to disease induction or progression. By integrating of genomic, proteomic and metabolomic data, models have been developed that mechanistically describe intra- and inter-cellular processes (e.g. during drug response or disease progression). 2. Top-down approach The top-down modeling approach attempts to develop integrative and predictive multiscale models of biological processes. A longterm goal would be an in silico model of human–pathogen physiology and infection. However, with the current technology, such modeling focuses on relatively specific problems at particular scales, for example, at the pathway, immune cell system or organ level. 3. Hybrid models Bottom-up models serve as scaffolds for top-down models by providing information of possible and potential interactions and subprocesses, how these subprocesses respond to drugs and infection and how matter and information is passed between subprocesses and through different scales. Such hybrid approaches benefit from bottom-up molecular measurements and knowledge as well as topdown predictive modeling. A ‘postgenomic physiology’ could span many different levels of biology, from molecules to whole organisms, moving away from ‘naïve reductionism’ towards a discipline that fosters integration and synthesis, as Strange [7] envisioned in a recent review. Methods Complementary to the biological hierarchy of host–pathogen systems, methodological descriptions and simulations of such systems have been performed on different level of detail. Interaction networks and network models of biological systems have been studied at the level of: (i) topological connections; (ii) qualitative connections; (iii) quantitative connections; and (iv) higher order interactions, as reviewed by Bower and Bolouri [8]. 1. Topological connections Networks are assembled from interaction data, physical measurements or computational predictions of protein–protein, protein–DNA, www.drugdiscoverytoday.com 221 Reviews • INFORMATICS BOX 1 REVIEWS DDT • Volume 11, Number 5/6 • March 2006 REVIEWS Top-down Approach Bottom-up Approach Bottoms-up Reviews • INFORMATICS Pathways Molecules me Meters Seconds 10-9 10-8 10-6 10-7 Cells 10-6 10-4 10-5 102 Tissues 104 10-3 10-2 Humans 10-1 105 1 108 FIGURE 1 Multiscale approaches in biological systems modeling, from molecules, pathways, cells and cell-cell interactions and tissues to the whole organism. Life takes place on many different temporal and spatial scales.The spatial scale ranges in orders of magnitudes from nanometers, for chemicals and proteins, to meters for the whole body.The temporal scale covers fast biochemical reactions happening in microseconds to the lifespan of an organism in years.The bottom-up approach (‘omics’) focuses on large scale identification of molecular components.The top-down approach (modeling) attempts to form integrative (multilevel) models of human physiology and pathogen infection, which typically focus on relatively specific questions at particular scales, due to the limitations of current technologies. Adapted, with permission, from [6]. protein–small-chemical or other identified interactions; or by inference of correlations between cellular components. Undirected edges indicate interactions between components. Networks of this kind are often referred to as interaction maps or networks. Network biology [9,10] is a method that studies inter- and intracellular networks and their genomic, proteomic and metabolomic foundations. Network biology forms the basis of systems biology by providing information on biological components, their interactions and their functional interplay in biological networks. One particular aspect of network biology focuses on the graph structure of the underlying interaction map by providing quantifiable measures, such as node-degree distribution, mean path length and clustering coefficients, as well as by identifying architectural features, such as the existence of motifs and modules and their hierarchical structure [10]. Such measures are particularly interesting when linked to phenotypic properties of the biological system, such as system survival. Jeong et al. [11] have shown that proteins essential for survival are highly connected in a yeast protein-interaction network. 2. Qualitative connections At the level of qualitative connections, directionality and causality indicate how input nodes affect the output nodes Directional edges indicate causal relationship as well as qualitative interactions (e.g. activating or inhibitory interactions). Qualitative models include metabolic flux models that assume steady state conditions [12,13], models that consider gene regulation [14] or Boolean network models [15,16]. 3. Quantitative connections At the level of quantitative connections, quantitative functions are assigned sets of interactions that describe the dynamic co-dependence of the dynamic behavior of an output, depending on the dynamic behavior of inputs. Methods of choice include power law models, such as S-systems [17,18], reaction kinetics modeled by 222 www.drugdiscoverytoday.com ordinary differential equations [19,20] or stochastic simulations [21,22]. 4. Higher order interactions At the level of higher order interactions and reaction rules, higher level nodes and connections represent abstract concepts that can be expanded into hierarchical sublevel subgraphs based on reaction rules. Examples include signaling networks and metabolic reactions, with context dependent or rule-based interactions and different types of nodes [23]. Other methods Response networks Response networks were first mentioned in connection with biomolecular networks by Magasanik [24]. Groundwork for a systematic theoretical analysis of response networks has been laid by Zien et al. [25] and has been further developed independently by Ideker et al. [26]. The idea behind response network analysis is the analysis of experimental data, such as expression profiles, in the context of biological networks. By a superposition of experimental data with network information, networks are identified that best represent the system response according to the experimental conditions tested. Comparative network analysis In principle, graph comparison is an NP (non-deterministic polynomial-time) hard problem, which typically can only be addressed by exhaustive enumeration techniques. However, methods for comparative network analysis of biological systems have been developed in the past. Such methods have proven to be powerful in several applications including metabolic [27–30] and protein interaction networks [31], as well as correlation of protein interaction networks with gene expression [32]. Recently, a method has been developed to correlate and compare response networks for identification of common and specific responses [33]. DDT • Volume 11, Number 5/6 • March 2006 A plethora of tools and software for biological systems modeling have been developed and are available for download, often free for academic users. Since its development by Hucka and co-workers [34], many modeling tools use the systems biology markup language (SBML; http://sbml.org) for portable model description. Owing to the number of tools available, the potential user is referred to the systems biology website by ‘Kitano’s Symbiotic Systems Project’ (www.systems-biology.org) that lists model editors for graphic-assisted model construction, simulation tools for deterministic and stochastic simulations, analysis tools and utilities. Physiology modeling software is not yet well-integrated with molecular and cellular modeling tools. A list of physiological modeling groups and tools can be found at the Federation of American Scientists website (www.fas.org). Network Models Host–pathogen system models that fall in the ‘omics’ category comprise interaction maps or interaction networks that show components of a network and their interactions for further analysis . Genomic foundation of host–pathogen interactions: a Chlamydia psittaci–Human metabolic interaction network One example of a true host-pathogen network model is the tryptophan (trp) biosynthesis network of a class of obligate intracellular pathogens, Chlamydiae (Figure 1a) [9]. Chlamydia primarily infects mucosal epithelial cells with consecutive infection of subepithelial tissue [35]. Chlamydia infections progress although a lifecycle of three distinct stages. The host is invaded by elementary bodies (EBs), which represent the extracellular infectious stage. After infection, EBs develop into intracellular reticular bodies (RBs), which replicate and further mature into EBs, which then lyse the host cell and initiate another round of pathogen infection. The cycle between EBs, RBs and lysis of host cells characterizes the acute disease state. A third state of development, the persistence state, describes the chronic disease progress. In tissue culture, the persistence state of Chlamydia can be induced by various factors, specifically by introducing interferon-γ (IFN-γ), nutrient limitations or other environmental stress. For example, it is well documented that tryptophan levels in host cells decrease as an effect of IFN-γ [36]. It has also been recognized that tryptophan depletion might have a role in the development of chronic disease conditions [37]. Investigating the tryptophan biosynthesis pathway in a particular Chlamydia species, Chlamydia psittaci, is interesting because it shows the interdependence and connectivity between pathogen and host and thus helps to explain the development of the chronic disease. This pathway assembles an almost complete biosynthetic unit lacking the first step (the conversion of chorismate into anthranilate). Interestingly, genes encoding the enzymatic subunits trpAα and trpAβ that are typically present in tryptophan operons and that are responsible for catalyzing the conversion of chorismate to anthranilate are absent in the C. psittaci tryptophan operon and are not encoded elsewhere in the genome. Instead, the C. psittaci tryptophan operon includes two genes kynU and kprS, both of which are atypical components of the classic tryptophan operon (Figure 2b). This can only be understood through systematic metabolic network analysis. KynU encodes kynureninase, an enzyme that converts kynurenine into anthranilate. KprS codes for 5-phospho-D-ribosyl-1-pyrophosphate (PRPP) synthetase, a component needed in the first steps of tryptophan biosynthesis (Figure 2a). The complete tryptophan network, including the tryptophan-salvage pathway of the host, is shown in Figure 2a. For tryptophan biosynthesis, C. psittaci obtains an alternative source of anthranilate by hijacking the host’s tryptophan depletion pathway by intercepting the byproduct kynurenine. At first, the tryptophan depletion pathway of the host is activated by inducing indoleamine-2,3dioxygenase using IFN-γ (reaction with EC number* 1.13.11.11 in Figure 2a). Then, C. psittaci uses host kynurenine through kynU to produce its own tryptophan, enabling intracellular growth and causing chronic infections. Knowledge of such a metabolic host–pathogen system enables accelerated drug development of successful antibiotics against chronic Chlamydia infection. Dynamic models and simulations Immune–receptor signaling On the host side, mathematical and computational network models have been used to study the process of signaling through receptors of the immune system [38]. Mathematical dynamic models are essentially used on two distinct levels of description: ‘simple models’ and ‘detailed models’ (see earlier section titled ‘Computational systems biology models, methods and tools’). Simple models attempt to capture some features of the signal transduction system but make no attempt at mechanistically describing the signaling cascades that are activated. In these models, the actual signaling cascade is replaced by one or more arbitrary transitions. These models are simple in the sense that they have few components and a simple mathematical description. This implies that such models are also intrinsically more abstract than detailed models – their components and parameters often do not correspond directly to well-defined physical quantities, such as measured binding constants or chemical reaction rates. Nonetheless, simple models can provide insights into the behavior of a system and drive experimental and detailed modeling efforts by suggesting further more-detailed experiments or models. Simple models capture two basic aspects of immune-receptor signaling: serial engagement and serial triggering [39,40], and kinetic proofreading [41]. Serial engagement [42] and kinetic proofreading models [41] associate the measured properties of ligand–receptor interactions with the amplitude of signaling responses, but these models do not describe molecular interactions beyond ligand–receptor binding. Building detailed models of cell signaling cascades involves the selection of a limited set of protein components that participate in signaling. Goldstein et al. [38] propose a three-part protocol for defining such a mathematical model: 1. Selection of a set of components and identification of their interactions based on what is known about the system. 2. Selection of parameters that quantify the cellular concentrations of the components and the strength of the interactions between components (known as rate constants). 3. Selection of a mathematical formulation of a method of simulation. * The classification system for enzymes and biochemical reactions by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB, www.chem.qmul.ac.uk/iubmb/enzyme). www.drugdiscoverytoday.com 223 Reviews • INFORMATICS Tools REVIEWS DDT • Volume 11, Number 5/6 • March 2006 REVIEWS (a) 1–(2–carboxyphenylamino)– 1–deoxy–D–ribulose 5–phosphate TrpC 5.3.1.24 KprS D–ribose 5–phosphate 5–phospho-α− D–ribose 1–diphosphate 2.7.6.1 Reviews • INFORMATICS Pyruvate L–glutamine TrpD 4.1.1.48 N–(5–phospho–D–ribosyl)– Anthranilate 2.4.2.18 L–serine 1–(3–indolyl)glycerol– 3–phosphate 4.2.1.20 TrpB TrpEa/b Intracellular pathogen network in host cell Chorismate 4.1.3.27 Anthranilate L–tryptophan TrpA L–glutamate 3.7.1.3 KynU L–alanine 1.13.11.11 + Interferonγ L–formylkynurenine L–kynurenine 3.5.1.9 1.14.13.9 3–hydroxy–L–kynurenine Host network Formate Key: Tryptophan anabolism in pathogen Tryptophan catabolism/kynurenine anabolism Feeder reactions in pathogen Degradation in host Nonexistant reactions in pathogen 56 (b) Operon organization trpEb 231 trpR –9 –10 –3 –7 12 trpB trpD trpC trpEb trpEa –3 kynU kprS FIGURE 2 Chlamydia psittaci–host trp-metabolic network. (a) A shared trp metabolic network between the pathogen Chlamydia psittaci and a human host.The subnetwork in the oval denotes the pathogen part of the network.Thin red arrows indicate anabolic trp reactions in the pathogen, green arrows are catabolic reactions, blue arrows refer to feeder reaction that provide required compounds [L-alanine (ala), ser and 5-phospho-D-ribosyl-1-pyrophosphate (PRPP)] for trp biosynthesis, and solid black arrows denote degradation reactions in the host.The dashed black arrows are reactions not found in the pathogen.The bold red arrow describes the activation of indoleamine-2,3-dioxygenase (EC 1.13.11.11) by IFN-γ. Reproduced, with permission, from [9]. (b) The operon organization of the trp operon in C. psittaci is shown.The presence of kprS and kynU is unusual in typical trp operons. Numbers above the genes refer to intergenic distances. Reaction-network models are based on the assumption that each species is uniformly distributed throughout the cell. Reactiondiffusion models allow for the variation of species concentrations in different cellular compartments. Using the FcεRI receptor (the high-affinity IgE receptor) as an example, Faeder et al. [23] have developed a detailed signaling model that takes into account downstream components affecting the signaling cascade (Figure 3). Figure 3a shows the four components of the receptor, the ligand (IgE dimer), the receptor (FcεRI) and the two kinases Lyn and Syk. The nine basic interactions are shown in Figure 3b, which include association and dissociation, transphosphorylation (i.e. catalysis of phosphorylation) and dephosphorylation. A surprising aspect of this model is that, because of combinatorial complexity, four components and nine interactions expand to a signaling network with 354 species and 3680 reactions (one particular reaction species is depicted in Figure 3c). The simulation results of the FcεRI signaling model show complex behavior of phosphorylation profiles as a function of the ligand–receptor 224 www.drugdiscoverytoday.com off-rate, because the balance between kinetic proofreading and serial engagement changes, moving down the signaling cascade. Because serial engagement increases with the off-rate, an increase in phosphorylation with off-rate indicates that serial engagement is the dominant effect, whereas a decrease indicates that kinetic proofreading is dominant. Depending on the timely occurrence of a particular phosphorylation event, either kinetic proofreading or serial engagement is dominant. For example, the phosphorylation profile of γ immunoreceptor tyrosine-based activation motif (γ-ITAM) passes through a maximum, which indicates a transition between control by kinetic proofreading and control by serial engagement. Thus, the detailed signaling model shows that kinetic proofreading and serial engagement are emergent properties and the interplay of these mechanisms gives rise to an optimal off rate at which the highest response is achieved. The ultimate goal of immune-receptor signaling models is to understand how the components of a signaling cascade work together to direct cellular responses to changes in the extracellular DDT • Volume 11, Number 5/6 • March 2006 (a) Components (b) Interactions Ligand binding IgE dimer α FcεRI β ITAMs γ2 REVIEWS Association with receptor Transphosphorylation P P P Lyn P P Dephosphorylation Syk Aggregation (c) Reviews • INFORMATICS P Reaction network 354 states and 3680 reactions Four initial concentrations 21 Rate constants = 10 nM P = 4x10 per cell 5 ∗ PLγ P P P State 85 P P P P 4 = 3x10 per cell = 4x105 per cell P P P P P ∗ PLγ P P P P , , or P P P P P ...and 3 more states P P P P ∗ PLγ = 30 s–1 Two reactions with the same rate constant FIGURE 3 A detailed model of early events in FcεεRI signaling. (a) The four components in the model are the IgE dimer, the receptor (FcεRI) and the kinases Lyn and Syk. The two cytosolic domains of the receptor each contain an immunoreceptor tyrosine-based activation motif (ITAM). (b) There are nine basic interactions, five for association–dissociation between signaling components, three transphosphorylation reactions, and one for spontaneous dephosphorylation of phosphorylated sites. (c) All possible combinations between components and basic interactions yields 354 complexes and phosphorylation states, each of which is tracked as separate species.The species are connected by 3680 reactions assembling a large biological network that is defined by a small number of parameters (the initial conditions of 4 proteins and 21 rate constants). One typical species is illustrated along with nine different reactions, of which six are explicitly shown. Reactions seven to nine are generated by using different phosphorylation states of Syk (gray square) to form additional states from the complex in the center to the bottomleft complex (indicated by ‘…and 3 more states’).The states are connected by a large biochemical reaction network (composed of 3680 reactions). A small number of parameters ( the initial concentrations of the 4 proteins and 21 rate constants) define this network because the same rate constant can be used for many similar reactions.The figure shows two of the 24 reaction in which Lyn transphosphorylates γ-ITAM.The p*Lγ indicate the reaction rate of these two reactions. Reproduced, with permission, from [38] environment. Simple and detailed mathematical models have contributed to the understanding of essential host–pathogen signaling events through immune receptors by identifying the fundamental mechanisms that are involved in determining, regulating and therapeutically modifying immune responses. Immune system modeling Complementing the molecular biology modeling approach described earlier, the following models capture the dynamics of the immune response to infectious pathogens at the cellular level in host–pathogen systems biology. A comprehensive review on mathematic modeling techniques of such systems has been presented by Perelson and Weisbuch [43]. The basic idea of viral infection models is simple and lead to the development of viral dynamics as a research field [44]. These infection models consist of three types of cells, target cells (T), infected cells (I), and virus particles (virions, V). Infected cells produce new virus particles at a constant rate p and die at rate δ. Virions are cleared by the immune system at rate c. The rate at which a target cell is infected is k. dT dt = λ − dT − kVT [Eqn 1] dT dt = kVT − δ I [Eqn 2] dT dt = pI − cV [Eqn 3] These equations (Equations 1, 2 and 3), developed by Perelson et al. [45], describe the basic model of viral dynamics and have been used to study primary HIV infection. More-complex models include specific components of the immune systems, such as the cytotoxic T lymphocytes [46], other nontoxic lymphocytes and cytokines–chemokines [47]. These models essentially include specific expressions for resting, active, memory and cytotoxic T cells. Mathematical models of HIV infections have also proved to be useful in exploring the response of HIV to antiviral therapy. Specifically, the evolution and evasion by HIV under selection pressure from the immune system and drug treatments have been extensively studied. A review by Frost [3] discusses the benefit of evolutionary dynamic HIV models for the understanding of HIV response to highly active antiretroviral therapy. Frost specifically discusses the role of such models in the design and analysis of structured treatment-interruption studies in reducing drug toxicity, boosting HIV-specific immune responses and to allowing reversion of drug-resistance mutation in highly drug-experienced patients. Building on such pharmacokinetic models of HIV evolution in human hosts, Dixit and Perelson [48] developed a hybrid model of HIV dynamics under antiretroviral therapy that combines www.drugdiscoverytoday.com 225 DDT • Volume 11, Number 5/6 • March 2006 REVIEWS Reviews • INFORMATICS pharmacokinetics and intracellular delay (the time required for an infected cell to replicate a virus). This model helps to accurately determine the pharmacological delay and the time-dependent efficacy of drug action. predicts a decrease in airway eosinophils without much therapeutic improvement in airway conduction after reduction in interleukin (IL)-5 protein, as observed in clinical trials of an anti-IL-5 antibody in asthmatics. Models of host–pathogen physiology Conclusion A feasible approach to address the computational issues of integrating molecular, cellular and organ levels in a top-down approach is to put in place an organ-level framework and add increasing complexity in a modular format. Regarding the immune response, for example, one could begin with models of host–pathogen interactions that examine cell–cell communications through bacterial secretion systems as well as cytokine networks, and then start replacing the cells in the model, which are initially regarded as ‘black boxes’, with simulation of cell behavior modeled from intracellular network modules (e.g. models of cytoskeleton motility, proliferation or cytokine response and release), ultimately replacing black-box modules with bottom up-approaches. The Physiome Project (www.pysiome.org) [49], conceived by the Commission on Bioengineering in Physiology and presented at the International Union of Physiological Sciences (IUPS) council at their 32nd world congress in Glasgow, UK, in 1993, is a worldwide effort to define the physiome through the development of databases and models that will facilitate the understanding of the integrative function of cells, organs and organisms. The project is focused on compiling and providing a central repository of databases, linking experimental information and computational models from many laboratories into a single self-consistent framework. In the vision of the physiome project, such a framework enables and promotes multiscale modeling of physiological processes. Entelos have developed complex simulations of disease physiology using a framework, PhysioLab (www.entelos.com/science/physiolabtech.html), for determining differential equations based on empirical data in humans [50]. In these models, cells, or even tissues, are represented as black boxes without explicit internal network models that respond to inputs by providing specified dynamic outputs. Using such an organ level framework of disease physiology, Stokes et al. [51] have developed a computational model of chronic asthma that includes interactions between cells and their response to each other and their environment. Different steady states of this disease, such as chronic asthma including chronic eosinophilic inflammation, chronic airway obstruction, airway hyper-responsiveness and elevated IgE levels, can be induced in the model. These in silico asthmatic models respond as expected to various drugs, such as β2-agonists, glucocorticoids and leukotriene antagonists [51]. Furthermore, this model accurately Host–pathogen systems biology is still in its infancy. Although many aspects of host–pathogen systems have been addressed by experimental discoveries and mathematical and computational models, a comprehensive analysis of all aspects of host–pathogen interaction (including pathogen interference, host response, and pathogen response) is a far-fetched goal. Current model systems either focus on host response to pathogen infections or pathogen biology in a simulated host environment, and the size of these model systems depend essentially on the detail of description. They can be large interaction maps (bottom-up models) with thousands of components and interactions, conceptual top-down cellular or organ models consisting of interacting black boxes without detailed knowledge of internal processes within each individual black box, or small mechanistic dynamic models describing a few steps of a much larger system. Efforts are currently underway to combine pathogen action and host response in a comprehensive, multiscale (hybrid) model, merging top-down approaches with ‘omic’ bottom-up approaches [52]. Combined with sophisticated experimental techniques, such as quantitative protein expression, tags by quantum dots for localization and nanobiotechnological measurements on single cells, new insights into the complexity of host–pathogen systems are within reach. Potential applications of host–pathogen systems biology range from biological target identification and drug discovery to bio-threat assessment and personalized health care. As with any modeling approach, theoretical models raise the challenge of experimental validation and the iterative cycle of improvement inherent to the modeling effort. Concerning drug discovery, success stories are still anecdotal – until a given model shows a track record of successful predictions it will be risky to rely on for drug development decisions. For the foreseeable future, modeling predictions will most likely be only one of many inputs into the decision-making process in the pharmaceutical industry. A long-term goal for host–pathogen systems biology, although still in the realm of science fiction, would include a full scale in silico model of an individualized human fighting against pathogenic infections. Acknowledgements Fruitful discussions with, and critical reading of the manuscript by, James Faeder are gratefully acknowledged. References 1 Goldsby, R.A. et al. (1999) Kuby Immunology, 4th edition, W. H. Freeman & Co. 2 Kahn, R. et al. (2002) Cellular hijacking: a common strategy for microbial infection. Trends Biochem. Sci. 27, 308–314 3 Frost, S.D.W. (2002) Dynamics and evoluion of HIV-1 during structired treatment interruptions. AIDS Rev. 4, 119–127 4 Bugrim, A. et al. (2004) Early prediction of drug metabolism and toxicity: systems biology approach and modeling. Drug Discov. Today 9, 127–135 5 Butcher, E.C. (2005) Can cell systems biology rescure drug discovery? Nat. Rev. Drug Discov. 4, 461–467 6 Butcher, E.C. et al. (2004) Systems biology in drug discovery. Nat. Biotechnol. 22, 1253–1259 7 Strange, K. (2005) The end of “naïve reductionism”: rise of systems biology or 226 www.drugdiscoverytoday.com renaissance of physiology? Am. J. Physiol. Cell Physiol. 288, C968–C974 8 Bower, J.M. and Bolouri, H., eds (2001) Computational modeling of genetic and biochemical networks, MIT Press. 9 Forst, C.V. (2002) Network genomics – a novel approach for the analysis of biological systems in the post-genomic era. Mol. Biol. Rep. 29, 265–280 10 Barabási, A-L. and Oltvai, Z.N. (2004) Network biology: understanding the cell’s functional organization. Nat. Rev. Genet. 5, 101–113 11 Jeong, H. et al. (2001) Lethality and centrality in protein networks. Nature 411, 41–42 12 Schilling, C.H. (1999) Toward metabolic phenomics: analysis of genomic data using flux balances. Biotechnol. Prog. 15, 288–295 13 Schuster, S. (2000) A general definition of metabolic pathways useful for 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 systematic organization and analysis of complex metabolic networks. Nat. Biotechnol. 18, 326–332 Covert, M.W. and Palsson, B.O. (2002) Transcriptional regulation in constraintbased metabolic models of Escherichia coli. J. Biol. Chem. 277, 28058–28064 Kauffman, S.A. (1969) Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. 22, 437–467 Kauffman, S.A. (1993) The origins of order: self-organization and selection in evolution, Oxford University Press. Savageau, M.A. (1969) Biological Systems Analysis, II. The steady-state solutions for an n-pool system using a power-law approximation. J. Theor. Biol. 25, 370–379 Savageau, M.A. (1970) Biological Systems Analysis, III. Dynamic solutions using a power-law approximation. J. Theor. Biol. 26, 215–226 Ackers, G.K. et al. (1982) Quantitative model for gene regulation by lambda repressor. Proc. Natl. Acad. Sci. U. S. A. 79, 1129–1133 Novak, B. and Tyson, J.J. (1993) Modeling the cell division cycle: M-phase trigger, oscillations, and size control. J. Theor. Biol. 165, 101–134 Gillespie, D. (1977) Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 McAdams, H.H. and Arkin, A. (1998) Simulation of prokaryotic genetic circuits. Annu. Rev. Biophys. Biomol. Struct. 27, 199–224 Faeder, J.R. et al. (2003) Investigation of early events in FcεRI-mediated signaling using a detailed mathematical model. J. Immunol. 170, 3769–3781 Magasanik, B. (1995) Nitrogen response networks of yeast and bacteria. J. Cell Biol. (suppl. 19A), 326–326 Zien, A. et al. (2000) Analysis of gene expression data with pathway scores. In Proceedings of ISMB’00 (Bourne, P. et al., eds.), pp. 407–417, AAAI Press. Ideker, T. et al. (2002) Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18(Suppl. 1), S233–S240 Dandekar, T. et al. (1999) Pathway alignment: application to the comparative analysis of glycolytic enzymes. Biochem. J. 343, 115–124 Forst, C.V. and Schulten, K. (1999) Evolution of metabolism: a new method for the comparison of metabolic pathways using genomic information. J. Comput. Biol. 6, 343–360 Forst, C.V. and Schulten, K. (2001) Phylogenetic Analysis of Metabolic Pathways. J. Mol. Biol. 52, 471–489 Ogata, H. et al. (2000) A heuristic graph comparison algorithm and its application to detect functionally related enzyme clusters. Nucleic Acids Res. 28, 4021–4028 Kelley, B.P. et al. (2003) Conserved pathways within bacteria and yeast as revealed by global protein network alignment. Proc. Natl. Acad. Sci. U. S. A. 100, 11394–11399 Nakaya, A. et al. (2001) Extraction of Correlated Gene Clusters by Multiple Graph Comparison. Genome Inform. Ser. Workshop Genome Inform. 12, 44–53 Cabusora, L. et al. (2005) Differential network expression during drug and stress response. Bioinformatics 21, 2898–2905 Hucka, M. et al. (2003) The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 19, 524–531 REVIEWS 35 Campbell, L.A. et al. (1993) Detection of Chlamydia trachomatis deoxyribonucleic acid in women with tubal infertility. Fertil. Steril. 59, 45–50 36 Byrne, G.I. et al. (1986) Induction of tryptophan catabolism is the mechanism for gamma-interferon-mediated inhibition of intracellular Chlamydia psittaci replication in T24 cells. Infect. Immun. 53, 347–351 37 Beatty, W.L. et al. (1994) Persistent Chlamydiae: from cell culture to a paradigm for chlamydial pathogenesis. Microbiol. Rev. 58, 686–699 38 Goldstein, B. et al. (2004) Mathematical and computational models of immunereceptor signaling. Nat. Rev. Immunol. 4, 445–456 39 Valitutti, S. et al. (1995) Serial triggering of many T-cell receptors by a few peptide-MHC complexes. Nature 375, 148–151 40 San José, E. et al. (2000) Triggering the TCR complex causes the downregulation of nonengaged receptors by a signal transduction-dependent mechanism. Immunity 12, 161–170 41 McKeithan, T.W. (1995) Kinetic proofreading in T-cell receptor signal-transduction. Proc. Natl. Acad. Sci. U. S. A. 92, 5042–5046 42 Wofsy, C. et al. (2001) Calculations show substantial serial engagement of T cell receptors. Biophys. J. 80, 606–612 43 Perelson, A. and Weisbuch, G. (1997) Immunology for physicists. Rev. Mod. Phys. 69, 1219–1267 44 Nowak, M.A. and May, R.M. (2000) Virus dynamics: mathematical principles of immunology and virology, Oxford Univ. Press. 45 Perelson, A.S. et al. (1996) HIV-1 dynamics in vivo: virion clearance rate, infected cell life-span, and viral generation time. Science 271, 1582–1586 46 DeBoer, R.J. et al. (2001) Recruitment times, proliferation, and apoptosis rates during the CD8+ T cell response to LCMV. J. Virol. 75, 10663–10669 47 Wodarz, D. et al. (2000) A new theory of cytotoxic T-lymphocyte memory: implications for HIV treatment. Philos. Trans. R. Soc. Lond. B Biol. Sci. 355, 329–343 48 Dixit, N.M. and Perelson, A.S. (2004) Complex patterns of viral load decay under antiretroviral therapy: influence of pharmacokinetics and intracellular delay. J. Theor. Biol. 226, 95–109 49 Hunter, P.J. and Borg, T.K. (2003) Integration from proteins to organs: the Physiome Project. Nat. Rev. Mol. Cell Biol. 4, 237–243 50 Musante, C.J. et al. (2002) Small- and large-scale biosimulation applied to drug discovery and development. Drug Discov. Today 7, S192–S196 51 Stokes, C.L. et al. (2001) A computer model of chronic asthma with applications to clinical studies: example of treatment of exercise-induced asthma. J. Allergy Clin. Immunol. 107, 933 52 Kirschner, D. and Marino, S. (2005) Mycobacterium tuberculosis as viewed through a computer. Trends Microbiol. 13, 206–211 53 Akira, S. and Takeda, K. (2004) Toll-like receptor signaling. Nat. Rev. Immunol. 4, 499–511 54 Gordon, S. (2002) Pattern recognition receptors: doubling up for the innate immune response. Cell 111, 927–930 55 Platt, N. et al. (2002) The many roles of the class A macrophage scavenger receptor. Int. Rev. Cytol. 212, 1–40 56 Ernst, J.D. (1998) Macrophage receptors for Mycobacterium tuberculosis. Infect. Immun. 66, 1277–1281 www.drugdiscoverytoday.com 227 Reviews • INFORMATICS DDT • Volume 11, Number 5/6 • March 2006

RELATED PAPERS

RELATED TOPICS

Log In

Host–Pathogen Systems Biology

Host–Pathogen Systems Biology

Related Papers

RELATED PAPERS

RELATED TOPICS