-
LLMs can generate robotic scripts from goal-oriented instructions in biological laboratory automation
Authors:
Takashi Inagaki,
Akari Kato,
Koichi Takahashi,
Haruka Ozaki,
Genki N. Kanda
Abstract:
The use of laboratory automation by all researchers may substantially accelerate scientific activities by humans, including those in the life sciences. However, computer programs to operate robots should be written to implement laboratory automation, which requires technical knowledge and skills that may not be part of a researcher's training or expertise. In the last few years, there has been rem…
▽ More
The use of laboratory automation by all researchers may substantially accelerate scientific activities by humans, including those in the life sciences. However, computer programs to operate robots should be written to implement laboratory automation, which requires technical knowledge and skills that may not be part of a researcher's training or expertise. In the last few years, there has been remarkable development in large language models (LLMs) such as GPT-4, which can generate computer codes based on natural language instructions. In this study, we used LLMs, including GPT-4, to generate scripts for robot operations in biological experiments based on ambiguous instructions. GPT-4 successfully generates scripts for OT-2, an automated liquid-handling robot, from simple instructions in natural language without specifying the robotic actions. Conventionally, translating the nuances of biological experiments into low-level robot actions requires researchers to understand both biology and robotics, imagine robot actions, and write robotic scripts. Our results showed that GPT-4 can connect the context of biological experiments with robot operation through simple prompts with expert-level contextual understanding and inherent knowledge. Replacing robot script programming, which is a tedious task for biological researchers, with natural-language LLM instructions that do not consider robot behavior significantly increases the number of researchers who can benefit from automating biological experiments.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Experimentally testable whole brain manifolds that recapitulate behavior
Authors:
Gerald M Pao,
Cameron Smith,
Joseph Park,
Keichi Takahashi,
Wassapon Watanakeesuntorn,
Hiroaki Natsukawa,
Sreekanth H Chalasani,
Tom Lorimer,
Ryousei Takano,
Nuttida Rungratsameetaweemana,
George Sugihara
Abstract:
We propose an algorithm grounded in dynamical systems theory that generalizes manifold learning from a global state representation, to a network of local interacting manifolds termed a Generative Manifold Network (GMN). Manifolds are discovered using the convergent cross mapping (CCM) causal inference algorithm which are then compressed into a reduced redundancy network. The representation is a ne…
▽ More
We propose an algorithm grounded in dynamical systems theory that generalizes manifold learning from a global state representation, to a network of local interacting manifolds termed a Generative Manifold Network (GMN). Manifolds are discovered using the convergent cross mapping (CCM) causal inference algorithm which are then compressed into a reduced redundancy network. The representation is a network of manifolds embedded from observational data where each orthogonal axis of a local manifold is an embedding of a individually identifiable neuron or brain area that has exact correspondence in the real world. As such these can be experimentally manipulated to test hypotheses derived from theory and data analysis. Here we demonstrate that this representation preserves the essential features of the brain of flies,larval zebrafish and humans. In addition to accurate near-term prediction, the GMN model can be used to synthesize realistic time series of whole brain neuronal activity and locomotion viewed over the long term. Thus, as a final validation of how well GMN captures essential dynamic information, we show that the artificially generated time series can be used as a training set to predict out-of-sample observed fly locomotion, as well as brain activity in out of sample withheld data not used in model building. Remarkably, the artificially generated time series show realistic novel behaviors that do not exist in the training data, but that do exist in the out-of-sample observational data. This suggests that GMN captures inherently emergent properties of the network. We suggest our approach may be a generic recipe for mapping time series observations of any complex nonlinear network into a model that is able to generate naturalistic system behaviors that identifies variables that have real world correspondence and can be experimentally manipulated.
△ Less
Submitted 20 June, 2021;
originally announced June 2021.
-
Massively Parallel Causal Inference of Whole Brain Dynamics at Single Neuron Resolution
Authors:
Wassapon Watanakeesuntorn,
Keichi Takahashi,
Kohei Ichikawa,
Joseph Park,
George Sugihara,
Ryousei Takano,
Jason Haga,
Gerald M. Pao
Abstract:
Empirical Dynamic Modeling (EDM) is a nonlinear time series causal inference framework. The latest implementation of EDM, cppEDM, has only been used for small datasets due to computational cost. With the growth of data collection capabilities, there is a great need to identify causal relationships in large datasets. We present mpEDM, a parallel distributed implementation of EDM optimized for moder…
▽ More
Empirical Dynamic Modeling (EDM) is a nonlinear time series causal inference framework. The latest implementation of EDM, cppEDM, has only been used for small datasets due to computational cost. With the growth of data collection capabilities, there is a great need to identify causal relationships in large datasets. We present mpEDM, a parallel distributed implementation of EDM optimized for modern GPU-centric supercomputers. We improve the original algorithm to reduce redundant computation and optimize the implementation to fully utilize hardware resources such as GPUs and SIMD units. As a use case, we run mpEDM on AI Bridging Cloud Infrastructure (ABCI) using datasets of an entire animal brain sampled at single neuron resolution to identify dynamical causation patterns across the brain. mpEDM is 1,530 X faster than cppEDM and a dataset containing 101,729 neuron was analyzed in 199 seconds on 512 nodes. This is the largest EDM causal inference achieved to date.
△ Less
Submitted 22 November, 2020;
originally announced November 2020.
-
Signaling activations through G-protein-coupled-receptor aggregations
Authors:
Masaki Watabe,
Hideaki Yoshimura,
Satya N. V. Arjunan,
Kazunari Kaizu,
Koichi Takahashi
Abstract:
Eukaryotic cells transmit extracellular signal information to cellular interiors through the formation of a ternary complex made up of a ligand (or agonist), G-protein, and G-protein coupled receptor (GPCR). Previously formalized theories of ternary complex formation have mainly assumed that observable states of receptors can only take the form of monomers. Here, we propose a multiary complex mode…
▽ More
Eukaryotic cells transmit extracellular signal information to cellular interiors through the formation of a ternary complex made up of a ligand (or agonist), G-protein, and G-protein coupled receptor (GPCR). Previously formalized theories of ternary complex formation have mainly assumed that observable states of receptors can only take the form of monomers. Here, we propose a multiary complex model of GPCR signaling activations via the vector representation of various unobserved aggregated receptor states. Our results from model simulations imply that receptor aggregation processes can govern cooperative effects in a regime inaccessible by previous theories. In particular, we show how the affinity of ligand-receptor binding can be largely varied by various oligomer formations in the low concentration range of G-protein stimulus.
△ Less
Submitted 22 September, 2020; v1 submitted 15 April, 2020;
originally announced April 2020.
-
Cooperativity transitions driven by higher-order oligomer formations in ligand-induced receptor dimerization
Authors:
Masaki Watabe,
Satya N. V. Arjunan,
Wei Xiang Chew,
Kazunari Kaizu,
Koichi Takahashi
Abstract:
While cooperativity in ligand-induced receptor dimerization has been linked with receptor-receptor couplings via minimal representations of physical observables, effects arising from higher-order oligomer (e.g., trimer and tetramer) formations of unobserved receptors have received less attention. Here, we propose a dimerization model of ligand-induced receptors in multivalent form representing phy…
▽ More
While cooperativity in ligand-induced receptor dimerization has been linked with receptor-receptor couplings via minimal representations of physical observables, effects arising from higher-order oligomer (e.g., trimer and tetramer) formations of unobserved receptors have received less attention. Here, we propose a dimerization model of ligand-induced receptors in multivalent form representing physical observables under basis vectors of various aggregated receptor-states. Our simulations of multivalent models not only reject Wofsy-Goldstein parameter conditions for cooperativity, but show higher-order oligomer formations can shift cooperativity from positive to negative.
△ Less
Submitted 13 December, 2019; v1 submitted 27 May, 2019;
originally announced May 2019.
-
Surface reaction-diffusion kinetics on lattice at the microscopic scale
Authors:
Wei-Xiang Chew,
Kazunari Kaizu,
Masaki Watabe,
Sithi V. Muniandy,
Koichi Takahashi,
Satya N. V. Arjunan
Abstract:
Microscopic models of reaction-diffusion processes on the cell membrane can link local spatiotemporal effects to macroscopic self-organized patterns often observed on the membrane. Simulation schemes based on the microscopic lattice method (MLM) can model these processes at the microscopic scale by tracking individual molecules, represented as hard-spheres, on fine lattice voxels. Although MLM is…
▽ More
Microscopic models of reaction-diffusion processes on the cell membrane can link local spatiotemporal effects to macroscopic self-organized patterns often observed on the membrane. Simulation schemes based on the microscopic lattice method (MLM) can model these processes at the microscopic scale by tracking individual molecules, represented as hard-spheres, on fine lattice voxels. Although MLM is simple to implement and is generally less computationally demanding than off-lattice approaches, its accuracy and consistency in modeling surface reactions have not been fully verifed. Using the Spatiocyte scheme, we study the accuracy of MLM in diffusion-influenced surface reactions. We derive the lattice-based bimolecular association rates for two-dimensional surface-surface reaction and one-dimensional volume-surface adsorption according to the Smoluchowski-Collins-Kimball model and random walk theory. We match the time-dependent rates on lattice with off-lattice counterparts to obtain the correct expressions for MLM parameters in terms of physical constants. The expressions indicate that the voxel size needs to be at least 0.6% larger than the molecule to accurately simulate surface reactions on triangular lattice. On square lattice, the minimum voxel size should be even larger, at 5%. We also demonstrate the ability of MLM-based schemes such as Spatiocyte to simulate a reaction-diffusion model that involves all dimensions: three-dimensional diffusion in the cytoplasm, two-dimensional diffusion on the cell membrane and one-dimensional cytoplasm-membrane adsorption. With the model, we examine the contribution of the 2D reaction pathway to the overall reaction rate at different reactant diffusivity, reactivity and concentrations.
△ Less
Submitted 17 November, 2018;
originally announced November 2018.
-
Reaction-diffusion kinetics on lattice at the microscopic scale
Authors:
Wei-Xiang Chew,
Kazunari Kaizu,
Masaki Watabe,
Sithi V. Muniandy,
Koichi Takahashi,
Satya N. V. Arjunan
Abstract:
Lattice-based stochastic simulators are commonly used to study biological reaction-diffusion processes. Some of these schemes that are based on the reaction-diffusion master equation (RDME), can simulate for extended spatial and temporal scales but cannot directly account for the microscopic effects in the cell such as volume exclusion and diffusion-influenced reactions. Nonetheless, schemes based…
▽ More
Lattice-based stochastic simulators are commonly used to study biological reaction-diffusion processes. Some of these schemes that are based on the reaction-diffusion master equation (RDME), can simulate for extended spatial and temporal scales but cannot directly account for the microscopic effects in the cell such as volume exclusion and diffusion-influenced reactions. Nonetheless, schemes based on the high-resolution microscopic lattice method (MLM) can directly simulate these effects by representing each finite-sized molecule explicitly as a random walker on fine lattice voxels. The theory and consistency of MLM in simulating diffusion-influenced reactions have not been clarified in detail. Here, we examine MLM in solving diffusion-influenced reactions in 3D space by employing the Spatiocyte simulation scheme. Applying the random walk theory, we construct the general theoretical framework underlying the method and obtain analytical expressions for the total rebinding probability and the effective reaction rate. By matching Collins-Kimball and lattice-based rate constants, we obtained the exact expressions to determine the reaction acceptance probability and voxel size. We found that the size of voxel should be about 2% larger than the molecule. MLM is validated by numerical simulations, showing good agreement with the off-lattice particle-based method, eGFRD. MLM run time is more than an order of magnitude faster than eGFRD when diffusing macromolecules with typical concentrations in the cell. MLM also showed good agreements with eGFRD and mean-field models in case studies of two basic motifs of intracellular signaling, the protein production-degradation process and the dual phosphorylation cycle. Moreover, when a reaction compartment is populated with volume-excluding obstacles, MLM captures the non-classical reaction kinetics caused by anomalous diffusion of reacting molecules.
△ Less
Submitted 31 August, 2018; v1 submitted 30 May, 2018;
originally announced May 2018.
-
Simulation of live-cell imaging system reveals hidden uncertainties in cooperative binding measurements
Authors:
Masaki Watabe,
Satya N. V. Arjunan,
Wei Xiang Chew,
Kazunari Kaizu,
Koichi Takahashi
Abstract:
We propose a computational method to quantitatively evaluate the systematic uncertainties that arise from undetectable sources in biological measurements using live-cell imaging techniques. We then demonstrate this method in measuring biological cooperativity of molecular binding networks: in particular, ligand molecules binding to cell surface receptor proteins. Our results show how the non-stati…
▽ More
We propose a computational method to quantitatively evaluate the systematic uncertainties that arise from undetectable sources in biological measurements using live-cell imaging techniques. We then demonstrate this method in measuring biological cooperativity of molecular binding networks: in particular, ligand molecules binding to cell surface receptor proteins. Our results show how the non-statistical uncertainties lead to invalid identification of the measured cooperativity. Through this computational scheme, the biological interpretation can be more objectively evaluated and understood under a specific experimental configuration of interest.
△ Less
Submitted 3 July, 2019; v1 submitted 26 February, 2018;
originally announced February 2018.
-
eGFRD in all dimensions
Authors:
Thomas R. Sokolowski,
Joris Paijmans,
Laurens Bossen,
Martijn Wehrens,
Thomas Miedema,
Nils B. Becker,
Kazunari Kaizu,
Koichi Takahashi,
Marlieen Dogterom,
Pieter Rein ten Wolde
Abstract:
Biochemical reactions typically occur at low copy numbers, but at once in crowded and diverse environments. Space and stochasticity therefore play an essential role in biochemical networks. Spatial-stochastic simulations have become a prominent tool for understanding how stochasticity at the microscopic level influences the macroscopic behavior of such systems. However, while particle-based models…
▽ More
Biochemical reactions typically occur at low copy numbers, but at once in crowded and diverse environments. Space and stochasticity therefore play an essential role in biochemical networks. Spatial-stochastic simulations have become a prominent tool for understanding how stochasticity at the microscopic level influences the macroscopic behavior of such systems. However, while particle-based models guarantee the level of detail necessary to accurately describe the microscopic dynamics at very low copy numbers, the algorithms used to simulate them oftentimes imply trade-offs between computational efficiency and accuracy. eGFRD (enhanced Green's Function Reaction Dynamics) is an exact algorithm that evades such trade-offs by partitioning the N-particle system into M<N analytically tractable one- and two-particle systems; the analytical solutions (Green's functions) then are used to implement an event-driven particle-based scheme that allows particles to make large jumps in time and space while retaining access to their state variables at any moment. Here we present "eGFRD2", a new eGFRD version that implements the principle of eGFRD in all dimensions, enabling efficient simulation of biochemical reaction-diffusion processes in the 3D cytoplasm, on 2D planes representing membranes, and on 1D elongated cylinders representative of, e.g., cytoskeletal tracks or DNA; in 1D, it also incorporates convective motion used to model active transport. We find that, for low particle densities, eGFRD2 is up to 3 orders of magnitude faster than optimized Brownian Dynamics. We exemplify the capabilities of eGFRD2 by simulating an idealized model of Pom1 gradient formation, which involves 3D diffusion, active transport on microtubules, and autophosphorylation on the membrane, confirming recent results on this system and demonstrating that it can efficiently operate under genuinely stochastic conditions.
△ Less
Submitted 30 August, 2017;
originally announced August 2017.
-
pSpatiocyte: A Parallel Stochastic Method for Particle Reaction-Diffusion Systems
Authors:
Atsushi Miyauchi,
Kazunari Iwamoto,
Satya Nanda Vel Arjunan,
Koichi Takahashi
Abstract:
Computational systems biology has provided plenty of insights into cell biology. Early on, the focus was on reaction networks between molecular species. Spatial distribution only began to be considered mostly within the last decade. However, calculations were restricted to small systems because of tremendously high computational workloads. To date, application to the cell of typical size with mole…
▽ More
Computational systems biology has provided plenty of insights into cell biology. Early on, the focus was on reaction networks between molecular species. Spatial distribution only began to be considered mostly within the last decade. However, calculations were restricted to small systems because of tremendously high computational workloads. To date, application to the cell of typical size with molecular resolution is still far from realization. In this article, we present a new parallel stochastic method for particle reaction-diffusion systems. The program called pSpatiocyte was created bearing in mind reaction networks in biological cells operating in crowded intracellular environments as the primary simulation target. pSpatiocyte employs unique discretization and parallelization algorithms based on a hexagonal close-packed lattice for efficient execution particularly on large distributed memory parallel computers. For two-level parallelization, we introduced isolated subdomain and tri-stage lockstep communication for process-level, and voxel-locking techniques for thread-level. We performed a series of parallel runs on RIKEN's K computer. For a fine lattice that had relatively low occupancy, pSpatiocyte achieved 7686 times speedup with 663552 cores relative to 64 cores from the viewpoint of strong scaling and exhibited 74\% parallel efficiency. As for weak scaling, efficiencies at least 60% were observed up to 663552 cores. In addition to computational performance, diffusion and reaction rates were validated by theory and another well-validated program and had good agreement. Lastly, as a preliminary example of real-world applications, we present a calculation of the MAPK model, a typical reaction network motif in cell signaling pathways.
△ Less
Submitted 12 May, 2016;
originally announced May 2016.
-
A computational framework for bioimaging simulation
Authors:
Masaki Watabe,
Satya N. V. Arjunan,
Seiya Fukushima,
Kazunari Iwamoto,
Jun Kozuka,
Satomi Matsuoka,
Yuki Shindo,
Masahiro Ueda,
Koichi Takahashi
Abstract:
Using bioimaging technology, biologists have attempted to identify and document analytical interpretations that underlie biological phenomena in biological cells. Theoretical biology aims at distilling those interpretations into knowledge in the mathematical form of biochemical reaction networks and understanding how higher level functions emerge from the combined action of biomolecules. However,…
▽ More
Using bioimaging technology, biologists have attempted to identify and document analytical interpretations that underlie biological phenomena in biological cells. Theoretical biology aims at distilling those interpretations into knowledge in the mathematical form of biochemical reaction networks and understanding how higher level functions emerge from the combined action of biomolecules. However, there still remain formidable challenges in bridging the gap between bioimaging and mathematical modeling. Generally, measurements using fluorescence microscopy systems are influenced by systematic effects that arise from stochastic nature of biological cells, the imaging apparatus, and optical physics. Such systematic effects are always present in all bioimaging systems and hinder quantitative comparison between the cell model and bioimages. Computational tools for such a comparison are still unavailable. Thus, in this work, we present a computational framework for handling the parameters of the cell models and the optical physics governing bioimaging systems. Simulation using this framework can generate digital images of cell simulation results after accounting for the systematic effects. We then demonstrate that such a framework enables comparison at the level of photon-counting units.
△ Less
Submitted 7 July, 2015; v1 submitted 5 November, 2014;
originally announced November 2014.
-
Membrane clustering and the role of rebinding in biochemical signaling
Authors:
Andrew Mugler,
Aimee Gotway Bailey,
Koichi Takahashi,
Pieter Rein ten Wolde
Abstract:
In many cellular signaling pathways, key components form clusters at the cell membrane. Although much work has focused on the mechanisms behind such cluster formation, the implications for downstream signaling remain poorly understood. Here, motivated by recent experiments, we study via particle-based simulation a covalent modification network in which the activating component is either clustered…
▽ More
In many cellular signaling pathways, key components form clusters at the cell membrane. Although much work has focused on the mechanisms behind such cluster formation, the implications for downstream signaling remain poorly understood. Here, motivated by recent experiments, we study via particle-based simulation a covalent modification network in which the activating component is either clustered or randomly distributed on the membrane. We find that while clustering reduces the response of a single-modification network, clustering can enhance the response of a double-modification network. The reduction is a bulk effect: a cluster presents a smaller effective target to a substrate molecule in the bulk. The enhancement, on the other hand, is a local effect: a cluster promotes the rapid rebinding and second activation of singly active substrate molecules. As such, the enhancement relies upon frequent collisions on a short timescale, which leads to a diffusion coefficient at which the enhancement is optimal. We complement simulation with analytic results at both the mean-field and first-passage distribution levels. Our results emphasize the importance of spatially resolved models, showing that significant effects of spatial correlations persist even in spatially averaged quantities such as response curves.
△ Less
Submitted 25 October, 2011;
originally announced October 2011.
-
Spatio-temporal correlations can drastically change the response of a MAPK pathway
Authors:
Koichi Takahashi,
Sorin Tanase-Nicola,
Pieter Rein ten Wolde
Abstract:
Multisite covalent modification of proteins is omnipresent in eukaryotic cells. A well-known example is the mitogen-activated protein kinase (MAPK) cascade, where in each layer of the cascade a protein is phosphorylated at two sites. It has long been known that the response of a MAPK pathway strongly depends on whether the enzymes that modify the protein act processively or distributively: distr…
▽ More
Multisite covalent modification of proteins is omnipresent in eukaryotic cells. A well-known example is the mitogen-activated protein kinase (MAPK) cascade, where in each layer of the cascade a protein is phosphorylated at two sites. It has long been known that the response of a MAPK pathway strongly depends on whether the enzymes that modify the protein act processively or distributively: distributive mechanism, in which the enzyme molecules have to release the substrate molecules in between the modification of the two sites, can generate an ultrasensitive response and lead to hysteresis and bistability. We study by Green's Function Reaction Dynamics, a stochastic scheme that makes it possible to simulate biochemical networks at the particle level and in time and space, a dual phosphorylation cycle in which the enzymes act according to a distributive mechanism. We find that the response of this network can differ dramatically from that predicted by a mean-field analysis based on the chemical rate equations. In particular, rapid rebindings of the enzyme molecules to the substrate molecules after modification of the first site can markedly speed up the response, and lead to loss of ultrasensitivity and bistability. In essence, rapid enzyme-substrate rebindings can turn a distributive mechanism into a processive mechanism. We argue that slow ADP release by the enzymes can protect the system against these rapid rebindings, thus enabling ultrasensitivity and bistability.
△ Less
Submitted 3 July, 2009;
originally announced July 2009.
-
Self-organization of feedforward structure and entrainment in excitatory neural networks with spike-timing-dependent plasticity
Authors:
Yuko K. Takahashi,
Hiroshi Kori,
Naoki Masuda
Abstract:
Spike-timing dependent plasticity (STDP) is an organizing principle of biological neural networks. While synchronous firing of neurons is considered to be an important functional block in the brain, how STDP shapes neural networks possibly toward synchrony is not entirely clear. We examine relations between STDP and synchronous firing in spontaneously firing neural populations. Using coupled het…
▽ More
Spike-timing dependent plasticity (STDP) is an organizing principle of biological neural networks. While synchronous firing of neurons is considered to be an important functional block in the brain, how STDP shapes neural networks possibly toward synchrony is not entirely clear. We examine relations between STDP and synchronous firing in spontaneously firing neural populations. Using coupled heterogeneous phase oscillators placed on initial networks, we show numerically that STDP prunes some synapses and promotes formation of a feedforward network. Eventually a pacemaker, which is the neuron with the fastest inherent frequency in our numerical simulations, emerges at the root of the feedforward network. In each oscillatory cycle, a packet of neural activity is propagated from the pacemaker to downstream neurons along layers of the feedforward network. This event occurs above a clear-cut threshold value of the initial synaptic weight. Below the threshold, neurons are self-organized into separate clusters each of which is a feedforward network.
△ Less
Submitted 20 May, 2009; v1 submitted 5 September, 2008;
originally announced September 2008.