-
CARLA: Adjusted common average referencing for cortico-cortical evoked potential data
Authors:
Harvey Huang,
Gabriela Ojeda Valencia,
Nicholas M. Gregg,
Gamaleldin M. Osman,
Morgan N. Montoya,
Gregory A. Worrell,
Kai J. Miller,
Dora Hermes
Abstract:
Human brain connectivity can be mapped by single pulse electrical stimulation during intracranial EEG measurements. The raw cortico-cortical evoked potentials (CCEP) are often contaminated by noise. Common average referencing (CAR) removes common noise and preserves response shapes but can introduce bias from responsive channels. We address this issue with an adjusted, adaptive CAR algorithm terme…
▽ More
Human brain connectivity can be mapped by single pulse electrical stimulation during intracranial EEG measurements. The raw cortico-cortical evoked potentials (CCEP) are often contaminated by noise. Common average referencing (CAR) removes common noise and preserves response shapes but can introduce bias from responsive channels. We address this issue with an adjusted, adaptive CAR algorithm termed "CAR by Least Anticorrelation (CARLA)".
CARLA was tested on simulated CCEP data and real CCEP data collected from four human participants. In CARLA, the channels are ordered by increasing mean cross-trial covariance, and iteratively added to the common average until anticorrelation between any single channel and all re-referenced channels reaches a minimum, as a measure of shared noise.
We simulated CCEP data with true responses in 0 to 45 of 50 total channels. We quantified CARLA's error and found that it erroneously included 0 (median) truly responsive channels in the common average with less than or equal to 42 responsive channels, and erroneously excluded less than or equal to 2.5 (median) unresponsive channels at all responsiveness levels. On real CCEP data, signal quality was quantified with the mean R-squared between all pairs of channels, which represents inter-channel dependency and is low for well-referenced data. CARLA re-referencing produced significantly lower mean R-squared than standard CAR, CAR using a fixed bottom quartile of channels by covariance, and no re-referencing.
CARLA minimizes bias in re-referenced CCEP data by adaptively selecting the optimal subset of non-responsive channels. It showed high specificity and sensitivity on simulated CCEP data and lowered inter-channel dependency compared to CAR on real CCEP data.
△ Less
Submitted 29 September, 2023;
originally announced October 2023.
-
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain
Authors:
Brian Long,
Jeremy Miller,
The SpaceTx Consortium
Abstract:
Mapping spatial distributions of transcriptomic cell types is essential to understanding the brain, with its exceptional cellular heterogeneity and the functional significance of its spatial organization. Spatial transcriptomics techniques are hoped to accomplish these measurements, but each method uses different experimental and computational protocols, with different trade-offs and optimizations…
▽ More
Mapping spatial distributions of transcriptomic cell types is essential to understanding the brain, with its exceptional cellular heterogeneity and the functional significance of its spatial organization. Spatial transcriptomics techniques are hoped to accomplish these measurements, but each method uses different experimental and computational protocols, with different trade-offs and optimizations. In 2017, the SpaceTx Consortium was formed to compare these methods and determine their suitability for large-scale spatial transcriptomic atlases. SpaceTx work included progress in tissue processing, taxonomy development, gene selection, image processing and data standardization, cell segmentation, cell type assignments, and visualization. Although the landscape of experimental methods has changed dramatically since the beginning of SpaceTx, the need for quantitative and detailed benchmarking of spatial transcriptomics methods in the brain is still unmet. Here, we summarize the work of SpaceTx and highlight outstanding challenges as spatial transcriptomics grows into a mature field. We also discuss how our progress provides a roadmap for benchmarking spatial transcriptomics methods in the future. Data and analyses from this consortium, along with code and methods are publicly available at https://spacetx.github.io/.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
Approximating quasi-stationary behaviour in network-based SIS dynamics
Authors:
Christopher E. Overton,
Robert R. Wilkinson,
Adedapo Loyinmi,
Joel C. Miller,
Kieran J. Sharkey
Abstract:
Deterministic approximations to stochastic Susceptible-Infectious-Susceptible models typically predict a stable endemic steady-state when above threshold. This can be hard to relate to the underlying stochastic dynamics, which has no endemic steady-state but can exhibit approximately stable behaviour. Here we relate the approximate models to the stochastic dynamics via the definition of the quasi-…
▽ More
Deterministic approximations to stochastic Susceptible-Infectious-Susceptible models typically predict a stable endemic steady-state when above threshold. This can be hard to relate to the underlying stochastic dynamics, which has no endemic steady-state but can exhibit approximately stable behaviour. Here we relate the approximate models to the stochastic dynamics via the definition of the quasi-stationary distribution (QSD), which captures this approximately stable behaviour. We develop a system of ordinary differential equations that approximate the number of infected individuals in the QSD for arbitrary contact networks and parameter values. When the epidemic level is high, these QSD approximations coincide with the existing approximation methods. However, as we approach the epidemic threshold, the models deviate, with these models following the QSD and the existing methods approaching the all susceptible state. Through consistently approximating the QSD, the proposed methods provide a more robust link to the stochastic models.
△ Less
Submitted 11 August, 2022;
originally announced August 2022.
-
Beyond Low Earth Orbit: Biological Research, Artificial Intelligence, and Self-Driving Labs
Authors:
Lauren M. Sanders,
Jason H. Yang,
Ryan T. Scott,
Amina Ann Qutub,
Hector Garcia Martin,
Daniel C. Berrios,
Jaden J. A. Hastings,
Jon Rask,
Graham Mackintosh,
Adrienne L. Hoarfrost,
Stuart Chalk,
John Kalantari,
Kia Khezeli,
Erik L. Antonsen,
Joel Babdor,
Richard Barker,
Sergio E. Baranzini,
Afshin Beheshti,
Guillermo M. Delgado-Aparicio,
Benjamin S. Glicksberg,
Casey S. Greene,
Melissa Haendel,
Arif A. Hamid,
Philip Heller,
Daniel Jamieson
, et al. (31 additional authors not shown)
Abstract:
Space biology research aims to understand fundamental effects of spaceflight on organisms, develop foundational knowledge to support deep space exploration, and ultimately bioengineer spacecraft and habitats to stabilize the ecosystem of plants, crops, microbes, animals, and humans for sustained multi-planetary life. To advance these aims, the field leverages experiments, platforms, data, and mode…
▽ More
Space biology research aims to understand fundamental effects of spaceflight on organisms, develop foundational knowledge to support deep space exploration, and ultimately bioengineer spacecraft and habitats to stabilize the ecosystem of plants, crops, microbes, animals, and humans for sustained multi-planetary life. To advance these aims, the field leverages experiments, platforms, data, and model organisms from both spaceborne and ground-analog studies. As research is extended beyond low Earth orbit, experiments and platforms must be maximally autonomous, light, agile, and intelligent to expedite knowledge discovery. Here we present a summary of recommendations from a workshop organized by the National Aeronautics and Space Administration on artificial intelligence, machine learning, and modeling applications which offer key solutions toward these space biology challenges. In the next decade, the synthesis of artificial intelligence into the field of space biology will deepen the biological understanding of spaceflight effects, facilitate predictive modeling and analytics, support maximally autonomous and reproducible experiments, and efficiently manage spaceborne data and metadata, all with the goal to enable life to thrive in deep space.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Beyond Low Earth Orbit: Biomonitoring, Artificial Intelligence, and Precision Space Health
Authors:
Ryan T. Scott,
Erik L. Antonsen,
Lauren M. Sanders,
Jaden J. A. Hastings,
Seung-min Park,
Graham Mackintosh,
Robert J. Reynolds,
Adrienne L. Hoarfrost,
Aenor Sawyer,
Casey S. Greene,
Benjamin S. Glicksberg,
Corey A. Theriot,
Daniel C. Berrios,
Jack Miller,
Joel Babdor,
Richard Barker,
Sergio E. Baranzini,
Afshin Beheshti,
Stuart Chalk,
Guillermo M. Delgado-Aparicio,
Melissa Haendel,
Arif A. Hamid,
Philip Heller,
Daniel Jamieson,
Katelyn J. Jarvis
, et al. (31 additional authors not shown)
Abstract:
Human space exploration beyond low Earth orbit will involve missions of significant distance and duration. To effectively mitigate myriad space health hazards, paradigm shifts in data and space health systems are necessary to enable Earth-independence, rather than Earth-reliance. Promising developments in the fields of artificial intelligence and machine learning for biology and health can address…
▽ More
Human space exploration beyond low Earth orbit will involve missions of significant distance and duration. To effectively mitigate myriad space health hazards, paradigm shifts in data and space health systems are necessary to enable Earth-independence, rather than Earth-reliance. Promising developments in the fields of artificial intelligence and machine learning for biology and health can address these needs. We propose an appropriately autonomous and intelligent Precision Space Health system that will monitor, aggregate, and assess biomedical statuses; analyze and predict personalized adverse health outcomes; adapt and respond to newly accumulated data; and provide preventive, actionable, and timely insights to individual deep space crew members and iterative decision support to their crew medical officer. Here we present a summary of recommendations from a workshop organized by the National Aeronautics and Space Administration, on future applications of artificial intelligence in space biology and health. In the next decade, biomonitoring technology, biomarker science, spacecraft hardware, intelligent software, and streamlined data management must mature and be woven together into a Precision Space Health system to enable humanity to thrive in deep space.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
Questioning the use of global estimates of reproduction numbers, with implications for policy
Authors:
Pratyush K. Kollepara,
Joel C. Miller
Abstract:
The basic reproduction number, $R_0$ is an important and widely used concept in the study of infectious diseases. We briefly review the recent trend of calculating the average of various $R_0$ estimates in systematic reviews aimed at estimating the basic reproduction number of SARS-CoV-2, and discuss the drawbacks and implications of using such averaging methods. Additionally, we argue that even a…
▽ More
The basic reproduction number, $R_0$ is an important and widely used concept in the study of infectious diseases. We briefly review the recent trend of calculating the average of various $R_0$ estimates in systematic reviews aimed at estimating the basic reproduction number of SARS-CoV-2, and discuss the drawbacks and implications of using such averaging methods. Additionally, we argue that even a theoretically grounded approach such as next generation matrix could have practical impediments in its use. More generally, the practice of associating an infectious disease with a single value of $R_0$ is problematic, when the disease can, in fact have different reproduction numbers in various populations.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Competition, Trait Variance Dynamics, and the Evolution of a Species' Range
Authors:
Farshad Shirani,
Judith R. Miller
Abstract:
Geographic ranges of communities of species evolve in response to environmental, ecological, and evolutionary forces. Understanding the effects of these forces on species' range dynamics is a major goal of spatial ecology. Previous mathematical models have jointly captured the dynamic changes in species' population distributions and the selective evolution of fitness-related phenotypic traits in t…
▽ More
Geographic ranges of communities of species evolve in response to environmental, ecological, and evolutionary forces. Understanding the effects of these forces on species' range dynamics is a major goal of spatial ecology. Previous mathematical models have jointly captured the dynamic changes in species' population distributions and the selective evolution of fitness-related phenotypic traits in the presence of an environmental gradient. These models inevitably include some unrealistic assumptions, and biologically reasonable ranges of values for their parameters are not easy to specify. As a result, simulations of the seminal models of this type can lead to markedly different conclusions about the behavior of such populations, including the possibility of maladaptation setting stable range boundaries. Here, we harmonize such results by developing and simulating a continuum model of range evolution in a community of species that interact competitively while diffusing over an environmental gradient. Our model extends existing models by incorporating both competition and freely changing intraspecific trait variance. Simulations of this model predict a spatial profile of species' trait variance that is consistent with experimental measurements available in the literature. Moreover, they reaffirm interspecific competition as an effective factor in limiting species' ranges, even when trait variance is not artificially constrained. These theoretical results can inform the design of, as yet rare, empirical studies to clarify the evolutionary causes of range stabilization.
△ Less
Submitted 24 November, 2021; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Serial Electron Diffraction Data Processing with diffractem and CrystFEL
Authors:
Robert Bücker,
Pascal Hogan-Lamarre,
R. J. Dwayne Miller
Abstract:
Serial electron diffraction (SerialED) is an emerging technique, which applies the snapshot data-collection mode of serial X-ray crystallography to three-dimensional electron diffraction (3D ED), forgoing the conventional rotation method. Similarly to serial X-ray crystallography, this approach leads to almost complete absence of radiation damage effects even for the most sensitive samples, and al…
▽ More
Serial electron diffraction (SerialED) is an emerging technique, which applies the snapshot data-collection mode of serial X-ray crystallography to three-dimensional electron diffraction (3D ED), forgoing the conventional rotation method. Similarly to serial X-ray crystallography, this approach leads to almost complete absence of radiation damage effects even for the most sensitive samples, and allows for a high level of automation. However, SerialED also necessitates new techniques of data processing, which combine existing pipelines for rotation electron diffraction and serial X-ray crystallography with some more particular solutions for challenges arising in SerialED specifically. Here, we introduce our analysis pipeline for SerialED data, and its implementation using the CrystFEL and diffractem program packages. Detailed examples are provided in extensive supplementary code.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
Mediating Ribosomal Competition by Splitting Pools
Authors:
Jared Miller,
M. Ali Al-Radhawi,
Eduardo D. Sontag
Abstract:
Synthetic biology constructs often rely upon the introduction of "circuit" genes into host cells, in order to express novel proteins and thus endow the host with a desired behavior. The expression of these new genes "consumes" existing resources in the cell, such as ATP, RNA polymerase, amino acids, and ribosomes. Ribosomal competition among strands of mRNA may be described by a system of nonlinea…
▽ More
Synthetic biology constructs often rely upon the introduction of "circuit" genes into host cells, in order to express novel proteins and thus endow the host with a desired behavior. The expression of these new genes "consumes" existing resources in the cell, such as ATP, RNA polymerase, amino acids, and ribosomes. Ribosomal competition among strands of mRNA may be described by a system of nonlinear ODEs called the Ribosomal Flow Model (RFM). The competition for resources between host and circuit genes can be ameliorated by splitting the ribosome pool by use of orthogonal ribosomes, where the circuit genes are exclusively translated by mutated ribosomes. In this work, the RFM system is extended to include orthogonal ribosome competition. This Orthogonal Ribosomal Flow Model (ORFM) is proven to be stable through the use of Robust Lyapunov Functions. The optimization problem of maximizing the weighted protein translation rate by adjusting allocation of ribosomal species is formulated and implemented.
△ Less
Submitted 4 September, 2020; v1 submitted 1 September, 2020;
originally announced September 2020.
-
The impact of network properties and mixing on control measures and disease-induced herd immunity in epidemic models: a mean-field model perspective
Authors:
Francesco Di Lauro,
Luc Berthouze,
Matthew D. Dorey,
Joel C. Miller,
István Z. Kiss
Abstract:
The contact structure of a population plays an important role in transmission of infection. Many ``structured models'' capture aspects of the contact structure through an underlying network or a mixing matrix. An important observation in such models, is that once a fraction $1-1/\mathcal{R}_0$ has been infected, the residual susceptible population can no longer sustain an epidemic. A recent observ…
▽ More
The contact structure of a population plays an important role in transmission of infection. Many ``structured models'' capture aspects of the contact structure through an underlying network or a mixing matrix. An important observation in such models, is that once a fraction $1-1/\mathcal{R}_0$ has been infected, the residual susceptible population can no longer sustain an epidemic. A recent observation of some structured models is that this threshold can be crossed with a smaller fraction of infected individuals, because the disease acts like a targeted vaccine, preferentially immunizing higher-risk individuals who play a greater role in transmission. Therefore, a limited ``first wave'' may leave behind a residual population that cannot support a second wave once interventions are lifted. In this paper, we systematically analyse a number of mean-field models for networks and other structured populations to address issues relevant to the Covid-19 pandemic. In particular, we consider herd-immunity under several scenarios. We confirm that, in networks with high degree heterogeneity, the first wave confers herd-immunity with significantly fewer infections than equivalent models with lower degree heterogeneity. However, if modelling the intervention as a change in the contact network, then this effect might become more subtle. Indeed, modifying the structure can shield highly connected nodes from becoming infected during the first wave and make the second wave more substantial. We confirm this finding by using an age-structured compartmental model parameterised with real data and comparing lockdown periods implemented either as a global scaling of the mixing matrix or age-specific structural changes. We find that results regarding herd immunity levels are strongly dependent on the model, the duration of lockdown and how lockdown is implemented.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
Deep Reinforcement Learning and its Neuroscientific Implications
Authors:
Matthew Botvinick,
Jane X. Wang,
Will Dabney,
Kevin J. Miller,
Zeb Kurth-Nelson
Abstract:
The emergence of powerful artificial intelligence is defining new research directions in neuroscience. To date, this research has focused largely on deep neural networks trained using supervised learning, in tasks such as image classification. However, there is another area of recent AI work which has so far received less attention from neuroscientists, but which may have profound neuroscientific…
▽ More
The emergence of powerful artificial intelligence is defining new research directions in neuroscience. To date, this research has focused largely on deep neural networks trained using supervised learning, in tasks such as image classification. However, there is another area of recent AI work which has so far received less attention from neuroscientists, but which may have profound neuroscientific implications: deep reinforcement learning. Deep RL offers a comprehensive framework for studying the interplay among learning, representation and decision-making, offering to the brain sciences a new set of research tools and a wide range of novel hypotheses. In the present review, we provide a high-level introduction to deep RL, discuss some of its initial applications to neuroscience, and survey its wider implications for research on brain and behavior, concluding with a list of opportunities for next-stage research.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Key Questions for Modelling COVID-19 Exit Strategies
Authors:
Robin N Thompson,
T Deirdre Hollingsworth,
Valerie Isham,
Daniel Arribas-Bel,
Ben Ashby,
Tom Britton,
Peter Challoner,
Lauren H K Chappell,
Hannah Clapham,
Nik J Cunniffe,
A Philip Dawid,
Christl A Donnelly,
Rosalind Eggo,
Sebastian Funk,
Nigel Gilbert,
Julia R Gog,
Paul Glendinning,
William S Hart,
Hans Heesterbeek,
Thomas House,
Matt Keeling,
Istvan Z Kiss,
Mirjam Kretzschmar,
Alun L Lloyd,
Emma S McBryde
, et al. (18 additional authors not shown)
Abstract:
Combinations of intense non-pharmaceutical interventions ('lockdowns') were introduced in countries worldwide to reduce SARS-CoV-2 transmission. Many governments have begun to implement lockdown exit strategies that allow restrictions to be relaxed while attempting to control the risk of a surge in cases. Mathematical modelling has played a central role in guiding interventions, but the challenge…
▽ More
Combinations of intense non-pharmaceutical interventions ('lockdowns') were introduced in countries worldwide to reduce SARS-CoV-2 transmission. Many governments have begun to implement lockdown exit strategies that allow restrictions to be relaxed while attempting to control the risk of a surge in cases. Mathematical modelling has played a central role in guiding interventions, but the challenge of designing optimal exit strategies in the face of ongoing transmission is unprecedented. Here, we report discussions from the Isaac Newton Institute 'Models for an exit strategy' workshop (11-15 May 2020). A diverse community of modellers who are providing evidence to governments worldwide were asked to identify the main questions that, if answered, will allow for more accurate predictions of the effects of different exit strategies. Based on these questions, we propose a roadmap to facilitate the development of reliable models to guide exit strategies. The roadmap requires a global collaborative effort from the scientific community and policy-makers, and is made up of three parts: i) improve estimation of key epidemiological parameters; ii) understand sources of heterogeneity in populations; iii) focus on requirements for data collection, particularly in Low-to-Middle-Income countries. This will provide important information for planning exit strategies that balance socio-economic benefits with public health.
△ Less
Submitted 21 July, 2020; v1 submitted 21 June, 2020;
originally announced June 2020.
-
Common Cell type Nomenclature for the mammalian brain: A systematic, extensible convention
Authors:
Jeremy A. Miller,
Nathan W. Gouwens,
Bosiljka Tasic,
Forrest Collman,
Cindy T. J. van Velthoven,
Trygve E. Bakken,
Michael J. Hawrylycz,
Hongkui Zeng,
Ed S. Lein,
Amy Bernard
Abstract:
The advancement of single cell RNA-sequencing technologies has led to an explosion of cell type definitions across multiple organs and organisms. While standards for data and metadata intake are arising, organization of cell types has largely been left to individual investigators, resulting in widely varying nomenclature and limited alignment between taxonomies. To facilitate cross-dataset compari…
▽ More
The advancement of single cell RNA-sequencing technologies has led to an explosion of cell type definitions across multiple organs and organisms. While standards for data and metadata intake are arising, organization of cell types has largely been left to individual investigators, resulting in widely varying nomenclature and limited alignment between taxonomies. To facilitate cross-dataset comparison, the Allen Institute created the Common Cell type Nomenclature (CCN) for matching and tracking cell types across studies that is qualitatively similar to gene transcript management across different genome builds. The CCN can be readily applied to new or established taxonomies and was applied herein to diverse cell type datasets derived from multiple quantifiable modalities. The CCN facilitates assigning accurate yet flexible cell type names in the mammalian cortex as a step towards community-wide efforts to organize multi-source, data-driven information related to cell type taxonomies from any organism.
△ Less
Submitted 13 November, 2020; v1 submitted 9 June, 2020;
originally announced June 2020.
-
A Data-Driven Network Model for the Emerging COVID-19 Epidemics in Wuhan, Toronto and Italy
Authors:
Ling Xue,
Shuanglin Jing,
Joel C. Miller,
Wei Sun,
Huafeng Li,
Jose Guillermo Estrada-Franco,
James M Hyman,
Huaiping Zhu
Abstract:
The ongoing Coronavirus Disease 2019 (COVID-19) pandemic threatens the health of humans and causes great economic losses. Predictive modelling and forecasting the epidemic trends are essential for developing countermeasures to mitigate this pandemic. We develop a network model, where each node represents an individual and the edges represent contacts between individuals where the infection can spr…
▽ More
The ongoing Coronavirus Disease 2019 (COVID-19) pandemic threatens the health of humans and causes great economic losses. Predictive modelling and forecasting the epidemic trends are essential for developing countermeasures to mitigate this pandemic. We develop a network model, where each node represents an individual and the edges represent contacts between individuals where the infection can spread. The individuals are classified based on the number of contacts they have each day (their node degrees) and their infection status. The transmission network model was respectively fitted to the reported data for the COVID-19 epidemic in Wuhan (China), Toronto (Canada), and the Italian Republic using a Markov Chain Monte Carlo (MCMC) optimization algorithm. Our model fits all three regions well with narrow confidence intervals and could be adapted to simulate other megacities or regions. The model projections on the role of containment strategies can help inform public health authorities to plan control measures.
△ Less
Submitted 28 May, 2020;
originally announced May 2020.
-
Stochasticity and heterogeneity in the transmission dynamics of SARS-CoV-2
Authors:
Benjamin M. Althouse,
Edward A. Wenger,
Joel C. Miller,
Samuel V. Scarpino,
Antoine Allard,
Laurent Hébert-Dufresne,
Hao Hu
Abstract:
SARS-CoV-2 causing COVID-19 disease has moved rapidly around the globe, infecting millions and killing hundreds of thousands. The basic reproduction number, which has been widely used and misused to characterize the transmissibility of the virus, hides the fact that transmission is stochastic, is dominated by a small number of individuals, and is driven by super-spreading events (SSEs). The distin…
▽ More
SARS-CoV-2 causing COVID-19 disease has moved rapidly around the globe, infecting millions and killing hundreds of thousands. The basic reproduction number, which has been widely used and misused to characterize the transmissibility of the virus, hides the fact that transmission is stochastic, is dominated by a small number of individuals, and is driven by super-spreading events (SSEs). The distinct transmission features, such as high stochasticity under low prevalence, and the central role played by SSEs on transmission dynamics, should not be overlooked. Many explosive SSEs have occurred in indoor settings stoking the pandemic and shaping its spread, such as long-term care facilities, prisons, meat-packing plants, fish factories, cruise ships, family gatherings, parties and night clubs. These SSEs demonstrate the urgent need to understand routes of transmission, while posing an opportunity that outbreak can be effectively contained with targeted interventions to eliminate SSEs. Here, we describe the potential types of SSEs, how they influence transmission, and give recommendations for control of SARS-CoV-2.
△ Less
Submitted 27 May, 2020;
originally announced May 2020.
-
A Digital Ecosystem for Animal Movement Science: Making animal movement datasets, data-linkage techniques, methods, and environmental layers easier to find, interpret, and analyze
Authors:
Brendan Hoover,
Gil Bohrer,
Jerod Merkle,
Jennifer A. Miller
Abstract:
Movement is a fundamental aspect of animal life and plays a crucial role in determining the structure of population dynamics, communities, ecosystems, and diversity. In recent years, the recording of animal movements via GPS collars, camera traps, acoustic sensors, and citizen science, along with the abundance of environmental and other ancillary data used by researchers to contextualize those mov…
▽ More
Movement is a fundamental aspect of animal life and plays a crucial role in determining the structure of population dynamics, communities, ecosystems, and diversity. In recent years, the recording of animal movements via GPS collars, camera traps, acoustic sensors, and citizen science, along with the abundance of environmental and other ancillary data used by researchers to contextualize those movements, has reached a level of volume, velocity, and variety that puts movement ecology research in the realm of big data science. That data growth has spawned increasingly complex methods for movement analysis. Consequently, animal ecologists need a greater understanding of technical skills such as statistics, geographic information systems (GIS), remote sensing, and coding. Therefore, collaboration has become increasingly crucial, as research requires both domain knowledge and technical expertise. Datasets of animal movement and environmental data are typically available in repositories run by government agencies, universities, and non-governmental organizations (NGOs) with methods described in scientific journals. However, there is little connectivity between these entities. The construction of a digital ecosystem for animal movement science is critically important right now. The digital ecosystem represents a setting where movement data, environmental layers, and analysis methods are discoverable and available for efficient storage, manipulation, and analysis. We argue that such a system which will help mature the field of movement ecology by engendering collaboration, facilitating replication, expanding the spatiotemporal range of potential analyses, and limiting redundancy in method development. We describe the key components of the digital ecosystem, the critical challenges that would need addressing, as well as potential solutions to those challenges.
△ Less
Submitted 27 May, 2020; v1 submitted 13 April, 2020;
originally announced April 2020.
-
EoN (Epidemics on Networks): a fast, flexible Python package for simulation, analytic approximation, and analysis of epidemics on networks
Authors:
Joel C. Miller,
Tony TIng
Abstract:
We provide a description of the Epidemics on Networks (EoN) python package designed for studying disease spread in static networks. The package consists of over $100$ methods available for users to perform stochastic simulation of a range of different processes including SIS and SIR disease, and generic simple or comlex contagions.
We provide a description of the Epidemics on Networks (EoN) python package designed for studying disease spread in static networks. The package consists of over $100$ methods available for users to perform stochastic simulation of a range of different processes including SIS and SIR disease, and generic simple or comlex contagions.
△ Less
Submitted 18 January, 2020; v1 submitted 8 January, 2020;
originally announced January 2020.
-
Open Source Software Sustainability Models: Initial White Paper from the Informatics Technology for Cancer Research Sustainability and Industry Partnership Work Group
Authors:
Y. Ye,
R. D. Boyce,
M. K. Davis,
K. Elliston,
C. Davatzikos,
A. Fedorov,
J. C. Fillion-Robin,
I. Foster,
J. Gilbertson,
M. Heiskanen,
J. Klemm,
A. Lasso,
J. V. Miller,
M. Morgan,
S. Pieper,
B. Raumann,
B. Sarachan,
G. Savova,
J. C. Silverstein,
D. Taylor,
J. Zelnis,
G. Q. Zhang,
M. J. Becich
Abstract:
The Sustainability and Industry Partnership Work Group (SIP-WG) is a part of the National Cancer Institute Informatics Technology for Cancer Research (ITCR) program. The charter of the SIP-WG is to investigate options of long-term sustainability of open source software (OSS) developed by the ITCR, in part by developing a collection of business model archetypes that can serve as sustainability plan…
▽ More
The Sustainability and Industry Partnership Work Group (SIP-WG) is a part of the National Cancer Institute Informatics Technology for Cancer Research (ITCR) program. The charter of the SIP-WG is to investigate options of long-term sustainability of open source software (OSS) developed by the ITCR, in part by developing a collection of business model archetypes that can serve as sustainability plans for ITCR OSS development initiatives. The workgroup assembled models from the ITCR program, from other studies, and via engagement of its extensive network of relationships with other organizations (e.g., Chan Zuckerberg Initiative, Open Source Initiative and Software Sustainability Institute). This article reviews existing sustainability models and describes ten OSS use cases disseminated by the SIP-WG and others, and highlights five essential attributes (alignment with unmet scientific needs, dedicated development team, vibrant user community, feasible licensing model, and sustainable financial model) to assist academic software developers in achieving best practice in software sustainability.
△ Less
Submitted 1 January, 2020; v1 submitted 27 December, 2019;
originally announced December 2019.
-
Distribution of outbreak sizes for SIR disease in finite populations
Authors:
Joel C Miller
Abstract:
We consider the spread of a Susceptible-Infected-Recovered (SIR) disease through finite populations and derive an expression for the final size distribution. Our derivation allows arbitrary distributions of the number of transmissions caused by an infected individual. We show how this calculation can be used to infer parameters of the infectious disease through observations in multiple small popul…
▽ More
We consider the spread of a Susceptible-Infected-Recovered (SIR) disease through finite populations and derive an expression for the final size distribution. Our derivation allows arbitrary distributions of the number of transmissions caused by an infected individual. We show how this calculation can be used to infer parameters of the infectious disease through observations in multiple small populations. The inference suffers from some identifiability difficulties, and it requires many observations to distinguish between parameter combinations that correspond to the same reproductive number.
△ Less
Submitted 11 July, 2019;
originally announced July 2019.
-
Fast variables determine the epidemic threshold in the pairwise model with an improved closure
Authors:
István Z. Kiss,
Joel C. Miller,
Péter L. Simon
Abstract:
Pairwise models are used widely to model epidemic spread on networks. These include the modelling of susceptible-infected-removed (SIR) epidemics on regular networks and extensions to SIS dynamics and contact tracing on more exotic networks exhibiting degree heterogeneity, directed and/or weighted links and clustering. However, extra features of the disease dynamics or of the network lead to an in…
▽ More
Pairwise models are used widely to model epidemic spread on networks. These include the modelling of susceptible-infected-removed (SIR) epidemics on regular networks and extensions to SIS dynamics and contact tracing on more exotic networks exhibiting degree heterogeneity, directed and/or weighted links and clustering. However, extra features of the disease dynamics or of the network lead to an increase in system size and analytical tractability becomes problematic. Various `closures' can be used to keep the system tractable. Focusing on SIR epidemics on regular but clustered networks, we show that even for the most complex closure we can determine the epidemic threshold as an asymptotic expansion in terms of the clustering coefficient.We do this by exploiting the presence of a system of fast variables, specified by the correlation structure of the epidemic, whose steady state determines the epidemic threshold. While we do not find the steady state analytically, we create an elegant asymptotic expansion of it. We validate this new threshold by comparing it to the numerical solution of the full system and find excellent agreement over a wide range of values of the clustering coefficient, transmission rate and average degree of the network. The technique carries over to pairwise models with other closures [1] and we note that the epidemic threshold will be model dependent. This emphasises the importance of model choice when dealing with realistic outbreaks.
△ Less
Submitted 11 September, 2018;
originally announced September 2018.
-
A primer on the use of probability generating functions in infectious disease modeling
Authors:
Joel C. Miller
Abstract:
We explore the application of probability generating functions (PGFs) to invasive processes, focusing on infectious disease introduced into large populations. Our goal is to acquaint the reader with applications of PGFs, moreso than to derive new results. PGFs help predict a number of properties about early outbreak behavior while the population is still effectively infinite, including the probabi…
▽ More
We explore the application of probability generating functions (PGFs) to invasive processes, focusing on infectious disease introduced into large populations. Our goal is to acquaint the reader with applications of PGFs, moreso than to derive new results. PGFs help predict a number of properties about early outbreak behavior while the population is still effectively infinite, including the probability of an epidemic, the size distribution after some number of generations, and the cumulative size distribution of non-epidemic outbreaks. We show how PGFs can be used in both discrete-time and continuous-time settings, and discuss how to use these results to infer disease parameters from observed outbreaks. In the large population limit for susceptible-infected-recovered (SIR) epidemics PGFs lead to survival-function based models that are equivalent the the usual mass-action SIR models but with fewer ODEs. We use these to explore properties such as the final size of epidemics or even the dynamics once stochastic effects are negligible. We target this tutorial to biologists and public health researchers who want to learn how to apply PGFs to invasive diseases, but it could also be used in an introductory mathematics course on PGFs. We include many exercises to help demonstrate concepts and to give practice applying the results. We summarize our main results in a few tables. Additionally we provide a small python package which performs many of the relevant calculations.
△ Less
Submitted 9 August, 2018; v1 submitted 14 March, 2018;
originally announced March 2018.
-
Edge-based compartmental modelling of an SIR epidemic on a dual-layer static-dynamic multiplex network with tunable clustering
Authors:
Rosanna C Barnard,
Istvan Z Kiss,
Luc Berthouze,
Joel C Miller
Abstract:
The duration, type and structure of connections between individuals in real-world populations play a crucial role in how diseases invade and spread. Here, we incorporate the aforementioned heterogeneities into a model by considering a dual-layer static-dynamic multiplex network. The static network layer affords tunable clustering and describes an individual's permanent community structure. The dyn…
▽ More
The duration, type and structure of connections between individuals in real-world populations play a crucial role in how diseases invade and spread. Here, we incorporate the aforementioned heterogeneities into a model by considering a dual-layer static-dynamic multiplex network. The static network layer affords tunable clustering and describes an individual's permanent community structure. The dynamic network layer describes the transient connections an individual makes with members of the wider population by imposing constant edge rewiring. We follow the edge-based compartmental modelling approach to derive equations describing the evolution of a susceptible - infected - recovered (SIR) epidemic spreading through this multiplex network of individuals. We derive the basic reproduction number, measuring the expected number of new infectious cases caused by a single infectious individual in an otherwise susceptible population. We validate model equations by showing convergence to pre-existing edge-based compartmental model equations in limiting cases and by comparison with stochastically simulated epidemics. We explore the effects of altering model parameters and multiplex network attributes on resultant epidemic dynamics. We validate the basic reproduction number by plotting its value against associated final epidemic sizes measured from simulation and predicted by model equations for a number of setups. Further, we explore the effect of varying individual model parameters on the basic reproduction number. We conclude with a discussion of the significance and interpretation of the model and its relation to existing research literature. We highlight intrinsic limitations and potential extensions of the present model and outline future research considerations, both experimental and theoretical.
△ Less
Submitted 4 April, 2018; v1 submitted 4 January, 2018;
originally announced January 2018.
-
A monotonic relationship between the variability of the infectious period and final size in pairwise epidemic modelling
Authors:
Zsolt Vizi,
István Z. Kiss,
Joel C. Miller,
Gergely Röst
Abstract:
For a recently derived pairwise model of network epidemics with non-Markovian recovery, we prove that under some mild technical conditions on the distribution of the infectious periods, smaller variance in the recovery time leads to higher reproduction number, and consequently to a larger epidemic outbreak, when the mean infectious period is fixed. We discuss how this result is related to various…
▽ More
For a recently derived pairwise model of network epidemics with non-Markovian recovery, we prove that under some mild technical conditions on the distribution of the infectious periods, smaller variance in the recovery time leads to higher reproduction number, and consequently to a larger epidemic outbreak, when the mean infectious period is fixed. We discuss how this result is related to various stochastic orderings of the distributions of infectious periods. The results are illustrated by a number of explicit stochastic simulations, suggesting that their validity goes beyond regular networks.
△ Less
Submitted 24 December, 2018; v1 submitted 16 December, 2017;
originally announced December 2017.
-
An elementary proof of the total progeny size of a birth-death process, with application to network component sizes
Authors:
Joel C. Miller
Abstract:
We revisit the size distribution of finite components in infinite Configuration Model networks. We provide an elementary combinatorial proof about the sizes of birth-death trees which is more intuitive than previous proofs. We use this to rederive the component size distribution for Configuration Model networks. Our derivation provides a more intuitive interpretation of the formula as contrasted w…
▽ More
We revisit the size distribution of finite components in infinite Configuration Model networks. We provide an elementary combinatorial proof about the sizes of birth-death trees which is more intuitive than previous proofs. We use this to rederive the component size distribution for Configuration Model networks. Our derivation provides a more intuitive interpretation of the formula as contrasted with the previous derivation based on contour integrations. We demonstrate that the formula performs well, even on networks with heavy tails which violate assumptions of the derivation. We explain why the result should remain robust for these networks.
△ Less
Submitted 25 October, 2017; v1 submitted 16 October, 2017;
originally announced October 2017.
-
Random Spatial Networks: Small Worlds without Clustering, Traveling Waves, and Hop-and-Spread Disease Dynamics
Authors:
John Lang,
Hans De Sterck,
Jamieson L. Kaiser,
Joel C. Miller
Abstract:
Random network models play a prominent role in modeling, analyzing and understanding complex phenomena on real-life networks. However, a key property of networks is often neglected: many real-world networks exhibit spatial structure, the tendency of a node to select neighbors with a probability depending on physical distance. Here, we introduce a class of random spatial networks (RSNs) which gener…
▽ More
Random network models play a prominent role in modeling, analyzing and understanding complex phenomena on real-life networks. However, a key property of networks is often neglected: many real-world networks exhibit spatial structure, the tendency of a node to select neighbors with a probability depending on physical distance. Here, we introduce a class of random spatial networks (RSNs) which generalizes many existing random network models but adds spatial structure. In these networks, nodes are placed randomly in space and joined in edges with a probability depending on their distance and their individual expected degrees, in a manner that crucially remains analytically tractable. We use this network class to propose a new generalization of small-world networks, where the average shortest path lengths in the graph are small, as in classical Watts-Strogatz small-world networks, but with close spatial proximity of nodes that are neighbors in the network playing the role of large clustering. Small-world effects are demonstrated on these spatial small-world networks without clustering. We are able to derive partial integro-differential equations governing susceptible-infectious-recovered disease spreading through an RSN, and we demonstrate the existence of traveling wave solutions. If the distance kernel governing edge placement decays slower than exponential, the population-scale dynamics are dominated by long-range hops followed by local spread of traveling waves. This provides a theoretical modeling framework for recent observations of how epidemics like Ebola evolve in modern connected societies, with long-range connections seeding new focal points from which the epidemic locally spreads in a wavelike manner.
△ Less
Submitted 4 February, 2017;
originally announced February 2017.
-
Paradoxes in Leaky Microbial Trade
Authors:
Yoav Kallus,
John H. Miller,
Eric Libby
Abstract:
Microbes produce metabolic resources that are important for cell growth yet leak across membranes into the extracellular environment. Other microbes in the same environment can use these resources and adjust their own metabolic production accordingly---causing other resources to leak into the environment. The combined effect of these processes is an economy in which organismal growth and metabolic…
▽ More
Microbes produce metabolic resources that are important for cell growth yet leak across membranes into the extracellular environment. Other microbes in the same environment can use these resources and adjust their own metabolic production accordingly---causing other resources to leak into the environment. The combined effect of these processes is an economy in which organismal growth and metabolic production are coupled to others in the community. We propose a model for the co-evolving dynamics of metabolite concentrations, production regulation, and population frequencies for the case of two cell types, each requiring and capable of producing two metabolites. In this model, beneficial trade relations emerge without any coordination, via individual-level production decisions that maximize each cell's growth rate given its perceived environment. As we vary production parameters of the model, we encounter three paradoxical behaviors, where a change that should intuitively benefit some cell type, actually harms it. (1) If a cell type is more efficient than its counterpart at producing a metabolite and becomes even more efficient, its frequency in the population can decrease. (2) If a cell type is less efficient than its counterpart at producing a metabolite but becomes less inefficient, the growth rate of the population can decrease. (3) Finally, if a cell type controls its counterpart's production decisions so as to maximize its own growth rate, the ultimate growth rate it achieves can be lower than if the two cell types each maximized their own growth. These three paradoxes highlight the complex and counter-intuitive dynamics that emerge in simple microbial economies.
△ Less
Submitted 9 December, 2016;
originally announced December 2016.
-
Saturation Effects and the Concurrency Hypothesis: Insights from an Analytic Model
Authors:
Joel C. Miller,
Anja C. Slim
Abstract:
Sexual partnerships that overlap in time (concurrent relationships) may play a significant role in the HIV epidemic, but the precise effect is unclear. We derive edge-based compartmental models of disease spread in idealized dynamic populations with and without concurrency to allow for an investigation of its effects. Our models assume that partnerships change in time and individuals enter and lea…
▽ More
Sexual partnerships that overlap in time (concurrent relationships) may play a significant role in the HIV epidemic, but the precise effect is unclear. We derive edge-based compartmental models of disease spread in idealized dynamic populations with and without concurrency to allow for an investigation of its effects. Our models assume that partnerships change in time and individuals enter and leave the at-risk population. Infected individuals transmit at a constant per-partnership rate to their susceptible partners. In our idealized populations we find regions of parameter space where the existence of concurrent partnerships leads to substantially faster growth and higher equilibrium levels, but also regions in which the existence of concurrent partnerships has very little impact on the growth or the equilibrium. Additionally we find mixed regimes in which concurrency significantly increases the early growth, but has little effect on the ultimate equilibrium level. Guided by model predictions, we discuss general conditions under which concurrent relationships would be expected to have large or small effects in real-world settings. Our observation that the impact of concurrency saturates suggests that concurrency-reducing interventions may be most effective in populations with low to moderate concurrency.
△ Less
Submitted 1 November, 2017; v1 submitted 15 November, 2016;
originally announced November 2016.
-
Mean-field models for non-Markovian epidemics on networks: from edge-based compartmental to pairwise models
Authors:
N. Sherborne,
J. C. Miller,
K. B. Blyuss,
I. Z. Kiss
Abstract:
This paper presents a novel extension of the edge-based compartmental model for epidemics with arbitrary distributions of transmission and recovery times. Using the message passing approach we also derive a new pairwise-like model for epidemics with Markovian transmission and an arbitrary recovery period. The new pairwise-like model allows one to formally prove that the message passing and edge-ba…
▽ More
This paper presents a novel extension of the edge-based compartmental model for epidemics with arbitrary distributions of transmission and recovery times. Using the message passing approach we also derive a new pairwise-like model for epidemics with Markovian transmission and an arbitrary recovery period. The new pairwise-like model allows one to formally prove that the message passing and edge-based compartmental models are equivalent in the case of Markovian transmission and arbitrary recovery processes. The edge-based and message passing models are conjectured to also be equivalent for arbitrary transmission processes; we show the first step of a full proof of this. The new pairwise-like model encompasses many existing well-known models that can be obtained by appropriate reductions. It is also amenable to a relatively straightforward numerical implementation. We test the theoretical results by comparing the numerical solutions of the various pairwise-like models to results based on explicit stochastic network simulations.
△ Less
Submitted 12 November, 2016;
originally announced November 2016.
-
Mathematical models of SIR disease spread with combined non-sexual and sexual transmission routes
Authors:
Joel C. Miller
Abstract:
The emergence of diseases such as Zika and Ebola has highlighted the need to understand the role of sexual transmission in the spread of diseases with a primarily non-sexual transmission route. In this paper we develop a number of low-dimensional models which are appropriate for a range of assumptions for how a disease will spread if it has sexual transmission through a sexual contact network comb…
▽ More
The emergence of diseases such as Zika and Ebola has highlighted the need to understand the role of sexual transmission in the spread of diseases with a primarily non-sexual transmission route. In this paper we develop a number of low-dimensional models which are appropriate for a range of assumptions for how a disease will spread if it has sexual transmission through a sexual contact network combined with some other transmission mechanism, such as direct contact or vectors. The equations derived provide exact predictions for the dynamics of the corresponding simulations in the large population limit.
△ Less
Submitted 1 November, 2016; v1 submitted 26 September, 2016;
originally announced September 2016.
-
Estimating Propensity Parameters using Google PageRank and Genetic Algorithms
Authors:
David Murrugarra,
Jacob Miller,
Alex Mueller
Abstract:
Stochastic Boolean networks, or more generally, stochastic discrete networks, are an important class of computational models for molecular interaction networks. The stochasticity stems from the updating schedule. Standard updating schedules include the synchronous update, where all the nodes are updated at the same time, and the asynchronous update where a random node is updated at each time step.…
▽ More
Stochastic Boolean networks, or more generally, stochastic discrete networks, are an important class of computational models for molecular interaction networks. The stochasticity stems from the updating schedule. Standard updating schedules include the synchronous update, where all the nodes are updated at the same time, and the asynchronous update where a random node is updated at each time step. The former produces a deterministic dynamics while the latter a stochastic dynamics. A more general stochastic setting considers propensity parameters for updating each node. Stochastic Discrete Dynamical Systems (SDDS) is a modeling framework that considers two propensity parameters for updating each node and uses one when the update has a positive impact on the variable, that is, when the update causes the variable to increase its value, and uses the other when the update has a negative impact, that is, when the update causes it to decrease its value. This framework offers additional features for simulations but also adds a complexity in parameter estimation of the propensities. This paper presents a method for estimating the propensity parameters for SDDS. The method is based on adding noise to the system using the Google PageRank approach to make the system ergodic and thus guaranteeing the existence of a stationary distribution. Then with the use of a genetic algorithm, the propensity parameters are estimated. Approximation techniques that make the search algorithms efficient are also presented and Matlab/Octave code to test the algorithms are available at http://www.ms.uky.edu/~dmu228/GeneticAlg/Code.html.
△ Less
Submitted 17 October, 2016; v1 submitted 1 August, 2016;
originally announced August 2016.
-
Malaria elimination campaigns in the Lake Kariba region of Zambia: a spatial dynamical model
Authors:
Milen Nikolov,
Caitlin A. Bever,
Alexander Upfill-Brown,
Busiku Hamainza,
John M. Miller,
Philip A. Eckhoff,
Edward A. Wenger,
Jaline Gerardin
Abstract:
Background As more regions approach malaria elimination, understanding how different interventions interact to reduce transmission becomes critical. The Lake Kariba area of Southern Province, Zambia, is part of a multi-country elimination effort and presents a particular challenge as it is an interconnected region of variable transmission intensities.
Methods In 2012-13, six rounds of mass-scree…
▽ More
Background As more regions approach malaria elimination, understanding how different interventions interact to reduce transmission becomes critical. The Lake Kariba area of Southern Province, Zambia, is part of a multi-country elimination effort and presents a particular challenge as it is an interconnected region of variable transmission intensities.
Methods In 2012-13, six rounds of mass-screen-and-treat drug campaigns were carried out in the Lake Kariba region. A spatial dynamical model of malaria transmission in the Lake Kariba area, with transmission and climate modeled at the village scale, was calibrated to the 2012-13 prevalence survey data, with case management rates, insecticide-treated net usage, and drug campaign coverage informed by surveillance. The model was used to simulate the effect of various interventions implemented in 2014-22 on reducing regional transmission, achieving elimination by 2022, and maintaining elimination through 2028.
Findings The model captured the spatio-temporal trends of decline and rebound in malaria prevalence in 2012-13 at the village scale. Simulations predicted that elimination required repeated mass drug administrations coupled with simultaneous increase in net usage. Drug campaigns targeted only at high-burden areas were as successful as campaigns covering the entire region.
Interpretation Elimination in the Lake Kariba region is possible through coordinating mass drug campaigns with high-coverage vector control. Targeting regional hotspots is a viable alternative to global campaigns when human migration within an interconnected area is responsible for maintaining transmission in low-burden areas.
△ Less
Submitted 15 March, 2016;
originally announced March 2016.
-
GELATO and SAGE: An Integrated Framework for MS Annotation
Authors:
Khalifeh AlJadda,
Rene Ranzinger,
Melody Porterfield,
Brent Weatherly,
Mohammed Korayem,
John A. Miller,
Khaled Rasheed,
Krys J. Kochut,
William S. York
Abstract:
Several algorithms and tools have been developed to (semi) automate the process of glycan identification by interpreting Mass Spectrometric data. However, each has limitations when annotating MSn data with thousands of MS spectra using uncurated public databases. Moreover, the existing tools are not designed to manage MSn data where n > 2. We propose a novel software package to automate the annota…
▽ More
Several algorithms and tools have been developed to (semi) automate the process of glycan identification by interpreting Mass Spectrometric data. However, each has limitations when annotating MSn data with thousands of MS spectra using uncurated public databases. Moreover, the existing tools are not designed to manage MSn data where n > 2. We propose a novel software package to automate the annotation of tandem MS data. This software consists of two major components. The first, is a free, semi-automated MSn data interpreter called the Glycomic Elucidation and Annotation Tool (GELATO). This tool extends and automates the functionality of existing open source projects, namely, GlycoWorkbench (GWB) and GlycomeDB. The second is a machine learning model called Smart Anotation Enhancement Graph (SAGE), which learns the behavior of glycoanalysts to select annotations generated by GELATO that emulate human interpretation of the spectra.
△ Less
Submitted 8 January, 2016; v1 submitted 28 December, 2015;
originally announced December 2015.
-
Orthologs from maxmer sequence context
Authors:
Kun Gao,
Jonathan Miller
Abstract:
Context-dependent identification of orthologs customarily relies on conserved gene order or whole-genome sequence alignment. It is shown here that short-range context--as short as single maximal matches--also provides an effective means to identify orthologs within whole genomes. On pristine (un-repeatmasked) mammalian whole-genome assemblies we perform a genome "intersection" that in general cons…
▽ More
Context-dependent identification of orthologs customarily relies on conserved gene order or whole-genome sequence alignment. It is shown here that short-range context--as short as single maximal matches--also provides an effective means to identify orthologs within whole genomes. On pristine (un-repeatmasked) mammalian whole-genome assemblies we perform a genome "intersection" that in general consumes less than one thirtieth of the computation time required by commonly used methods for whole-genome alignment, and we extract "non-embedded maximal matches," maximal matches that are not embedded into other maximal matches, as potential orthologs. An ortholog identified via non-embedded maximal matches is analogous to a "positional ortholog" or a "primary ortholog" as defined in previous literature; such orthologs constitute homologs derived from the same direct ancestor whose ancestral positions in the genome are conserved. At the nucleotide level, non-embedded maximal matches recapitulate most exact matches identified by a Lastz net alignment. At the gene level, reciprocal best hits of genes containing non-embedded maximal matches recover one-to-one orthologs annotated by Ensembl Compara with high selectivity and high sensitivity; these reciprocal best hits additionally include putatively novel orthologs not found in Ensembl (e.g. over two thousand for human/chimpanzee). The method is especially suitable for genome-wide identification of orthologs.
△ Less
Submitted 22 November, 2015; v1 submitted 15 September, 2015;
originally announced September 2015.
-
Inadequate experimental methods and erroneous epilepsy diagnostic criteria result in confounding acquired focal epilepsy with genetic absence epilepsy
Authors:
Raimondo D'Ambrosio,
Clifford L. Eastman,
John W. Miller
Abstract:
Here we provide a thorough discussion of the study conducted by Rodgers et al. (J Neurosci. 2015; 35(24):9194-204. doi: 10.1523/JNEUROSCI.0919-15.2015) to investigate focal seizures and acquired epileptogenesis induced by head injury in the rat. This manuscript serves as supplementary document for our letter to the Editor to appear in the Journal of Neuroscience. We find that the subject article s…
▽ More
Here we provide a thorough discussion of the study conducted by Rodgers et al. (J Neurosci. 2015; 35(24):9194-204. doi: 10.1523/JNEUROSCI.0919-15.2015) to investigate focal seizures and acquired epileptogenesis induced by head injury in the rat. This manuscript serves as supplementary document for our letter to the Editor to appear in the Journal of Neuroscience. We find that the subject article suffers from poor experimental design, very selective consideration of antecedent literature, and application of inappropriate epilepsy diagnostic criteria which, together, lead to unwarranted conclusions.
△ Less
Submitted 3 September, 2015;
originally announced September 2015.
-
Optimal population-level infection detection strategies for malaria control and elimination in a spatial model of malaria transmission
Authors:
Jaline Gerardin,
Caitlin A. Bever,
Busiku Hamainza,
John M. Miller,
Philip A. Eckhoff,
Edward A. Wenger
Abstract:
Mass campaigns with antimalarial drugs are potentially a powerful tool for local elimination of malaria, yet current diagnostic technologies are insufficiently sensitive to identify all individuals who harbor infections. At the same time, overtreatment of uninfected individuals increases the risk of accelerating emergence of drug resistance and losing community acceptance. Local heterogeneity in t…
▽ More
Mass campaigns with antimalarial drugs are potentially a powerful tool for local elimination of malaria, yet current diagnostic technologies are insufficiently sensitive to identify all individuals who harbor infections. At the same time, overtreatment of uninfected individuals increases the risk of accelerating emergence of drug resistance and losing community acceptance. Local heterogeneity in transmission intensity may allow campaign strategies that respond to index cases to successfully target subpatent infections while simultaneously limiting overtreatment. While selective targeting of hotspots of transmission has been proposed as a strategy for malaria control, such targeting has not been tested in the context of malaria elimination. Using household locations, demographics, and prevalence data from a survey of four health facility catchment areas in southern Zambia and an agent-based model of malaria transmission and immunity acquisition, a transmission intensity was fit to each household based on neighborhood age-dependent malaria prevalence. A set of individual infection trajectories was constructed for every household in each catchment area, accounting for heterogeneous exposure and immunity. Various campaign strategies (mass drug administration, mass screen and treat, focal mass drug administration, snowball reactive case detection, pooled sampling, and a hypothetical serological diagnostic) were simulated and evaluated for performance at finding infections, minimizing overtreatment, reducing clinical case counts, and interrupting transmission. For malaria control, presumptive treatment leads to substantial overtreatment without additional morbidity reduction under all but the highest transmission conditions. Selective targeting of hotspots with drug campaigns is an ineffective tool for elimination due to limited sensitivity of available field diagnostics.
△ Less
Submitted 2 September, 2015;
originally announced September 2015.
-
Equivalence of several generalized percolation models on networks
Authors:
Joel C. Miller
Abstract:
In recent years, many variants of percolation have been used to study network structure and the behavior of processes spreading on networks. These include bond percolation, site percolation, $k$-core percolation, bootstrap percolation, the generalized epidemic process, and the Watts Threshold Model (WTM). We show that --- except for bond percolation --- each of these processes arises as a special…
▽ More
In recent years, many variants of percolation have been used to study network structure and the behavior of processes spreading on networks. These include bond percolation, site percolation, $k$-core percolation, bootstrap percolation, the generalized epidemic process, and the Watts Threshold Model (WTM). We show that --- except for bond percolation --- each of these processes arises as a special case of the WTM and bond percolation arises from a small modification. In fact "heterogeneous $k$-core percolation", a corresponding "heterogeneous bootstrap percolation" model, and the generalized epidemic process are completely equivalent to one another and the WTM. We further show that a natural generalization of the WTM in which individuals "transmit" or "send a message" to their neighbors with some probability less than $1$ can be reformulated in terms of the WTM, and so this apparent generalization is in fact not more general. Finally, we show that in bond percolation, finding the set of nodes in the component containing a given node is equivalent to finding the set of nodes activated if that node is initially activated and the node thresholds are chosen from the appropriate distribution. A consequence of these results is that mathematical techniques developed for the WTM apply to these other models as well, and techniques that were developed for some particular case may in fact apply much more generally.
△ Less
Submitted 21 August, 2016; v1 submitted 30 April, 2015;
originally announced May 2015.
-
Complex Contagions and hybrid phase transitions in unclustered and clustered random networks
Authors:
Joel C. Miller
Abstract:
A complex contagion is an infectious process in which individuals may require multiple transmissions before changing state. These are used to model behaviors if an individual only adopts a particular behavior after perceiving a consensus among others. We may think of individuals as beginning inactive and becoming active after contact with a sufficient number of active partners. These have been stu…
▽ More
A complex contagion is an infectious process in which individuals may require multiple transmissions before changing state. These are used to model behaviors if an individual only adopts a particular behavior after perceiving a consensus among others. We may think of individuals as beginning inactive and becoming active after contact with a sufficient number of active partners. These have been studied in a number of cases, but analytic models for the dynamic spread of complex contagions are typically complex. Here we study the dynamics of the Watts Threshold Model (WTM) assuming transmission occurs in continuous time as a Poisson process, or in discrete time where individuals transmit to all partners in the time step following their activation. We adapt techniques developed for infectious disease modeling to develop and analyze analytic models for the dynamics of the WTM in Configuration Model networks and a class of random clustered (triangle-based) networks. The resulting model is relatively simple and compact. We use it to gain insights into the contagion dynamics. In the infinite population limit, we derive conditions under which cascades happen with an arbitrarily small initial proportion active, confirming a hypothesis of Watts for this case. We also observe hybrid phase transitions when cascades are not possible for small initial conditions, but occur for large enough initial conditions. We derive sufficient conditions for this hybrid phase transition to occur. We show that in many cases, if the hybrid phase transition occurs, all individuals eventually become active. Finally, we discuss the role clustering plays in facilitating or impeding the spread and find that the hypothesis of Watts that was confirmed in Configuration Model networks does not hold in general. This approach allows us to unify many existing disparate observations and derive new results.
△ Less
Submitted 25 June, 2015; v1 submitted 7 January, 2015;
originally announced January 2015.
-
Molecular insights into Neotropical bird-tick ecological associations and the role of birds in tick-borne disease ecology
Authors:
Matthew J. Miller,
Helen J. Esser,
Jose R. Loaiza,
Edward A. Herre,
Celestino Aguilar,
Diomedes Quintero,
Erick Alvarez,
Eldredge Bermingham
Abstract:
Ticks are important vectors of emerging zoonotic diseases. While adults of many tick species parasitize mammals, immature ticks are often found on wild birds. In the tropics, difficulties in species-level identification of immature ticks hinder studies of tick ecology and tick-borne disease transmission, including any potential role for birds. In Panama, we found immature ticks on 227 out of 3,498…
▽ More
Ticks are important vectors of emerging zoonotic diseases. While adults of many tick species parasitize mammals, immature ticks are often found on wild birds. In the tropics, difficulties in species-level identification of immature ticks hinder studies of tick ecology and tick-borne disease transmission, including any potential role for birds. In Panama, we found immature ticks on 227 out of 3,498 birds representing 93 host species, about 1/8th of the entire Panamanian terrestrial avifauna. Tick parasitism rates did not vary with temperature or rainfall, but parasitism rates did vary with host ecological traits: non-migratory residents, forest dwelling birds, bark insectivores, terrestrial foragers and lowland species were most likely to be infested with ticks. Using a molecular library developed from adult ticks specifically for this study, we identified 130 immature ticks obtained from wild birds, corresponding to eleven tick species, indicating that a substantial portion of the Panamanian avifauna is parasitized by a variety of tick species. Furthermore, we found evidence that immature ticks show taxonomic or ecological specificity to avian hosts. Finally, our data indicate that Panamanian birds are not parasitized regularly by the tick species responsible for most known tick-borne diseases. However, they are frequent hosts of other tick species known to carry a variety of rickettsial parasites of unknown pathogenicity. Given the broad interaction between tick and avian biodiversity in the Neotropics, future work on emerging tropical tick-borne disease should explicitly consider wild birds as vertebrate hosts.
△ Less
Submitted 24 November, 2014;
originally announced November 2014.
-
Optimizing Hybrid Spreading in Metapopulations
Authors:
Changwang Zhang,
Shi Zhou,
Joel C. Miller,
Ingemar J. Cox,
Benjamin M. Chain
Abstract:
Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both…
▽ More
Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both types of spreading. In this study we develop a theoretical framework for studying hybrid epidemics, and examine the optimum balance between spreading mechanisms in terms of achieving the maximum outbreak size. We show the existence of critically hybrid epidemics where neither spreading mechanism alone can cause a noticeable spread but a combination of the two spreading mechanisms would produce an enormous outbreak. Our results provide new strategies for maximising beneficial epidemics and estimating the worst outcome of damaging hybrid epidemics.
△ Less
Submitted 31 March, 2015; v1 submitted 25 September, 2014;
originally announced September 2014.
-
Human-chimpanzee alignment: Ortholog Exponentials and Paralog Power Laws
Authors:
Kun Gao,
Jonathan Miller
Abstract:
Genomic subsequences conserved between closely related species such as human and chimpanzee exhibit an exponential length distribution, in contrast to the algebraic length distribution observed for sequences shared between distantly related genomes. We find that the former exponential can be further decomposed into an exponential component primarily composed of orthologous sequences, and a truncat…
▽ More
Genomic subsequences conserved between closely related species such as human and chimpanzee exhibit an exponential length distribution, in contrast to the algebraic length distribution observed for sequences shared between distantly related genomes. We find that the former exponential can be further decomposed into an exponential component primarily composed of orthologous sequences, and a truncated algebraic component primarily composed of paralogous sequences.
△ Less
Submitted 27 August, 2014; v1 submitted 15 July, 2014;
originally announced July 2014.
-
Convex Analysis of Mixtures for Separating Non-negative Well-grounded Sources
Authors:
Yitan Zhu,
Niya Wang,
David J. Miller,
Yue Wang
Abstract:
Blind Source Separation (BSS) has proven to be a powerful tool for the analysis of composite patterns in engineering and science. We introduce Convex Analysis of Mixtures (CAM) for separating non-negative well-grounded sources, which learns the mixing matrix by identifying the lateral edges of the convex data scatter plot. We prove a sufficient and necessary condition for identifying the mixing ma…
▽ More
Blind Source Separation (BSS) has proven to be a powerful tool for the analysis of composite patterns in engineering and science. We introduce Convex Analysis of Mixtures (CAM) for separating non-negative well-grounded sources, which learns the mixing matrix by identifying the lateral edges of the convex data scatter plot. We prove a sufficient and necessary condition for identifying the mixing matrix through edge detection, which also serves as the foundation for CAM to be applied not only to the exact-determined and over-determined cases, but also to the under-determined case. We show the optimality of the edge detection strategy, even for cases where source well-groundedness is not strictly satisfied. The CAM algorithm integrates plug-in noise filtering using sector-based clustering, an efficient geometric convex analysis scheme, and stability-based model order selection. We demonstrate the principle of CAM on simulated data and numerically mixed natural images. The superior performance of CAM against a panel of benchmark BSS techniques is demonstrated on numerically mixed gene expression data. We then apply CAM to dissect dynamic contrast-enhanced magnetic resonance imaging data taken from breast tumors and time-course microarray gene expression data derived from in-vivo muscle regeneration in mice, both producing biologically plausible decomposition results.
△ Less
Submitted 10 December, 2015; v1 submitted 27 June, 2014;
originally announced June 2014.
-
Epidemic spread in networks: Existing methods and current challenges
Authors:
Joel C Miller,
Istvan Z Kiss
Abstract:
We consider the spread of infectious disease through contact networks of Configuration Model type. We assume that the disease spreads through contacts and infected individuals recover into an immune state. We discuss a number of existing mathematical models used to investigate this system, and show relations between the underlying assumptions of the models. In the process we offer simplifications…
▽ More
We consider the spread of infectious disease through contact networks of Configuration Model type. We assume that the disease spreads through contacts and infected individuals recover into an immune state. We discuss a number of existing mathematical models used to investigate this system, and show relations between the underlying assumptions of the models. In the process we offer simplifications of some of the existing models. The distinctions between the underlying assumptions are subtle, and in many if not most cases this subtlety is irrelevant. Indeed, under appropriate conditions the models are equivalent. We compare the benefits and disadvantages of the different models, and discuss their application to other populations (\emph{e.g.,} clustered networks). Finally we discuss ongoing challenges for network-based epidemic modeling.
△ Less
Submitted 8 March, 2014;
originally announced March 2014.
-
Space use by foragers consuming renewable resources
Authors:
Guillermo Abramson,
Marcelo N Kuperman,
Juan M Morales,
Joel C Miller
Abstract:
We study a simple model of a forager as a walk that modifies a relaxing substrate. Within it simplicity, this provides an insight on a number of relevant and non-intuitive facts. Even without memory of the good places to feed and no explicit cost of moving, we observe the emergence of a finite home range. We characterize the walks and the use of resources in several statistical ways, involving the…
▽ More
We study a simple model of a forager as a walk that modifies a relaxing substrate. Within it simplicity, this provides an insight on a number of relevant and non-intuitive facts. Even without memory of the good places to feed and no explicit cost of moving, we observe the emergence of a finite home range. We characterize the walks and the use of resources in several statistical ways, involving the behavior of the average used fraction of the system, the length of the cycles followed by the walkers, and the frequency of visits to plants. Preliminary results on population effects are explored by means of a system of two non directly interacting animals. Properties of the overlap of home ranges show the existence of a set of parameters that provides the best utilization of the shared resource.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.
-
Fragmentation dynamics of DNA sequence duplications
Authors:
M. V. Koroteev,
J. Miller
Abstract:
Motivated by empirical observations of algebraic duplicated sequence length distributions in a broad range of natural genomes, we analytically formulate and solve a class of simple discrete duplication/substitution models that generate steady-states sharing this property. Continuum equations are derived for arbitrary time-independent duplication length source distribution, a limit that we show can…
▽ More
Motivated by empirical observations of algebraic duplicated sequence length distributions in a broad range of natural genomes, we analytically formulate and solve a class of simple discrete duplication/substitution models that generate steady-states sharing this property. Continuum equations are derived for arbitrary time-independent duplication length source distribution, a limit that we show can be mapped directly onto certain fragmentation models that have been intensively studied by physicists in recent years. Quantitative agreement with simulation is demonstrated. These models account for the algebraic form and exponent of naturally occuring duplication length distributions without the need for fine-tuning of parameters.
△ Less
Submitted 28 October, 2013; v1 submitted 4 April, 2013;
originally announced April 2013.
-
Co-circulation of infectious diseases on networks
Authors:
Joel C. Miller
Abstract:
We consider multiple diseases spreading in a static Configuration Model network. We make standard assumptions that infection transmits from neighbor to neighbor at a disease-specific rate and infected individuals recover at a disease-specific rate. Infection by one disease confers immediate and permanent immunity to infection by any disease. Under these assumptions, we find a simple, low-dimension…
▽ More
We consider multiple diseases spreading in a static Configuration Model network. We make standard assumptions that infection transmits from neighbor to neighbor at a disease-specific rate and infected individuals recover at a disease-specific rate. Infection by one disease confers immediate and permanent immunity to infection by any disease. Under these assumptions, we find a simple, low-dimensional ordinary differential equations model which captures the global dynamics of the infection. The dynamics depend strongly on initial conditions. Although we motivate this article with infectious disease, the model may be adapted to the spread of other infectious agents such as competing political beliefs, rumors, or adoption of new technologies if these are influenced by contacts. As an example, we demonstrate how to model an infectious disease which can be prevented by a behavior change.
△ Less
Submitted 16 October, 2012;
originally announced October 2012.
-
Epidemics on networks with large initial conditions or changing structure
Authors:
Joel C. Miller
Abstract:
Background: Recently developed techniques to study the spread of infectious diseases through networks make assumptions that the initial proportion infected is infinitesimal and the population behavior is static throughout the epidemic. The models do not apply if the initial proportion is large (and fail whenever R_0<1), and cannot measure the impact of an intervention.
Methods: In this paper we…
▽ More
Background: Recently developed techniques to study the spread of infectious diseases through networks make assumptions that the initial proportion infected is infinitesimal and the population behavior is static throughout the epidemic. The models do not apply if the initial proportion is large (and fail whenever R_0<1), and cannot measure the impact of an intervention.
Methods: In this paper we adapt "edge-based compartmental models" to situations having finite-sized initial conditions.
Results: The resulting models remain simple and accurately capture the effect of the initial conditions. It is possible to generalize the model to networks whose partnerships change in time.
Conclusions: The resulting models can be applied to a range of important contexts. The models can be used to choose between different interventions that affect the disease or the population structure.
△ Less
Submitted 16 August, 2012;
originally announced August 2012.
-
Edge-Based Compartmental Modeling for Infectious Disease Spread Part III: Disease and Population Structure
Authors:
Joel C. Miller,
Erik M. Volz
Abstract:
We consider the edge-based compartmental models for infectious disease spread introduced in Part I. These models allow us to consider standard SIR diseases spreading in random populations. In this paper we show how to handle deviations of the disease or population from the simplistic assumptions of Part I. We allow the population to have structure due to effects such as demographic detail or multi…
▽ More
We consider the edge-based compartmental models for infectious disease spread introduced in Part I. These models allow us to consider standard SIR diseases spreading in random populations. In this paper we show how to handle deviations of the disease or population from the simplistic assumptions of Part I. We allow the population to have structure due to effects such as demographic detail or multiple types of risk behavior the disease to have more complicated natural history. We introduce these modifications in the static network context, though it is straightforward to incorporate them into dynamic networks. We also consider serosorting, which requires using the dynamic network models. The basic methods we use to derive these generalizations are widely applicable, and so it is straightforward to introduce many other generalizations not considered here.
△ Less
Submitted 30 June, 2011;
originally announced June 2011.
-
Edge-Based Compartmental Modeling for Infectious Disease Spread Part I: An Overview
Authors:
Joel C. Miller,
Anja C. Slim,
Erik M. Volz
Abstract:
The primary tool for predicting infectious disease spread and intervention effectiveness is the mass action Susceptible-Infected-Recovered model of Kermack and McKendrick. Its usefulness derives largely from its conceptual and mathematical simplicity; however, it incorrectly assumes all individuals have the same contact rate and contacts are fleeting. This paper is the first of three investigating…
▽ More
The primary tool for predicting infectious disease spread and intervention effectiveness is the mass action Susceptible-Infected-Recovered model of Kermack and McKendrick. Its usefulness derives largely from its conceptual and mathematical simplicity; however, it incorrectly assumes all individuals have the same contact rate and contacts are fleeting. This paper is the first of three investigating edge-based compartmental modeling, a technique eliminating these assumptions. In this paper, we derive simple ordinary differential equation models capturing social heterogeneity (heterogeneous contact rates) while explicitly considering the impact of contact duration. We introduce a graphical interpretation allowing for easy derivation and communication of the model. This paper focuses on the technique and how to apply it in different contexts. The companion papers investigate choosing the appropriate level of complexity for a model and how to apply edge-based compartmental modeling to populations with various sub-structures.
△ Less
Submitted 30 June, 2011;
originally announced June 2011.
-
Edge-based compartmental modeling for epidemic spread Part II: Model Selection and Hierarchies
Authors:
Joel C. Miller,
Erik M. Volz
Abstract:
We consider the edge-based compartmental models for epidemic spread developed in Part I. We show conditions under which simpler models may be substituted for more detailed models, and in so doing we define a hierarchy of epidemic models. In particular we provide conditions under which it is appropriate to use the standard mass action SIR model, and we show what happens when these conditions fail.…
▽ More
We consider the edge-based compartmental models for epidemic spread developed in Part I. We show conditions under which simpler models may be substituted for more detailed models, and in so doing we define a hierarchy of epidemic models. In particular we provide conditions under which it is appropriate to use the standard mass action SIR model, and we show what happens when these conditions fail. Using our hierarchy, we provide a procedure leading to the choice of the appropriate model for a given population. Our result about the convergence of models to the Mass Action model gives clear, rigorous conditions under which the Mass Action model is accurate.
△ Less
Submitted 30 June, 2011;
originally announced June 2011.
-
An MRI-Derived Definition of MCI-to-AD Conversion for Long-Term, Automati c Prognosis of MCI Patients
Authors:
Yaman Aksu,
David J. Miller,
George Kesidis,
Don C. Bigler,
Qing X. Yang
Abstract:
Alzheimer's disease (AD) and mild cognitive impairment (MCI), continue to be widely studied. While there is no consensus on whether MCIs actually "convert" to AD, the more important question is not whether MCIs convert, but what is the best such definition. We focus on automatic prognostication, nominally using only a baseline image brain scan, of whether an MCI individual will convert to AD withi…
▽ More
Alzheimer's disease (AD) and mild cognitive impairment (MCI), continue to be widely studied. While there is no consensus on whether MCIs actually "convert" to AD, the more important question is not whether MCIs convert, but what is the best such definition. We focus on automatic prognostication, nominally using only a baseline image brain scan, of whether an MCI individual will convert to AD within a multi-year period following the initial clinical visit. This is in fact not a traditional supervised learning problem since, in ADNI, there are no definitive labeled examples of MCI conversion. Prior works have defined MCI subclasses based on whether or not clinical/cognitive scores such as CDR significantly change from baseline. There are concerns with these definitions, however, since e.g. most MCIs (and ADs) do not change from a baseline CDR=0.5, even while physiological changes may be occurring. These works ignore rich phenotypical information in an MCI patient's brain scan and labeled AD and Control examples, in defining conversion. We propose an innovative conversion definition, wherein an MCI patient is declared to be a converter if any of the patient's brain scans (at follow-up visits) are classified "AD" by an (accurately-designed) Control-AD classifier. This novel definition bootstraps the design of a second classifier, specifically trained to predict whether or not MCIs will convert. This second classifier thus predicts whether an AD-Control classifier will predict that a patient has AD. Our results demonstrate this new definition leads not only to much higher prognostic accuracy than by-CDR conversion, but also to subpopulations much more consistent with known AD brain region biomarkers. We also identify key prognostic region biomarkers, essential for accurately discriminating the converter and nonconverter groups.
△ Less
Submitted 28 April, 2011;
originally announced April 2011.