-
The Modified Combo i3+3 Design for Novel-Novel Combination Dose-Finding Trials in Oncology
Authors:
Jiaxin Liu,
Shijie Yuan,
Qiqi Deng,
Yuan Ji
Abstract:
We consider a modified Ci3+3 (MCi3+3) design for dual-agent dose-finding trials in which both agents are tested on multiple doses. This usually happens when the agents are novel therapies. The MCi3+3 design offers a two-stage or three-stage version, depending on the practical need. The first stage begins with single-agent dose escalation, the second stage launches a model-free combination dose fin…
▽ More
We consider a modified Ci3+3 (MCi3+3) design for dual-agent dose-finding trials in which both agents are tested on multiple doses. This usually happens when the agents are novel therapies. The MCi3+3 design offers a two-stage or three-stage version, depending on the practical need. The first stage begins with single-agent dose escalation, the second stage launches a model-free combination dose finding for both agents, and optionally, the third stage follows with a model-based design. MCi3+3 aims to maintain a relatively simple framework to facilitate practical application, while also address challenges that are unique to novel-novel combination dose finding. Through simulations, we demonstrate that the MCi3+3 design adeptly manages various toxicity scenarios. It exhibits operational characteristics on par with other combination designs, while offering an enhanced safety profile. The design is motivated and tested for a real-life clinical trial.
△ Less
Submitted 4 September, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
A Bayesian Estimator of Sample Size
Authors:
Dehua Bi,
Yuan Ji
Abstract:
We consider a Bayesian estimator of sample size (BESS) and an application to oncology dose optimization clinical trials. BESS is built upon three pillars, Sample size, Evidence from observed data, and Confidence in posterior inference. It uses a simple logic of "given the evidence from data, a specific sample size can achieve a degree of confidence in the posterior inference." The key distinction…
▽ More
We consider a Bayesian estimator of sample size (BESS) and an application to oncology dose optimization clinical trials. BESS is built upon three pillars, Sample size, Evidence from observed data, and Confidence in posterior inference. It uses a simple logic of "given the evidence from data, a specific sample size can achieve a degree of confidence in the posterior inference." The key distinction between BESS and standard sample size estimation (SSE) is that SSE, typically based on Frequentist inference, specifies the true parameters values in its calculation while BESS assumes possible outcome from the observed data. As a result, the calibration of the sample size is not based on type I or type II error rates, but on posterior probabilities. We demonstrate that BESS leads to a more interpretable statement for investigators, and can easily accommodates prior information as well as sample size re-estimation. We explore its performance in comparison to the standard SSE and demonstrate its usage through a case study of oncology optimization trial. BESS can be applied to general hypothesis tests. An R tool is available at https://ccte.uchicago.edu/BESS.
△ Less
Submitted 20 April, 2024; v1 submitted 11 April, 2024;
originally announced April 2024.
-
A Semiparametric Gaussian Mixture Model for Chest CT-based 3D Blood Vessel Reconstruction
Authors:
Qianhan Zeng,
Jing Zhou,
Ying Ji,
Hansheng Wang
Abstract:
Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, three-dimensional (3D) structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstructi…
▽ More
Computed tomography (CT) has been a powerful diagnostic tool since its emergence in the 1970s. Using CT data, three-dimensional (3D) structures of human internal organs and tissues, such as blood vessels, can be reconstructed using professional software. This 3D reconstruction is crucial for surgical operations and can serve as a vivid medical teaching example. However, traditional 3D reconstruction heavily relies on manual operations, which are time-consuming, subjective, and require substantial experience. To address this problem, we develop a novel semiparametric Gaussian mixture model tailored for the 3D reconstruction of blood vessels. This model extends the classical Gaussian mixture model by enabling nonparametric variations in the component-wise parameters of interest according to voxel positions. We develop a kernel-based expectation-maximization algorithm for estimating the model parameters, accompanied by a supporting asymptotic theory. Furthermore, we propose a novel regression method for optimal bandwidth selection. Compared to the conventional cross-validation-based (CV) method, the regression method outperforms the CV method in terms of computational and statistical efficiency. In application, this methodology facilitates the fully automated reconstruction of 3D blood vessel structures with remarkable accuracy.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
BayesFLo: Bayesian fault localization of complex software systems
Authors:
Yi Ji,
Simon Mak,
Ryan Lekivetz,
Joseph Morgan
Abstract:
Software testing is essential for the reliable development of complex software systems. A key step in software testing is fault localization, which uses test data to pinpoint failure-inducing combinations for further diagnosis. Existing fault localization methods, however, are largely deterministic, and thus do not provide a principled approach for assessing probabilistic risk of potential root ca…
▽ More
Software testing is essential for the reliable development of complex software systems. A key step in software testing is fault localization, which uses test data to pinpoint failure-inducing combinations for further diagnosis. Existing fault localization methods, however, are largely deterministic, and thus do not provide a principled approach for assessing probabilistic risk of potential root causes, or for integrating domain and/or structural knowledge from test engineers. To address this, we propose a novel Bayesian fault localization framework called BayesFLo, which leverages a flexible Bayesian model on potential root cause combinations. A key feature of BayesFLo is its integration of the principles of combination hierarchy and heredity, which capture the structured nature of failure-inducing combinations. A critical challenge, however, is the sheer number of potential root cause scenarios to consider, which renders the computation of posterior root cause probabilities infeasible even for small software systems. We thus develop new algorithms for efficient computation of such probabilities, leveraging recent tools from integer programming and graph representations. We then demonstrate the effectiveness of BayesFLo over state-of-the-art fault localization methods, in a suite of numerical experiments and in two motivating case studies on the JMP XGBoost interface.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
PAM-HC: A Bayesian Nonparametric Construction of Hybrid Control for Randomized Clinical Trials Using External Data
Authors:
Dehua Bi,
Tianjian Zhou,
Wei Zhong,
Yuan Ji
Abstract:
It is highly desirable to borrow information from external data to augment a control arm in a randomized clinical trial, especially in settings where the sample size for the control arm is limited. However, a main challenge in borrowing information from external data is to accommodate potential heterogeneous subpopulations across the external and trial data. We apply a Bayesian nonparametric model…
▽ More
It is highly desirable to borrow information from external data to augment a control arm in a randomized clinical trial, especially in settings where the sample size for the control arm is limited. However, a main challenge in borrowing information from external data is to accommodate potential heterogeneous subpopulations across the external and trial data. We apply a Bayesian nonparametric model called Plaid Atoms Model (PAM) to identify overlapping and unique subpopulations across datasets, with which we restrict the information borrowing to the common subpopulations. This forms a hybrid control (HC) that leads to more precise estimation of treatment effects Simulation studies demonstrate the robustness of the new method, and an application to an Atopic Dermatitis dataset shows improved treatment effect estimation.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Pharmacometrics-Enabled DOse OPtimization (PEDOOP) for Seamless Phase I-II Trials in Oncology
Authors:
Shijie Yuan,
Zhanbo Huang,
Jiaxin Liu,
Yuan Ji
Abstract:
We consider a dose-optimization design for first-in-human oncology trial that aims to identify a suitable dose for late-phase drug development. The proposed approach, called the Pharmacometrics-Enabled DOse OPtimization (PEDOOP) design, incorporates observed patient-level pharmacokinetics (PK) measurements and latent pharmacodynamics (PD) information for trial decision making and dose optimization…
▽ More
We consider a dose-optimization design for first-in-human oncology trial that aims to identify a suitable dose for late-phase drug development. The proposed approach, called the Pharmacometrics-Enabled DOse OPtimization (PEDOOP) design, incorporates observed patient-level pharmacokinetics (PK) measurements and latent pharmacodynamics (PD) information for trial decision making and dose optimization. PEDOOP consists of two seamless phases. In phase I, patient-level time-course drug concentrations, derived PD effects, and the toxicity outcomes from patients are integrated into a statistical model to estimate the dose-toxicity response. A simple dose-finding design guides dose escalation in phase I. At the end of the phase I dose finding, a graduation rule is used to assess the safety and efficacy of all the doses and select those with promising efficacy and acceptable safety for a randomized comparison against a control arm in phase II. In phase II, patients are randomized to the selected doses based on a fixed or adaptive randomization ratio. At the end of phase II, an optimal biological dose (OBD) is selected for late-phase development. We conduct simulation studies to assess the PEDOOP design in comparison to an existing seamless design that also combines phases I and II in a single trial.
△ Less
Submitted 22 February, 2024; v1 submitted 29 September, 2023;
originally announced September 2023.
-
A Class of Dependent Random Distributions Based on Atom Skipping
Authors:
Dehua Bi,
Yuan Ji
Abstract:
We propose the Plaid Atoms Model (PAM), a novel Bayesian nonparametric model for grouped data. Founded on an idea of `atom skipping', PAM is part of a well-established category of models that generate dependent random distributions and clusters across multiple groups. Atom skipping referrs to stochastically assigning 0 weights to atoms in an infinite mixture. Deploying atom skipping across groups,…
▽ More
We propose the Plaid Atoms Model (PAM), a novel Bayesian nonparametric model for grouped data. Founded on an idea of `atom skipping', PAM is part of a well-established category of models that generate dependent random distributions and clusters across multiple groups. Atom skipping referrs to stochastically assigning 0 weights to atoms in an infinite mixture. Deploying atom skipping across groups, PAM produces a dependent clustering pattern with overlapping and non-overlapping clusters across groups. As a result, interpretable posterior inference is possible such as reporting the posterior probability of a cluster being exclusive to a single group or shared among a subset of groups. We discuss the theoretical properties of the proposed and related models. Minor extensions of the proposed model for multivariate or count data are presented. Simulation studies and applications using real-world datasets illustrate the performance of the new models with comparison to existing models.
△ Less
Submitted 30 December, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
A Multi-Arm Two-Stage (MATS) Design for Proof-of-Concept and Dose Optimization in Early-Phase Oncology Trials
Authors:
Zhenghao Jiang,
Gu Mi,
Ji Lin,
Christelle Lorenzato,
Yuan Ji
Abstract:
The Project Optimus initiative by the FDA's Oncology Center of Excellence is widely viewed as a groundbreaking effort to change the $\textit{status quo}$ of conventional dose-finding strategies in oncology. Unlike in other therapeutic areas where multiple doses are evaluated thoroughly in dose ranging studies, early-phase oncology dose-finding studies are characterized by the practice of identifyi…
▽ More
The Project Optimus initiative by the FDA's Oncology Center of Excellence is widely viewed as a groundbreaking effort to change the $\textit{status quo}$ of conventional dose-finding strategies in oncology. Unlike in other therapeutic areas where multiple doses are evaluated thoroughly in dose ranging studies, early-phase oncology dose-finding studies are characterized by the practice of identifying a single dose, such as the maximum tolerated dose (MTD) or the recommended phase 2 dose (RP2D). Following the spirit of Project Optimus, we propose an Multi-Arm Two-Stage (MATS) design for proof-of-concept (PoC) and dose optimization that allows the evaluation of two selected doses from a dose-escalation trial. The design assess the higher dose first across multiple indications in the first stage, and adaptively enters the second stage for an indication if the higher dose exhibits promising anti-tumor activities. In the second stage, a randomized comparison between the higher and lower doses is conducted to achieve proof-of-concept (PoC) and dose optimization. A Bayesian hierarchical model governs the statistical inference and decision making by borrowing information across doses, indications, and stages. Our simulation studies show that the proposed MATS design yield desirable performance. An R Shiny application has been developed and made available at https://matsdesign.shinyapps.io/mats/.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
The Backfill i3+3 Design for Dose-Finding Trials in Oncology
Authors:
Jiaxin Liu,
Shijie Yuan,
B. Nebiyou Bekele,
Yuan Ji
Abstract:
We consider a formal statistical design that allows simultaneous enrollment of a main cohort and a backfill cohort of patients in a dose-finding trial. The goal is to accumulate more information at various doses to facilitate dose optimization. The proposed design, called Bi3+3, combines the simple dose-escalation algorithm in the i3+3 design and a model-based inference under the framework of prob…
▽ More
We consider a formal statistical design that allows simultaneous enrollment of a main cohort and a backfill cohort of patients in a dose-finding trial. The goal is to accumulate more information at various doses to facilitate dose optimization. The proposed design, called Bi3+3, combines the simple dose-escalation algorithm in the i3+3 design and a model-based inference under the framework of probability of decisions (POD), both previously published. As a result, Bi3+3 provides a simple algorithm for backfilling patients to lower doses in a dose-finding trial once these doses exhibit safety profile in patients. The POD framework allows dosing decisions to be made when some backfill patients are still being followed with incomplete toxicity outcomes, thereby potentially expediting the clinical trial. At the end of the trial, Bi3+3 uses both toxicity and efficacy outcomes to estimate an optimal biological dose (OBD). The proposed inference is based on a dose-response model that takes into account either a monotone or plateau dose-efficacy relationship, which are frequently encountered in modern oncology drug development. Simulation studies show promising operating characteristics of the Bi3+3 design in comparison to existing designs.
△ Less
Submitted 9 January, 2024; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Stacking designs: designing multi-fidelity computer experiments with target predictive accuracy
Authors:
Chih-Li Sung,
Yi Ji,
Simon Mak,
Wenjia Wang,
Tao Tang
Abstract:
In an era where scientific experiments can be very costly, multi-fidelity emulators provide a useful tool for cost-efficient predictive scientific computing. For scientific applications, the experimenter is often limited by a tight computational budget, and thus wishes to (i) maximize predictive power of the multi-fidelity emulator via a careful design of experiments, and (ii) ensure this model ac…
▽ More
In an era where scientific experiments can be very costly, multi-fidelity emulators provide a useful tool for cost-efficient predictive scientific computing. For scientific applications, the experimenter is often limited by a tight computational budget, and thus wishes to (i) maximize predictive power of the multi-fidelity emulator via a careful design of experiments, and (ii) ensure this model achieves a desired error tolerance with some notion of confidence. Existing design methods, however, do not jointly tackle objectives (i) and (ii). We propose a novel stacking design approach that addresses both goals. A multi-level reproducing kernel Hilbert space (RKHS) interpolator is first introduced to build the emulator, under which our stacking design provides a sequential approach for designing multi-fidelity runs such that a desired prediction error of $ε> 0$ is met under regularity assumptions. We then prove a novel cost complexity theorem that, under this multi-level interpolator, establishes a bound on the computation cost (for training data simulation) needed to achieve a prediction bound of $ε$. This result provides novel insights on conditions under which the proposed multi-fidelity approach improves upon a conventional RKHS interpolator which relies on a single fidelity level. Finally, we demonstrate the effectiveness of stacking designs in a suite of simulation experiments and an application to finite element analysis.
△ Less
Submitted 27 October, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Conglomerate Multi-Fidelity Gaussian Process Modeling, with Application to Heavy-Ion Collisions
Authors:
Yi Ji,
Henry Shaowu Yuchi,
Derek Soeder,
J. -F. Paquet,
Steffen A. Bass,
V. Roshan Joseph,
C. F. Jeff Wu,
Simon Mak
Abstract:
In an era where scientific experimentation is often costly, multi-fidelity emulation provides a powerful tool for predictive scientific computing. While there has been notable work on multi-fidelity modeling, existing models do not incorporate an important "conglomerate" property of multi-fidelity simulators, where the accuracies of different simulator components are controlled by different fideli…
▽ More
In an era where scientific experimentation is often costly, multi-fidelity emulation provides a powerful tool for predictive scientific computing. While there has been notable work on multi-fidelity modeling, existing models do not incorporate an important "conglomerate" property of multi-fidelity simulators, where the accuracies of different simulator components are controlled by different fidelity parameters. Such conglomerate simulators are widely encountered in complex nuclear physics and astrophysics applications. We thus propose a new CONglomerate multi-FIdelity Gaussian process (CONFIG) model, which embeds this conglomerate structure within a novel non-stationary covariance function. We show that the proposed CONFIG model can capture prior knowledge on the numerical convergence of conglomerate simulators, which allows for cost-efficient emulation of multi-fidelity systems. We demonstrate the improved predictive performance of CONFIG over state-of-the-art models in a suite of numerical experiments and two applications, the first for emulation of cantilever beam deflection and the second for emulating the evolution of the quark-gluon plasma, which was theorized to have filled the Universe shortly after the Big Bang.
△ Less
Submitted 28 September, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Use of Non-concurrent Common Control in Master Protocols in Oncology Trials: Report of an American Statistical Association Biopharmaceutical Section Open Forum Discussion
Authors:
Rajeshwari Sridhara,
Olga Marchenko,
Qi Jiang,
Richard Pazdur,
Martin Posch,
Scott Berry,
Marc Theoret,
Yuan Li Shen,
Thomas Gwise,
Lorenzo Hess,
Andrew Raven,
Khadija Rantell,
Kit Roes,
Richard Simon,
Mary Redman,
Yuan Ji,
Cindy Lu
Abstract:
This article summarizes the discussions from the American Statistical Association (ASA) Biopharmaceutical (BIOP) Section Open Forum that took place on December 10, 2020 and was organized by the ASA BIOP Statistical Methods in Oncology Scientific Working Group, in coordination with the US FDA Oncology Center of Excellence. Diverse stakeholders including experts from international regulatory agencie…
▽ More
This article summarizes the discussions from the American Statistical Association (ASA) Biopharmaceutical (BIOP) Section Open Forum that took place on December 10, 2020 and was organized by the ASA BIOP Statistical Methods in Oncology Scientific Working Group, in coordination with the US FDA Oncology Center of Excellence. Diverse stakeholders including experts from international regulatory agencies, academicians, and representatives of the pharmaceutical industry engaged in a discussion on the use of non-concurrent control in Master Protocols for oncology trials. While the use of non-concurrent control with the concurrent control may increase the power of detecting the therapeutic difference between a treatment and the control, the panelists had diverse opinion on the statistical approaches for modeling non-concurrent and concurrent controls. Some were more concerned about the temporality of the non-concurrent control and bias introduced by different confounders related to time, e.g., changes in standard of care, changes in patient population, changes in recruiting strategies, changes in assessment of endpoints. Nevertheless, in some situations such as when the recruitment is extremely challenging for a rare disease, the panelists concluded that the use of a non-concurrent control can be justified.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
On Bayesian Sequential Clinical Trial Designs
Authors:
Tianjian Zhou,
Yuan Ji
Abstract:
Clinical trials usually involve sequential patient entry. When designing a clinical trial, it is often desirable to include a provision for interim analyses of accumulating data with the potential for stopping the trial early. We review Bayesian sequential clinical trial designs based on posterior probabilities, posterior predictive probabilities, and decision-theoretic frameworks. A pertinent que…
▽ More
Clinical trials usually involve sequential patient entry. When designing a clinical trial, it is often desirable to include a provision for interim analyses of accumulating data with the potential for stopping the trial early. We review Bayesian sequential clinical trial designs based on posterior probabilities, posterior predictive probabilities, and decision-theoretic frameworks. A pertinent question is whether Bayesian sequential designs need to be adjusted for the planning of interim analyses. We answer this question from three perspectives: a frequentist-oriented perspective, a calibrated Bayesian perspective, and a subjective Bayesian perspective. We also provide new insights into the likelihood principle, which is commonly tied to statistical inference and decision making in sequential clinical trials. Some theoretical results are derived, and numerical studies are conducted to illustrate and assess these designs.
△ Less
Submitted 9 March, 2023; v1 submitted 17 December, 2021;
originally announced December 2021.
-
A Unified Decision Framework for Phase I Dose-Finding Designs
Authors:
Yunshan Duan,
Shijie Yuan,
Yuan Ji,
Peter Mueller
Abstract:
The purpose of a phase I dose-finding clinical trial is to investigate the toxicity profiles of various doses for a new drug and identify the maximum tolerated dose. Over the past three decades, various dose-finding designs have been proposed and discussed, including conventional model-based designs, new model-based designs using toxicity probability intervals, and rule-based designs. We present a…
▽ More
The purpose of a phase I dose-finding clinical trial is to investigate the toxicity profiles of various doses for a new drug and identify the maximum tolerated dose. Over the past three decades, various dose-finding designs have been proposed and discussed, including conventional model-based designs, new model-based designs using toxicity probability intervals, and rule-based designs. We present a simple decision framework that can generate several popular designs as special cases. We show that these designs share common elements under the framework, such as the same likelihood function, the use of loss functions, and the nature of the optimal decisions as Bayes rules. They differ mostly in the choice of the prior distributions. We present theoretical results on the decision framework and its link to specific and popular designs like mTPI, BOIN, and CRM. These results provide useful insights into the designs and their underlying assumptions, and convey information to help practitioners select an appropriate design.
△ Less
Submitted 23 November, 2021;
originally announced November 2021.
-
A graphical multi-fidelity Gaussian process model, with application to emulation of heavy-ion collisions
Authors:
Yi Ji,
Simon Mak,
Derek Soeder,
J-F Paquet,
Steffen A. Bass
Abstract:
With advances in scientific computing and mathematical modeling, complex scientific phenomena such as galaxy formations and rocket propulsion can now be reliably simulated. Such simulations can however be very time-intensive, requiring millions of CPU hours to perform. One solution is multi-fidelity emulation, which uses data of different fidelities to train an efficient predictive model which emu…
▽ More
With advances in scientific computing and mathematical modeling, complex scientific phenomena such as galaxy formations and rocket propulsion can now be reliably simulated. Such simulations can however be very time-intensive, requiring millions of CPU hours to perform. One solution is multi-fidelity emulation, which uses data of different fidelities to train an efficient predictive model which emulates the expensive simulator. For complex scientific problems and with careful elicitation from scientists, such multi-fidelity data may often be linked by a directed acyclic graph (DAG) representing its scientific model dependencies. We thus propose a new Graphical Multi-fidelity Gaussian Process (GMGP) model, which embeds this DAG structure (capturing scientific dependencies) within a Gaussian process framework. We show that the GMGP has desirable modeling traits via two Markov properties, and admits a scalable algorithm for recursive computation of the posterior mean and variance along at each depth level of the DAG. We also present a novel experimental design methodology over the DAG given an experimental budget, and propose a nonlinear extension of the GMGP via deep Gaussian processes. The advantages of the GMGP are then demonstrated via a suite of numerical experiments and an application to emulation of heavy-ion collisions, which can be used to study the conditions of matter in the Universe shortly after the Big Bang. The proposed model has broader uses in data fusion applications with graphical structure, which we further discuss.
△ Less
Submitted 27 February, 2024; v1 submitted 31 July, 2021;
originally announced August 2021.
-
The Ci3+3 Design for Dual-Agent Combination Dose-Finding Clinical Trials
Authors:
Shijie Yuan,
Tianjian Zhou,
Yawen Lin,
Yuan Ji
Abstract:
We propose a rule-based statistical design for combination dose-finding trials with two agents. The Ci3+3 design is an extension of the i3+3 design with simple decision rules comparing the observed toxicity rates and equivalence intervals that define the maximum tolerated dose combination. Ci3+3 consists of two stages to allow fast and efficient exploration of the dose-combination space. Statistic…
▽ More
We propose a rule-based statistical design for combination dose-finding trials with two agents. The Ci3+3 design is an extension of the i3+3 design with simple decision rules comparing the observed toxicity rates and equivalence intervals that define the maximum tolerated dose combination. Ci3+3 consists of two stages to allow fast and efficient exploration of the dose-combination space. Statistical inference is restricted to a beta-binomial model for dose evaluation, and the entire design is built upon a set of fixed rules. We show via simulation studies that the Ci3+3 design exhibits similar and comparable operating characteristics to more complex designs utilizing model-based inferences. We believe that the Ci3+3 design may provide an alternative choice to help simplify the design and conduct of combination dose-finding trials in practice.
△ Less
Submitted 16 September, 2021; v1 submitted 25 March, 2021;
originally announced March 2021.
-
Incorporating External Data into the Analysis of Clinical Trials via Bayesian Additive Regression Trees
Authors:
Tianjian Zhou,
Yuan Ji
Abstract:
Most clinical trials involve the comparison of a new treatment to a control arm (e.g., the standard of care) and the estimation of a treatment effect. External data, including historical clinical trial data and real-world observational data, are commonly available for the control arm. Borrowing information from external data holds the promise of improving the estimation of relevant parameters and…
▽ More
Most clinical trials involve the comparison of a new treatment to a control arm (e.g., the standard of care) and the estimation of a treatment effect. External data, including historical clinical trial data and real-world observational data, are commonly available for the control arm. Borrowing information from external data holds the promise of improving the estimation of relevant parameters and increasing the power of detecting a treatment effect if it exists. In this paper, we propose to use Bayesian additive regression trees (BART) for incorporating external data into the analysis of clinical trials, with a specific goal of estimating the conditional or population average treatment effect. BART naturally adjusts for patient-level covariates and captures potentially heterogeneous treatment effects across different data sources, achieving flexible borrowing. Simulation studies demonstrate that BART compares favorably to a hierarchical linear model and a normal-normal hierarchical model. We illustrate the proposed method with an acupuncture trial.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
BaySize: Bayesian Sample Size Planning for Phase I Dose-Finding Trials
Authors:
Xiaolei Lin,
Jiaying Lyu,
Shijie Yuan,
Sue-Jane Wang,
Yuan Ji
Abstract:
We propose BaySize, a sample size calculator for phase I clinical trials using Bayesian models. BaySize applies the concept of effect size in dose finding, assuming the MTD is defined based on an equivalence interval. Leveraging a decision framework that involves composite hypotheses, BaySize utilizes two prior distributions, the fitting prior (for model fitting) and sampling prior (for data gener…
▽ More
We propose BaySize, a sample size calculator for phase I clinical trials using Bayesian models. BaySize applies the concept of effect size in dose finding, assuming the MTD is defined based on an equivalence interval. Leveraging a decision framework that involves composite hypotheses, BaySize utilizes two prior distributions, the fitting prior (for model fitting) and sampling prior (for data generation), to conduct sample size calculation under desirable statistical power. Look-up tables are generated to facilitate practical applications. To our knowledge, BaySize is the first sample size tool that can be applied to a broad range of phase I trial designs.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
PoD-BIN: A Probability of Decision Bayesian Interval Design for Time-to-Event Dose-Finding Trials with Multiple Toxicity Grades
Authors:
Meizi Liu,
Yuan Ji,
Ji Lin
Abstract:
We consider a Bayesian framework based on "probability of decision" for dose-finding trial designs. The proposed PoD-BIN design evaluates the posterior predictive probabilities of up-and-down decisions. In PoD-BIN, multiple grades of toxicity, categorized as the mild toxicity (MT) and dose-limiting toxicity (DLT), are modeled simultaneously, and the primary outcome of interests is time-to-toxicity…
▽ More
We consider a Bayesian framework based on "probability of decision" for dose-finding trial designs. The proposed PoD-BIN design evaluates the posterior predictive probabilities of up-and-down decisions. In PoD-BIN, multiple grades of toxicity, categorized as the mild toxicity (MT) and dose-limiting toxicity (DLT), are modeled simultaneously, and the primary outcome of interests is time-to-toxicity for both MT and DLT. This allows the possibility of enrolling new patients when previously enrolled patients are still being followed for toxicity, thus potentially shortening trial length. The Bayesian decision rules in PoD-BIN utilize the probability of decisions to balance the need to speed up the trial and the risk of exposing patients to overly toxic doses. We demonstrate via numerical examples the resulting balance of speed and safety of PoD-BIN and compare to existing designs.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
Lessons Learned from the Bayesian Design and Analysis for the BNT162b2 COVID-19 Vaccine Phase 3 Trial
Authors:
Yuan Ji,
Shijie Yuan
Abstract:
The phase III BNT162b2 mRNA COVID-19 vaccine trial is based on a Bayesian design and analysis, and the main evidence of vaccine efficacy is presented in Bayesian statistics. Confusion and mistakes are produced in the presentation of the Bayesian results. Some key statistics, such as Bayesian credible intervals, are mislabeled and stated as confidence intervals. Posterior probabilities of the vacci…
▽ More
The phase III BNT162b2 mRNA COVID-19 vaccine trial is based on a Bayesian design and analysis, and the main evidence of vaccine efficacy is presented in Bayesian statistics. Confusion and mistakes are produced in the presentation of the Bayesian results. Some key statistics, such as Bayesian credible intervals, are mislabeled and stated as confidence intervals. Posterior probabilities of the vaccine efficacy are not reported as the main results. We illustrate the main differences in the reporting of Bayesian analysis results for a clinical trial and provide four recommendations. We argue that statistical evidence from a Bayesian trial, when presented properly, is easier to interpret and directly addresses the main clinical questions, thereby better supporting regulatory decision making. We also recommend using abbreviation "BI" to represent Bayesian credible intervals as a differentiation to "CI" which stands for confidence interval.
△ Less
Submitted 8 January, 2021;
originally announced March 2021.
-
Hi3+3: A Model-Assisted Dose-Finding Design Borrowing Historical Data
Authors:
Yunshan Duan,
Sue-Jane Wang,
Yuan Ji
Abstract:
Background -- In phase I clinical trials, historical data may be available through multi-regional programs, reformulation of the same drug, or previous trials for a drug under the same class. Statistical designs that borrow information from historical data can reduce cost, speed up drug development, and maintain safety. Purpose -- Based on a hybrid design that partly uses probability models and pa…
▽ More
Background -- In phase I clinical trials, historical data may be available through multi-regional programs, reformulation of the same drug, or previous trials for a drug under the same class. Statistical designs that borrow information from historical data can reduce cost, speed up drug development, and maintain safety. Purpose -- Based on a hybrid design that partly uses probability models and partly uses algorithmic rules for decision making, we aim to improve the efficiency of the dose-finding trials in the presence of historical data, maintain safety for patients, and achieve a level of simplicity for practical applications. Methods -- We propose the Hi3+3 design, in which the letter "H" represents "historical data". We apply the idea in power prior to borrow historical data and define the effective sample size (ESS) of the prior. Dose-finding decision rules follow the idea in the i3+3 design while incorporating the historical data via the power prior and ESS. The proposed Hi3+3 design pretabulates the dosing decisions before the trial starts, a desirable feature for ease of application in practice. Results -- The Hi3+3 design is superior than the i3+3 design due to information borrow from historical data. It is capable of maintaining a high level of safety for trial patients without sacrificing the ability to identify the correct MTD. Illustration of this feature are found in the simulation results. Conclusion -- With the demonstrated safety, efficiency, and simplicity, the Hi3+3 design could be a desirable choice for dose-finding trials borrowing historical data.
△ Less
Submitted 20 October, 2020;
originally announced October 2020.
-
Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems
Authors:
Xianfu Chen,
Celimuge Wu,
Tao Chen,
Zhi Liu,
Honggang Zhang,
Mehdi Bennis,
Hang Liu,
Yusheng Ji
Abstract:
This paper studies the problem of information freshness-aware task offloading in an air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP). A third-party real-time application service provider provides computing services to the subscribed mobile users (MUs) with the limited communication and computation resources from the InP based on a long…
▽ More
This paper studies the problem of information freshness-aware task offloading in an air-ground integrated multi-access edge computing system, which is deployed by an infrastructure provider (InP). A third-party real-time application service provider provides computing services to the subscribed mobile users (MUs) with the limited communication and computation resources from the InP based on a long-term business agreement. Due to the dynamic characteristics, the interactions among the MUs are modelled by a non-cooperative stochastic game, in which the control policies are coupled and each MU aims to selfishly maximize its own expected long-term payoff. To address the Nash equilibrium solutions, we propose that each MU behaves in accordance with the local system states and conjectures, based on which the stochastic game is transformed into a single-agent Markov decision process. Moreover, we derive a novel online deep reinforcement learning (RL) scheme that adopts two separate double deep Q-networks for each MU to approximate the Q-factor and the post-decision Q-factor. Using the proposed deep RL scheme, each MU in the system is able to make decisions without a priori statistical knowledge of dynamics. Numerical experiments examine the potentials of the proposed scheme in balancing the age of information and the energy consumption.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
SLAM using ICP and graph optimization considering physical properties of environment
Authors:
Ryuki Suzuki,
Ryosuke Kataoka,
Yonghoon Ji,
Hiromitsu Fujii,
Hitoshi Kono,
Kazunori Umeda
Abstract:
This paper describes a novel SLAM (simultaneous localization and mapping) scheme based on scan matching in an environment including various physical properties.
This paper describes a novel SLAM (simultaneous localization and mapping) scheme based on scan matching in an environment including various physical properties.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Statistical Frameworks for Oncology Dose-Finding Designs with Late-Onset Toxicities: A Review
Authors:
Tianjian Zhou,
Yuan Ji
Abstract:
In oncology dose-finding trials, due to staggered enrollment, it might be desirable to make dose-assignment decisions in real-time in the presence of pending toxicity outcomes, for example, when the dose-limiting toxicity is late-onset. Patients' time-to-event information may be utilized to facilitate such decisions. We review statistical frameworks for time-to-event modeling in dose-finding trial…
▽ More
In oncology dose-finding trials, due to staggered enrollment, it might be desirable to make dose-assignment decisions in real-time in the presence of pending toxicity outcomes, for example, when the dose-limiting toxicity is late-onset. Patients' time-to-event information may be utilized to facilitate such decisions. We review statistical frameworks for time-to-event modeling in dose-finding trials and summarize existing designs into two classes: TITE designs and POD designs. TITE designs are based on inference on toxicity probabilities, while POD designs are based on inference on dose-finding decisions. These two classes of designs contain existing individual designs as special cases and also give rise to new designs. We discuss and study the theoretical properties of these designs, including large-sample convergence properties, coherence principles, and the underlying decision rules. To facilitate the use of these designs in practice, we introduce efficient computational algorithms and review common practical considerations, such as safety rules and suspension rules. Finally, the operating characteristics of several designs are evaluated and compared through computer simulations.
△ Less
Submitted 10 March, 2023; v1 submitted 20 June, 2020;
originally announced June 2020.
-
MUCE: Bayesian Hierarchical Modeling for the Design and Analysis of Phase 1b Multiple Expansion Cohort Trials
Authors:
Jiaying Lyu,
Tianjian Zhou,
Shijie Yuan,
Wentian Guo,
Yuan Ji
Abstract:
We propose a multiple cohort expansion (MUCE) approach as a design or analysis method for phase 1b multiple expansion cohort trials, which are novel first-in-human studies conducted following phase 1a dose escalation. The MUCE design is based on a class of Bayesian hierarchical models that adaptively borrow information across arms. Statistical inference is directly based on the posterior probabili…
▽ More
We propose a multiple cohort expansion (MUCE) approach as a design or analysis method for phase 1b multiple expansion cohort trials, which are novel first-in-human studies conducted following phase 1a dose escalation. The MUCE design is based on a class of Bayesian hierarchical models that adaptively borrow information across arms. Statistical inference is directly based on the posterior probability of each arm being efficacious, facilitating the decision making that decides which arm to select for further testing.
△ Less
Submitted 17 June, 2020; v1 submitted 13 June, 2020;
originally announced June 2020.
-
Semiparametric Bayesian Inference for the Transmission Dynamics of COVID-19 with a State-Space Model
Authors:
Tianjian Zhou,
Yuan Ji
Abstract:
The outbreak of Coronavirus Disease 2019 (COVID-19) is an ongoing pandemic affecting over 200 countries and regions. Inference about the transmission dynamics of COVID-19 can provide important insights into the speed of disease spread and the effects of mitigation policies. We develop a novel Bayesian approach to such inference based on a probabilistic compartmental model using data of daily confi…
▽ More
The outbreak of Coronavirus Disease 2019 (COVID-19) is an ongoing pandemic affecting over 200 countries and regions. Inference about the transmission dynamics of COVID-19 can provide important insights into the speed of disease spread and the effects of mitigation policies. We develop a novel Bayesian approach to such inference based on a probabilistic compartmental model using data of daily confirmed COVID-19 cases. In particular, we consider a probabilistic extension of the classical susceptible-infectious-recovered model, which takes into account undocumented infections and allows the epidemiological parameters to vary over time. We estimate the disease transmission rate via a Gaussian process prior, which captures nonlinear changes over time without the need of specific parametric assumptions. We utilize a parallel-tempering Markov chain Monte Carlo algorithm to efficiently sample from the highly correlated posterior space. Predictions for future observations are done by sampling from their posterior predictive distributions. Performance of the proposed approach is assessed using simulated datasets. Finally, our approach is applied to COVID-19 data from four states of the United States: Washington, New York, California, and Illinois. An R package BaySIR is made available at https://github.com/tianjianzhou/BaySIR for the public to conduct independent analysis or reproduce the results in this paper.
△ Less
Submitted 2 July, 2020; v1 submitted 9 June, 2020;
originally announced June 2020.
-
Posterior Contraction Rate of Sparse Latent Feature Models with Application to Proteomics
Authors:
Tong Li,
Tianjian Zhou,
Kam-Wah Tsui,
Lin Wei,
Yuan Ji
Abstract:
The Indian buffet process (IBP) and phylogenetic Indian buffet process (pIBP) can be used as prior models to infer latent features in a data set. The theoretical properties of these models are under-explored, however, especially in high dimensional settings. In this paper, we show that under mild sparsity condition, the posterior distribution of the latent feature matrix, generated via IBP or pIBP…
▽ More
The Indian buffet process (IBP) and phylogenetic Indian buffet process (pIBP) can be used as prior models to infer latent features in a data set. The theoretical properties of these models are under-explored, however, especially in high dimensional settings. In this paper, we show that under mild sparsity condition, the posterior distribution of the latent feature matrix, generated via IBP or pIBP priors, converges to the true latent feature matrix asymptotically. We derive the posterior convergence rate, referred to as the contraction rate. We show that the convergence holds even when the dimensionality of the latent feature matrix increases with the sample size, therefore making the posterior inference valid in high dimensional setting. We demonstrate the theoretical results using computer simulation, in which the parallel-tempering Markov chain Monte Carlo method is applied to overcome computational hurdles. The practical utility of the derived properties is demonstrated by inferring the latent features in a reverse phase protein arrays (RPPA) dataset under the IBP prior model. Software and dataset reported in the manuscript are provided at http://www.compgenome.org/IBP.
△ Less
Submitted 19 September, 2019;
originally announced September 2019.
-
Consensus Monte Carlo for Random Subsets using Shared Anchors
Authors:
Yang Ni,
Yuan Ji,
Peter Mueller
Abstract:
We present a consensus Monte Carlo algorithm that scales existing Bayesian nonparametric models for clustering and feature allocation to big data. The algorithm is valid for any prior on random subsets such as partitions and latent feature allocation, under essentially any sampling model. Motivated by three case studies, we focus on clustering induced by a Dirichlet process mixture sampling model,…
▽ More
We present a consensus Monte Carlo algorithm that scales existing Bayesian nonparametric models for clustering and feature allocation to big data. The algorithm is valid for any prior on random subsets such as partitions and latent feature allocation, under essentially any sampling model. Motivated by three case studies, we focus on clustering induced by a Dirichlet process mixture sampling model, inference under an Indian buffet process prior with a binomial sampling model, and with a categorical sampling model. We assess the proposed algorithm with simulation studies and show results for inference with three datasets: an MNIST image dataset, a dataset of pancreatic cancer mutations, and a large set of electronic health records (EHR). Supplementary materials for this article are available online.
△ Less
Submitted 25 February, 2020; v1 submitted 28 June, 2019;
originally announced June 2019.
-
PoD-TPI: Probability-of-Decision Toxicity Probability Interval Design to Accelerate Phase I Trials
Authors:
Tianjian Zhou,
Wentian Guo,
Yuan Ji
Abstract:
Cohort-based enrollment can slow down dose-finding trials since the outcomes of the previous cohort must be fully evaluated before the next cohort can be enrolled. This results in frequent suspension of patient enrollment. The issue is exacerbated in recent immune-oncology trials where toxicity outcomes can take a long time to observe. We propose a novel phase I design, the probability-of-decision…
▽ More
Cohort-based enrollment can slow down dose-finding trials since the outcomes of the previous cohort must be fully evaluated before the next cohort can be enrolled. This results in frequent suspension of patient enrollment. The issue is exacerbated in recent immune-oncology trials where toxicity outcomes can take a long time to observe. We propose a novel phase I design, the probability-of-decision toxicity probability interval (PoD-TPI) design, to accelerate phase I trials. PoD-TPI enables dose assignment in real-time in the presence of pending toxicity outcomes. With uncertain outcomes, the dose assignment decisions are treated as a random variable, and we calculate the posterior distribution of the decisions. The posterior distribution reflects the variability in the pending outcomes and allows a direct and intuitive evaluation of the confidence of all possible decisions. Optimal decisions are calculated based on 0-1 loss, and extra safety rules are constructed to enforce sufficient protection from exposing patients to risky doses. A new and useful feature of PoD-TPI is that it allows investigators and regulators to balance the trade-off between enrollment speed and making risky decisions by tuning a pair of intuitive design parameters. Through numerical studies, we evaluate the operating characteristics of PoD-TPI and demonstrate that PoD-TPI shortens trial duration and maintains trial safety and efficiency compared to existing time-to-event designs.
△ Less
Submitted 29 December, 2019; v1 submitted 29 April, 2019;
originally announced April 2019.
-
Learning Models from Data with Measurement Error: Tackling Underreporting
Authors:
Roy Adams,
Yuelong Ji,
Xiaobin Wang,
Suchi Saria
Abstract:
Measurement error in observational datasets can lead to systematic bias in inferences based on these datasets. As studies based on observational data are increasingly used to inform decisions with real-world impact, it is critical that we develop a robust set of techniques for analyzing and adjusting for these biases. In this paper we present a method for estimating the distribution of an outcome…
▽ More
Measurement error in observational datasets can lead to systematic bias in inferences based on these datasets. As studies based on observational data are increasingly used to inform decisions with real-world impact, it is critical that we develop a robust set of techniques for analyzing and adjusting for these biases. In this paper we present a method for estimating the distribution of an outcome given a binary exposure that is subject to underreporting. Our method is based on a missing data view of the measurement error problem, where the true exposure is treated as a latent variable that is marginalized out of a joint model. We prove three different conditions under which the outcome distribution can still be identified from data containing only error-prone observations of the exposure. We demonstrate this method on synthetic data and analyze its sensitivity to near violations of the identifiability conditions. Finally, we use this method to estimate the effects of maternal smoking and opioid use during pregnancy on childhood obesity, two import problems from public health. Using the proposed method, we estimate these effects using only subject-reported drug use data and substantially refine the range of estimates generated by a sensitivity analysis-based approach. Further, the estimates produced by our method are consistent with existing literature on both the effects of maternal smoking and the rate at which subjects underreport smoking.
△ Less
Submitted 25 January, 2019;
originally announced January 2019.
-
The i3+3 Design for Phase I Clinical Trials
Authors:
Meizi Liu,
Sue-Jane Wang,
Yuan Ji
Abstract:
Purpose: The 3+3 design has been shown to be less likely to achieve the objectives of phase I dose-finding trials when compared with more advanced model-based designs. One major criticism of the 3+3 design is that it is based on simple rules, does not depend on statistical models for inference, and leads to unsafe and unreliable operating characteristics. On the other hand, being rule-based allows…
▽ More
Purpose: The 3+3 design has been shown to be less likely to achieve the objectives of phase I dose-finding trials when compared with more advanced model-based designs. One major criticism of the 3+3 design is that it is based on simple rules, does not depend on statistical models for inference, and leads to unsafe and unreliable operating characteristics. On the other hand, being rule-based allows 3+3 to be easily understood and implemented in practice, making it the first choice among clinicians. Is it possible to have a rule-based design with great performance? Methods: We propose a new rule-based design called i3+3, where the letter "i" represents the word "interval". The i3+3 design is based on simple but more advanced rules that account for the variabilities in the observed data. We compare the operating characteristics for the proposed i3+3 design with other popular phase I designs by simulation. Results: The i3+3 design is far superior than the 3+3 design in trial safety and the ability to identify the true MTD. Compared with model-based phase I designs, i3+3 also demonstrates comparable performances. In other words, the i3+3 design possesses both the simplicity and transparency of the rule-based approaches, and the superior operating characteristics seen in model-based approaches. An online R Shiny tool (https://i3design.shinyapps.io/i3plus3/) is provided to illustrate the i3+3 design, although in practice it requires no software to design or conduct a dose-finding trial. Conclusion: The i3+3 design could be a practice-altering method for the clinical community.
△ Less
Submitted 26 April, 2019; v1 submitted 4 January, 2019;
originally announced January 2019.
-
An Adaptive Oversampling Learning Method for Class-Imbalanced Fault Diagnostics and Prognostics
Authors:
Wenfang Lin,
Zhenyu Wu,
Yang Ji
Abstract:
Data-driven fault diagnostics and prognostics suffers from class-imbalance problem in industrial systems and it raises challenges to common machine learning algorithms as it becomes difficult to learn the features of the minority class samples. Synthetic oversampling methods are commonly used to tackle these problems by generating the minority class samples to balance the distributions between maj…
▽ More
Data-driven fault diagnostics and prognostics suffers from class-imbalance problem in industrial systems and it raises challenges to common machine learning algorithms as it becomes difficult to learn the features of the minority class samples. Synthetic oversampling methods are commonly used to tackle these problems by generating the minority class samples to balance the distributions between majority and minority classes. However, many of oversampling methods are inappropriate that they cannot generate effective and useful minority class samples according to different distributions of data, which further complicate the process of learning samples. Thus, this paper proposes a novel adaptive oversampling technique: EM-based Weighted Minority Oversampling TEchnique (EWMOTE) for industrial fault diagnostics and prognostics. The methods comprises a weighted minority sampling strategy to identify hard-to-learn informative minority fault samples and Expectation Maximization (EM) based imputation algorithm to generate fault samples. To validate the performance of the proposed methods, experiments are conducted in two real datasets. The results show that the method could achieve better performance on not only binary class, but multi-class imbalance learning task in different imbalance ratios than other oversampling-based baseline models.
△ Less
Submitted 19 November, 2018;
originally announced November 2018.
-
Data-Driven Load Modeling and Forecasting of Residential Appliances
Authors:
Yuting Ji,
Elizabeth Buechler,
Ram Rajagopal
Abstract:
The expansion of residential demand response programs and increased deployment of controllable loads will require accurate appliance-level load modeling and forecasting. This paper proposes a conditional hidden semi-Markov model to describe the probabilistic nature of residential appliance demand, and an algorithm for short-term load forecasting. Model parameters are estimated directly from power…
▽ More
The expansion of residential demand response programs and increased deployment of controllable loads will require accurate appliance-level load modeling and forecasting. This paper proposes a conditional hidden semi-Markov model to describe the probabilistic nature of residential appliance demand, and an algorithm for short-term load forecasting. Model parameters are estimated directly from power consumption data using scalable statistical learning methods. Case studies performed using sub-metered 1-minute power consumption data from several types of appliances demonstrate the effectiveness of the model for load forecasting and anomaly detection.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Bayesian Double Feature Allocation for Phenotyping with Electronic Health Records
Authors:
Yang Ni,
Peter Mueller,
Yuan Ji
Abstract:
We propose a categorical matrix factorization method to infer latent diseases from electronic health records (EHR) data in an unsupervised manner. A latent disease is defined as an unknown biological aberration that causes a set of common symptoms for a group of patients. The proposed approach is based on a novel double feature allocation model which simultaneously allocates features to the rows a…
▽ More
We propose a categorical matrix factorization method to infer latent diseases from electronic health records (EHR) data in an unsupervised manner. A latent disease is defined as an unknown biological aberration that causes a set of common symptoms for a group of patients. The proposed approach is based on a novel double feature allocation model which simultaneously allocates features to the rows and the columns of a categorical matrix. Using a Bayesian approach, available prior information on known diseases greatly improves identifiability and interpretability of latent diseases. This includes known diagnoses for patients and known association of diseases with symptoms. We validate the proposed approach by simulation studies including mis-specified models and comparison with sparse latent factor models. In the application to Chinese EHR data, we find interesting results, some of which agree with related clinical and medical knowledge.
△ Less
Submitted 13 February, 2019; v1 submitted 4 September, 2018;
originally announced September 2018.
-
Scalable Bayesian Nonparametric Clustering and Classification
Authors:
Yang Ni,
Peter Müller,
Maurice Diesendruck,
Sinead Williamson,
Yitan Zhu,
Yuan Ji
Abstract:
We develop a scalable multi-step Monte Carlo algorithm for inference under a large class of nonparametric Bayesian models for clustering and classification. Each step is "embarrassingly parallel" and can be implemented using the same Markov chain Monte Carlo sampler. The simplicity and generality of our approach makes inference for a wide range of Bayesian nonparametric mixture models applicable t…
▽ More
We develop a scalable multi-step Monte Carlo algorithm for inference under a large class of nonparametric Bayesian models for clustering and classification. Each step is "embarrassingly parallel" and can be implemented using the same Markov chain Monte Carlo sampler. The simplicity and generality of our approach makes inference for a wide range of Bayesian nonparametric mixture models applicable to large datasets. Specifically, we apply the approach to inference under a product partition model with regression on covariates. We show results for inference with two motivating data sets: a large set of electronic health records (EHR) and a bank telemarketing dataset. We find interesting clusters and favorable classification performance relative to other widely used competing classifiers.
△ Less
Submitted 7 June, 2018;
originally announced June 2018.
-
Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning
Authors:
Xianfu Chen,
Honggang Zhang,
Celimuge Wu,
Shiwen Mao,
Yusheng Ji,
Mehdi Bennis
Abstract:
To improve the quality of computation experience for mobile devices, mobile-edge computing (MEC) is a promising paradigm by providing computing capabilities in close proximity within a sliced radio access network (RAN), which supports both traditional communication and MEC services. Nevertheless, the design of computation offloading policies for a virtual MEC system remains challenging. Specifical…
▽ More
To improve the quality of computation experience for mobile devices, mobile-edge computing (MEC) is a promising paradigm by providing computing capabilities in close proximity within a sliced radio access network (RAN), which supports both traditional communication and MEC services. Nevertheless, the design of computation offloading policies for a virtual MEC system remains challenging. Specifically, whether to execute a computation task at the mobile device or to offload it for MEC server execution should adapt to the time-varying network dynamics. In this paper, we consider MEC for a representative mobile user in an ultra-dense sliced RAN, where multiple base stations (BSs) are available to be selected for computation offloading. The problem of solving an optimal computation offloading policy is modelled as a Markov decision process, where our objective is to maximize the long-term utility performance whereby an offloading decision is made based on the task queue state, the energy queue state as well as the channel qualities between MU and BSs. To break the curse of high dimensionality in state space, we first propose a double deep Q-network (DQN) based strategic computation offloading algorithm to learn the optimal policy without knowing a priori knowledge of network dynamics. Then motivated by the additive structure of the utility function, a Q-function decomposition technique is combined with the double DQN, which leads to novel learning algorithm for the solving of stochastic computation offloading. Numerical experiments show that our proposed learning algorithms achieve a significant improvement in computation offloading performance compared with the baseline policies.
△ Less
Submitted 16 May, 2018;
originally announced May 2018.
-
Where Classification Fails, Interpretation Rises
Authors:
Chanh Nguyen,
Georgi Georgiev,
Yujie Ji,
Ting Wang
Abstract:
An intriguing property of deep neural networks is their inherent vulnerability to adversarial inputs, which significantly hinders their application in security-critical domains. Most existing detection methods attempt to use carefully engineered patterns to distinguish adversarial inputs from their genuine counterparts, which however can often be circumvented by adaptive adversaries. In this work,…
▽ More
An intriguing property of deep neural networks is their inherent vulnerability to adversarial inputs, which significantly hinders their application in security-critical domains. Most existing detection methods attempt to use carefully engineered patterns to distinguish adversarial inputs from their genuine counterparts, which however can often be circumvented by adaptive adversaries. In this work, we take a completely different route by leveraging the definition of adversarial inputs: while deceiving for deep neural networks, they are barely discernible for human visions. Building upon recent advances in interpretable models, we construct a new detection framework that contrasts an input's interpretation against its classification. We validate the efficacy of this framework through extensive experiments using benchmark datasets and attacks. We believe that this work opens a new direction for designing adversarial input detection methods.
△ Less
Submitted 2 December, 2017;
originally announced December 2017.
-
Modular Learning Component Attacks: Today's Reality, Tomorrow's Challenge
Authors:
Xinyang Zhang,
Yujie Ji,
Ting Wang
Abstract:
Many of today's machine learning (ML) systems are not built from scratch, but are compositions of an array of {\em modular learning components} (MLCs). The increasing use of MLCs significantly simplifies the ML system development cycles. However, as most MLCs are contributed and maintained by third parties, their lack of standardization and regulation entails profound security implications.
In t…
▽ More
Many of today's machine learning (ML) systems are not built from scratch, but are compositions of an array of {\em modular learning components} (MLCs). The increasing use of MLCs significantly simplifies the ML system development cycles. However, as most MLCs are contributed and maintained by third parties, their lack of standardization and regulation entails profound security implications.
In this paper, for the first time, we demonstrate that potentially harmful MLCs pose immense threats to the security of ML systems. We present a broad class of {\em logic-bomb} attacks in which maliciously crafted MLCs trigger host systems to malfunction in a predictable manner. By empirically studying two state-of-the-art ML systems in the healthcare domain, we explore the feasibility of such attacks. For example, we show that, without prior knowledge about the host ML system, by modifying only 3.3{\textperthousand} of the MLC's parameters, each with distortion below $10^{-3}$, the adversary is able to force the misdiagnosis of target victims' skin cancers with 100\% success rate. We provide analytical justification for the success of such attacks, which points to the fundamental characteristics of today's ML models: high dimensionality, non-linearity, and non-convexity. The issue thus seems fundamental to many ML systems. We further discuss potential countermeasures to mitigate MLC-based attacks and their potential technical challenges.
△ Less
Submitted 25 August, 2017;
originally announced August 2017.
-
A Bayesian Mixture Model for Clustering on the Stiefel Manifold
Authors:
Subhajit Sengupta,
Subhadip Pal,
Riten Mitra,
Ying Guo,
Arunava Banerjee,
Yuan Ji
Abstract:
Analysis of a Bayesian mixture model for the Matrix Langevin distribution on the Stiefel manifold is presented. The model exploits a particular parametrization of the Matrix Langevin distribution, various aspects of which are elaborated on. A general, and novel, family of conjugate priors, and an efficient Markov chain Monte Carlo (MCMC) sampling scheme for the corresponding posteriors is then dev…
▽ More
Analysis of a Bayesian mixture model for the Matrix Langevin distribution on the Stiefel manifold is presented. The model exploits a particular parametrization of the Matrix Langevin distribution, various aspects of which are elaborated on. A general, and novel, family of conjugate priors, and an efficient Markov chain Monte Carlo (MCMC) sampling scheme for the corresponding posteriors is then developed for the mixture model. Theoretical properties of the prior and posterior distributions, including posterior consistency, are explored in detail. Extensive simulation experiments are presented to validate the efficacy of the framework. Real-world examples, including a large scale neuroimaging dataset, are analyzed to demonstrate the computational tractability of the approach.
△ Less
Submitted 23 August, 2017;
originally announced August 2017.
-
AAA: Triple-adaptive Bayesian designs for the identification of optimal dose combinations in dual-agent dose-finding trials
Authors:
Jiaying Lyu,
Yuan Ji,
Naiqing Zhao,
Daniel V. T. Catenacci
Abstract:
We propose a flexible design for the identification of optimal dose combinations in dual-agent dose-finding clinical trials. The design is called AAA, standing for three adaptations: adaptive model selection, adaptive dose insertion, and adaptive cohort divi- sion. The adaptations highlight the need and opportunity for innovation for dual-agent dose finding, and are supported by the numerical resu…
▽ More
We propose a flexible design for the identification of optimal dose combinations in dual-agent dose-finding clinical trials. The design is called AAA, standing for three adaptations: adaptive model selection, adaptive dose insertion, and adaptive cohort divi- sion. The adaptations highlight the need and opportunity for innovation for dual-agent dose finding, and are supported by the numerical results presented in the proposed simulation studies. To our knowledge, this is the first design that allows for all three adaptations at the same time. We find that AAA improves the statistical inference, enhances the chance of finding the optimal dose combinations, and shortens the trial duration. A clinical trial is being planned to apply the AAA design.
△ Less
Submitted 10 June, 2017;
originally announced June 2017.
-
On the Interval-Based Dose-Finding Designs
Authors:
Yuan Ji,
Shengjie Yang
Abstract:
The landscape of dose-finding designs for phase I clinical trials is rapidly shifting in the recent years, noticeably marked by the emergence of interval-based designs. We categorize them as the iDesigns and the IB-Designs. The iDesigns are originated by the toxicity probability inter- val (TPI) designs and its two modifications, the mTPI and mTPI-2 designs. The IB-Designs started as the cumulativ…
▽ More
The landscape of dose-finding designs for phase I clinical trials is rapidly shifting in the recent years, noticeably marked by the emergence of interval-based designs. We categorize them as the iDesigns and the IB-Designs. The iDesigns are originated by the toxicity probability inter- val (TPI) designs and its two modifications, the mTPI and mTPI-2 designs. The IB-Designs started as the cumulative cohort design (CCD) and is recently extended by the BOIN design. We discuss the differences and similarities between these two classes of interval-based designs, and compare their simulation performance with popular non-interval designs, such as the CRM and 3+3 designs. We also show that in addition to the population-level operating characteristics from simulated trials, investigators should also assess the dose-finding decision tables from the implemented designs to better understand the per-trial and per-patient behavior. This is particularly important for nonstatisticians to assess the designs with transparency. We pro- vide, to our knowledge, the most comprehensive simulation-based comparative study on various interval-based dose-finding designs.
△ Less
Submitted 14 June, 2017; v1 submitted 10 June, 2017;
originally announced June 2017.
-
TreeClone: Reconstruction of Tumor Subclone Phylogeny Based on Mutation Pairs using Next Generation Sequencing Data
Authors:
Tianjian Zhou,
Subhajit Sengupta,
Peter Mueller,
Yuan Ji
Abstract:
We present TreeClone, a latent feature allocation model to reconstruct tumor subclones subject to phylogenetic evolution that mimics tumor evolution. Similar to most current methods, we consider data from next-generation sequencing of tumor DNA. Unlike most methods that use information in short reads mapped to single nucleotide variants (SNVs), we consider subclone phylogeny reconstruction using p…
▽ More
We present TreeClone, a latent feature allocation model to reconstruct tumor subclones subject to phylogenetic evolution that mimics tumor evolution. Similar to most current methods, we consider data from next-generation sequencing of tumor DNA. Unlike most methods that use information in short reads mapped to single nucleotide variants (SNVs), we consider subclone phylogeny reconstruction using pairs of two proximal SNVs that can be mapped by the same short reads. As part of the Bayesian inference model, we construct a phylogenetic tree prior. The use of the tree structure in the prior greatly strengthens inference. Only subclones that can be explained by a phylogenetic tree are assigned non-negligible probabilities. The proposed Bayesian framework implies posterior distributions on the number of subclones, their genotypes, cellular proportions, and the phylogenetic tree spanned by the inferred subclones. The proposed method is validated against different sets of simulated and real-world data using single and multiple tumor samples. An open source software package is available at http://www.compgenome.org/treeclone.
△ Less
Submitted 25 October, 2017; v1 submitted 10 March, 2017;
originally announced March 2017.
-
PairClone: A Bayesian Subclone Caller Based on Mutation Pairs
Authors:
Tianjian Zhou,
Peter Mueller,
Subhajit Sengupta,
Yuan Ji
Abstract:
Tumor cell populations can be thought of as being composed of homogeneous cell subpopulations, with each subpopulation being characterized by overlapping sets of single nucleotide variants (SNVs). Such subpopulations are known as subclones and are an important target for precision medicine. Reconstructing such subclones from next-generation sequencing (NGS) data is one of the major challenges in p…
▽ More
Tumor cell populations can be thought of as being composed of homogeneous cell subpopulations, with each subpopulation being characterized by overlapping sets of single nucleotide variants (SNVs). Such subpopulations are known as subclones and are an important target for precision medicine. Reconstructing such subclones from next-generation sequencing (NGS) data is one of the major challenges in precision medicine. We present PairClone as a new tool to implement this reconstruction. The main idea of PairClone is to model short reads mapped to pairs of proximal SNVs. In contrast, most existing methods use only marginal reads for unpaired SNVs. Using Bayesian nonparametric models, we estimate posterior probabilities of the number, genotypes and population frequencies of subclones in one or more tumor sample. We use the categorical Indian buffet process (cIBP) as a prior probability model for subclones that are represented as vectors of categorical matrices that record the corresponding sets of mutation pairs. Performance of PairClone is assessed using simulated and real datasets. An open source software package can be obtained at http://www.compgenome.org/pairclone.
△ Less
Submitted 24 February, 2017;
originally announced February 2017.
-
DyNet: The Dynamic Neural Network Toolkit
Authors:
Graham Neubig,
Chris Dyer,
Yoav Goldberg,
Austin Matthews,
Waleed Ammar,
Antonios Anastasopoulos,
Miguel Ballesteros,
David Chiang,
Daniel Clothiaux,
Trevor Cohn,
Kevin Duh,
Manaal Faruqui,
Cynthia Gan,
Dan Garrette,
Yangfeng Ji,
Lingpeng Kong,
Adhiguna Kuncoro,
Gaurav Kumar,
Chaitanya Malaviya,
Paul Michel,
Yusuke Oda,
Matthew Richardson,
Naomi Saphra,
Swabha Swayamdipta,
Pengcheng Yin
Abstract:
We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its deriva…
▽ More
We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives. In DyNet's dynamic declaration strategy, computation graph construction is mostly transparent, being implicitly constructed by executing procedural code that computes the network outputs, and the user is free to use different network structures for each input. Dynamic declaration thus facilitates the implementation of more complicated network architectures, and DyNet is specifically designed to allow users to implement their models in a way that is idiomatic in their preferred programming language (C++ or Python). One challenge with dynamic declaration is that because the symbolic computation graph is defined anew for every training example, its construction must have low overhead. To achieve this, DyNet has an optimized C++ backend and lightweight graph representation. Experiments show that DyNet's speeds are faster than or comparable with static declaration toolkits, and significantly faster than Chainer, another dynamic declaration toolkit. DyNet is released open-source under the Apache 2.0 license and available at http://github.com/clab/dynet.
△ Less
Submitted 14 January, 2017;
originally announced January 2017.
-
Heterogeneous Reciprocal Graphical Models
Authors:
Yang Ni,
Peter Mueller,
Yitan Zhu,
Yuan Ji
Abstract:
We develop novel hierarchical reciprocal graphical models to infer gene networks from heterogeneous data. In the case of data that can be naturally divided into known groups, we propose to connect graphs by introducing a hierarchical prior across group-specific graphs, including a correlation on edge strengths across graphs. Thresholding priors are applied to induce sparsity of the estimated netwo…
▽ More
We develop novel hierarchical reciprocal graphical models to infer gene networks from heterogeneous data. In the case of data that can be naturally divided into known groups, we propose to connect graphs by introducing a hierarchical prior across group-specific graphs, including a correlation on edge strengths across graphs. Thresholding priors are applied to induce sparsity of the estimated networks. In the case of unknown groups, we cluster subjects into subpopulations and jointly estimate cluster-specific gene networks, again using similar hierarchical priors across clusters. We illustrate the proposed approach by simulation studies and two applications with multiplatform genomic data for multiple cancers.
△ Less
Submitted 21 January, 2018; v1 submitted 18 December, 2016;
originally announced December 2016.
-
Multi-Area Interchange Scheduling under Uncertainty
Authors:
Yuting Ji,
Lang Tong
Abstract:
The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optim…
▽ More
The problem of multi-area interchange scheduling under system uncertainty is considered. A new scheduling technique is proposed for a multi-proxy bus system based on stochastic optimization that captures uncertainty in renewable generation and stochastic load. In particular, the proposed algorithm iteratively optimizes the interface flows using a multidimensional demand and supply functions. Optimality and convergence are guaranteed for both synchronous and asynchronous scheduling under nominal assumptions.
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
A Bayesian Interval Dose-Finding Design Addressing Ockham's Razor: mTPI-2
Authors:
Wentian Guo,
Sue-Jane Wang,
Shengjie Yang,
Suiheng Lin,
Yuan Ji
Abstract:
There has been an increasing interest in using interval-based Bayesian designs for dose finding, one of which is the modified toxicity probability interval (mTPI) method. We show that the decision rules in mTPI correspond to an optimal rule under a formal Bayesian decision theoretic framework. However, the probability models in mTPI are overly sharpened by the Ockham's razor, which, while in gener…
▽ More
There has been an increasing interest in using interval-based Bayesian designs for dose finding, one of which is the modified toxicity probability interval (mTPI) method. We show that the decision rules in mTPI correspond to an optimal rule under a formal Bayesian decision theoretic framework. However, the probability models in mTPI are overly sharpened by the Ockham's razor, which, while in general helps with parsimonious statistical inference, leads to suboptimal decisions in small-sample inference such as dose finding. We propose a new framework that blunts the Ockham's razor, and demonstrate the superior performance of the new method, called mTPI-2. An online web tool is provided for users who can generate the design, conduct clinical trials, and examine operating characteristics of the designs through big data and crowd sourcing.
△ Less
Submitted 27 September, 2016;
originally announced September 2016.
-
Reciprocal Graphical Models for Integrative Gene Regulatory Network Analysis
Authors:
Yang Ni,
Yuan Ji,
Peter Mueller
Abstract:
Constructing gene regulatory networks is a fundamental task in systems biology. We introduce a Gaussian reciprocal graphical model for inference about gene regulatory relationships by integrating mRNA gene expression and DNA level information including copy number and methylation. Data integration allows for inference on the directionality of certain regulatory relationships, which would be otherw…
▽ More
Constructing gene regulatory networks is a fundamental task in systems biology. We introduce a Gaussian reciprocal graphical model for inference about gene regulatory relationships by integrating mRNA gene expression and DNA level information including copy number and methylation. Data integration allows for inference on the directionality of certain regulatory relationships, which would be otherwise indistinguishable due to Markov equivalence. Efficient inference is developed based on simultaneous equation models. Bayesian model selection techniques are adopted to estimate the graph structure. We illustrate our approach by simulations and two applications in ZODIAC pairwise gene interaction analysis and colon adenocarcinoma pathway analysis.
△ Less
Submitted 22 July, 2016;
originally announced July 2016.
-
Probabilistic Forecasting and Simulation of Electricity Markets via Online Dictionary Learning
Authors:
Weisi Deng,
Yuting Ji,
Lang Tong
Abstract:
The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of j…
▽ More
The problem of probabilistic forecasting and online simulation of real-time electricity market with stochastic generation and demand is considered. By exploiting the parametric structure of the direct current optimal power flow, a new technique based on online dictionary learning (ODL) is proposed. The ODL approach incorporates real-time measurements and historical traces to produce forecasts of joint and marginal probability distributions of future locational marginal prices, power flows, and dispatch levels, conditional on the system state at the time of forecasting. Compared with standard Monte Carlo simulation techniques, the ODL approach offers several orders of magnitude improvement in computation time, making it feasible for online forecasting of market operations. Numerical simulations on large and moderate size power systems illustrate its performance and complexity features and its potential as a tool for system operators.
△ Less
Submitted 24 June, 2016;
originally announced June 2016.
-
An Ensemble EM Algorithm for Bayesian Variable Selection
Authors:
Jin Wang,
Feng Liang,
Yuan Ji
Abstract:
We study the Bayesian approach to variable selection in the context of linear regression. Motivated by a recent work by Rockova and George (2014), we propose an EM algorithm that returns the MAP estimate of the set of relevant variables. Due to its particular updating scheme, our algorithm can be implemented efficiently without inverting a large matrix in each iteration and therefore can scale up…
▽ More
We study the Bayesian approach to variable selection in the context of linear regression. Motivated by a recent work by Rockova and George (2014), we propose an EM algorithm that returns the MAP estimate of the set of relevant variables. Due to its particular updating scheme, our algorithm can be implemented efficiently without inverting a large matrix in each iteration and therefore can scale up with big data. We also show that the MAP estimate returned by our EM algorithm achieves variable selection consistency even when $p$ diverges with $n$. In practice, our algorithm could get stuck with local modes, a common problem with EM algorithms. To address this issue, we propose an ensemble EM algorithm, in which we repeatedly apply the EM algorithm on a subset of the samples with a subset of the covariates, and then aggregate the variable selection results across those bootstrap replicates. Empirical studies have demonstrated the superior performance of the ensemble EM algorithm.
△ Less
Submitted 14 March, 2016;
originally announced March 2016.