Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A Bayesian Approach to Intervention-Based Clustering

Published: 30 January 2018 Publication History
  • Get Citation Alerts
  • Abstract

    An important task for intelligent healthcare systems is to predict the effect of a new intervention on individuals. This is especially true for medical treatments. For example, consider patients who do not respond well to a new drug or have adversary reactions. Predicting the likelihood of positive or negative response before trying the drug on the patient can potentially save his or her life. We are therefore interested in identifying distinctive subpopulations that respond differently to a given intervention. For this purpose, we have developed a novel technique, Intervention-based Clustering, based on a Bayesian mixture model. Compared to the baseline techniques, the novelty of our approach lies in its ability to model complex decision boundaries by using soft clustering, thus predicting the effect for individuals more accurately. It can also incorporate prior knowledge, making the method useful even for smaller datasets. We demonstrate how our method works by applying it to both simulated and real data. Results of our evaluation show that our model has strong predictive power and is capable of producing high-quality clusters compared to the baseline methods.

    References

    [1]
    Mohamed Alosh, Kathleen Fritsch, Mohammad Huque, Kooros Mahjoob, Gene Pennello, Mark Rothmann, Estelle Russek-Cohen, Fraser Smith, Stephen Wilson, and Lilly Yue. 2015. Statistical considerations on subgroup analysis in clinical trials. Stat. Biopharm. Res. 7, 4 (2015), 286--303.
    [2]
    Susan Athey and Guido Imbens. 2016. Recursive partitioning for heterogeneous causal effects. Proc. Natl. Acad. Sci. U.S.A. 113, 27 (2016), 7353--7360.
    [3]
    Daniel J. Bauer and Patrick J. Curran. 2003. Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes. Psychol. Methods 8, 3 (2003), 338.
    [4]
    James O. Berger, Xiaojing Wang, and Lei Shen. 2014. A bayesian approach to subgroup identification. J. Biopharm. Stat. 24, 1 (2014), 110--129.
    [5]
    Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Springer.
    [6]
    Leo Breiman. 2001. Random forests. Mach. Learn. 45, 1 (2001), 5--32.
    [7]
    Leo Breiman, Jerome Friedman, Charles J. Stone, and Richard A. Olshen. 1984. Classification and Regression Trees. CRC Press.
    [8]
    Frank Dellaert. 2002. The Expectation Maximization Algorithm. Technical Report. Georgia Institute of Technology.
    [9]
    Elise Dusseldorp, Claudio Conversano, and Bart Jan Van Os. 2010. Combining an additive and tree-based regression model simultaneously: STIMA. J. Comput. Graph. Stat. 19, 3 (2010), 514--530.
    [10]
    Elise Dusseldorp, Lisa Doove, and Iven Van Mechelen. 2016. Quint: An R package for the identification of subgroups of clients who differ in which treatment alternative is best for them. Behav. Res. Methods 48, 2 (2016), 650--663.
    [11]
    Elise Dusseldorp and Iven Van Mechelen. 2014. Qualitative interaction trees: A tool to identify qualitative treatment--subgroup interactions. Stat. Med. 33, 2 (2014), 219--237.
    [12]
    Brian P. Flaherty. 2002. Assessing reliability of categorical substance use measures with latent class analysis. Drug Alcohol Depend. 68 (2002), 7--20.
    [13]
    Jared C. Foster, Jeremy M. G. Taylor, and Stephen J. Ruberg. 2011. Subgroup identification from randomized clinical trial data. Stat. Med. 30, 24 (2011), 2867--2880.
    [14]
    Nicholas C. Henderson, Thomas A. Louis, Chenguang Wang, and Ravi Varadhan. 2016. Bayesian analysis of heterogeneous treatment effects for patient-centered outcomes research. Health Serv. Outcomes Res. Methodol. 16, 4 (2016), 213--233.
    [15]
    Kosuke Imai, Marc Ratkovic, and others. 2013. Estimating treatment effect heterogeneity in randomized program evaluation. Ann. Appl. Stat. 7, 1 (2013), 443--470.
    [16]
    Tony Jung and K. A. S. Wickrama. 2008. An introduction to latent class growth analysis and growth mixture modeling. Soc. Pers. Psychol. Compass 2, 1 (2008), 302--317.
    [17]
    Ilya Lipkovich, Alex Dmitrienko, Jonathan Denne, and Gregory Enas. 2011. Subgroup identification based on differential effect search—a recursive partitioning method for establishing response to treatment in patient subpopulations. Stat. Med. 30, 21 (2011), 2601--2621.
    [18]
    Wei-Yin Loh, Haoda Fu, Michael Man, Victoria Champion, and Menggang Yu. 2016. Identification of subgroups with differential treatment effects for longitudinal and multiresponse variables. Stat. Med. 35, 26 (2016), 4837--4855.
    [19]
    Wei-Yin Loh, Xu He, and Michael Man. 2015. A regression tree approach to identifying subgroups with differential treatment effects. Stat. Med. 34, 11 (2015), 1818--1833.
    [20]
    Kevin P. Murphy. 2012. Machine Learning: A Probabilistic Perspective. MIT Press.
    [21]
    Bengt Muthén. 2004. Latent variable analysis. The Sage Handbook of Quantitative Methodology for the Social Sciences. Sage Publications, 345--68.
    [22]
    Bengt Muthén, C. Hendricks Brown, Katherine Masyn, Booil Jo, Siek-Toon Khoo, Chih-Chien Yang, Chen-Pin Wang, Sheppard G. Kellam, John B. Carlin, and Jason Liao. 2002. General growth mixture modeling for randomized preventive interventions. Biostatistics 3, 4 (2002), 459--475.
    [23]
    Austin Nichols and others. 2007. Causal inference with observational data. Stata J. 7, 4 (2007), 507.
    [24]
    Alexander Peysakhovich and Akos Lada. 2016. Combining observational and experimental data to find heterogeneous treatment effects. arXiv:1611.02385 (2016).
    [25]
    Paul R. Rosenbaum and Donald B. Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41--55.
    [26]
    Tapash Roy and Cathy E. Lloyd. 2012. Epidemiology of depression and diabetes: A systematic review. J. Affect. Disord. 142 (2012), S8--S21.
    [27]
    Donald B. Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies.J. Educ. Psychol. 66, 5 (1974), 688.
    [28]
    Zach Shahn, David Madigan, and others. 2017. Latent class mixture models of treatment effect heterogeneity. Bayesian Analysis 12, 3 (2017), 831--854.
    [29]
    Bonnie Sibbald and Martin Roland. 1998. Understanding controlled trials. Why are randomised controlled trials important? Br. Med. J. 316, 7126 (1998), 201.
    [30]
    Xiaogang Su, Chih-Ling Tsai, Hansheng Wang, David M. Nickerson, and Bogong Li. 2009. Subgroup analysis via recursive partitioning. J. Mach. Learn. Res. 10 (2009), 141--158.
    [31]
    Lan Umek and Blaz Zupan. 2011. Subgroup discovery in datasets with multi-dimensional responses. Intell. Data Anal. 15, 4 (2011), 533--549.
    [32]
    Andrew J. Vickers. 2006. Whose data set is it anyway? Sharing raw data from randomized trials. Trials 7, 1 (2006), 15.
    [33]
    Andrew J. Vickers, Rebecca W. Rees, Catherine E. Zollman, Rob McCarney, Claire M. Smith, Nadia Ellis, Peter Fisher, and Robbert Van Haselen. 2004. Acupuncture for chronic headache in primary care: Large, pragmatic, randomised trial. Br. Med. J. 328, 7442 (2004), 744.
    [34]
    Stefan Wager and Susan Athey. 2017. Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc.
    [35]
    John E. Ware Jr. 1999. SF-36 health survey.
    [36]
    Richard J. Willke, Zhiyuan Zheng, Prasun Subedi, Rikard Althin, and C. Daniel Mullins. 2012. From concepts, theory, and evidence of heterogeneity of treatment effects to methodological approaches: A primer. BMC Med. Res. Methodol. 12, 1 (2012), 185.
    [37]
    Achim Zeileis, Torsten Hothorn, and Kurt Hornik. 2008. Model-based recursive partitioning. J. Comput. Graph. Stat. 17, 2 (2008), 492--514.

    Cited By

    View all
    • (2023)Adaptive Integration of Categorical and Multi-relational Ontologies with EHR Data for Medical Concept EmbeddingACM Transactions on Intelligent Systems and Technology10.1145/362522414:6(1-20)Online publication date: 14-Nov-2023

    Index Terms

    1. A Bayesian Approach to Intervention-Based Clustering

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Intelligent Systems and Technology
      ACM Transactions on Intelligent Systems and Technology  Volume 9, Issue 4
      Research Survey and Regular Papers
      July 2018
      280 pages
      ISSN:2157-6904
      EISSN:2157-6912
      DOI:10.1145/3183892
      • Editor:
      • Yu Zheng
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 30 January 2018
      Accepted: 01 October 2017
      Revised: 01 September 2017
      Received: 01 May 2017
      Published in TIST Volume 9, Issue 4

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Bayesian analysis
      2. Clustering
      3. cross-validation
      4. heterogeneous treatment effects
      5. mixture model
      6. personalized medicine
      7. randomized controlled trial
      8. subgroup analysis

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Aug 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Adaptive Integration of Categorical and Multi-relational Ontologies with EHR Data for Medical Concept EmbeddingACM Transactions on Intelligent Systems and Technology10.1145/362522414:6(1-20)Online publication date: 14-Nov-2023

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media