Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3014812.3014874acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesaus-cswConference Proceedingsconference-collections
research-article

Data-driven patient segmentation using K-means clustering: the case of hip fracture care in Ireland

Published: 31 January 2017 Publication History
  • Get Citation Alerts
  • Abstract

    Machine learning continues to forge the future of decision making in a broad diversity of domains including healthcare. Data-driven methods are increasingly geared towards leveraging evidence-based insights from large volumes of patient data. In this context, this paper embraces a mere data-driven approach for the segmentation of patients with application to hip fracture care in Ireland. Using K-Means clustering, elderly patients are grouped based on the similarity of age, length of stay (LOS) and elapsed time to surgery. We utilise a dataset retrieved from the Irish Hip fracture Database (IHFD) covering the period of two years (2013--2014). Our results suggest the presence of three coherent clusters of patients. Through cluster analysis, possible correlations are explored in relation to patient characteristics, care-related factors, and patient outcomes. For instance, the study inspects the potential impact of time to surgery on patient outcomes (e.g. LOS) within the discovered clusters of patients. Furthermore, the clusters are visually interpreted in a demographic context with respect to the structure of the healthcare system in Ireland. Broadly, the study is claimed to serve useful purposes for healthcare executives in Ireland for developing more patient-centred care strategies.

    References

    [1]
    Aldenderfer, M.S. and Blashfield, R.K., 1984. Cluster analysis. Sage University paper series on quantitative applications in the social sciences 07--044.
    [2]
    El-Darzi, E., Abbi, R., Vasilakis, C., Gorunescu, F., Gorunescu, M. and Millard, P., 2009. Length of stay-based clustering methods for patient grouping. In Intelligent Patient Management (pp. 39--56). Springer Berlin Heidelberg.
    [3]
    Ellanti, P., Cushen, B., Galbraith, A., Brent, L., Hurson, C. and Ahern, E., 2014. Improving Hip Fracture Care in Ireland: A Preliminary Report of the Irish Hip Fracture Database. Journal of osteoporosis, 2014.
    [4]
    M. K. Dodds, M. B. Codd, A. Looney, and K. J. Mulhall, "Incidence of hip fracture in the Republic of Ireland and future projections: a population-based study," Osteoporosis International,vol. 20, no. 12, pp. 2105--2110, 2009.
    [5]
    Laffoy, M., 2008. Strategy to prevent falls and fractures in Ireland's ageing population summary, conclusions and recommendations.
    [6]
    http://www.hse.ie/eng/services/publications/olderpeople/Executive_Summary_-_Strategy_to_Prevent_Falls_and_Fractures_in_Ireland%E2%80%99s_Ageing_Population.pdf
    [7]
    http://www.cso.ie/en/statistics/population/
    [8]
    Gröne, O. and Garcia-Barbero, M., 2001. Integrated care. International journal of integrated care, 1(2).
    [9]
    http://www.hse.ie/eng/services/publications/corporate/CHO_Chapter_1.pdf
    [10]
    Jain, A.K. and Dubes, R.C., 1988. Algorithms for clustering data. Prentice-Hall, Inc.
    [11]
    https://www.noca.ie/irish-hip-fracture-database.
    [12]
    http://www.hpo.ie/hipe/hipe_data_dictionary/HIPE_Data_Dictionary_2015_V7.0.pdf
    [13]
    Hodge, V.J. and Austin, J., 2004. A survey of outlier detection methodologies. Artificial Intelligence Review, 22(2), pp.85--126.
    [14]
    Sarsfield, S., 2009. The data governance imperative. IT Governance Publishing.
    [15]
    https://www.noca.ie/wp-content/uploads/2015/11/IHFD-National-Report-2014-Online-Version.pdf.
    [16]
    Yang, Q. and Wu, X., 2006. 10 challenging problems in data mining research. International Journal of Information Technology & Decision Making, 5(04), pp.597--604.
    [17]
    Galar, M., Fernandez, A., Barrenechea, E., Bustince, H. and Herrera, F., 2012. A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42(4), pp.463--484.
    [18]
    Japkowicz, N. and Stephen, S., 2002. The class imbalance problem: A systematic study. Intelligent data analysis, 6(5), pp.429--449.
    [19]
    Sun, Y., Wong, A.K. and Kamel, M.S., 2009. Classification of imbalanced data: a review. International Journal of Pattern Recognition and Artificial Intelligence, 23(04), pp.687--719.
    [20]
    Japkowicz, N., 2000, July. Learning from imbalanced data sets: a comparison of various strategies. In AAAI workshop on learning from imbalanced data sets (Vol. 68, pp. 10--15).
    [21]
    Visalakshi, N.K. and Thangavel, K., 2009. Impact of normalization in distributed k-means clustering. International Journal of Soft Computing, 4(4), pp.168--172.
    [22]
    Patel, V.R. and Mehta, R.G., 2011. Impact of outlier removal and normalization approach in modified k-means clustering algorithm. IJCSI International Journal of Computer Science Issues, 8(5).
    [23]
    British Orthopaedic Association, 2007. The care of patients with fragility fracture. London: British Orthopaedic Association, pp.8--11.
    [24]
    Jain, A.K., 2010. Data clustering: 50 years beyond K-means. Pattern recognition letters, 31(8), pp.651--666.
    [25]
    Sammut, C. and Webb, G.I. eds., 2011. Encyclopedia of machine learning. Springer Science & Business Media.
    [26]
    Rokach, L. and Maimon, O., 2005. Clustering methods. In Data mining and knowledge discovery handbook (pp. 321--352). Springer US.
    [27]
    Huang, Z., 1998. Extensions to the k-means algorithm for clustering large data sets with categorical values. Data mining and knowledge discovery, 2(3), pp.283--304.
    [28]
    Wickham, H., 2009. ggplot2: elegant graphics for data analysis. Springer Science & Business Media.
    [29]
    Barga, R., Fontama, V., Tok, W.H. and Cabrera-Cordon, L., 2015. Predictive analytics with Microsoft Azure machine learning. Apress.
    [30]
    Millard, Peter H., and Sally I. McClean, eds. Modelling hospital resource use: a different approach to the planning and control of health care systems. Royal Society of Medicine Press, 1994.
    [31]
    Faddy, M. J., and S. I. McClean. "Analysing data on lengths of stay of hospital patients using phase-type distributions." Applied Stochastic Models in Business and Industry 15, no. 4 (1999): 311--317.
    [32]
    Marshall, A. H., and S. I. McClean. "Conditional phase-type distributions for modelling patient length of stay in hospital." International Transactions in Operational Research 10, no. 6 (2003): 565--576.
    [33]
    Fackrell, Mark. "Modelling healthcare systems with phase-type distributions." Health care management science 12, no. 1 (2009): 11--26.
    [34]
    Johansen, A., Wakeman, R., Boulton, C., Plant, F., Roberts, J. and Williams, A., 2013. National Hip Fracture Database: National Report 2013. Clinical Effectiveness and Evaluation Unit at the Royal College of Physicians.
    [35]
    Wichmann, M.W., Inthorn, D., Andress, H.J. and Schildberg, F.W., 2000. Incidence and mortality of severe sepsis in surgical intensive care patients: the influence of patient gender on disease process and outcome. Intensive care medicine, 26(2), pp.167--172.
    [36]
    Kaplan, V., ANGUS, D.C., GRIFFIN, M.F., CLERMONT, G., Scott Watson, R. and LINDE-ZWIRBLE, W.T., 2002. Hospitalized community-acquired pneumonia in the elderly: age-and sex-related patterns of care and outcome in the United States. American journal of respiratory and critical care medicine, 165(6), pp.766--772.
    [37]
    Oksuzyan, A., Juel, K., Vaupel, J.W. and Christensen, K., 2008. Men: good health and high mortality. Sex differences in health and aging. Aging clinical and experimental research, 20(2), pp.91--102.
    [38]
    Isken, M.W. and Rajagopalan, B., 2002. Data mining to support simulation modeling of patient flow in hospitals. Journal of medical systems, 26(2), pp.179--197.
    [39]
    Ceglowski, R., Churilov, L. and Wasserthiel, J., 2007. Combining data mining and discrete event simulation for a value-added view of a hospital emergency department. Journal of the Operational Research Society, 58(2), pp.246--54.
    [40]
    Newcomer, S.R., Steiner, J.F. and Bayliss, E.A., 2010. Identifying subgroups of complex patients with cluster analysis. The American journal of managed care, 17(8), pp.e324--32.
    [41]
    Codrington-Virtue, A., Chaussalet, T., Millard, P., Whittlestone, P. and Kelly, J., 2006, June. A system for patient management based discrete-event simulation and hierarchical clustering. In 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06) (pp. 800--804). IEEE.

    Cited By

    View all
    • (2024)Augmenting K-Means Clustering With Qualitative Data to Discover the Engagement Patterns of Older Adults With Multimorbidity When Using Digital Health Technologies: Proof-of-Concept TrialJournal of Medical Internet Research10.2196/4628726(e46287)Online publication date: 28-Mar-2024
    • (2024)The hospital emigration to another region in the light of the environmental, social and governance model in Italy during the period 2004-2021BMC Public Health10.1186/s12889-024-19369-x24:1Online publication date: 15-Jul-2024
    • (2023)Representation Learning and Spectral Clustering for the Development and External Validation of Dynamic Sepsis Phenotypes: Observational Cohort StudyJournal of Medical Internet Research10.2196/4561425(e45614)Online publication date: 23-Jun-2023
    • Show More Cited By

    Index Terms

    1. Data-driven patient segmentation using K-means clustering: the case of hip fracture care in Ireland

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ACSW '17: Proceedings of the Australasian Computer Science Week Multiconference
        January 2017
        615 pages
        ISBN:9781450347686
        DOI:10.1145/3014812
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 31 January 2017

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. K-means
        2. clustering
        3. elderly healthcare
        4. hip fracture care
        5. machine learning
        6. unsupervised learning

        Qualifiers

        • Research-article

        Conference

        ACSW 2017
        ACSW 2017: Australasian Computer Science Week 2017
        January 30 - February 3, 2017
        Geelong, Australia

        Acceptance Rates

        ACSW '17 Paper Acceptance Rate 78 of 156 submissions, 50%;
        Overall Acceptance Rate 204 of 424 submissions, 48%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)31
        • Downloads (Last 6 weeks)3

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Augmenting K-Means Clustering With Qualitative Data to Discover the Engagement Patterns of Older Adults With Multimorbidity When Using Digital Health Technologies: Proof-of-Concept TrialJournal of Medical Internet Research10.2196/4628726(e46287)Online publication date: 28-Mar-2024
        • (2024)The hospital emigration to another region in the light of the environmental, social and governance model in Italy during the period 2004-2021BMC Public Health10.1186/s12889-024-19369-x24:1Online publication date: 15-Jul-2024
        • (2023)Representation Learning and Spectral Clustering for the Development and External Validation of Dynamic Sepsis Phenotypes: Observational Cohort StudyJournal of Medical Internet Research10.2196/4561425(e45614)Online publication date: 23-Jun-2023
        • (2022)Forecast of Medical Costs in Health Companies Using Models Based on Advanced AnalyticsAlgorithms10.3390/a1504010615:4(106)Online publication date: 23-Mar-2022
        • (2022)Identification and Potential Use of Clusters of Patients With Colorectal Cancer and Patients With Prostate Cancer in Clinical Practice: Explorative Mixed Methods StudyJMIR Cancer10.2196/429088:4(e42908)Online publication date: 27-Dec-2022
        • (2021)Fusion of Unobtrusive Sensing Solutions for Sprained Ankle Rehabilitation Exercises Monitoring in Home EnvironmentsSensors10.3390/s2122756021:22(7560)Online publication date: 13-Nov-2021
        • (2021)A Cluster Analysis Methodology for the Categorization of Soil Samples for Forensic Sciences Based on Elemental FingerprintApplied Artificial Intelligence10.1080/08839514.2021.201094136:1Online publication date: 20-Dec-2021

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media