Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3107514.3107518acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmhiConference Proceedingsconference-collections
research-article

Usage Patterns and Data Quality: A Case Study of a National Type-1 Diabetes Study

Published: 20 May 2017 Publication History

Abstract

The Environmental Determinants of Islet Auto- immunity (ENDIA) project is Australia's largest study into the causes of Type-1 Diabetes (T1D). The ENDIA study is supported by a Cloud-based software platform including a clinical registry comprising extensive longitudinal information on families at risk of having a child that might go on to develop T1D. This registry includes both demographic and clinical information on families and children as well as the environmental factors that might influence the onset of T1D. A multitude of samples are obtained through the study and used to support a diverse portfolio of bioinformatics data analytics. The quality of the data in the registry is essential to the overall success of the project. This paper presents a Cloud-based log-analytics platform that supports the detailed analysis of patterns of usage of the registry by the clinical centres and collaborators involved. We explore the impact that the usage patterns have on the overall data quality. We also consider ways of improving data quality by mothers entering their own data through targeted mobile applications that have been developed for dietary data collection.

References

[1]
Agosti, M., Crivellari, F. and Di Nunzio, G. 2012. Web log analysis: a review of a decade of Studies about information acquisition, inspection and Interpretation of user interaction, Data Mining and Knowledge Discovery 24, 3(2012), 663--696, DOI= 10.1007/s10618-011-0228-8
[2]
Alberti, K. and Zimmet, P. 1998. Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus. Provisional report of a WHO Consultation. Diabetic Medicine 15, 7(1998), 539--553.
[3]
Anderson, J., Lehnardt, J., Slater, N. 2010. CouchDB: The Definitive Guide. Sebastopol, Calif.: O'Reilly Media, Inc.
[4]
Andrews, J. H. 1998. Testing using log file analysis: tools, methods, and issues, In 13th IEEE International Conference on Automated Software Engineering, Proceedings, 157--166.
[5]
Brandt, C. A., Argraves, S., Money, R., Ananth, G., Trocky, N. and Nadkarni, P. M. 2006. Informatics tools to improve clinical research study implementation. Contemporary Clinical Trials 27, 2(2006), 112--122.
[6]
Buchan, I. and Bischop, J. W. C. 2009. A Unified Modeling Approach to Data-Intensive Healthcare. In T. Hey, S. Tansley, & K. Tolle (Eds.), The Fourth Paradigm, Redmond, Washington: Microsoft Research, 91--98.
[7]
Catanzariti, L., Faulks, K., Moon, L., Waters, A., Flack, J. and Craig, M. 2009. Australia's national trends in the incidence of Type 1 diabetes in 0-14-year-olds, 2000--2006. Diabetic Medicine 26, 6 (2009), 596--601.
[8]
Chen, K., Chen, H., Conway, N., Hellerstein, J. M. and Parikh, T. S. 2011. Usher: Improving Data Quality with Dynamic Forms. IEEE Transactions on Knowledge and Data Engineering 23, 8(2011), 1138--1153.
[9]
Collins, D. 2016. The Importance of Log Analysis, 2016. www.davetalks.com/articles/importance-of-log-analysis.htm.
[10]
Dechartres, A., Boutron, I., Trinquart, L., Charles, P. and Ravaud, P. 2011. Single-Center Trials Show Larger Treatment Effects Than Multicenter Trials: Evidence From a Meta-epidemiologic Study. Annals of Internal Medicine 155, 1 (2011), 39.
[11]
Deterding, S., Sicart, M., Nacke, L., O'Hara, K., Dixon, D. and Gamification, D. 2011. Using game-design elements in non-gaming contexts. In CHI'11 Extended Abstracts on Human Factors in Computing Systems, 2425--2428.
[12]
ECRIN. 2007. ECRIN Report - Survey on data management, tools and procedures within ECRIN.
[13]
ECRIN. 2010. Standard requirements for GCP-compliant data management in multinational clinical trials, (May. 2010), 1--24.
[14]
Gatta, G., Capocaccia, R., Trama, A. and Martinez, C. 2010. The Burden of Rare Cancers in Europe. In M. Posada de la Paz & S. C. Groft (Eds.), Rare Diseases Epidemiology 686, 285--303, Dordrecht: Springer.
[15]
Glöckner, S., Arlt, W., Bancos, I., Stell, A. and Sinnott, R.O. 2015. Improving Data Quality in Disease Registries and Clinical Trials: A Case Study from the ENSAT-CANCER Project. In A. Maeder & J. Warren (Eds.), 8th Australasian Workshop on Health Informatics and Knowledge Management (HIKM 2015) 164, 25--32, Sydney, Australia.
[16]
Guthrie, L. B., Oken, E., Sterne, J. A. C., Gillman, M., Patel, R., Vilchuck, K. and Martin, R. M. 2012. Ongoing monitoring of data clustering in multicenter studies. BMC Medical Research Methodology 12, 29.
[17]
Hamari, J., Koivisto, J. and Sarsa, H. 2014. Does gamification work? - A literature review of empirical studies on gamification. Proceedings of the Annual Hawaii International Conference on System Sciences, 3025--3034.
[18]
Hripcsak, G., Cimino, J. J. and Sengupta, S. 1999. WebCIS: large-scale deployment of a Web-based clinical information system, Proc AMIA Symp, 804--808.
[19]
Kimball, R., Ross, M. 2013. The data warehouse toolkit. Indianapolis, Wiley.
[20]
Kozelek, P. 2010. Audit Trail -- Tracing Data Changes in Databases, http://www.codeproject.com/Articles/105768/Audit-Trail-Tracing-Data-Changes-in-Database.
[21]
Miranskyy, A., Hamou-Lhadj, A., Cialini, E. and Larsson, E. A. 2016. Operational-Log Analysis for Big Data Systems: Challenges and Solutions. IEEE Software 33, 2(2016), 52--59.
[22]
Nahm, M., Pieper, C. F. and Cunningham, M. M. 2008. Quantifying data quality for clinical trials using electronic data capture. PloS One 3, 8(2008), e3049.
[23]
Nahm, M. 2012. Data Quality in Clinical Research. Clinical Research Informatics, 175--201, London: Springer. DOI= http://doi.org/10.1007/978-1-84882-448-5_10
[24]
Oliner, A., Ganapathi, A. and Xu, W. 2012. Advances and challenges in log analysis. Communications of the ACM 55, 2(2012), 55.
[25]
Paulsen, A., Overgaard, S. and Lauritsen, J. M. 2012. Quality of data entry using single entry, double entry and automated forms processing--an example based on a study of patient-reported outcomes. PloS One 7, 4(2012).
[26]
Penno, M., Couper, J., Craig, M., Colman, P., Rawlinson, W., Cotterill, A., Jones, T., Harrison, L., Baghurst, P., Barry, S., Cameron, F., Dodd, J., Duran, C., Forbes, J., Makrides, M., Morahan, G., Nelson, K., Nankervis, A., Sinnott, R.O. and Wentworth, J. 2013. Environmental determinants of islet autoimmunity (ENDIA): a pregnancy to early life cohort study in children at-risk of type 1 diabetes 13, BMC Pediatrics, 124(2013), DOI =10.1186/1471-2431-13-124.
[27]
Stead, W., Searle, J., Fessler, H., Smith, J. and Shortliffe, E. 2011. Biomedical Informatics: Changing What Physicians Need to Know and How They Learn. Academic Medicine: Journal of the Association of American Medical Colleges 86, 4 (2011), 429--434.
[28]
Taplin, C. E., Craig, M. E., Lloyd, M.,Silink, M., Howard, N. J., Taylor, C. and Crock, P. 2005. The rising incidence of childhood type 1 diabetes in New South Wales, 1990-2002 Med J Aust, 183(2005), 243--6.
[29]
Tenenbaum, J. D. 2015. Personalized Medicine. In P. R. O. Payne, P. J. Embi, I. N. Sarkar, N. Shah, J. D. Tenenbaum, & A. B. Wilcox (Eds.), Translational Informatics, 35--60.
[30]
Venet, D., Doffagne, E. and Burzykowski, T. et al. 2012. A statistical approach to central monitoring of data quality in clinical trials. Clinical Trials 9, 6 (2012), 705--713.
[31]
Walther, B., Hossin, S., Townend, J., Abernethy, N., Parker, D. and Jeffries, D. 2011. Comparison of electronic data capture (EDC) with the standard data capture method for clinical trial data. PloS One 6, 9(2011).
[32]
Wickramage, C., Sahama, T. and Fidge, C. 2016. Anatomy of log files: Implications for information accountability measures. In e-Health Networking, Applications and Services (Healthcom), 2016 IEEE 18th International Conference, 1--6. IEEE.

Index Terms

  1. Usage Patterns and Data Quality: A Case Study of a National Type-1 Diabetes Study

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMHI '17: Proceedings of the 1st International Conference on Medical and Health Informatics 2017
    May 2017
    118 pages
    ISBN:9781450352246
    DOI:10.1145/3107514
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 May 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Cloud
    2. Type-1 diabetes
    3. auditing
    4. log analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMHI '17

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 115
      Total Downloads
    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media