Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3383972.3384061acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

Statistic Solution for Machine Learning to Analyze Heart Disease Data

Published: 26 May 2020 Publication History

Abstract

Data crawling, collection and analysis have become a popular pillar for the business intelligence of big data analysis which is the latest hot-topic among the research association. Numerous tools and techniques to solve and analyze the structured and unstructured datasets are developing very quickly. The previous studies show the different approaches in the identification of the strengths and weaknesses of multiple machine learning algorithms. But, most of the approaches demand more expert knowledge base information to understand the concepts of given data. In this paper, we modernize the machine learning methods for the effective prediction of heart disease. This work deliberates the detailed process of implementation of our proposed system. The goal of this work is to find a strong and effective machine learning algorithm for disease prediction for the problem; how can doctors get fast and better results for their diagnosis of heart disease. We design a new system for disease prediction using machine learning prediction algorithms (LR, ANN and SVC) by utilizing an effective approach of ETL, OLAP and data mining. The results showed that the best machine learning algorithm is SVC with 92% accuracy for the risk prediction model. We found that subjects at 56-64 years old have a high risk of heart disease, as well as men, have more heart disease rate than women. This proposed study can be favorable for the medical practitioners in the field of healthcare, supportive practice and precautions to the heart disease patients.

References

[1]
Hal Varian Answers Your Questions, February 25, 2008 (http://www.freakonomics.com/2008/02/25/hal-varian-answers-your-questions/), accessed: 2018-05-20.
[2]
EMC education services (2014) "Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and presenting data."
[3]
Center for disease control and prevention, https://www.cdc.gov/heartdisease/facts.htm, accessed: 2018-05-23.
[4]
Surajit Chaudhuri, Umeshwar Dayal, Vivek Narasayya, Communications of the ACM, Vol. 54 No. 8, Pages 88-98, 10.1145/1978542.1978562.
[5]
Chen, H., Chiang, R. H., & Storey, V. C. (2012). Business intelligence and analytics: from big data to big impact. MIS quarterly, 1165--1188.
[6]
Soni, J., Ansari, U., Sharma, D., & Soni, S. (2011). Intelligent and effective heart disease prediction system using weighted associative classifiers. International Journal on Computer Science and Engineering, 3(6), 2385--2392.
[7]
Bordeleau, Fanny-È, "Business Intelligence in Industry 4.0: State of the art and research opportunities", The Digital Supply Chain of the Future: Technologies, Applications and Business Models.
[8]
IoannisKavakiotis, (2017), "Machine Learning and Data Mining Methods in Diabetes Research", Volume 15, 2017, Pages 104-116, https://doi.org/10.1016/j.csbj.2016.12.005.
[9]
T. P. Fowdur, 2017, "Big Data Analytics with Machine Learning Tools", Internet of Things and Big Data Analytics Toward Next-Generation Intelligence pp 49--97.
[10]
Fatima, M. and Pasha, M. (2017) Survey of Machine Learning Algorithms for Disease Diagnostic. Journal of Intelligent Learning Systems and Applications, 9, 1--16. https://doi.org/10.4236/jilsa.2017.91001.
[11]
Chaurasia, Vikas and Pal, Saurabh, A Novel Approach for Breast Cancer Detection Using Data Mining Techniques (June 29, 2017). International Journal of Innovative Research in Computer & Communication Engineering, Vol. 2, Issue 1.
[12]
Adam Felman (7 February 2018), Reviewed by Debra Sullivan, PhD, MSN, RN, CNE, COI. https://www.medicalnewstoday.com/articles/237191.php.
[13]
Min Chen; Yixue Hao, (2017), "Disease Prediction by Machine Learning Over Big Data From Healthcare Communities", IEEE Access (Volume: 5)
[14]
Mehrbakhsh Nilashi, 2017, "An analytical method for diseases prediction using machine learning techniques", Computers & Chemical Engineering, Volume 106, 2 November 2017, Pages 212-223, https://doi.org/10.1016/j.compchemeng.2017.06.011.
[15]
Shifei Ding, 2015, "Unsupervised extreme learning machine with representational features", "International Journal of Machine Learning and Cybernetics", April 2017, Volume 8, Issue 2, pp 587--595.
[16]
Adilah Sabtu, (2017), "The challenges of Extract, Transform and Loading (ETL) system implementation for near real-time environment", "2017 International Conference on Research and Innovation in Information Systems (ICRIIS)".
[17]
Arik Sofan Tohir, (2017), "On-Line Analytic Processing (OLAP) modeling for graduation data presentation", "2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE)".

Cited By

View all
  • (2024)Enhanced machine learning models for predicting breast cancer: Healthcare systemITM Web of Conferences10.1051/itmconf/2024640102064(01020)Online publication date: 5-Jul-2024
  • (2023)Cardiovascular disease prediction with imputation techniques and recursive feature eliminationTHE 6TH INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT, EPIDEMIOLOGY AND INFORMATION SYSTEM (ICENIS) 2021: Topic of Energy, Environment, Epidemiology, and Information System10.1063/5.0124079(030016)Online publication date: 2023
  • (2022)Improved Machine Learning-Based Predictive Models for Breast Cancer DiagnosisInternational Journal of Environmental Research and Public Health10.3390/ijerph1906321119:6(3211)Online publication date: 9-Mar-2022
  • Show More Cited By

Index Terms

  1. Statistic Solution for Machine Learning to Analyze Heart Disease Data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing
    February 2020
    607 pages
    ISBN:9781450376426
    DOI:10.1145/3383972
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Shenzhen University: Shenzhen University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 May 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Machine learning
    2. data mining
    3. heart disease analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMLC 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)56
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 02 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Enhanced machine learning models for predicting breast cancer: Healthcare systemITM Web of Conferences10.1051/itmconf/2024640102064(01020)Online publication date: 5-Jul-2024
    • (2023)Cardiovascular disease prediction with imputation techniques and recursive feature eliminationTHE 6TH INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT, EPIDEMIOLOGY AND INFORMATION SYSTEM (ICENIS) 2021: Topic of Energy, Environment, Epidemiology, and Information System10.1063/5.0124079(030016)Online publication date: 2023
    • (2022)Improved Machine Learning-Based Predictive Models for Breast Cancer DiagnosisInternational Journal of Environmental Research and Public Health10.3390/ijerph1906321119:6(3211)Online publication date: 9-Mar-2022
    • (2022)An effective up-sampling approach for breast cancer prediction with imbalanced data: A machine learning model-based comparative analysisPLOS ONE10.1371/journal.pone.026913517:5(e0269135)Online publication date: 27-May-2022
    • (2020)Analysis and Detection of Lung Sounds Anomalies Based on NMA-RNN2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM49941.2020.9313197(2498-2504)Online publication date: 16-Dec-2020

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media