Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Efficient and secure medical big data management system using optimal map-reduce framework and deep learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Big Data (BD) and cloud computing (CC) are the two widely used technologies and focus of study in several industries. Amongst all, healthcare sources generate a tremendous amount of data daily. Traditional processing techniques cannot handle this data because they are huge. Furthermore, being large, this data is also dynamic and diverse. Large data sets are stored, processed, and analyzed under BD. However, as the volume of data increased, third parties could hack it easily. The data are regularly stored in the cloud in an encrypted form to protect the data from intruders. This paper proposes a secure and efficient medical BD management and classification scheme using optimal Map Reduce (MR) and deep learning framework in a cloud environment. The system comprises '4' phases: authentication of patients, BD management in the cloud, secure data transfer and BD classification. First, the patient who wants to upload the files to the cloud and access the resources from the cloud is registered with the TC. It generates hash value using the Whirlpool Hashing (WH) algorithm, and the user credentials are stored in Blockchain (BC) to protect the network from unauthorized access. Once the authentication is successful, the patient can access any data in the cloud. Before uploading the file to the cloud, the preprocessing is carried out using missing values imputation, numerical conversion, and normalization, which improves the data quality. It is fed into the MR framework using Kernelized K-Means (KKM) clustering and Enhanced Butterfly Optimization Algorithm (EBOA) for managing the data efficiently. Then the map-reduced data is encrypted using Whirlpool Hashing-based Enhanced Rivest Shamir Adelman (WHERSA) to upload into the cloud securely. Finally, the classification of BD is done using Enhanced Ant Colony Weight Optimization based Deep Belief Network (EACWODBN) for disease prediction. The experimental outcomes reveal that the proposed system outperformed state-of-art methods while offering effectual and secure data management.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The dataset used for the present work is the publicly available, https://www.kaggle.com/datasets/johnsmith88/heart-disease-dataset

References

  1. Gill SS, Arya RC, Wander GS, Buyya R (2019) Fog-based smart healthcare as a big data and cloud service for heart patients using IoT. In: International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018. Springer International Publishing, pp 1376–1383

    Chapter  Google Scholar 

  2. Rajabion L, Shaltooki AA, Taghikhah M, Ghasemi A, Badfar A (2019) Healthcare big data processing mechanisms: The role of cloud computing. Int J Inf Manag 49:271–289

    Article  Google Scholar 

  3. Ramírez PLG, Lloret J, Taha M, Tomás J (2018). Architecture to integrate IoT networks using artificial intelligence in the cloud. In 2018 International Conference on Computational Science and Computational Intelligence (CSCI) (pp. 996-1001). IEEE

  4. Khaleel MI, Boskany NW, Abdulla MT (2018) A novel cloud-based temperature monitoring service to datacenter environment. Kurdistan J Appl Res 3(2):20–26

    Article  Google Scholar 

  5. Kulkarni O, Jena S, Sanjay CH (2020) Fractional fuzzy clustering and particle whale optimization-based mapreduce framework for big data clustering. J Intell Syst 29(1):1496–1513

    Google Scholar 

  6. Dash S, Shakyawar SK, Sharma M, Kaushik S (2019) Big data in healthcare: management, analysis and future prospects. J Big Data 6(1):1–25

    Article  Google Scholar 

  7. Saranya, P., & Asha, P. (2019). Survey on big data analytics in health care. In 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 46-51). IEEE

  8. Lv Z, Qiao L (2020) Analysis of healthcare big data. Futur Gener Comput Syst 109:103–110

    Article  Google Scholar 

  9. Chen PT, Lin CL, Wu WN (2020) Big data management in healthcare: Adoption challenges and implications. Int J Inf Manag 53:102078

    Article  Google Scholar 

  10. Jagadeeswari V, Subramaniyaswamy V, Logesh R, Vijayakumar V (2018) A study on medical Internet of Things and Big Data in personalized healthcare system. Health Inform Sci Syst 6:1–20

    Google Scholar 

  11. Alexandru A, Alexandru C, Coardos D, Tudora E (2016) Healthcare, big data and cloud computing. management 1(2)

  12. Mohamed A, Najafabadi MK, Wah YB, Zaman EAK, Maskat R (2020) The state of the art and taxonomy of big data analytics: view from new big data framework. Artif Intell Rev 53:989–1037

    Article  Google Scholar 

  13. Gupta YK (2020) Aspect of Big Data in Medical Imaging to Extract the Hidden Information Using HIPI in HDFS Environment. Advance Mach Intel Interact Med Image Analys:19–40

  14. Kulkarni AJ, Siarry P, Singh PK, Abraham A, Zhang M, Zomaya A, Baki F, (Eds.). (2020) Big Data Analytics in Healthcare. Springer

    Book  Google Scholar 

  15. Ramani R, Vimala Devi K, Ruba Soundar K (2020) RETRACTED ARTICLE: MapReduce-based big data framework using modified artificial neural network classifier for diabetic chronic disease prediction. Soft Comput 24(21):16335–16345

    Article  Google Scholar 

  16. Abualigah L, Masri BA (2021). Advances in MapReduce big data processing: platform, tools, and algorithms. Artificial intelligence and IoT: Smart convergence for eco-friendly topography, pp. 105-128

  17. Tariq MI, Tayyaba S, Ashraf MW, Balas VE (2020) Deep learning techniques for optimizing medical big data. In: Deep Learning Techniques for Biomedical and Health Informatics. Academic Press, pp 187–211

    Google Scholar 

  18. Saeed MM, Al Aghbari Z, Alsharidah M (2020) Big data clustering techniques based on spark: a literature review. PeerJ Comput Sci 6:e321

    Article  Google Scholar 

  19. Farooqi MM, Shah MA, Wahid A, Akhunzada A, Khan F, ul Amin, N., & Ali, I. (2019) Big data in healthcare: A survey. Appli Intel Technol Health:143–152

  20. Lo'ai AT, Saldamli G (2021) Reconsidering big data security and privacy in cloud and mobile cloud systems. J King Saud Univ-Comput InformSci 33(7):810–819

    Google Scholar 

  21. Essa YM, Hemdan EED, El-Mahalawy A, Attiya G, El-Sayed A (2019) IFHDS: intelligent framework for securing healthcare bigdata. J Med Syst 43:1–13

    Google Scholar 

  22. Khan M, Ansari MD (2019) Security and privacy issue of big data over the cloud computing: a comprehensive analysis. IJRTE-Scopus Indexed 7(6s):413–417

    Google Scholar 

  23. Deepa N, Pham QV, Nguyen DC, Bhattacharya S, Prabadevi B, Gadekallu TR, ... & Pathirana PN (2022). A survey on blockchain for big data: approaches, opportunities, and future directions. Future Generation Computer Systems

  24. Stergiou CL, Plageras AP, Psannis KE, Gupta BB (2020) Secure machine learning scenario from big data in cloud computing via internet of things network. Principles and Paradigms, Handbook of Computer Networks and Cyber Security, pp 525–554

    Google Scholar 

  25. Premkamal PK, Pasupuleti SK, Alphonse PJA (2019) A new verifiable outsourced ciphertext-policy attribute based encryption for big data privacy and access control in cloud. J Ambient Intell Humaniz Comput 10:2693–2707

    Article  Google Scholar 

  26. Namasudra S, Devi D, Kadry S, Sundarasekar R, Shanthini A (2020) Towards DNA based data security in the cloud computing environment. Comput Commun 151:539–547

    Article  Google Scholar 

  27. Subramanian EK, Tamilselvan L (2020) Elliptic curve Diffie–Hellman cryptosystem in big data cloud security. Clust Comput 23:3057–3067

    Article  Google Scholar 

  28. Ramachandra MN, Srinivasa Rao M, Lai WC, Parameshachari BD, Ananda Babu J, Hemalatha KL (2022) An efficient and secure big data storage in cloud environment by using triple data encryption standard. Big Data Cognit Comput 6(4):101

    Article  Google Scholar 

  29. Djemaiel Y, Berrahal S, Boudriga N (2018) A novel graph-based approach for the management of health data on cloud-based WSANs. J Grid Comput 16:317–344

    Article  Google Scholar 

  30. Alabdulatif A, Khalil I, Yi X (2020) Towards secure big data analytic for cloud-enabled applications with fully homomorphic encryption. J Paral Distributed Comput 137:192–204

    Article  Google Scholar 

  31. Venkatesh R, Balasubramanian C, Kaliappan M (2019) Development of big data predictive analytics model for disease prediction using machine learning technique. J Med Syst 43:1–8

    Article  Google Scholar 

  32. Vaishali G, Kalaivani V (2016) Big data analysis for heart disease detection system using map reduce technique. In 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE'16) (pp. 1-6). IEEE

  33. Mohiyuddin A, Javed AR, Chakraborty C, Rizwan M, Shabbir M, Nebhen J (2022) Secure cloud storage for medical IoT data using adaptive neuro-fuzzy inference system. Int J Fuzzy Syst 24(2):1203–1215

    Article  Google Scholar 

  34. Mohan S, Thirumalai C, Srivastava G (2019) Effective heart disease prediction using hybrid machine learning techniques. IEEE access 7:81542–81554

    Article  Google Scholar 

  35. Choi SY, Chung K (2020) Knowledge process of health big data using MapReduce-based associative mining. Pers Ubiquit Comput 24:571–581

    Article  Google Scholar 

  36. Harb H, Mroue H, Mansour A, Nasser A, Motta Cruz E (2020) A hadoop-based platform for patient classification and disease diagnosis in healthcare applications. Sensors 20(7):1931

    Article  Google Scholar 

  37. Rajendran S, Khalaf OI, Alotaibi Y, Alghamdi S (2021) MapReduce-based big data classification model using feature subset selection and hyperparameter tuned deep belief network. Sci Rep 11(1):24138

    Article  Google Scholar 

  38. Zhou Y, Varzaneh MG (2022) Efficient and scalable patients clustering based on medical big data in cloud platform. J Cloud Comput 11(1):1–10

    Article  Google Scholar 

  39. Game PS, Vaze V, Emmanuel M (2019) Optimized Decision tree rules using divergence based grey wolf optimization for big data classification in health care. Evol Intel:1–17

  40. Li JP, Haq AU, Din SU, Khan J, Khan A, Saboor A (2020) Heart disease identification method using machine learning classification in e-healthcare. IEEE access 8:107562–107582

    Article  Google Scholar 

  41. Jain P, Gyanchandani M, Khare N (2019) Enhanced secured map reduce layer for big data privacy and security. J Big Data 6(1):1–17

    Article  Google Scholar 

  42. Masud M, Gaba GS, Choudhary K, Alroobaea R, Hossain MS (2021) A robust and lightweight secure access scheme for cloud-based E-healthcare services. Peer-to-peer Network Appli 14(5):3043–3057

    Article  Google Scholar 

  43. Thamrin A, Xu H (2020). Cloud based secure and reliable bigdata storage service in healthcare system. in proceedings of international conference on Service Oriented System Engineering, https://doi.org/10.1109/SOSE52839.2021.00015

  44. Nguyen DC, Pathirana PN, Ding M, Seneviratne A (2019) Blockchain for secure ehrs sharing of mobile cloud-based e-health systems. IEEE access 7:66792–66806

    Article  Google Scholar 

  45. Vione E, Darmawan JB (2019, April) Performance of K-means in Hadoop Using MapReduce Programming Model. In: In Proceedings of the 1st International Conference on Science and Technology for an Internet of Things, 20 October 2018. Indonesia, Yogyakarta

    Google Scholar 

  46. Arora S, Singh S (2019) Butterfly optimization algorithm: a novel approach for global optimization. Soft Comput 23:715–734

    Article  Google Scholar 

  47. Parthasarathy PR, Yee HW, Loong SS, Rajamanickam L, Ayyappan P (2019) Implementation of RSA Algorithm to Secure Data in Cloud Computing. Int J Innov Sci, Engin Technol 6(4)

  48. Chen X, Li TH, Zhao Y, Wang CC, Zhu CC (2021) Deep-belief network for predicting potential miRNA-disease associations. Brief Bioinform 22(3):bbaa186

    Article  Google Scholar 

  49. Zhang H, Li Z, Shu W, Chou J (2019) Ant colony optimization algorithm based on mobile sink data collection in industrial wireless sensor networks. EURASIP J Wirel Commun Netw 2019(1):1–10

    Article  Google Scholar 

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Rajeshkumar.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rajeshkumar, K., Dhanasekaran, S. & Vasudevan, V. Efficient and secure medical big data management system using optimal map-reduce framework and deep learning. Multimed Tools Appl 83, 47111–47138 (2024). https://doi.org/10.1007/s11042-023-17381-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17381-8

Keywords