Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/SERVICES.2014.41guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Challenges for MapReduce in Big Data

Published: 27 June 2014 Publication History

Abstract

In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the MapReduce paradigm which allows for massively parallel and distributed execution over a large number of computing nodes. This paper identifies MapReduce issues and challenges in handling Big Data with the objective of providing an overview of the field, facilitating better planning and management of Big Data projects, and identifying opportunities for future research in this field. The identified challenges are grouped into four main categories corresponding to Big Data tasks types: data storage (relational databases and NoSQL stores), Big Data analytics (machine learning and interactive analytics), online processing, and security and privacy. Moreover, current efforts aimed at improving and extending MapReduce to address identified challenges are presented. Consequently, by identifying issues and challenges MapReduce faces when handling Big Data, this study encourages future Big Data research.

Cited By

View all
  • (2024)Role of IoT technologies in big data management systemsPervasive and Mobile Computing10.1016/j.pmcj.2024.101905100:COnline publication date: 1-May-2024
  • (2023)A Novel Query Method for Spatial Database Based on Improved K-Nearest Neighbor AlgorithmInternational Journal of Decision Support System Technology10.4018/IJDSST.33277316:1(1-15)Online publication date: 14-Nov-2023
  • (2021)Small Files in HDFS and Their Impact on Hadoop PerformanceThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487717(385-390)Online publication date: 29-Nov-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
SERVICES '14: Proceedings of the 2014 IEEE World Congress on Services
June 2014
496 pages
ISBN:9781479950690

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 June 2014

Author Tags

  1. Big Data
  2. Big Data Analytics
  3. Interactive Analytics
  4. Machine Learning
  5. MapReduce
  6. NoSQL
  7. Online Processing
  8. Privacy
  9. Security

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Role of IoT technologies in big data management systemsPervasive and Mobile Computing10.1016/j.pmcj.2024.101905100:COnline publication date: 1-May-2024
  • (2023)A Novel Query Method for Spatial Database Based on Improved K-Nearest Neighbor AlgorithmInternational Journal of Decision Support System Technology10.4018/IJDSST.33277316:1(1-15)Online publication date: 14-Nov-2023
  • (2021)Small Files in HDFS and Their Impact on Hadoop PerformanceThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487717(385-390)Online publication date: 29-Nov-2021
  • (2019)A Systematic Review on Big Data Analytics Frameworks for Higher Education - Tools and AlgorithmsProceedings of the 2019 2nd International Conference on E-Business, Information Management and Computer Science10.1145/3377817.3377836(1-9)Online publication date: 12-Aug-2019
  • (2019)Parallel Computing of Support Vector MachinesACM Computing Surveys10.1145/328098951:6(1-38)Online publication date: 28-Jan-2019
  • (2018)A Binary Cuckoo Search Big Data Algorithm Applied to Large-Scale Crew Scheduling ProblemsComplexity10.1155/2018/83951932018Online publication date: 1-Jan-2018
  • (2017)A strategy for scheduling reduce task based on intermediate data locality of the MapReduceCluster Computing10.1007/s10586-017-0972-720:4(2821-2831)Online publication date: 1-Dec-2017
  • (2016)Attributed Graph Rewriting for Complex Event Processing Self-ManagementACM Transactions on Autonomous and Adaptive Systems10.1145/296749911:3(1-39)Online publication date: 20-Sep-2016
  • (2016)A security and cost aware scheduling algorithm for heterogeneous tasks of scientific workflow in cloudsFuture Generation Computer Systems10.1016/j.future.2015.12.01465:C(140-152)Online publication date: 1-Dec-2016
  • (2016)A general perspective of Big DataThe Journal of Supercomputing10.1007/s11227-015-1501-172:8(3073-3113)Online publication date: 1-Aug-2016
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media