Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Flower: a data analytics <u>flow</u> elasticity manager

Published: 01 August 2017 Publication History

Abstract

A data analytics flow typically operates on three layers: ingestion, analytics, and storage, each of which is provided by a data-intensive system. These systems are often available as cloud managed services, enabling the users to have pain-free deployment of data analytics flow applications such as click-stream analytics. Despite straightforward orchestration, elasticity management of the flows is challenging. This is due to: a) heterogeneity of workloads and diversity of cloud resources such as queue partitions, compute servers and NoSQL throughputs capacity, b) workload dependencies between the layers, and c) different performance behaviours and resource consumption patterns.
In this demonstration, we present Flower, a holistic elasticity management system that exploits advanced optimization and control theory techniques to manage elasticity of complex data analytics flows on clouds. Flower analyzes statistics and data collected from different data-intensive systems to provide the user with a suite of rich functionalities, including: workload dependency analysis, optimal resource share analysis, dynamic resource provisioning, and cross-platform monitoring. We will showcase various features of Flower using a real-world data analytics flow. We will allow the audience to explore Flower by visually defining and configuring a data analytics flow elasticity manager and get hands-on experience with integrated data analytics flow management.

References

[1]
Amazon auto scaling. https://aws.amazon.com/autoscaling/.
[2]
Amazon cloudwatch. https://aws.amazon.com/cloudwatch/.
[3]
Amazon dynamodb. https://aws.amazon.com/dynamodb.
[4]
Amazon kinesis. http://aws.amazon.com/kinesis.
[5]
Ganglia monitoring system http://ganglia.sourceforge.net/.
[6]
Apache storm. http://storm.apache.org/.
[7]
R. Bhartia. Amazon kinesis and apache storm: Building a real-time sliding-window dashboard over streaming data. Technical report, Amazon Web Services, October 2014.
[8]
K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fast and elitist multiobjective genetic algorithm: NSGA-II. TEVC, 6(2):182--197, 2002.
[9]
A. Khoshkbarforoushha, A. Khosravian, and R. Ranjan. Elasticity management of Streaming Data Analytics Flows on clouds, Journal of Computer System Sciences, 2016.
[10]
A. Khoshkbarforoushha, M. Wang, R. Ranjan, L. Wang, L. Alem, S. U. Khan, and B. Benatallah. Dimensions for evaluating cloud resource orchestration frameworks. Computer, 49(2):24--33, 2016.
[11]
I. Konstantinou, E. Angelou, D. Tsoumakos, C. Boumpouka, N. Koziris, and S. Sioutas. TIRAMOLA: elastic NoSQL provisioning through a cloud management platform. In SIGMOD, pages 725--728. ACM, 2012.
[12]
H. C. Lim, S. Babu, and J. S. Chase. Automated control for elastic storage. In ICAC, pages 1--10. ACM, 2010.
[13]
J. Ortiz, B. Lee, and M. Balazinska. Perfenforce demonstration: Data analytics with performance guarantees. SIGMOD, 2016.
[14]
P. Padala, K. G. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, A. Merchant, and K. Salem. Adaptive control of virtualized resources in utility computing environments. In ACM SIGOPS Operating Systems Review, volume 41, pages 289--302. 2007.
[15]
T. Zhu, A. Gandhi, M. Harchol-Balter, and M. A. Kozuch. Saving cash by using less cache. In HotCloud, 2012.

Cited By

View all
  • (2018)MoiraProceedings of the International Workshop on Real-Time Business Intelligence and Analytics10.1145/3242153.3242160(1-10)Online publication date: 27-Aug-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 10, Issue 12
August 2017
427 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2017
Published in PVLDB Volume 10, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2018)MoiraProceedings of the International Workshop on Real-Time Business Intelligence and Analytics10.1145/3242153.3242160(1-10)Online publication date: 27-Aug-2018

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media