Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2486767acmconferencesBook PagePublication PagesmodConference Proceedingsconference-collections
DanaC '13: Proceedings of the Second Workshop on Data Analytics in the Cloud
ACM2013 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
SIGMOD/PODS'13: International Conference on Management of Data New York New York 23 June 2013
ISBN:
978-1-4503-2202-7
Published:
23 June 2013
Sponsors:

Reflects downloads up to 10 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

Data analytics has the potential to be a transformer of scientific research, and data-driven business decisions. By effectively analyzing huge volumes of data, scientific research can be transformed from hypothesis-driven to data-driven, where forming scientific hypotheses will be aided by discovering patterns in vast quantities of data. For most technology companies that operate on a Web scale, analyzing customer data can provide insights on customer behavior, and lead to answers for critical business decisions.

Cloud computing has emerged as a cost-effective and elastic computing paradigm. Cloud infrastructures scale to massive numbers of commodity computing nodes and provide adaptive provisioning without prohibitive initial investments. Data analytics has the potential to be a significant cloud application, and to constitute a large fraction of the workload of modern data centers. Designing the infrastructures and systems for data management in the new computing environments remains an open challenge.

Skip Table Of Content Section
research-article
Don't match twice: redundancy-free similarity computation with MapReduce

To improve the effectiveness of pair-wise similarity computation, state-of-the-art approaches assign objects to multiple overlapping clusters. This introduces redundant pair comparisons when similar objects share more than one cluster. We propose an ...

research-article
Multi-objective optimization of data flows in a multi-cloud environment

As cloud-based solutions have become one of the main choices for intensive data analysis both for business decision making and scientific purposes, users face the problem of choosing among different cloud providers. In this work, we deal with data ...

research-article
ScyPer: elastic OLAP throughput on transactional data

Ever increasing main memory sizes and the advent of multi-core parallel processing have fostered the development of in-core databases. Even the transactional data of large enterprises can be retained in-memory on a single server. Modern in-core ...

research-article
Scalable I/O-bound parallel incremental gradient descent for big data analytics in GLADE

Incremental gradient descent is a general technique to solve a large class of convex optimization problems arising in many machine learning tasks. GLADE is a parallel infrastructure for big data analytics providing a generic task specification ...

research-article
A vision for personalized service level agreements in the cloud

Public Clouds today provide a variety of services for data analysis such as Amazon Elastic MapReduce and Google BigQuery. Each service comes with a pricing model and service level agreement (SLA). Today's pricing models and SLAs are described at the ...

research-article
Towards a workload for evolutionary analytics

Emerging data analysis involves the ingestion and exploration of new data sets, application of complex functions, and frequent query revisions based on observing prior query answers. We call this new type of analysis evolutionary analytics and identify ...

research-article
GPText: Greenplum parallel statistical text analysis framework

Many companies keep large amounts of text data inside of relational databases. Several challenges exist in using state-of-the-art systems to perform analysis on such datasets. First, expensive big data transfer cost must be paid up front to move data ...

research-article
Enabling secure query processing in the cloud using fully homomorphic encryption

The database community, at least for the last decade, has been grappling with querying encrypted data, which would enable secure database as a service solutions. A recent breakthrough in the cryptographic community (in 2009) related to fully homomorphic ...

research-article
A case for dynamic memory partitioning in data centers

Leveraging distributed main memory is becoming an increasingly popular approach to speed up large-scale data-intensive cluster applications. However, despite the growing number of possible performance benefits, recent studies indicate that the static ...

Contributors
  • Technical University of Berlin
  • Unravel Data

Recommendations

Acceptance Rates

DanaC '13 Paper Acceptance Rate 9 of 16 submissions, 56%;
Overall Acceptance Rate 19 of 34 submissions, 56%
YearSubmittedAcceptedRate
DanaC'156467%
DanaC'1412650%
DanaC '1316956%
Overall341956%