Massively parallel data analysis with pacts on nephele

A Alexandrov, M Heimel, V Markl, D Battré… - Proceedings of the …, 2010 - dl.acm.org
Proceedings of the VLDB Endowment, 2010dl.acm.org
Large-scale data analysis applications require processing and analyzing of Terabytes or
even Petabytes of data, particularly in the areas of web analysis or scientific data
management. This trend has been discussed as" web-scale data management" in a panel at
VLDB 2009. Formerly, parallel data processing was the domain of parallel database
systems. Today, novel requirements like scaling out to thousands of machines, improved
fault-tolerance, and schema free processing have made a case for new approaches.
Large-scale data analysis applications require processing and analyzing of Terabytes or even Petabytes of data, particularly in the areas of web analysis or scientific data management. This trend has been discussed as "web-scale data management" in a panel at VLDB 2009. Formerly, parallel data processing was the domain of parallel database systems. Today, novel requirements like scaling out to thousands of machines, improved fault-tolerance, and schema free processing have made a case for new approaches.
ACM Digital Library