Abstract
After Google published their first paper on their software infrastruc-ture in October 2003, the open-source community quickly began working on similar free solutions. Yahoo! is now able to process terabytes of data daily using Hadoop, which is a scalable distributed file system and an open-source implementation of Google’s MapReduce. HBase, a distributed database that uses Hadoop, enables the reliable storage of structured data, just like Google’s Bigtable which powers applications like Google Maps and Google Analytics, to name only two. Many companies are tempted to use these technologies, but it is currently difficult to compare today’s systems with systems built on top of HBase. This paper presents this new technology and, a list of proposed comparison elements to existing database technology as well as proposed comparison assessment criteria.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Carr, D.: How Google Works, http://www.baselinemag.com/c/a/Projects-Networks-and-Storage/How-Google-Works-%5B1%5D/
HADOOP.COM. Product page, http://www.hadoop.com
HBASE.ORG. Product page, http://www.hbase.org
Chang, F., Dean, J., Ghemawat, S., et al.: Bigtable: A Distributed Storage System for Structured Data. In: 7th Symposium on Operating Systems Design and Implementation (OSDI 2006), Seattle, WA, USA, November 6-8, pp. 205–218 (2006)
Cordes, K.: YouTube scalability Talk (July 14, 2007), http://kylecordes.com/2007/07/12/youtube-scalability/
Ghemawat, S., Gobioff, H., Leung, S.-T.: The Google file system. In: Proc. of the 19th ACM SOSP, December 2003, pp. 29–43 (2003)
Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters. In: Proc. of the 6th OSDI, December 2004, pp. 137–150 (2004)
Burrows, M.: The Chubby lock service for loosely coupled distributed systems. In: Proc. of the 7th OSDI (November 2006)
YAHOO.COM. Product page, http://research.yahoo.com/node/1849
APACHE.ORG. Product page, http://lucene.apache.org/nutch
Baldeschwieler, E.: Yahoo! Launches world’s biggest Hadoop production application (February 19, 2008), http://developer.yahoo.com/blogs/hadoop/2008/02/yahoo-worlds-largest-production-hadoop.html
Borthakur, D.: The Hadoop Distributed File System: Architecture and Design (May 21, 2008), http://hadoop.apache.org/core/docs/r0.17.0/hdfs_design.html
Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters. Communications of the ACM 51(1) (January 2008)
Hood, S.: MapReduce at Rackspace (January 23, 2008), http://blog.racklabs.com/?p=66
List of companies using Hadoop (June 6, 2008), http://wiki.apache.org/hadoop/PoweredBy
Reed, B.: Zookeeper (March 25, 2008), http://research.yahoo.com/node/2120
GOOGLE.COM. Product page, http://code.google.com/p/pyshards/wiki/Pyshards
Delap, S.: HBase Leads Discuss Hadoop, BigTable and Distributed Databases (April 28, 2008), http://www.infoq.com/news/2008/04/hbase-interview
Hoff, T.: How to learn to stop worrying and use lots of disk space to scale (May 21, 2008), http://highscalability.com/how-i-learned-stop-worrying-and-love-using-lot-disk-space-scale
FACEBOOK.COM. Product page, http://developers.facebook.com/thrift/
HADOOP.CA. Product page, http://www.hadoop.ca
Unknown author, Hadoop MapReduce Tutorial (May 21, 2008), http://hadoop.apache.org/core/docs/r0.17.0/mapred_tutorial.html
ISO/IEC 9126:1999. Software Engineering – Product quality. Int. Org. for Standardization, ISO 9126 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cryans, JD., April, A., Abran, A. (2008). Criteria to Compare Cloud Computing with Current Database Technology. In: Dumke, R.R., Braungarten, R., Büren, G., Abran, A., Cuadrado-Gallego, J.J. (eds) Software Process and Product Measurement. Mensura MetriKon IWSM 2008 2008 2008. Lecture Notes in Computer Science, vol 5338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89403-2_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-89403-2_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89402-5
Online ISBN: 978-3-540-89403-2
eBook Packages: Computer ScienceComputer Science (R0)