Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Are quorums an alternative for data replication?

Published: 01 September 2003 Publication History

Abstract

Data replication is playing an increasingly important role in the design of parallel information systems. In particular, the widespread use of cluster architectures often requires to replicate data for performance and availability reasons. However, maintaining the consistency of the different replicas is known to cause severe scalability problems. To address this limitation, quorums are often suggested as a way to reduce the overall overhead of replication. In this article, we analyze several quorum types in order to better understand their behavior in practice. The results obtained challenge many of the assumptions behind quorum based replication. Our evaluation indicates that the conventional read-one/write-all-available approach is the best choice for a large range of applications requiring data replication. We believe this is an important result for anybody developing code for computing clusters as the read-one/write-all-available strategy is much simpler to implement and more flexible than quorum-based approaches. In this article, we show that, in addition, it is also the best choice using a number of other selection criteria.

References

[1]
Agrawal, D. and Abbadi, A. E. 1989. An efficient solution to the mutual exclusion problem. In Proceedings of the ACM Symposium on Principles of Distributed Computing. (Edmonton, Alb., Canada). ACM, New York, 193--200.
[2]
Agrawal, D. and Abbadi, A. E. 1990a. Exploiting logical structures in replicated databases. Information Processing Letters 33, 5, 250--260.
[3]
Agrawal, D. and Abbadi, A. E. 1990b. The tree quorum protocol: An eficient approach for managing replicated data. In Proceedings of the International Conference on Very Large Databases (VLDB) (Brisbane, Australia). Morgan-Kaufmann, San Mateo, Calif., 243--254.
[4]
Agrawal, D. and Abbadi, A. E. 1991. An efficient and fault-tolerant solution for distributed mutual exclusion. ACM Trans. Comput. Syst. 9, 1 (Feb.), 1--20.
[5]
Agrawal, D. and Abbadi, A. E. 1992. The generalized tree quorum protocol: An efficient approach for managing replicated data. ACM Trans. Datab. Syst. 17, 4 (Dec.), 689--717.
[6]
Ahamad, M. and Ammar, M. H. 1989. Performance characterization of quorum-consensus algorithms for replicated data. IEEE Trans. Softw. Eng. 15, 4 (Apr.), 492--496.
[7]
Amir, Y. and Wool, A. 1996. Evaluating quorum systems over the internet. In Proceedings of the IEEE International Conference on Fault-Tolerant Computing Systems (FTCS) (Sendai, Japan). IEEE Computer Society Press, Los Alamitos, Calif., 26--35.
[8]
Amir, Y. and Wool, A. 1998. Optimal availability quorums systems: Theory and practice. Inf. Proc. Lett. 65, 5 (Mar.), 223--228.
[9]
Anderson, T. A., Breitbart, Y., Korth, H. F., and Wool, A. 1998. Replication, consistency, and practicality: Are these mutually exclusive? In Proceedings of the ACM SIGMOD Conference (Seattle, Wash.). ACM, New York, 484--495.
[10]
Bacon, J. 1997. Concurrency Systems. Addison-Wesley, Reading, Mass.
[11]
Barbará, D. and Garcia-Molina, H. 1987. The reliability of vote mechanisms. IEEE Trans. Comput. 36, 10 (Oct.), 1197--1208.
[12]
Barbará, D., Garcia-Molina, H., and Spauster, A. 1989. Increasing availability under mutual exclusion constraints with dynamic vote reassignment. ACM Trans. Comput. Syst. 7, 4 (Nov.), 394--426.
[13]
Bernstein, P. A., Hadzilacos, V., and Goodman, N. 1987. Concurrency Control and Recovery in Database Systems. Addison-Wesley, Reading, Mass.
[14]
Bernstein, P. A., Shipman, D., and Rothnie, J. B. 1980. Concurrency control in a system for distributed databases (SDD-1). ACM Trans. Datab. Syst. 5, 1 (Mar.), 18--51.
[15]
Carey, M. J. and Livny, M. 1986. Conflict detection tradeoffs for replicated data. ACM Trans. Datab. Syst. 16, 4 (Dec.), 703--746.
[16]
Carey, M. J. and Livny, M. 1988. Distributed concurrency control performance: A study of algorithms, distribution, and replication. In Proceedings of the International Conference on Very Large Databases (VLDB). Morgan-Kaufmann, San Mateo Calif., 13--25.
[17]
Cheung, S. Y., Ahamad, M., and Ammar, M. H. 1990. The grid protocol: a high performance scheme for maintaining replicated data. In Proceedings of the IEEE International Conference on Data Engineering (ICDE). IEEE Computer Society Press, Los Alamitos, Calif., 438--445.
[18]
Cheung, S. Y., Ammar, M. H., and Ahamad, M. 1991. Multidimensional voting. ACM Trans. Comput. Syst. 9, 4 (Nov.), 399--431.
[19]
Ciciani, B., Dias, D. M., and Yu, P. S. 1990. Analysis of replication in distributed database systems. IEEE Trans. Knowl. Data Eng. 2, 2 (June), 247--261.
[20]
Coulouris, G., Dollimore, J., and Kindberg, T. 2000. Distributed Systems. Concepts and Design. 3rd edition. Addison Wesley, Reading, Mass.
[21]
Eager, D. L. and Sevcik, K. C. 1983. Achieving robustness in distributed database systems. ACM Trans. Datab. Syst 8, 3 (Sept.), 354--381.
[22]
Erdös, P. and Lovász, L. 1975. Problems and results on 3-chromatic hypergraphs and some related questions. Colloq. Math. Soc. János Bolyai 10, 609--627.
[23]
Gifford, D. K. 1979. Weighted Voting for Replicated Data. In Proceedings of the 7th ACM Symposium on Operating Systems (Pacific Grove, Calif.). ACM, New York, 150--162.
[24]
Gray, J., Helland, P., O'Neil, P., and Shasha, D. 1996. The dangers of replication and a solution. In Proceedings of the ACM SIGMOD Conference (Montreal, Ont., Canada). ACM, New York, 173--182.
[25]
Gray, J. and Reuter, A. 1993. Transaction Processing: Concepts and Techniques. Morgan-Kaufmann Publishers, San Mateo, Calif.
[26]
Herlihy, M. 1987. Dynamic quorum adjustment for partitioned data. ACM Trans. Datab. Syst. 12, 2 (June), 170--194.
[27]
Hwang, S., Lee, K. K. S., and Chin, Y. H. 1996. Data replication in a distributed system: A performance study. In Proceedings of the 7th Database and Expert Systems Applications (Zurich, Switzerland). Lecture Notes in Computer Science, vol. 1134. Springer-Verlag, New York, 708--717.
[28]
Ingols, K. and Keidar, I. 2001. Availability study of dynamic voting systems. In Proceedings of the IEEE Internationl Conference on Distributed Computing Systems (ICDCS) (Phoenix, Ariz.). IEEE Computer Society Press, Los Alamitos. Calif., 247--254.
[29]
Jajodia, S. and Mutchler, D. 1990. Dynamic voting algorithms for maintaining the consistency of a replicated database. ACM Trans. Datab. Syst. 15, 2 (June), 230--280.
[30]
Jiménez-Peris, R., Patiño-Martínez, M., Alonso, G., and Kemme, B. 2001. How to select a replication protocol according to scalability, availability, and communication overhead. In Proceedings of the International Symposium on Reliable Distributed Systems (SRDS) (New Orleans, La.). IEEE Computer Society Press, Los Alamitos, Calif., 24--33.
[31]
Jiménez-Peris, R., Patiño-Martínez, M., Alonso, G., and Kemme, B. 2002. Scalable database replication middleware. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Vienna, Austria). IEEE Computer Users Society, Los Alamitos, Calif.
[32]
Kemme, B. 2000. Database Replication for Clusters of Workstations. Ph.D. dissertation. Dept. of Computer Science, Swiss Federal Institute of Technology Zurich.
[33]
Kemme, B. and Alonso, G. 2000. Don't be lazy, be consistent: Postgres-R, A new way to implement Database Replication. In Proceedings of the IEEE International Conference (VLDB) (Cairo, Egypt). Morgan-Kaufmann, San Mateo, Calif., pp. 134--143.
[34]
Keum, C. S., Choi, W., Hong, E. K., Kim, W. Y., and Whang, Y. 1995. Performance evaluation of replica control algorithms in a locally distributed database system. In Proceedings of the 4th International Conference on Database Systems for Advanced Applications (Singapore). World Scientific Press, 388--396.
[35]
Kumar, A. 1990. Performance of a Hierarchical Quorum Consensus Algorithm for Replicated Objects. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Paris, France). IEEE Computer Society Press, Los Alamitos, Calif., 378--385.
[36]
Kumar, A. 1991. Hierarchical quorum consensus: A new algorithm for managing replicated data. IEEE Trans. Comput. 40, 9 (Sept.), 996--1004.
[37]
Kumar, A. and Segev, A. 1993. Cost and availability tradeoffs in replicated data concurrency control. ACM Trans. Datab. Syst. 18, 1 (Mar.), 102--131.
[38]
Kumar, A., Rabinovich, M., and Sinha, R. K. 1993. A performance study of general grid structures for replicated data. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Pittsburgh, Pa.), IEEE Computer Society Press, Los Alamitos, Calif., 178--185.
[39]
Lewis, P., Bernstein, A., and Kifer, M. 2002. Databases and Transaction Processing. Addison-Wesley, Reading, Mass.
[40]
Liu, M. L., Agrawal, D., and Abbadi, A. E. 1997. The performance of replica control protocols in the presence of site failures. Distrib. Syst. Eng. 4, 2 (June), 59--77.
[41]
Lovász, L. 1973. Coverings and colorings in hypergraphs. In Proceedings of 4th South Eastern Conference on Combinatorics, Graph Theory and Computing. Utilitas Math., Winnipeg, B.C., Canada, 3--12.
[42]
Maekawa, M. 1985. A &Nsqrt; algorithm for mutual exclusion in decentralized systems. ACM Trans. Comput. Syst. 3, 2 (May), 145--159.
[43]
Naor, M. and Wool, A. 1998. The load, capacity, and availability of quorum systems. SIAM J. Comput. 27, 2 (Apr.), 423--447.
[44]
Nicola, M. and Jarke, M. 2000. Performance modeling of distributed and replicated databases. IEEE Trans. Knowl. Data Eng. 12, 4 (July), 645--672.
[45]
Paris, J. F. 1986. Voting with witnesses: A consistency scheme for replicated files. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Cambridge, Mass.). IEEE Computer Society Press, Los Alamitos, Calif., 606--612.
[46]
Paris, J. F. 1989. Voting with bystanders. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Newport Beach, Calif.). IEEE Computer Society Press, Los Alamitos, Calif., 394--405.
[47]
Paris, J. F. and Long, D. E. 1991. Voting with regenerable volatile witnesses. In Proceedings of the IEEE International Conference on Data Engineering (ICDE) (Kobe, Japan). IEEE Computer Society Press, Los Alamitos, Calif., 112--119.
[48]
Peleg, D. and Wool, A. 1995. The availability of quorum systems. Inf. Comput. 123, 2 (Dec.), 210--223.
[49]
Peleg, D. and Wool, A. 1997. Crumbling walls: A class of practical and efficient quorum systems. Distrib. Comput. 10, 2, 87--97.
[50]
Rangarajan, S., Jalote, P., and Tripathi, S. K. 1993. Capacity of voting systems. IEEE Trans. Softw. Eng. 19, 7 (July), 698--706.
[51]
Rangarajan, S., Setia, S., and Tripathi, S. K. 1995. A fault-tolerant algorithm for replicated data management. IEEE Trans. Parall. Distrib. Syst. 6, 12 (Dec.), 1271--1282.
[52]
Rothnie, J. B., Bernstein, P. A., Fox, P. A., Goodman, N., et al. 1980. Introduction to a system for distributed databases (SDD-1). ACM Trans. Datab. Syst. 5, 1 (Mar.), 1--17.
[53]
Saha, D., Rangarajan, S., and Tripathi, S. K. 1996. An analysis of the average message overhead in replica control protocols. IEEE Trans. Parall. and Distrib. Syst. 7, 10 (Oct.), 1026--1034.
[54]
Spasojevic, M. and Berman, P. 1994. Voting as the optimal static pessimistic scheme for managing replicated data. IEEE Trans. Parall. Distrib. Syst. 5, 1 (Jan.), 64--73.
[55]
Theel, O. and Pagnia-Koch, H. 1995. General design of grid-based data replication schemes using graphs and a few rules. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (Vancouver, B.C., Canada). IEEE Computer Society Press, Los Alamitos, Calif., 395--403.
[56]
Thomas, R. H. 1979. A majority consensus approach to concurrency control for multiple copy databases. ACM Trans. Datab. Syst. 4, 9 (June), 180--209.
[57]
Tong, Z. and Kain, R. Y. 1988. Vote Assignments in Weighted Voting Mechanisms. In Proceedings of the International Symposium on Reliable Distributed Systems (SRDS). (Columbus, Ohio). IEEE Computer Society Press, Los Alamitos, Calif., 138--143.
[58]
van Renesse, R. and Tanenbaum, A. S. 1988. Voting with ghosts. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS) (San Jose, Calif.). IEEE Computer Society Press, Los Alamitos, Calif., 456--462.
[59]
Weikum, G. and Vossen, G. 2001. Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery. Morgan-Kaufmann Publishers, San Mateo, Calif.
[60]
Wool, A. 1996. Quorum systems for distributed control protocols. Ph.D. dissertation, The Weizmann Institute of Science, Rehovot, Israel.
[61]
Wool, A. 1998. Quorum systems in replicated databases: Science or fiction? Data Eng. Bull. 21, 4 (Dec.), 3--11.
[62]
Wu, C. and Belford, G. G. 1992. The triangular lattice protocol: A highly fault tolerant and highly efficient protocol for replicated data. In Proceedings of the International Symposium on Reliable Distributed Systems (SRDS). (Houston, Tex.). IEEE Computer Society Press, Los Alamitos, Calif., 66--73.
[63]
Yu, P. S., Dias, D. M., and Lavenberg, S. S. 1993. On the Analytical Modeling of Database Concurrency Control. J. ACM 40, 4 (Sept.), 831--872.

Cited By

View all
  • (2022)Accelerating kubernetes with in-network cachingProceedings of the SIGCOMM '22 Poster and Demo Sessions10.1145/3546037.3546058(40-42)Online publication date: 22-Aug-2022
  • (2022)Addressing the Read-Performance Impact of Reconfigurations in Replicated Key-Value StoresIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.313513733:9(2106-2119)Online publication date: 1-Sep-2022
  • (2022)Elastic scalable transaction processing in LeanXcaleInformation Systems10.1016/j.is.2022.102043108:COnline publication date: 1-Sep-2022
  • Show More Cited By

Index Terms

  1. Are quorums an alternative for data replication?

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Database Systems
    ACM Transactions on Database Systems  Volume 28, Issue 3
    September 2003
    86 pages
    ISSN:0362-5915
    EISSN:1557-4644
    DOI:10.1145/937598
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 September 2003
    Published in TODS Volume 28, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Data replication
    2. availability
    3. distributed transactions.
    4. quorums
    5. scalability

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 15 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Accelerating kubernetes with in-network cachingProceedings of the SIGCOMM '22 Poster and Demo Sessions10.1145/3546037.3546058(40-42)Online publication date: 22-Aug-2022
    • (2022)Addressing the Read-Performance Impact of Reconfigurations in Replicated Key-Value StoresIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2021.313513733:9(2106-2119)Online publication date: 1-Sep-2022
    • (2022)Elastic scalable transaction processing in LeanXcaleInformation Systems10.1016/j.is.2022.102043108:COnline publication date: 1-Sep-2022
    • (2021)Advanced consistency management of highly-distributed transactional database in a hybrid cloud environment using novel R-TBC/RTA approachJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-021-00230-010:1Online publication date: 16-Apr-2021
    • (2021)Practical Experience Report: Cassandra+: Trading-Off Consistency, Latency, and Fault-tolerance in CassandraProceedings of the 22nd International Conference on Distributed Computing and Networking10.1145/3427796.3427816(191-195)Online publication date: 5-Jan-2021
    • (2021)Search for an object with a recognition error probability in a Desktop Grid environment2021 Ivannikov Ispras Open Conference (ISPRAS)10.1109/ISPRAS53967.2021.00018(96-99)Online publication date: Dec-2021
    • (2020)Hermes: A Fast, Fault-Tolerant and Linearizable Replication ProtocolProceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems10.1145/3373376.3378496(201-217)Online publication date: 9-Mar-2020
    • (2020)Probabilistic Consistency Guarantee in Partial Quorum-Based Data StoreIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2020.297361931:8(1815-1827)Online publication date: 1-Aug-2020
    • (2020)Risky Search with Increasing Complexity by a Desktop GridSupercomputing10.1007/978-3-030-64616-5_53(622-633)Online publication date: 4-Dec-2020
    • (2018)Fast and strongly-consistent per-item resilience in key-value storesProceedings of the Thirteenth EuroSys Conference10.1145/3190508.3190536(1-14)Online publication date: 23-Apr-2018
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media