Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Approximation Algorithms for Min-Max Generalization Problems

Published: 25 August 2014 Publication History

Abstract

We provide improved approximation algorithms for the min-max generalization problems considered by Du, Eppstein, Goodrich, and Lueker [Du et al. 2009]. Generalization is widely used in privacy-preserving data mining and can also be viewed as a natural way of compressing a dataset. In min-max generalization problems, the input consists of data items with weights and a lower bound wlb, and the goal is to partition individual items into groups of weight at least wlb while minimizing the maximum weight of a group. The rules of legal partitioning are specific to a problem. Du et al. consider several problems in this vein: (1) partitioning a graph into connected subgraphs, (2) partitioning unstructured data into arbitrary classes, and (3) partitioning a two-dimensional array into contiguous rectangles (subarrays) that satisfy these weight requirements.
We significantly improve approximation ratios for all the problems considered by Du et al. and provide additional motivation for these problems. Moreover, for the first problem, whereas Du et al. give approximation algorithms for specific graph families, namely, 3-connected and 4-connected planar graphs, no approximation algorithm that works for all graphs was known prior to this work.

References

[1]
S. F. Assmann, David S. Johnson, Daniel J. Kleitman, and Joseph Y.-T. Leung. 1984. On a dual version of the one-dimensional bin packing problem. Journal of Algorithms 5, 4 (1984), 502--525.
[2]
Nikhil Bansal and Maxim Sviridenko. 2006. The Santa Claus problem. In STOC’06: Proceedings of the 38th Annual ACM Symposium on Theory of Computing. ACM, New York, NY, 31--40.
[3]
Piotr Berman, Bhaskar DasGupta, and S. Muthukrishnan. 2002. Slice and dice: A simple, improved approximate tiling recipe. In SODA, David Eppstein (Ed.). ACM/SIAM, 455--464.
[4]
Piotr Berman, Bhaskar DasGupta, and S. Muthukrishnan. 2003. Approximation algorithms for MAX-MIN tiling. Journal of Algorithms 47, 2 (2003), 122--134.
[5]
Piotr Berman, Bhaskar DasGupta, S. Muthukrishnan, and Suneeta Ramaswami. 2001. Improved approximation algorithms for rectangle tiling and packing. In SODA. 427--436.
[6]
Piotr Berman and Sofya Raskhodnikova. 2010. Approximation algorithms for min-max generalization problems. In APPROX-RANDOM(Lecture Notes in Computer Science), Maria J. Serna, Ronen Shaltiel, Klaus Jansen, and José D. P. Rolim (Eds.), Vol. 6302. Springer, 53--66.
[7]
N. Chiba and T. Nishizeki. 1989. The Hamiltonian cycle problem is linear-time solvable for 4-connected planar graphs. Journal of Algorithms 10, 2 (1989), 187--211.
[8]
V. Ciriani, S. De Capitani di Vimercati, S. Foresti, and P. Samarati. 2008. k-anonymous data mining: A survey. In Privacy-Preserving Data Mining: Models and Algorithms, Charu C. Aggarwal and Philip S. Yu (Eds.). Springer, Chapter 5.
[9]
János Csirik, David S. Johnson, and Claire Kenyon. 2001. Better approximation algorithms for bin covering. In SODA, S. Rao Kosaraju (Ed.). ACM/SIAM, 557--566.
[10]
Wenliang Du, David Eppstein, Michael T. Goodrich, and George S. Lueker. 2009. On the approximability of geometric and geographic generalization and the min-max bin covering problem. In WADS, 242--253.
[11]
Y. J. Garcia, M. A. Lopez, and S. T. Leutenegger. 1998. A greedy algorithm for bulk loading R-trees. In GIS’98: Proceedings of the 1998 ACM International Symposium on Advances in Geographic Information Systems. ACM, 163--164.
[12]
R. L. Graham, E. L. Lawler, J. K. Lenstra, and A. H. G. Rinnoy Kan. 1979. Optimization and approximation in deterministic sequencing and scheduling: A survey. Annals of Discrete Mathematics 5 (1979), 287--326.
[13]
Klaus Jansen and Roberto Solis-Oba. 2003. An asymptotic fully polynomial time approximation scheme for bin covering. Theoretical Computer Science 306, 1--3 (2003), 543--551.
[14]
Sanjeev Khanna, S. Muthukrishnan, and Mike Paterson. 1998. On approximating rectangle tiling and packing. In SODA, Howard J. Karloff (Ed.). ACM/SIAM, 384--393.
[15]
Jon M. Kleinberg and Éva Tardos. 2006. Algorithm Design. Addison-Wesley. I--XXIII, 1--838.
[16]
J. K. Lenstra, D. B. Shmoys, and É. Tardos. 1990. Approximation algorithms for scheduling unrelated parallel machines. Mathematical Programming 46, 3 (1990), 259--271.
[17]
F. Manne. 1993. Load Balancing in Parallel Sparse Matrix Computation. Ph.D. Dissertation. University of Bergen, Norway.
[18]
S. Muthukrishnan, Viswanath Poosala, and Torsten Suel. 1999. On rectangular partitionings in two dimensions: Algorithms, complexity, and applications. In ICDT. 236--256.
[19]
Jonathan P. Sharp. 1999. Tiling multi-dimensional arrays. In FCT(Lecture Notes in Computer Science), Gabriel Ciobanu and Gheorghe Păun (Eds.), Vol. 1684. Springer, 500--511.
[20]
Adam Smith and Subhash Suri. 2000. Rectangular tiling in multidimensional arrays. Journal of Algorithms 37, 2 (2000), 451--467.
[21]
W. T. Tutte. 1956. A theorem on planar graphs. Transactions of the American Mathematical Society 82 (1956), 99--116.

Cited By

View all
  • (2021)Distributed Memetic Algorithm for Outsourced Database FragmentationIEEE Transactions on Cybernetics10.1109/TCYB.2020.302796251:10(4808-4821)Online publication date: Oct-2021
  • (2019)A benefit-driven genetic algorithm for balancing privacy and utility in database fragmentationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3321707.3321778(771-776)Online publication date: 13-Jul-2019

Index Terms

  1. Approximation Algorithms for Min-Max Generalization Problems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Algorithms
      ACM Transactions on Algorithms  Volume 11, Issue 1
      October 2014
      183 pages
      ISSN:1549-6325
      EISSN:1549-6333
      DOI:10.1145/2660578
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 August 2014
      Accepted: 01 October 2013
      Revised: 01 October 2013
      Received: 01 December 2011
      Published in TALG Volume 11, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. k-anonymity
      2. Generalization problems
      3. bin covering
      4. rectangle tiling

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 09 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)Distributed Memetic Algorithm for Outsourced Database FragmentationIEEE Transactions on Cybernetics10.1109/TCYB.2020.302796251:10(4808-4821)Online publication date: Oct-2021
      • (2019)A benefit-driven genetic algorithm for balancing privacy and utility in database fragmentationProceedings of the Genetic and Evolutionary Computation Conference10.1145/3321707.3321778(771-776)Online publication date: 13-Jul-2019

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media