Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1250790.1250863acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article

Testing k-wise and almost k-wise independence

Published: 11 June 2007 Publication History

Abstract

In this work, we consider the problems of testing whether adistribution over (0,1n) is k-wise (resp. (ε,k)-wise) independentusing samples drawn from that distribution.
For the problem of distinguishing k-wise independent distributions from those that are δ-far from k-wise independence in statistical distance, we upper bound the number ofrequired samples by Õ(nk2) and lower bound it by Ω(nk-1/2/δ) (these bounds hold for constantk, and essentially the same bounds hold for general k). Toachieve these bounds, we use Fourier analysis to relate adistribution's distance from k-wise independence to its biases, a measure of the parity imbalance it induces on a setof variables. The relationships we derive are tighter than previouslyknown, and may be of independent interest.
To distinguish (ε,k)-wise independent distributions from thosethat are δ-far from (ε,k)-wise independence in statistical distance, we upper bound thenumber of required samples by O(k log n / δ2ε2) and lower bound it by Ω(√ k log n / 2k(ε+δ)√ log 1/2k(ε+δ)). Although these bounds are anexponential improvement (in terms of n and k) over thecorresponding bounds for testing k-wise independence, we give evidence thatthe time complexity of testing (ε,k)-wise independence isunlikely to be poly(n,1/ε,1/δ) for k=Θ(log n),since this would disprove a plausible conjecture concerning the hardness offinding hidden cliques in random graphs. Under the conjecture, ourresult implies that for, say, k = log n and ε = 1 / n0.99,there is a set of (ε,k)-wise independent distributions, and a set of distributions at distance δ=1/n0.51 from (ε,k)-wiseindependence, which are indistinguishable by polynomial time algorithms.

References

[1]
N. Alon. Problems and results in extremal combinatorics (Part I). Discrete Math., 273:31--53, 2003.
[2]
N. Alon, L. Babai, and A. Itai. A fast and simple randomized algorithm for the maximal independent set problem. J. of Algorithms, 7:567--583, 1986.
[3]
N. Alon, O. Goldreich, J. Hastad, and R. Peralta. Simple constructions of almost k-wise independent random variables. Random Structures and Algorithms, 3(3):289--304, 1992. Earlier version in FOCS'90.
[4]
N. Alon, O. Goldreich, and Y. Mansour. Almost k-wise independence versus k-wise independence. Inform. Process. Lett., 88:107--110, 2003.
[5]
N. Alon, M. Krivelevich, and B. Sudakov. Finding a large hidden clique in a random graph. Random Structures and Algorithms, 13:457--466, 1998.
[6]
N. Alon and J. Spencer. The Probabilistic Method. John Wiley and Sons, second edition, 2000.
[7]
T. Batu, E. Fisher, L. Fortnow, R. Kumar, R. Rubinfeld, and P. White. Testing random variables for independence and identity. In Proc. 42nd Annual IEEE Symposium on Foundations of Computer Science, pages 442--451, 2001.
[8]
T. Batu, L. Fortnow, R. Rubinfeld, W.D. Smith, and P. White. Testing that distributions are close. In Proc. 41st Annual IEEE Symposium on Foundations of Computer Science, pages 189--197, 2000.
[9]
T. Batu, R. Kumar, and R. Rubinfeld. Sublinear algorithms for testing monotone and unimodal distributions. In Proc. 36th Annual ACM Symposium on the Theory of Computing, pages 381--390, New York, NY, USA, 2004. ACM Press.
[10]
W. Beckner. Inequalities in fourier analysis. Annals of Mathematics, 102:159--182, 1975.
[11]
S. Bernstein. The Theory of Probabilities. Gostehizdat Publishing House, Moscow, 1946.
[12]
A. Bonami. Etude des coefficients fourier des fonctiones de lp(g). Ann. Inst. Fourier (Grenoble), 20(2):335--402, 1970.
[13]
B. Chor, J. Friedman, O. Goldreich, J. Hastad, S. Rudich, and R. Smolensky. The bit extraction problem and t-resilient functions. In Proc. 26th Annual IEEE Symposium on Foundations of Computer Science, pages 396--407, 1985.
[14]
I. Dinur, E. Friedgut, G. Kindler, and R. O'Donnell. On the Fourier tails of bounded functions over the discrete cube. In Proc. 38th Annual ACM Symposium on the Theory of Computing, pages 437--446, New York, NY, USA, 2006. ACM Press.
[15]
U. Feige and R. Krauthgamer. Finding and certifying a large hidden clique in a semirandom graph. Random Structures and Algorithms, 16:195--208, 2000.
[16]
A. Frieze and C. McDiarmid. Algorithmic theory of random graphs. Random Structures and Algorithms, 10(1-2):5--42, 1997.
[17]
O. Goldreich and D. Ron. On testing expansion in bounded-degree graphs. Technical Report TR00-020, Electronic Colloquium on Computational Complexity, 2000.
[18]
M. Jerrum. Large cliques elude the metropolis process. Random Structures and Algorithms, 3(4):347--359, 1992.
[19]
A. Joffe. On a set of almost deterministic k-independent random variables. Annals of Probability, 2:161--162, 1974.
[20]
A. Juels and M. Peinado. Hiding cliques for cryptographic security. Des. Codes Cryptography, 20(3):269--280, 2000.
[21]
R. Karp and A. Wigderson. A fast parallel algorithm for the maximal independent set problem. Journal of the ACM, 32(4):762--773, 1985.
[22]
R.M. Karp. The probabilistic analysis of some combinatorial search algorithms. In JF. Traub, editor, Algorithms and Complexity: New directions and Recent Results, pages 1--19, New York, 1976. Academic Press.
[23]
L. Kucera. Expected complexity of graph partitioning problems. Disc. Applied Math., 57(2-3):193--212, 1995.
[24]
M. Luby. A simple parallel algorithm for the maximal independent set problem. SIAM J. on Comput., 15(4):1036--1053, 1986. Earlier version in STOC'85.
[25]
J. Naor and M. Naor. Small-bias probability spaces: efficient constructions and applications. SIAM J. on Comput., 22(4):838--856, 1993. Earlier version in STOC'90.
[26]
R. Rubinfeld and R.A. Servedio. Testing monotone high-dimensional distributions. In Proc. 37th Annual ACM Symposium on the Theory of Computing, pages 147--156, New York, NY, USA, 2005. ACM Press.

Cited By

View all
  • (2024)Planted Clique Conjectures Are EquivalentProceedings of the 56th Annual ACM Symposium on Theory of Computing10.1145/3618260.3649751(358-366)Online publication date: 10-Jun-2024
  • (2024) Succinct Quantum Testers for Closeness and k -Wise Uniformity of Probability Distributions IEEE Transactions on Information Theory10.1109/TIT.2024.339375670:7(5092-5103)Online publication date: Jul-2024
  • (2023)Hardness Self-Amplification: Simplified, Optimized, and UnifiedProceedings of the 55th Annual ACM Symposium on Theory of Computing10.1145/3564246.3585189(70-83)Online publication date: 2-Jun-2023
  • Show More Cited By

Index Terms

  1. Testing k-wise and almost k-wise independence

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    STOC '07: Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
    June 2007
    734 pages
    ISBN:9781595936318
    DOI:10.1145/1250790
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 June 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Fourier analysis
    2. almost k-wise independence
    3. hidden clique
    4. k-wise independence
    5. property testing

    Qualifiers

    • Article

    Conference

    STOC07
    Sponsor:
    STOC07: Symposium on Theory of Computing
    June 11 - 13, 2007
    California, San Diego, USA

    Acceptance Rates

    Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)25
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Planted Clique Conjectures Are EquivalentProceedings of the 56th Annual ACM Symposium on Theory of Computing10.1145/3618260.3649751(358-366)Online publication date: 10-Jun-2024
    • (2024) Succinct Quantum Testers for Closeness and k -Wise Uniformity of Probability Distributions IEEE Transactions on Information Theory10.1109/TIT.2024.339375670:7(5092-5103)Online publication date: Jul-2024
    • (2023)Hardness Self-Amplification: Simplified, Optimized, and UnifiedProceedings of the 55th Annual ACM Symposium on Theory of Computing10.1145/3564246.3585189(70-83)Online publication date: 2-Jun-2023
    • (2023)Testing Distributional Assumptions of Learning AlgorithmsProceedings of the 55th Annual ACM Symposium on Theory of Computing10.1145/3564246.3585117(1643-1656)Online publication date: 2-Jun-2023
    • (2023)The Hsu–Robbins–Erdös theorem for the maximum partial sums of quadruplewise independent random variablesJournal of Mathematical Analysis and Applications10.1016/j.jmaa.2022.126896521:1(126896)Online publication date: May-2023
    • (2023)Cryptography from Planted Graphs: Security with Logarithmic-Size MessagesTheory of Cryptography10.1007/978-3-031-48615-9_11(286-315)Online publication date: 27-Nov-2023
    • (2022)Independence testing for bounded degree Bayesian networksProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601363(15027-15038)Online publication date: 28-Nov-2022
    • (2022)Tensor clustering with planted structures: Statistical optimality and computational limitsThe Annals of Statistics10.1214/21-AOS212350:1Online publication date: 1-Feb-2022
    • (2022)A Planted Clique Perspective on Hypothesis PruningIEEE Robotics and Automation Letters10.1109/LRA.2022.31551987:2(5167-5174)Online publication date: Apr-2022
    • (2021)Concentration bounds for almost k-wise independence with applications to non-uniform securityProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458207(2404-2423)Online publication date: 10-Jan-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media