Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

Bhattacharyya, Arnab; Gayen, Sutanu; Price, Eric; Vinodchandran, N. V.

Computer Science > Data Structures and Algorithms

arXiv:2011.04144 (cs)

[Submitted on 9 Nov 2020 (v1), last revised 22 Jul 2021 (this version, v2)]

Title:Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

Authors:Arnab Bhattacharyya, Sutanu Gayen, Eric Price, N. V. Vinodchandran

View PDF

Abstract:We provide finite sample guarantees for the classical Chow-Liu algorithm (IEEE Trans.~Inform.~Theory, 1968) to learn a tree-structured graphical model of a distribution. For a distribution $P$ on $\Sigma^n$ and a tree $T$ on $n$ nodes, we say $T$ is an $\varepsilon$-approximate tree for $P$ if there is a $T$-structured distribution $Q$ such that $D(P\;||\;Q)$ is at most $\varepsilon$ more than the best possible tree-structured distribution for $P$. We show that if $P$ itself is tree-structured, then the Chow-Liu algorithm with the plug-in estimator for mutual information with $\widetilde{O}(|\Sigma|^3 n\varepsilon^{-1})$ i.i.d.~samples outputs an $\varepsilon$-approximate tree for $P$ with constant probability. In contrast, for a general $P$ (which may not be tree-structured), $\Omega(n^2\varepsilon^{-2})$ samples are necessary to find an $\varepsilon$-approximate tree. Our upper bound is based on a new conditional independence tester that addresses an open problem posed by Canonne, Diakonikolas, Kane, and Stewart~(STOC, 2018): we prove that for three random variables $X,Y,Z$ each over $\Sigma$, testing if $I(X; Y \mid Z)$ is $0$ or $\geq \varepsilon$ is possible with $\widetilde{O}(|\Sigma|^3/\varepsilon)$ samples. Finally, we show that for a specific tree $T$, with $\widetilde{O} (|\Sigma|^2n\varepsilon^{-1})$ samples from a distribution $P$ over $\Sigma^n$, one can efficiently learn the closest $T$-structured distribution in KL divergence by applying the add-1 estimator at each node.

Comments:	33 pages, 3 figures
Subjects:	Data Structures and Algorithms (cs.DS); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2011.04144 [cs.DS]
	(or arXiv:2011.04144v2 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.2011.04144

Submission history

From: Sutanu Gayen [view email]
[v1] Mon, 9 Nov 2020 02:08:56 UTC (408 KB)
[v2] Thu, 22 Jul 2021 06:37:12 UTC (1,614 KB)

Computer Science > Data Structures and Algorithms

Title:Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Near-Optimal Learning of Tree-Structured Distributions by Chow-Liu

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators