Banyan: Improved Representation Learning with Explicit Structure

Opper, Mattia; Siddharth, N.

Computer Science > Computation and Language

arXiv:2407.17771 (cs)

[Submitted on 25 Jul 2024]

Title:Banyan: Improved Representation Learning with Explicit Structure

Authors:Mattia Opper, N. Siddharth

View PDF HTML (experimental)

Abstract:We present Banyan, an improved model to learn semantic representations by inducing explicit structure over data. In contrast to prior approaches using structure spanning single sentences, Banyan learns by resolving multiple constituent structures into a shared one explicitly incorporating global context. Combined with an improved message-passing scheme inspired by Griffin, Banyan learns significantly better representations, avoids spurious false negatives with contrastive learning, and drastically improves memory efficiency in such explicit-structured models. Using the Self-StrAE framework, we show that Banyan (a) outperforms baselines using sentential structure across various settings (b) matches or outperforms unstructured baselines like GloVe (+augmentations) and a RoBERTa medium (+simcse) pre-trained on 100M tokens, despite having just a handful of (non-embedding) parameters, and (c) also learns effective representations across several low resource (Asian and African) languages as measured on SemRel tasks.

Comments:	First Draft
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2407.17771 [cs.CL]
	(or arXiv:2407.17771v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.17771

Submission history

From: Mattia Opper [view email]
[v1] Thu, 25 Jul 2024 04:58:08 UTC (326 KB)

Computer Science > Computation and Language

Title:Banyan: Improved Representation Learning with Explicit Structure

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Banyan: Improved Representation Learning with Explicit Structure

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators