BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

Zhang, Biao; Xiong, Deyi; Su, Jinsong

Computer Science > Computation and Language

arXiv:1605.07874 (cs)

[Submitted on 25 May 2016 (v1), last revised 25 Nov 2016 (this version, v2)]

Title:BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

Authors:Biao Zhang, Deyi Xiong, Jinsong Su

View PDF

Abstract:In this paper, we propose a bidimensional attention based recursive autoencoder (BattRAE) to integrate clues and sourcetarget interactions at multiple levels of granularity into bilingual phrase representations. We employ recursive autoencoders to generate tree structures of phrases with embeddings at different levels of granularity (e.g., words, sub-phrases and phrases). Over these embeddings on the source and target side, we introduce a bidimensional attention network to learn their interactions encoded in a bidimensional attention matrix, from which we extract two soft attention weight distributions simultaneously. These weight distributions enable BattRAE to generate compositive phrase representations via convolution. Based on the learned phrase representations, we further use a bilinear neural model, trained via a max-margin method, to measure bilingual semantic similarity. To evaluate the effectiveness of BattRAE, we incorporate this semantic similarity as an additional feature into a state-of-the-art SMT system. Extensive experiments on NIST Chinese-English test sets show that our model achieves a substantial improvement of up to 1.63 BLEU points on average over the baseline.

Comments:	7 pages, accepted by AAAI 2017
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1605.07874 [cs.CL]
	(or arXiv:1605.07874v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1605.07874

Submission history

From: Biao Zhang [view email]
[v1] Wed, 25 May 2016 13:29:07 UTC (206 KB)
[v2] Fri, 25 Nov 2016 03:26:45 UTC (180 KB)

Computer Science > Computation and Language

Title:BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators