DIANet: Dense-and-Implicit Attention Network

Huang, Zhongzhan; Liang, Senwei; Liang, Mingfu; Yang, Haizhao

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.10671 (cs)

[Submitted on 25 May 2019 (v1), last revised 23 Sep 2019 (this version, v2)]

Title:DIANet: Dense-and-Implicit Attention Network

Authors:Zhongzhan Huang, Senwei Liang, Mingfu Liang, Haizhao Yang

View PDF

Abstract:Attention networks have successfully boosted the performance in various vision problems. Previous works lay emphasis on designing a new attention module and individually plug them into the networks. Our paper proposes a novel-and-simple framework that shares an attention module throughout different network layers to encourage the integration of layer-wise information and this parameter-sharing module is referred as Dense-and-Implicit-Attention (DIA) unit. Many choices of modules can be used in the DIA unit. Since Long Short Term Memory (LSTM) has a capacity of capturing long-distance dependency, we focus on the case when the DIA unit is the modified LSTM (refer as DIA-LSTM). Experiments on benchmark datasets show that the DIA-LSTM unit is capable of emphasizing layer-wise feature interrelation and leads to significant improvement of image classification accuracy. We further empirically show that the DIA-LSTM has a strong regularization ability on stabilizing the training of deep networks by the experiments with the removal of skip connections or Batch Normalization in the whole residual network. The code is released at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1905.10671 [cs.CV]
	(or arXiv:1905.10671v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.10671

Submission history

From: Zhongzhan Huang [view email]
[v1] Sat, 25 May 2019 20:51:07 UTC (735 KB)
[v2] Mon, 23 Sep 2019 08:23:50 UTC (743 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhongzhan Huang
Senwei Liang
Mingfu Liang
Haizhao Yang

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:DIANet: Dense-and-Implicit Attention Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DIANet: Dense-and-Implicit Attention Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators