Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Fu, Kui; Shi, Peipei; Song, Yafei; Ge, Shiming; Lu, Xiangju; Li, Jia

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.04449 (cs)

[Submitted on 9 Apr 2019 (v1), last revised 2 Jan 2020 (this version, v2)]

Title:Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Authors:Kui Fu, Peipei Shi, Yafei Song, Shiming Ge, Xiangju Lu, Jia Li

View PDF

Abstract:Large convolutional neural network models have recently demonstrated impressive performance on video attention prediction. Conventionally, these models are with intensive computation and large memory. To address these issues, we design an extremely light-weight network with ultrafast speed, named UVA-Net. The network is constructed based on depth-wise convolutions and takes low-resolution images as input. However, this straight-forward acceleration method will decrease performance dramatically. To this end, we propose a coupled knowledge distillation strategy to augment and train the network effectively. With this strategy, the model can further automatically discover and emphasize implicit useful cues contained in the data. Both spatial and temporal knowledge learned by the high-resolution complex teacher networks also can be distilled and transferred into the proposed low-resolution light-weight spatiotemporal network. Experimental results show that the performance of our model is comparable to 11 state-of-the-art models in video attention prediction, while it costs only 0.68 MB memory footprint, runs about 10,106 FPS on GPU and 404 FPS on CPU, which is 206 times faster than previous models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1904.04449 [cs.CV]
	(or arXiv:1904.04449v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.04449

Submission history

From: Jia Li [view email]
[v1] Tue, 9 Apr 2019 03:32:08 UTC (2,008 KB)
[v2] Thu, 2 Jan 2020 07:35:44 UTC (2,014 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Ultrafast Video Attention Prediction with Coupled Knowledge Distillation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators