Video Action Recognition Via Neural Architecture Searching

Peng, Wei; Hong, Xiaopeng; Zhao, Guoying

Computer Science > Computer Vision and Pattern Recognition

arXiv:1907.04632 (cs)

[Submitted on 10 Jul 2019]

Title:Video Action Recognition Via Neural Architecture Searching

Authors:Wei Peng, Xiaopeng Hong, Guoying Zhao

View PDF

Abstract:Deep neural networks have achieved great success for video analysis and understanding. However, designing a high-performance neural architecture requires substantial efforts and expertise. In this paper, we make the first attempt to let algorithm automatically design neural networks for video action recognition tasks. Specifically, a spatio-temporal network is developed in a differentiable space modeled by a directed acyclic graph, thus a gradient-based strategy can be performed to search an optimal architecture. Nonetheless, it is computationally expensive, since the computational burden to evaluate each architecture candidate is still heavy. To alleviate this issue, we, for the video input, introduce a temporal segment approach to reduce the computational cost without losing global video information. For the architecture, we explore in an efficient search space by introducing pseudo 3D operators. Experiments show that, our architecture outperforms popular neural architectures, under the training from scratch protocol, on the challenging UCF101 dataset, surprisingly, with only around one percentage of parameters of its manual-design counterparts.

Comments:	Accepted by IEEE ICIP2019
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1907.04632 [cs.CV]
	(or arXiv:1907.04632v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1907.04632

Submission history

From: Wei Peng [view email]
[v1] Wed, 10 Jul 2019 11:44:28 UTC (350 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-07

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wei Peng
Xiaopeng Hong
Guoying Zhao

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Video Action Recognition Via Neural Architecture Searching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Video Action Recognition Via Neural Architecture Searching

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators