A Unified Method for First and Third Person Action Recognition

Javidani, Ali; Mahmoudi-Aznaveh, Ahmad

doi:10.1109/ICEE.2018.8472580

Computer Science > Computer Vision and Pattern Recognition

arXiv:1801.00192 (cs)

[Submitted on 30 Dec 2017 (v1), last revised 8 Apr 2018 (this version, v2)]

Title:A Unified Method for First and Third Person Action Recognition

Authors:Ali Javidani, Ahmad Mahmoudi-Aznaveh

View PDF

Abstract:In this paper, a new video classification methodology is proposed which can be applied in both first and third person videos. The main idea behind the proposed strategy is to capture complementary information of appearance and motion efficiently by performing two independent streams on the videos. The first stream is aimed to capture long-term motions from shorter ones by keeping track of how elements in optical flow images have changed over time. Optical flow images are described by pre-trained networks that have been trained on large scale image datasets. A set of multi-channel time series are obtained by aligning descriptions beside each other. For extracting motion features from these time series, PoT representation method plus a novel pooling operator is followed due to several advantages. The second stream is accomplished to extract appearance features which are vital in the case of video classification. The proposed method has been evaluated on both first and third-person datasets and results present that the proposed methodology reaches the state of the art successfully.

Comments:	5 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1801.00192 [cs.CV]
	(or arXiv:1801.00192v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1801.00192
Related DOI:	https://doi.org/10.1109/ICEE.2018.8472580

Submission history

From: Ali Javidani [view email]
[v1] Sat, 30 Dec 2017 21:03:13 UTC (596 KB)
[v2] Sun, 8 Apr 2018 14:42:37 UTC (514 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Unified Method for First and Third Person Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Unified Method for First and Third Person Action Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators