Effective human action recognition by combining manifold regularization and pairwise constraints

Ma, Xueqi; Tao, Dapeng; Liu, Weifeng

doi:10.1007/s11042-017-5172-1

Effective human action recognition by combining manifold regularization and pairwise constraints

Published: 03 September 2017

Volume 78, pages 13313–13329, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Xueqi Ma¹,
Dapeng Tao² &
Weifeng Liu¹

490 Accesses
Explore all metrics

Abstract

The ever-growing popularity of mobile networks and electronics has prompted intensive research on multimedia data (e.g. text, image, video, audio, etc.) management. This leads to the researches of semi-supervised learning that can incorporate a small number of labeled and a large number of unlabeled data by exploiting the local structure of data distribution. Manifold regularization and pairwise constraints are representative semi-supervised learning methods. In this paper, we introduce a novel local structure preserving approach by considering both manifold regularization and pairwise constraints. Specifically, we construct a new graph Laplacian that takes advantage of pairwise constraints compared with the traditional Laplacian. The proposed graph Laplacian can better preserve the local geometry of data distribution and achieve the effective recognition. Upon this, we build the graph regularized classifiers including support vector machines and kernel least squares as special cases for action recognition. Experimental results on a multimodal human action database (CAS-YNU-MHAD) show that our proposed algorithms outperform the general algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Human action recognition based on the Grassmann multi-graph embedding

Article 06 September 2018

Learning shared subspace regularization with linear discriminant analysis for multi-label action recognition

Article 29 January 2020

A compact discriminant hierarchical clustering approach for action recognition

Article 18 April 2017

References

Ballan L, Bertini M, Del Bimbo A, Seidenari L, Serra G (2012) Effective codebooks for human action representation and classification in unconstrained videos. IEEE Trans Multimedia 14(4):1234–1245
Article Google Scholar
Bar-Hillel A, Hertz T, Shental N, Weinshall D (2005) Learning a mahalanobis metric from equivalence constraints. J Mach Learn Res 6(6):937–965
MathSciNet MATH Google Scholar
Belkin M, Niyogi P (2001) Laplacian eigenmaps and spectral techniques for embedding and clustering. Int Conf Neural Inf Proces Syst: Nat and Synth MIT Press 14(6):585–591
Google Scholar
Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7(1):2399–2434
MathSciNet MATH Google Scholar
Bernstein M, De Silva V, Langford JC, Tenenbaum JB (2001) Graph approximations to geodesics on embedded manifolds. Tech Rep, Standard University 24(9):153–158
Google Scholar
Cevikalp H, Verbeek J, Jurie F, Klaser A (2008) Semi-supervised dimensionality reduction using pairwise equivalence constraints. Int Conf Comput Vis Theory Appl 1:489–496
Google Scholar
Chapelle O, Schölkopf B, Zien A (2006) Semi-supervised learning. MIT Press, Cambridge
Book Google Scholar
Chen C, Jafari R, Kehtarnavaz N (2015) Improving human action recognition using fusion of depth camera and inertial sensors. IEEE Trans Hum-Mach Syst 45(1):51–61
Article Google Scholar
Coyte JL, Stirling D, Haiping D, Ros M (2016) Seated whole-body vibration analysis, technologies, and modeling: a survey. IEEE Trans Syst Man Cybern Syst 46(6):725–739
Article Google Scholar
Ding S, Jia H, Zhang L, Jin F (2014) Research of semi-supervised spectral clustering algorithm based on pairwise constraints. Neural Comput & Applic 24(1):211–219
Article Google Scholar
Donoho DL, Grimes C (2003) Hessian eigenmaps: new locally linear embedding techniques for high-dimensional data. Natl Acad Sci USA 100(10):5591–5596
Article MathSciNet MATH Google Scholar
Gong C, Liu T, Tao D, Keren F, Enmei T, Yang J (2015) Deformed graph Laplacian for semisupervised learning. IEEE Trans Neural Netw Learn Syst 26(10):2261–2274
Article MathSciNet Google Scholar
Guo Y, Tao D, Liu W, Cheng J (2017) Multiview Cauchy estimator feature embedding for depth and inertial sensor-based human action recognition. IEEE Trans Syst Man Cybern Syst 47(4):617–627
Article Google Scholar
Hong C, Yu J, Tao D, Wang M (2015) Image-based three-dimensional human pose recovery by Multiview locality-sensitive sparse retrieval. IEEE Trans Ind Electron 62(6):3742–3751
Google Scholar
Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670
Article MathSciNet MATH Google Scholar
Hong C, Yu J, You J, Chen X, Tao D (2015) Multi-view ensemble manifold regularization for 3D object recognition. Inf Sci 320:395–405
Article MathSciNet Google Scholar
Huang K, Wang C, Tao D (2015) High-order topology modeling of visual words for image classification. IEEE Trans Image Process 24(11):3598–3608
Article MathSciNet MATH Google Scholar
Jalal A, Uddin MZ, Kim T-S (2012) Depth video-based human activity recognition system using translation and scaling invariant features for life logging at smart home. IEEE Trans Consum Electron 58(3):863–871
Article Google Scholar
Ji X, Zhaojie J, Wang C, Wang C (2015) Multi-view transition HMMs based view-invariant human action recognition method. Multimed Tools Appl 75(19):1–18
Google Scholar
Jiang J, Hu R, Wang Z, Cai Z (2016) CDMMA: coupled discriminant multi-manifold analysis for matching low-resolution face images. Signal Process 124:162–172
Article Google Scholar
Khan AM, Lee Y-K, Lee SY, Kim T-S (2010) A triaxial accelerometer-based physical-activity recognition via augmented-signal features and a hierarchical recognizer. IEEE Trans Inf Technol Biomed 14(5):1166–1172
Article Google Scholar
Li L, Dai S (2017) Action recognition with spatio-temporal augmented descriptor and fusion method. Multimed Tools Appl 76(12):13953–13969
Liu T, Tao D (2016) Classification with noisy labels by importance reweighting. IEEE Trans Pattern Anal Mach Intell 38(3):447–461
Article Google Scholar
Liu M, Zhang D (2016) Pairwise constraint-guided sparse learning for feature selection. IEEE Trans Cybern 46(1):298–310
Article MathSciNet Google Scholar
Liu W, Liu H, Tao D, Wang Y, Lu K (2014) Multiview hessian regularized logistic regression for action recognition. Signal Process 110:101–107
Article Google Scholar
Liu A, Yuting S, Jia P, Gao Z, Hao T, Yang Z (2015) Multiple/single-view human action recognition via part-induced multitask structural learning. IEEE Trans Cybern 45(6):1194–1208
Article Google Scholar
Luo Y, Tao D, Ramamohanarao K, Xu C, Wen Y (2015) Tensor canonical correlation analysis for multi-view dimension reduction. IEEE Trans Knowl Data Eng 27(11):3111–3124
Article Google Scholar
Luo Y, Wen Y, Tao D, Gui J, Xu C (2016) Large margin multi-modal multi-task feature extraction for image classification. IEEE Trans Image Process 25(1):414–427
Article MathSciNet MATH Google Scholar
Luo Y, Wen Y, Tao D (2016) On Combining Side Information and Unlabeled Data for Heterogeneous Multi-Task Metric Learning, International Joint Conference on Artificial Intelligence , pp. 1809–1815
Mignon A, Jurie F (2012) PCCA: A new approach for distance learning from sparse pairwise constraints, IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, pp. 2666–2672
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Article Google Scholar
Sang J, Deng Z, Lu D, Xu C (2015) Cross-OSN user modeling by homogeneous behavior quantification and local social regularization. IEEE Trans Multimed 17(12):2259–2270
Article Google Scholar
Schiller H, Chaudhuri BB (1990) Efficient coding of side information in a low bit rate hybrid image coder. Signal Process 19(1):61–73
Article Google Scholar
Seeger M (2000) Learning with labeled and unlabeled data. Technical report. University of Edinburgh, Edinburgh
Tentori M, Favela J (2008) Activity-aware computing for healthcare. IEEE Pervasive Comput 7(2):51–57
Article Google Scholar
Tosato D, Spera M, Cristani M, Murino V (2013) Characterizing humans on riemannian manifolds. IEEE Trans Pattern Anal Mach Intell 35(8):1972–1984
Article Google Scholar
Wagstaff K, Cardie C (2000) Clustering with instance-level constraints, International Conference on Machine Learning DBLP, pp. 1103–1110
Wang M, Ni B, Hua X-S, Chua T-S, (2012) Assistive tagging: a survey of multimedia tagging with human-computer joint exploration. ACM Comput Surv (CSUR) 44(4):25
Xia L, Aggarwa JK (2013) Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera, IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, pp. 2834–2841
Yan M, Sang J, Xu C, Shamim Hossain M (2015) YouTube video promotion by cross-network association: @Britney to advertise Gangnam style. IEEE Trans Multimed 17(8):1248–1261
Article Google Scholar
Yang X, Zhang C, Tian Y (2012) Recognizing actions using depth motion maps-based histograms of oriented gradients, Proceedings of the ACM international conference on Multimedia, pp. 1057–1060
Yu J, Rui Y, Tang Y, Tao D (2014) High-order distance based Multiview stochastic learning in image classification. IEEE Trans Cybern 44(12):2431–2442
Article Google Scholar
Zhang D, Zhou Z-H, Chen S, (2007) Semi-supervised dimensionality reduction, Siam International Conference on Data Mining DBLP, 22, pp. 11–393
Zhang D, Chen S, Zhou Z-H (2008) Constraint score: a new filter method for feature selection with pairwise constraints. Pattern Recogn 41(5):1440–1451
Article MATH Google Scholar
Zhang T, Liu S, Xu C, Lu H (2013) Mining semantic context information for intelligent video surveillance of traffic scenes. IEEE Trans Ind Inf 9(1):149–160
Article Google Scholar
Zhang J, Han Y, Tang J, Hu Q, Jiang J (2017) Semi-supervised image-to-video adaptation for video action recognition. IEEE Trans Cybern 47(4):960–973
Article Google Scholar
Zheng J, Jiang Z, Chellappa R (2016) Cross-view action recognition via transferable dictionary learning. IEEE Trans Image Process 25(6):2542–2556
Article MathSciNet MATH Google Scholar
Zhenyong F, Lu Z, Ip HHS, Lu H, Wang Y (2015) Local similarity learning for pairwise constraint propagation. Multimed Tools Appl 74(11):3739–3758
Article Google Scholar
Zhu X (2008) Semi-supervised learning literature survey. Comput Sci 37(1):63–77
MathSciNet Google Scholar

Download references

Acknowledgements

This paper is partly supported by the National Natural Science Foundation of China (Grant No. 61671480), the Fundamental Research Funds for the Central Universities, China University of Petroleum (East China) (Grant No. 14CX02203A, YCX2017059).

Author information

Authors and Affiliations

China University of Petroleum (East China), Qingdao, 266580, China
Xueqi Ma & Weifeng Liu
Yunnan University, Kunming, 650091, China
Dapeng Tao

Authors

Xueqi Ma
View author publications
You can also search for this author in PubMed Google Scholar
Dapeng Tao
View author publications
You can also search for this author in PubMed Google Scholar
Weifeng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Weifeng Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ma, X., Tao, D. & Liu, W. Effective human action recognition by combining manifold regularization and pairwise constraints. Multimed Tools Appl 78, 13313–13329 (2019). https://doi.org/10.1007/s11042-017-5172-1

Download citation

Received: 07 April 2017
Revised: 05 June 2017
Accepted: 29 August 2017
Published: 03 September 2017
Issue Date: 30 May 2019
DOI: https://doi.org/10.1007/s11042-017-5172-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Effective human action recognition by combining manifold regularization and pairwise constraints

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Human action recognition based on the Grassmann multi-graph embedding

Learning shared subspace regularization with linear discriminant analysis for multi-label action recognition

A compact discriminant hierarchical clustering approach for action recognition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Effective human action recognition by combining manifold regularization and pairwise constraints

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Human action recognition based on the Grassmann multi-graph embedding

Learning shared subspace regularization with linear discriminant analysis for multi-label action recognition

A compact discriminant hierarchical clustering approach for action recognition

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation