Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Partially-supervised learning from facial trajectories for face recognition in video surveillance

Published: 01 July 2015 Publication History

Abstract

Face recognition (FR) is employed in several video surveillance applications to determine if facial regions captured over a network of cameras correspond to a target individuals. To enroll target individuals, it is often costly or unfeasible to capture enough high quality reference facial samples a priori to design representative facial models. Furthermore, changes in capture conditions and physiology contribute to a growing divergence between these models and faces captured during operations. Adaptive biometrics seek to maintain a high level of performance by updating facial models over time using operational data. Adaptive multiple classifier systems (MCSs) have been successfully applied to video-to-video FR, where the face of each target individual is modeled using an ensemble of 2-class classifiers (trained using target vs. non-target samples). In this paper, a new adaptive MCS is proposed for partially-supervised learning of facial models over time based on facial trajectories. During operations, information from a face tracker and individual-specific ensembles is integrated for robust spatio-temporal recognition and for self-update of facial models. The tracker defines a facial trajectory for each individual that appears in a video, which leads to the recognition of a target individual if the positive predictions accumulated along a trajectory surpass a detection threshold for an ensemble. When the number of positive ensemble predictions surpasses a higher update threshold, then all target face samples from the trajectory are combined with non-target samples (selected from the cohort and universal models) to update the corresponding facial model. A learn-and-combine strategy is employed to avoid knowledge corruption during self-update of ensembles. In addition, a memory management strategy based on Kullback-Leibler divergence is proposed to rank and select the most relevant target and non-target reference samples to be stored in memory as the ensembles evolves. For proof-of-concept, a particular realization of the proposed system was validated with videos from Face in Action dataset. Initially, trajectories captured from enrollment videos are used for supervised learning of ensembles, and then videos from various operational sessions are presented to the system for FR and self-update with high-confidence trajectories. At a transaction level, the proposed approach outperforms baseline systems that do not adapt to new trajectories, and provides comparable performance to ideal systems that adapt to all relevant target trajectories, through supervised learning. Subject-level analysis reveals the existence of individuals for which self-updating ensembles with unlabeled facial trajectories provides a considerable benefit. Trajectory-level analysis indicates that the proposed system allows for robust spatio-temporal video-to-video FR, and may therefore enhance security and situation analysis in video surveillance.

References

[1]
F. Li, H. Wechsler, Open set face recognition using transduction, IEEE Trans. Pattern Anal. Mach. Intell., 27 (2005) 1686-1697.
[2]
M. De-la Torre, E. Granger, P.V.W. Radtke, R. Sabourin, D.O. Gorodnichy, Incremental update of biometric models in face-based video surveillance, in: Proceedings on International Joint Conference on Neural Networks, Brisbane, Australia, 2012, pp. 1-8.
[3]
F. Matta, J.-L. Dugelay, Person recognition using facial video information: a state of the art, J. Vis. Lang. Comput., 20 (2009) 180-187.
[4]
V. Despiegel, S. Gentric, J. Fondeur, Border control: From technical to operational evaluation, in: Proceedings on International Biometric Performance Testing Conference, Gaithersburg, Maryland, US, 2012.
[5]
H.K. Ekenel, J. Stallkamp, R. Stiefelhagen, A video-based door monitoring system using local appearance-based face models, Comput. Vis. Image Underst., 114 (2010) 596-608.
[6]
S. Zhou, V. Krueger, R. Chellappa, Probabilistic recognition of human faces from video, Comput. Vis. Image Underst., 91 (2003) 214-245.
[7]
R. Polikar, L. Udpa, S.S. Udpa, V. Honavar, Learn++: an incremental learning algorithm for MLP networks, IEEE Trans. Syst. Man Cybern., 31 (2001) 497-508.
[8]
R. Singh, M. Vatsa, A. Ross, A. Noore, Biometric classifier update using online learning: a case study in near infrared face verification, Image Vis. Comput., 28 (2010) 1098-1105.
[9]
J.-F. Connolly, E. Granger, R. Sabourin, Evolution of heterogeneous ensembles through dynamic particle swarm optimization for video-based face recognition, Pattern Recogn., 45 (2012) 2460-2477.
[10]
C. Pagano, E. Granger, R. Sabourin, D.O. Gorodnichy, Detector ensembles for face recognition in video surveillance, in: Proceedings on International Joint Conference on Neural Networks, Brisbane, Australia, 2012, pp. 1-8.
[11]
A. Rattani, Adaptive Biometric System Based on Template Update Procedures, Ph.D. Thesis, University of Cagliari, 2010.
[12]
F. Roli, G.L. Marcialis, Semi-supervised PCA-based face recognition using self-training, in: Proceedings on Joint International Association for Pattern Recognition - International Workshop on Structural and Syntactical Pattern Recognition and Statistical Techniques in Pattern Recognition, vol. 4109, Springer, Hong Kong, China, 2006, pp. 560-568.
[13]
A. Franco, D. Maio, D. Maltoni, Incremental template updating for face recognition in home environments, Pattern Recogn., 43 (2010) 2891-2903.
[14]
F. Roli, L. Didaci, G. Marcialis, Template co-update in multimodal biometric systems, in: Proceedings on International Conference on Biometrics, vol. 4642, Seoul, Korea, 2007, pp. 1194 - 1202.
[15]
A. Merati, N. Poh, J. Kittler, Extracting discriminative information from cohort models, in: Proceedings on IEEE International Conference on Biometrics: Theory Applications and Systems, 2010, pp. 1-6.
[16]
P. Hart, The condensed nearest neighbor rule (correspondence), IEEE Trans. Inf. Theory, 14 (1968) 515-516.
[17]
A. KachitesMcCallum, K. Nigam, Employing EM and pool-based active learning for text classification, in: Proceedings on International Conference on Machine Learning, San Francisco, USA, 1998, pp. 350-358.
[18]
A. Yilmaz, O. Javed, M. Shah, Object tracking: a survey, ACM Comput. Surv., 38 (2006) 1-45.
[19]
D. Tax, R. Duin, Growing a multi-class classifier with a reject option, Pattern Recogn., 29 (2008) 1565-1570.
[20]
A.K. Jain, A. Ross, Learning user-specific parameters in a multibiometric system, in: Proceedings on International Conference on Image Processing, 2002, pp. 57-60.
[21]
B. Kamgar-Parsi, W. Lawson, B. Kamgar-Parsi, Toward development of a face recognition system for watchlist surveillance, IEEE Trans. Pattern Anal. Mach. Intell., 33 (2011) 1925-1937.
[22]
Y. Zhang, A. Martinez, From stills to video: face recognition using a probabilistic approach, in: Proceedings on Workshop Conference on Computer Vision and Pattern Recognition, 2004, p. 78. http://dx.doi.org/10.1109/CVPR.2004.75.
[23]
X. Liu, T. Cheng, Video-based face recognition using adaptive Hidden Markov Models, in: Proceedings IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, Los Alamitos, CA, USA, 2003, pp. 340-345.
[24]
Y.-C. Chen, V.M. Patel, S. Shekhar, R. Chellappa, P.J. Phillips, Video-based face recognition via joint sparse representation, in: Proceedings on International Conference on Automatic Face and Gesture Recognition, Shangai, China, 2013.
[25]
B. Freni, G.L. Marcialis, F. Roli, Template selection by editing algorithms: A case study in face recognition, in: Proceedings on Joint International Association of Pattern Recognition, vol. 5342, Orlando, USA, 2008, pp. 745-754.
[26]
D.D. Lewis, J. Catlett, Heterogeneous uncertainty sampling for supervised learning, in: Proceedings on International Conference on Machine Learning, Morgan Kaufmann, 1994, pp. 148-156.
[27]
W. Liu, J.C. Principe, HaykinSimon, Kernel Adaptive Filtering: A Comprehensive Introduction, Wiley, 2010.
[28]
T. Scheffer, C. Decomain, S. Wrobel, Active Hidden Markov models for information extraction, in: Proceedings on International Conference in Advances in Intelligent Data Analysis, vol. 2189, Berlin, Germany, 2001, pp. 309-318.
[29]
C.E. Shannon, A mathematical theory of communication, Bell Syst. Techn. J., 27 (1948) 623-656.
[30]
I. Dagan, S. Engelson, Committee-based sampling for training probabilistic classifiers, in: Proceedings on International Conference on Machine Learning, San Francisco, USA, 1995, pp. 150-157.
[31]
E. Tang, P. Suganthan, X. Yao, An analysis of diversity measures, Mach. Learn., 65 (2006) 247-271.
[32]
X. Guo, Y. Yin, C. Dong, G. Yang, G. Zhou, On the class imbalance problem, in: Proceedings on International Conference on Natural Computation, vol. 4, Piscaatway, NJ, USA, 2008, pp. 192-201.
[33]
M. Galar, A. Fernandez, E. Barrenechea, H. Bustince, F. Herrera, A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches, IEEE Trans. Syst. Man Cybern., 42 (2011) 463-484.
[34]
I. Tomek, Two modifications of CNN, IEEE Trans. Syst. Man Cybern., 6 (1976) 769-772.
[35]
P. Flach, E. Matsubara, On classification, ranking, and probability estimation, in: Proceedings on Probabilistic, Logical and Relational Learning - A Further Synthesis, No. 07161 in Dagstuhl Seminar Proceedings, Dagstuhl, Germany, 2008, pp. 1-10.
[36]
F. Roli, L. Didaci, G.L. Marcialis, Adaptive biometric systems that can improve with use, in: Advances in Biometrics: Sensors, Systems and Algorithms, Springer, 2008, pp. 447-471.
[37]
K. Okada, L. Kite, C. vonder Malsburg, An adaptive person recognition system, in: Proceedings on IEEE International Workshop on Robot and Human Interactive Communication, Piscataway, NJ, USA, 2001, pp. 436-441.
[38]
A. Rattani, G. Marcialis, F. Roli, Capturing large intra-class variations of biometric data by template co-updating, in: Proceedings on IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Piscataway, NJ, USA, 2008, pp. 1-6.
[39]
A. Rattani, B. Freni, G.L. Marcialis, F. Roli, Template update methods in adaptive biometric systems: a critical review, in: Lecture Notes in Computer Science (Included Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5558, Alghero, Italy, 2009, pp. 847-856.
[40]
I. Cohen, F.G. Cozman, N. Sebe, M.C. Cirelo, T.S. Huang, Semisupervised learning of classifiers: theory, algorithms, and their application to human-computer interaction, IEEE Trans. Pattern Anal. Mach. Intell., 26 (2004) 1553-1567.
[41]
K. Lu, Z. Ding, J. Zhao, Y. Wu, A novel semi-supervised face recognition for video, in: Proceedings of the International Conference on Intelligent Control and Information Processing, 2010, pp. 313-316.
[42]
L. Didaci, F. Roli, Using co-training and self-training in semi-supervised multiple classifier systems, in: Lecture Notes on Computer Sciences, vol. 4109, Hong Kong, China, 2006, pp. 522-530.
[43]
N. ElGayar, S.A. Shaban, S. Hamdy, Face recognition with semi-supervised learning and multiple classifiers, in: Proceedings on World Scientific Engineering Academy and Society International Conference on Computational Intelligence, Man-Machine Systems and Cybernetics, USA, 2006, pp. 296-301.
[44]
G. Yu, G. Zhang, C. Domeniconi, Z. Yu, J. YouZ, Semi-supervised classification based on random subspace dimensionality reduction, Pattern Recogn., 45 (2012) 1119-1135.
[45]
R. Hewitt, S. Belongie, Active learning in face recognition: Using tracking to build a face model, in: Proceedings IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, United States, 2006, p. 157.
[46]
T. Fawcett, An introduction to ROC analysis, Pattern Recogn. Lett., 27 (2006) 861-874.
[47]
L. Kuncheva, Combining Pattern Classifiers: Methods and Algorithms, Wiley, 2004.
[48]
R. Goh, L. Liu, X. Liu, T. Chen, The CMU face in action database, in: Analysis and Modelling of Faces and Gestures, Carnegie Mellon University, 2005, pp. 255-263.
[49]
P. Viola, M. Jones, Robust real-time face detection, Int. J. Comput. Vis., 2 (2004) 137-154.
[50]
G.R. Bradski, Computer vision face tracking for use in a perceptual user interface, Intel Technol. J., Q2 (1998) 1-15.
[51]
T. Ojala, M. Pietikainen, T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Trans. Pattern Anal. Mach. Intell., 24 (2002) 971-987.
[52]
C.P. Lim, R.F. Harrison, Probabilistic fuzzy ARTMAP: an autonomous neural network architecture for bayesian probability estimation, in: Proceedings International Conference on Artificial Neural Networks, 1995, pp. 148-153.
[53]
W. Khreich, E. Granger, A. Miri, R. Sabourin, Iterative Boolean combination of classifiers in the ROC space: an application to anomaly detection with HMMs, Pattern Recogn., 43 (2010) 2732-2752.
[54]
G. Doddington, W. Liggett, A. Martin, M. Przybocki, D. Reynolds, Sheep, goats, lambs and wolves: a statistical analysis of speaker performance, in: Proceedings on International Conference on spoken language processing, 1998, pp. 1351-1354.
[55]
I. Oh, C.I. Suen, A class-modular feedforward neural network for handwriting recognition, Pattern Recogn., 35 (2002) 229-244.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Information Fusion
Information Fusion  Volume 24, Issue C
July 2015
168 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 July 2015

Author Tags

  1. Adaptive biometrics
  2. Incremental learning
  3. Multiple classifier systems
  4. Video surveillance
  5. Video-to-video face recognition

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)3D Face Reconstruction: The Road to ForensicsACM Computing Surveys10.1145/362528856:3(1-38)Online publication date: 21-Oct-2023
  • (2023)Exploring fusion strategies for accurate RGBT visual object trackingInformation Fusion10.1016/j.inffus.2023.10188199:COnline publication date: 1-Nov-2023
  • (2022)Incremental Learning from Low-labelled Stream Data in Open-Set Video Face RecognitionPattern Recognition10.1016/j.patcog.2022.108885131:COnline publication date: 1-Nov-2022
  • (2021)Towards a self-sufficient face verification systemExpert Systems with Applications: An International Journal10.1016/j.eswa.2021.114734174:COnline publication date: 15-Jul-2021
  • (2021)Face recognition using particle swarm optimization based block ICAMultimedia Tools and Applications10.1007/s11042-021-10792-580:28-29(35685-35695)Online publication date: 1-Nov-2021
  • (2020)Image set-based face recognition using pose estimation with facial landmarksMultimedia Tools and Applications10.1007/s11042-019-08408-079:27-28(19493-19507)Online publication date: 1-Jul-2020
  • (2019)Incremental Learning Techniques Within a Self-updating Approach for Face Verification in Video-SurveillancePattern Recognition and Image Analysis10.1007/978-3-030-31321-0_3(25-37)Online publication date: 1-Jul-2019
  • (2018)Multi-layer CNN Features Fusion and Classifier Optimization for Face RecognitionProceedings of the 2018 2nd International Conference on Computer Science and Artificial Intelligence10.1145/3297156.3297208(273-276)Online publication date: 8-Dec-2018
  • (2018)Incremental on-line learningNeurocomputing10.1016/j.neucom.2017.06.084275:C(1261-1274)Online publication date: 31-Jan-2018
  • (2018)Real-time foreground detection approach based on adaptive ensemble learning with arbitrary algorithms for changing environmentsInformation Fusion10.1016/j.inffus.2017.05.00139:C(154-167)Online publication date: 1-Jan-2018
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media