Abstract
Face recognition has many important applications in areas such as public surveillance and security, identity verification in the digital world, and modeling techniques in multimedia data management. Facial expression recognition is also important for targeted marketing, medical analysis, and human–robot interaction. In this paper, we survey a few techniques for facial analysis. We compare the cloud platform AWS Rekognition, convolutional neural networks, transfer learning from pre-trained neural nets, and traditional feature extraction using facial landmarks for this analysis. Although not comprehensive, this survey covers a lot of ground in the state-of-the-art solutions for facial analysis. We show that to get high accuracy, good-quality data and processing power must be provided in large quantities. We present the results of our experiments which have been conducted over six different public as well as proprietary image data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Facial recognition system. (2017, October). https://en.wikipedia.org/wiki/Facial_recognition_system.
Amazon Rekognition. (2017). https://aws.amazon.com/rekognition/.
Karpathy, A. Convolutional neural networks (CNNs/ConvNets). http://cs231n.stanford.edu/.
Karpathy, A. Transfer learning. http://cs231n.github.io/transfer-learning/.
Feature engineering. (2017, September). https://en.wikipedia.org/wiki/Feature_engineering.
Kazemi, V., & Sullivan, J. (2014). One millisecond face alignment with an ensemble of regression trees. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014, pp. 1867–1874.
Rowley, H. A., Baluja, S., & Kanade, T. (1998, January). Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(1), 23–38.
Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001. Kauai, HI, USA: IEEE.
Zhang, C., & Zhang, Z. (2010, June). A survey of recent advances in face detection (Technical Report). Redmond, WA 98052: Microsoft Research.
Principal component analysis. (2017, October). https://en.wikipedia.org/wiki/Principal_component_analysis.
Linear discriminant analysis. (2017, September). https://en.wikipedia.org/wiki/Linear_discriminant_analysis.
Gabor wavelet. (2015, October). https://en.wikipedia.org/wiki/Gabor_wavelet.
Local binary patterns. (2017, September). https://en.wikipedia.org/wiki/Local_binary_patterns.
Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, July 2005. San Diego, CA, USA: IEEE.
Scale-invariant feature transform. (2017, September). https://en.wikipedia.org/wiki/Scaleinvariant_feature_transform.
Microsoft Developer Guide. Microsoft Kinect. (2017). https://developer.microsoft.com/en-us/windows/kinect.
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. In NIPS’12 Proceedings of the 25th International Conference on Neural Information Processing Systems—Volume 1, Lake Tahoe, Nevada, December 03–06, 2012, pp. 1097–1105.
Zeiler, M. D. & Fergus, R. (2013). Visualizing and understanding convolutional networks. In CoRR.
Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. In CoRR.
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., et al. (2015). Going deeper with convolutions. In Computer Vision and Pattern Recognition (CVPR). IEEE.
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016. Las Vegas, NV, USA: IEEE.
Stanford Vision Lab, Stanford University, Princeton University. (2016). ImageNet.
Lyons, M. J., Akamatsu, S., Kamachi, M., & Gyoba, J. (1998). Coding facial expressions with Gabor Wavelets. In Proceedings, Third IEEE International Conference on Automatic Face and Gesture Recognition, April 14–16, 1998, pp. 200–205. Nara Japan: IEEE Computer Society.
Kanade, T., Cohn, J. F., & Tian, Y. (2000). Comprehensive database for facial expression analysis. In Fourth IEEE International Conference on Automatic Face and Gesture Recognition.
Challenges in Representation Learning: Facial Expression Recognition Challenge. (2013). https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data.
Rothe, R., Timofte, R., & Van Gool, L. (n.d.). IMDB-WIKI—500 k + face images with age and gender labels.
Amazon Web Services, Inc. Amazon S3. (2017). https://aws.amazon.com/s3/.
Acknowledgements
We would like to thank our colleagues—Dr. Siddhartha Chatterjee and Aashis Tiwari—at Persistent Systems for valuable discussions and feedback during this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Mane, S., Shah, G. (2019). Facial Recognition, Expression Recognition, and Gender Identification. In: Balas, V., Sharma, N., Chakrabarti, A. (eds) Data Management, Analytics and Innovation. Advances in Intelligent Systems and Computing, vol 808. Springer, Singapore. https://doi.org/10.1007/978-981-13-1402-5_21
Download citation
DOI: https://doi.org/10.1007/978-981-13-1402-5_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1401-8
Online ISBN: 978-981-13-1402-5
eBook Packages: EngineeringEngineering (R0)