Abstract
This paper presents techniques for image reconstruction and recognition using autoencoders. Experiments are conducted to compare the performances of three types of autoencoder neural networks based on their efficiency of reconstruction and recognition. Reconstruction error and recognition rate are determined in all the three cases using the same architecture configuration and training algorithm. The results obtained with autoencoders are also compared with those obtained using principal component analysis method. Instead of whole images, image patches are used for training, and this leads to much simpler autoencoder architectures and reduced training time.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313:504–507
Bengio Y, Lamblin P, Popovici D, Larochelle H (2006) Greedy layer-wise training of deep networks. Advances in Neural Information Processing Systems 19. MIT Press, Cambridge, Tech Rep 1282, Aug 2006
Tan CC, Eswaran C (2009) Using autoencoders for mammogram compression. J Med Syst. doi:10.1007/s10916-009-9340-3
Polak E, Ribiére G (1969) Note sur la convergence de methods de directions conjures. Revue Francais Information Recherche Operationnelle 16:35–43
Larochelle H, Erhan D, Courville A, Bergstra J, Bengio Y (2007) An empirical evaluation of deep architectures on problems with many factors of variation. In: Proceedings of the 24th international conference on machine learning (ICLM). ACM, Corvalis, Oregon, USA, pp 473–480
Hinton GE, Boltzmann Machine. Online. Scholarpedia, 2(5):1668. Available: http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5586
Tee YW, Hinton GE (2000) Rate-coded Restricted Boltzmann Machines for face recognition. In: Neural information processing systems 13. MIT Press, Cambridge, pp 908–914
Salakhutdinov R, Mnih A, Hinton G (2007) Restricted Boltzmann machines for collaborative filtering. In: Proceedings of the 24th International Conference on Machine Learning. Corvallis, OR, USA, pp 791–798
Cover T, Hart P Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1):21–27
Hinton GE, Revow M, Dayan P (1995) Recognizing handwritten digits using mixtures of linear models. In: Tesauro G, Touretzky DS, Leen TK (eds) Advances in neural information processing systems 7. MIT Press, Cambridge, pp 1015–1022
Hinton GE, Dayan P, Revow M (1997) Modeling the manifolds of images of handwritten digits. IEEE Trans Neural Netw 8(1):65–74
Hinton GE, Salakhutdinov RR (2006) Supporting online material for reducing the dimensionality of data with neural networks. Science 313(5786), Online. Available: http://www.sciencemag.org/cgi/content/full/313/5786/504/DC1
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tan, C.C., Eswaran, C. Reconstruction and recognition of face and digit images using autoencoders. Neural Comput & Applic 19, 1069–1079 (2010). https://doi.org/10.1007/s00521-010-0378-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-010-0378-4