Abstract
Convolutional neural networks (CNNs) have exhibited great potential in the field of image classification in the past few years. In this paper, we present a novel strategy named cross-level to improve the existing CNNs’ architecture in which different levels of feature representation in a network are merely connected in series. The basic idea of cross-level is to establish a convolutional layer between two nonadjacent levels, aiming to learn more sufficient feature representations. The proposed cross-level strategy can be naturally combined into a CNN without any change on its original architecture, which makes this strategy very practical and convenient. Three popular CNNs for image classification are employed to illustrate its implementation in detail. Experimental results on the dataset adopted by the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) verify the effectiveness of the proposed cross-level strategy on image classification. Furthermore, a new CNN with cross-level architecture is introduced in this paper to demonstrate the value of the proposed strategy in the future CNN design.
Chapter PDF
Similar content being viewed by others
Keywords
References
LeCun, Y., Boser, B., Denker, J.S., et al.: Backpropagation applied to handwritten zip code recognition. Neural Computation 1, 541–551 (1989)
LeCun, Y., Kavukcuoglu K., Farabet C., et al.: Convolutional networks and applications in vision. In: IEEE International Symposium on Circuits and Systems, pp. 254–256 (2010)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convoluntional neural networks. Advances in Neural Information Processing Systems 25, 1106–1114 (2012)
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part I. LNCS, vol. 8689, pp. 818–833. Springer, Heidelberg (2014)
Lin, M., Chen Q., Yan, S.: Network in network (2013). arXiv: 1312.4400 [cs.NE]
He, K., Zhang, X., Ren, S., et al.: Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition (2014). arXiv: 1406.4729 [cs.CV]
Szegedy, C., Liu, W., Jia Y., et al.: Going deeper with convolutions (2014). arXiv: 1409.4842 [cs.CV]
He, K., Zhang, X., Ren, S., et al.: Delving deep into rectifiers: Surpassing human-Level performance on imageNet classification (2015). arXiv: 1502.01852 [cs.CV]
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)
Bengio, Y., Courville, A., Vincent, P.: Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 1798–1828 (2013)
Desimone, R., Duncan, J.: Neural mechanisms of selective visual attention. Annual review of neuroscience 18, 193–222 (1995)
Spirkovska, L., Reid, M.B.: Robust position, scale, and rotation invariant object recognition using higher-order neural networks. Pattern Recognition 25, 975–985 (1992)
Fan, J., Xu, W., Wu, Y., et al.: Human tracking using convolutional neural networks. IEEE Transactions on Neural Networks 21, 1610–1623 (2010)
Sermanet, P., LeCun, Y.: Traffic sign recognition with multi-scale convolutional networks. In: International Joint Conference on Neural Networks, pp. 2809–2813 (2011)
Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1891–1898 (2014)
Jia, Y., Shelhamer, E., Donahue, J., et al.: Caffe: Convolutional architecture for fast feature embedding. In: ACM International Conference on Multimedia, pp. 675–678 (2014)
Caffe website. http://caffe.berkeleyvision.org/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Y., Yin, B., Yu, J., Wang, Z. (2015). Cross-Level: A Practical Strategy for Convolutional Neural Networks Based Image Classification. In: Zha, H., Chen, X., Wang, L., Miao, Q. (eds) Computer Vision. CCCV 2015. Communications in Computer and Information Science, vol 546. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48558-3_40
Download citation
DOI: https://doi.org/10.1007/978-3-662-48558-3_40
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48557-6
Online ISBN: 978-3-662-48558-3
eBook Packages: Computer ScienceComputer Science (R0)