Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3373724.3373730acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicraiConference Proceedingsconference-collections
research-article

Enhance the Efficacy of Deep CNN with Auxiliary Labels

Published: 03 February 2020 Publication History

Abstract

Auxiliary attributes can improve the performance of deep CNNs for image classification. However, not all auxiliary tasks are helpful for main task performance. Therefore, in this paper we focus on improving the efficacy of deep CNN models with the support of auxiliary attributes. On the principle of minimizing cross entropy loss, we derive a new algorithm to iteratively train the deep CNN model on target and auxiliary labels. We demonstrate that by introducing auxiliary attributes to training images, uncertainty can be reduced in target classification tasks, and adversarial effects avoided in multi-task formulation. We evaluated our learning approach on three categories of overlapping image sets for identical, partial overlapping, and disjoint situations. We performed three group of experiments on three popular deep CNN networks, and three challenging datasets. The results show that our method is able to improve efficacy for target tasks with auxiliary labels in situations where the multi-task learning fails, or is not applicable.

References

[1]
Alonso, H. and Plank, B. 2017. When is multitask learning effective? Semantic sequence prediction under varying data conditions. In EACL.
[2]
Bjerva, J. 2017. Will my auxiliary tagging task help? Estimating auxiliary tasks effectivity in multi-task learning. In NCCL. 216--220.
[3]
Kendall, A., Gal, Y., and Cipolla, R. 2018. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In CVPR.
[4]
Liebel, L. and Körner, M. 2018. Auxiliary tasks in multi-task learning. CoRR, abs/1805.06334.
[5]
Liang, K., Guo, Y., Chang, H., and Chen, X. 2017. Incomplete attribute learning with auxiliary labels. In IJCAI. 2252--2258.
[6]
Russakovsky, O. and Fei-Fei, L. 2010. Attribute learning in large-scale datasets. In ECCV Workshop.
[7]
Romera-Paredes, B., Argyriou, A., Berthouze, N., and Pontil, M. 2012. Exploiting unrelated tasks in multi-task learning. In ICAIS. 951--959.
[8]
Ruder, S. 2017. An overview of multi-task learning in deep neural networks. arXiv:1706.05098.
[9]
Zhang, Z., Luo, P., Loy, C., and Tang, X. 2016. Learning deep representation for faced alignment with auxiliary attributes. TPAMI, 38, 918--930
[10]
Huh, M.-Y., Agrawal, P., and Efros, A. A. 2016. What makes imagenet good for transfer learning? CoRR, abs/1608.08614.
[11]
Khan, S., Hayat, M., and Porikli, F. 2017. Scene categorization with spectral features. In ICCV.
[12]
Hinton, G., Vinyals, O., and Dean, J. 2015. Distilling the knowledge in a neural network. NIPS Deep Learning and Representation Learning Workshop.
[13]
Pereyra, G., Tucker, G., Chorowski, J., Kaiser, L., and Hinton, G. 2017. Regularizing neural networks by penalizing confient output distributions. In ICLR Workshop.
[14]
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., and Darrell, T. 2014. Caffe: Convolutional architecture for fast feature embedding. ACM MM.
[15]
Krause, J., Sapp, B., Howard, A., Zhou, H., Toshev, A., Duerig, T., Philbin, J., and Fei-Fei, L. 2016. The unreasonable effectiveness of noisy data for fine-grained recognition. In ECCV.
[16]
Krause, J., Stark, M., Deng, J., and Fei-Fei, L. 2013. 3d object representations for fine-grained categorization. In ICCV Workshops. 554--561.
[17]
Zheng, H., Fu, J., Mei, T., and Luo, J. 2017. Learning multi-attention convolutional neural network for fine-grained image recognition. In ICCV.
[18]
Khosla, A., Jayadevaprakash, N., Yao, B., and Fei-Fei, L. 2011. Novel dataset for fine-grained image categorization. In CVPR.
[19]
Zhang, L., Yang, F., Zhang, Y. D., and Zhu, Y. J., 2016. Road crack detection using deep convolutional neural network. In ICIP.
[20]
Xie, S., Yang, T., Wang, X., and Lin, Y. 2015. Hyper-class augmented and regularized deep learning for fine-grained image classification. In CVPR.
[21]
Krizhevsky, A., Sutskever, I., and Hinton, G. 2012. Imagenet classification with deep convolutional neural networks. In NIPS.
[22]
Simonyan, K. and Zisserman, A. 2014. Very deep convolutional networks for large-scale image recognition. In CVPR.
[23]
Zhou, F. and Lin, Y. 2016. Fine-grained image classification by exploring bipartite-graph labels. In CVPR.
[24]
Hou, S., Liu, X., and Wang, Z. 2017. Dualnet: Learn complementary features for image recognition. ICCV.
[25]
Hu, Y. and Zhao, C. 2010. A local binary pattern based methods for pavement crack detection. Journal of Pattern Recognition Research. 5, 140--147.
[26]
Kapela, R., Sniataa, P., Turkot, A., Rybarczyk, A., Pozarycki, A., Rydzewski, P., Wyczaek, M., and Bloch, A. 2015. Asphalt surfaced pavement cracks detection based on histograms of oriented gradients. Int. Conf. on Mixed Design of Integrated Circuits and Systems. 579--584.
[27]
Oliveira, H. and Correia, P. 2009. Automatic road crack segmentation using entropy and image dynamic thresholding. In Eur Conference on Signal Process. 622--626.
[28]
Salman, M., Mathavan, S., Kamal, K., and Rahman, M. 2013. Pavement crack detection using the gabor filter. In International Conference on Intelligent Transportation Systems. 2039--2044. IEEE.
[29]
van-der Maaten, L. and Hinton, G. 2008. Visualizing data using t-sne. Journal of Machine Learning Research. 9, 2579--2605.

Index Terms

  1. Enhance the Efficacy of Deep CNN with Auxiliary Labels

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICRAI '19: Proceedings of the 5th International Conference on Robotics and Artificial Intelligence
    November 2019
    108 pages
    ISBN:9781450372350
    DOI:10.1145/3373724
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 February 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Auxiliary attributes
    2. Deep CNN
    3. Iterative training

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICRAI '19

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 143
      Total Downloads
    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media