Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Fine-Grained Adversarial Semi-Supervised Learning

Published: 25 January 2022 Publication History
  • Get Citation Alerts
  • Editorial Notes

    The authors have requested minor, non-substantive changes to the VoR and, in accordance with ACM policies, a Corrected Version of Record was published on March 18, 2022. For reference purposes, the VoR may still be accessed via the Supplemental Material section on this citation page.

    Abstract

    In this article, we exploit Semi-Supervised Learning (SSL) to increase the amount of training data to improve the performance of Fine-Grained Visual Categorization (FGVC). This problem has not been investigated in the past in spite of prohibitive annotation costs that FGVC requires. Our approach leverages unlabeled data with an adversarial optimization strategy in which the internal features representation is obtained with a second-order pooling model. This combination allows one to back-propagate the information of the parts, represented by second-order pooling, onto unlabeled data in an adversarial training setting. We demonstrate the effectiveness of the combined use by conducting experiments on six state-of-the-art fine-grained datasets, which include Aircrafts, Stanford Cars, CUB-200-2011, Oxford Flowers, Stanford Dogs, and the recent Semi-Supervised iNaturalist-Aves. Experimental results clearly show that our proposed method has better performance than the only previous approach that examined this problem; it also obtained higher classification accuracy with respect to the supervised learning methods with which we compared.

    Supplementary Material

    3485473-vor (3485473-vor.pdf)
    Version of Record for "Fine-Grained Adversarial Semi-Supervised Learning" by Mugnai et al., ACM Transactions on Multimedia Computing, Communications, and Applications, Volume 18, No. 1s (TOMM 18:1s).

    References

    [1]
    Connor Anderson, Matt Gwilliam, Adam Teuscher, Andrew Merrill, and Ryan Farrell. 2020. Facing the hard problems in FGVC. arXiv:2006.13190. https://arxiv.org/abs/2006.13190.
    [2]
    Ben Athiwaratkun, Marc Finzi, Pavel Izmailov, and Andrew Gordon Wilson. 2018. There are many consistent explanations of unlabeled data: Why you should average. In International Conference on Learning Representations.
    [3]
    Thomas Berg, Jiongxin Liu, Seung Woo Lee, Michelle L. Alexander, David W. Jacobs, and Peter N. Belhumeur. 2014. Birdsnap: Large-scale fine-grained visual categorization of birds. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [4]
    David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, and Colin A. Raffel. 2019. Mixmatch: A holistic approach to semi-supervised learning. In Advances in Neural Information Processing Systems.
    [5]
    Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, and Vicente Ordonez. 2021. Curriculum labeling: Revisiting pseudo-labeling for semi-supervised learning. Proceedings of the AAAI Conference on Artificial Intelligence 35, 8 (May 2021), 6912–6920.
    [6]
    Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey E. Hinton. 2020. Big self-supervised models are strong semi-supervised learners. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020 (NeurIPS’20), virtual.
    [7]
    Wei Chen, Yu Liu, Weiping Wang, Erwin Bakker, Theodoros Georgiou, Paul Fieguth, Li Liu, and Michael S. Lew. 2021. Deep image retrieval: A survey. arXiv:2101.11282. https://arxiv.org/abs/2101.11282.
    [8]
    Wei Chen, Yu Liu, Weiping Wang, Tinne Tuytelaars, Erwin M. Bakker, and Michael S. Lew. 2020. On the exploration of incremental learning for fine-grained image retrieval. In 31st British Machine Vision Conference 2020 (BMVC’20), virtual event. BMVA Press.
    [9]
    Cheng Cui, Zhi Ye, Yangxi Li, Xinjian Li, Min Yang, Kai Wei, Bing Dai, Yanmei Zhao, Zhongji Liu, and Rong Pang. 2020. Semi-supervised recognition under a noisy and fine-grained dataset. arXiv:2006.10702. https://arxiv.org/abs/2006.10702.
    [10]
    Yin Cui, Yang Song, Chen Sun, Andrew Howard, and Serge Belongie. 2018. Large scale fine-grained categorization and domain-specific transfer learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [11]
    Hal Daumé III, Abhishek Kumar, and Avishek Saha. 2010. Frustratingly easy semi-supervised domain adaptation. In Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing.
    [12]
    Jia Deng, Jonathan Krause, and Li Fei-Fei. 2013. Fine-grained crowdsourcing for fine-grained recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [13]
    Jeff Donahue, Judy Hoffman, Erik Rodner, Kate Saenko, and Trevor Darrell. 2013. Semi-supervised domain adaptation with instance constraints. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [14]
    Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, and Neil Houlsby. 2020. An Image is Worth 16 \( \times \) 16 Words: Transformers for Image Recognition at Scale. arXiv preprint arXiv:2010.11929. https://arxiv.org/abs/2010.11929.
    [15]
    Yaroslav Ganin and Victor Lempitsky. 2015. Unsupervised domain adaptation by backpropagation. In International Conference on Machine Learning. PMLR 37, 1180–1189.
    [16]
    Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, and Victor Lempitsky. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17, 59 (2016), 1–35.
    [17]
    Weifeng Ge, Xiangru Lin, and Yizhou Yu. 2019. Weakly supervised complementary parts models for fine-grained image classification from the bottom up. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [18]
    Yves Grandvalet and Yoshua Bengio. 2005. Semi-supervised learning by entropy minimization. In Advances in Neural Information Processing Systems.
    [19]
    Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
    [20]
    Zhenwei He and Lei Zhang. 2019. Multi-adversarial faster-RCNN for unrestricted object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 6668–6677.
    [21]
    Olivier J. Hénaff, Aravind Srinivas, Jeffrey De Fauw, Ali Razavi, Carl Doersch, S. M. Eslami, and Aaron van den Oord. 2019. Data-efficient image recognition with contrastive predictive coding. arXiv:1905.09272. https://arxiv.org/abs/1905.09272.
    [22]
    Nicholas J. Higham. 2008. Functions of Matrices: Theory and Computation. SIAM. https://arxiv.org/abs/1503.02531.
    [23]
    Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv:1503.02531.
    [24]
    Tao Hu, Honggang Qi, Qingming Huang, and Yan Lu. 2019. See better before looking closer: Weakly supervised data augmentation network for fine-grained visual classification. arXiv:1901.09891. https://arxiv.org/abs/1901.09891.
    [25]
    Catalin Ionescu, Orestis Vantzos, and Cristian Sminchisescu. 2015. Matrix backpropagation for deep networks with structured layers. In Proceedings of the IEEE International Conference on Computer Vision.
    [26]
    Mehran Javanmardi and Tolga Tasdizen. 2018. Domain adaptation for biomedical image segmentation using adversarial training. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI’18). IEEE, 554–558.
    [27]
    Hiroharu Kato and Tatsuya Harada. 2019. Learning view priors for single-view 3D reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9778–9787.
    [28]
    Aditya Khosla, Nityananda Jayadevaprakash, Bangpeng Yao, and Li Fei-Fei. 2011. Novel dataset for fine-grained image categorization. In 1st Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition.
    [29]
    Dimitri Korsch, Paul Bodesheim, and Joachim Denzler. 2019. Classification-specific parts for improving fine-grained visual categorization. In German Conference on Pattern Recognition.
    [30]
    Dimitri Korsch, Paul Bodesheim, and Joachim Denzler. 2020. End-to-end learning of a Fisher vector encoding for part features in fine-grained recognition. arXiv:2007.02080. https://arxiv.org/abs/2007.02080.
    [31]
    Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, and Li Fei-Fei. 2016. The unreasonable effectiveness of noisy data for fine-grained recognition. In European Conference on Computer Vision.
    [32]
    Jonathan Krause, Michael Stark, Jia Deng, and Li Fei-Fei. 2013. 3D object representations for fine-grained categorization. In Proceedings of the IEEE International Conference on Computer Vision Workshops.
    [33]
    Alex Krizhevsky et al. 2009. Learning multiple layers of features from tiny images. Technical Report. University of Toronto.
    [34]
    Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In NIPS.
    [35]
    Abhishek Kumar, Avishek Saha, and Hal Daume. 2010. Co-regularization based semi-supervised domain adaptation. In Advances in Neural Information Processing Systems.
    [36]
    Dong-Hyun Lee. 2013. Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. In ICML 2013 Workshop: Challenges in Representation Learning (WREPL’13).
    [37]
    Peihua Li, Jiangtao Xie, Qilong Wang, and Zilin Gao. 2018. Towards faster training of global covariance pooling networks by iterative matrix square root normalization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [38]
    Yan Li, Junge Zhang, Kaiqi Huang, and Jianguo Zhang. 2018. Mixed supervised object detection with robust objectness transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 3 (2018), 639–653.
    [39]
    Yu-Feng Li and De-Ming Liang. 2019. Safe semi-supervised learning: A brief introduction. Frontiers of Computer Science 13, 4 (2019), 669–676.
    [40]
    Tsung-Yu Lin and Subhransu Maji. 2017. Improved bilinear pooling with CNNs. In Proceedings of the British Machine Vision Conference (BMVC’17). BMVA Press.
    [41]
    Tsung-Yu Lin, Aruni RoyChowdhury, and Subhransu Maji. 2015. Bilinear CNN models for fine-grained visual recognition. In Proceedings of the IEEE International Conference on Computer Vision.
    [42]
    Bin Liu, Zhirong Wu, Han Hu, and Stephen Lin. 2019. Deep metric transfer for label propagation with limited annotated data. In Proceedings of the IEEE International Conference on Computer Vision Workshops.
    [43]
    Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song. 2017. Sphereface: Deep hypersphere embedding for face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [44]
    Weiyang Liu, Yandong Wen, Zhiding Yu, and Meng Yang. 2016. Large-margin softmax loss for convolutional neural networks. In Proceedings of the 33rd International Conference on International Conference on Machine Learning, Vol. 8.
    [45]
    Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, 86 (2008), 2579–2605.
    [46]
    Subhransu Maji, Esa Rahtu, Juho Kannala, Matthew Blaschko, and Andrea Vedaldi. 2013. Fine-grained visual classification of aircraft. arXiv:1306.5151. https://arxiv.org/abs/1306.5151.
    [47]
    Marc Masana, Xialei Liu, Bartlomiej Twardowski, Mikel Menta, Andrew D. Bagdanov, and Joost van de Weijer. 2020. Class-incremental learning: Survey and performance evaluation. arXiv:2010.15277. https://arxiv.org/abs/2010.15277.
    [48]
    Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, and Shin Ishii. 2018. Virtual adversarial training: A regularization method for supervised and semi-supervised learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 8 (2018), 1979–1993.
    [49]
    Daniele Mugnai, Federico Pernici, Francesco Turchini, and Alberto Del Bimbo. 2021. Soft pseudo-labeling semi-supervised learning applied to fine-grained visual classification. In Proceedings of the ICPR International Workshops and Challenges on Pattern Recognition. Part IV, virtual event. Springer International Publishing, 102–110.
    [50]
    Obed Tettey Nartey, Guowu Yang, Jinzhao Wu, and Sarpong Kwadwo Asare. 2019. Semi-supervised learning for fine-grained classification with self-training. IEEE Access 8 (2019), 2109–2121.
    [51]
    Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, and Andrew Y. Ng. 2011. Reading digits in natural images with unsupervised feature learning. In NIPS Workshop on Deep Learning and Unsupervised Feature Learning.
    [52]
    Jiquan Ngiam, Daiyi Peng, Vijay Vasudevan, Simon Kornblith, Quoc V. Le, and Ruoming Pang. 2018. Domain adaptive transfer learning with specialist models. arXiv:1811.07056. https://arxiv.org/abs/1811.07056.
    [53]
    Maria-Elena Nilsback and Andrew Zisserman. 2008. Automated flower classification over a large number of classes. In Indian Conference on Computer Vision, Graphics and Image Processing.
    [54]
    Avital Oliver, Augustus Odena, Colin A. Raffel, Ekin Dogus Cubuk, and Ian Goodfellow. 2018. Realistic evaluation of deep semi-supervised learning algorithms. In Advances in Neural Information Processing Systems.
    [55]
    Chapelle Olivier, Scholkopf Bernhard, and Zien Alexander. 2006. Semi-supervised learning. MIT Press.
    [56]
    Yassine Ouali, Céline Hudelot, and Myriam Tami. 2020. An overview of deep semi-supervised learning. arXiv:2006.05278. https://arxiv.org/abs/2006.05278.
    [57]
    Federico Pernici, Federico Bartoli, Matteo Bruni, and Alberto Del Bimbo. 2018. Memory based online learning of deep representations from video streams. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2324–2334.
    [58]
    F. Pernici, M. Bruni, C. Baecchi, and A. D. Bimbo. 2021. Regular polytope networks. IEEE Transactions on Neural Networks and Learning Systems (2021), 1–15. DOI:
    [59]
    Federico Pernici, Matteo Bruni, Claudio Baecchi, and Alberto Del Bimbo. 2019. Maximally compact and separated features with regular polytope networks. In CVPR Workshops. 46–53.
    [60]
    Federico Pernici, Matteo Bruni, Claudio Baecchi, Francesco Turchini, and Alberto Del Bimbo. 2020. Class-incremental learning with pre-allocated fixed classifiers. In 25th International Conference on Pattern Recognition (ICPR’20). IEEE Computer Society.
    [61]
    Federico Pernici, Matteo Bruni, and Alberto Del Bimbo. 2020. Self-supervised on-line cumulative learning from video streams. Computer Vision and Image Understanding 197 (2020), 102983.
    [62]
    Federico Pernici and Alberto Del Bimbo. 2017. Unsupervised incremental learning of deep descriptors from video streams. In 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW’17). IEEE, 477–482.
    [63]
    Nan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, and Michael S. Lew. 2021. Lifelong person re-identification via adaptive knowledge accumulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’21). 7901–7910.
    [64]
    Esteban Real, Alok Aggarwal, Yanping Huang, and Quoc V. Le. 2019. Regularized evolution for image classifier architecture search. In Proceedings of the AAAI Conference on Artificial Intelligence.
    [65]
    Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. 2015. ImageNet large scale visual recognition challenge. International Journal of Computer Vision 115 (2015), 211–252.
    [66]
    Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Trevor Darrell, and Kate Saenko. 2019. Semi-supervised domain adaptation via minimax entropy. In Proceedings of the IEEE International Conference on Computer Vision.
    [67]
    Yantao Shen, Yuanjun Xiong, Wei Xia, and Stefano Soatto. 2020. Towards backward-compatible representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6368–6377.
    [68]
    Yusuke Shinohara. 2016. Adversarial multi-task learning of deep neural networks for robust speech recognition. In Interspeech. 2369–2372.
    [69]
    Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, and Erik Rodner. 2017. Generalized orderless pooling performs implicit salient matching. In Proceedings of the IEEE International Conference on Computer Vision.
    [70]
    Marcel Simon, Erik Rodner, Trevor Darrell, and Joachim Denzler. 2018. The whole is more than its parts? From explicit to implicit pose normalization. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018).
    [71]
    Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, and Colin Raffel. 2020. Fixmatch: Simplifying semi-supervised learning with consistency and confidence. arXiv:2001.07685. https://arxiv.org/abs/2006.05278.
    [72]
    Jong-Chyi Su, Zezhou Cheng, and Subhransu Maji. 2021. A realistic evaluation of semi-supervised learning for fine-grained classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12966–12975.
    [73]
    Jong-Chyi Su and Subhransu Maji. 2021. The Semi-Supervised iNaturalist-Aves Challenge at FGVC7 Workshop.
    [74]
    Chen Sun, Abhinav Shrivastava, Saurabh Singh, and Abhinav Gupta. 2017. Revisiting unreasonable effectiveness of data in deep learning era. In Proceedings of the IEEE International Conference on Computer Vision.
    [75]
    Antti Tarvainen and Harri Valpola. 2017. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in Neural Information Processing Systems.
    [76]
    Hugo Touvron, Andrea Vedaldi, Matthijs Douze, and Hervé Jégou. 2019. Fixing the train-test resolution discrepancy. In Advances in Neural Information Processing Systems.
    [77]
    Jesper E. Van Engelen and Holger H. Hoos. 2020. A survey on semi-supervised learning. Machine Learning 109 (2020), 373–440.
    [78]
    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems.
    [79]
    Catherine Wah, Steve Branson, Peter Welinder, Pietro Perona, and Serge Belongie. 2011. The Caltech-UCSD birds-200-2011 dataset.
    [80]
    Mei Wang and Weihong Deng. 2021. Deep face recognition: A survey. Neurocomputing 429 (2021), 215–244. DOI:
    [81]
    Q. Wang, J. Xie, W. Zuo, L. Zhang, and P. Li. 2020. Deep CNNs meet global covariance pooling: Better representation and generalization. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 8 (2020), 2582–2597.
    [82]
    Yunyun Wang and Songcan Chen. 2013. Safety-aware semi-supervised classification. IEEE Transactions on Neural Networks and Learning Systems 24, 11 (2013), 1763–1772.
    [83]
    Xiu-Shen Wei, Jianxin Wu, and Quan Cui. 2019. Deep learning for fine-grained image analysis: A survey. arXiv:1907.03069. https://arxiv.org/abs/1907.03069.
    [84]
    Tianjun Xiao, Yichong Xu, Kuiyuan Yang, Jiaxing Zhang, Yuxin Peng, and Zheng Zhang. 2015. The application of two-level attention models in deep convolutional neural network for fine-grained image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [85]
    Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, and Quoc V. Le. 2019. Unsupervised data augmentation for consistency training. arXiv:1904.12848. https://arxiv.org/abs/1904.12848.
    [86]
    I. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, and Dhruv Mahajan. 2019. Billion-scale semi-supervised learning for image classification. arxiv:1905.00546 [cs.CV]. https://arxiv.org/abs/1905.00546.
    [87]
    Ze Yang, Tiange Luo, Dong Wang, Zhiqiang Hu, Jun Gao, and Liwei Wang. 2018. Learning to navigate for fine-grained classification. In Proceedings of the European Conference on Computer Vision (ECCV’18).
    [88]
    Ting Yao, Yingwei Pan, Chong-Wah Ngo, Houqiang Li, and Tao Mei. 2015. Semi-supervised domain adaptation with subspace learning for visual recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [89]
    Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven C. H. Hoi. 2021. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence.
    [90]
    Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Seong Joon Oh, Youngjoon Yoo, and Junsuk Choe. 2008. CutMix: Regularization strategy to train strong classifiers with localizable features. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV’19). 6022–6031.
    [91]
    Xiaohua Zhai, Avital Oliver, Alexander Kolesnikov, and Lucas Beyer. 2019. S4l: Self-supervised semi-supervised learning. In Proceedings of the IEEE International Conference on Computer Vision.
    [92]
    Fan Zhang, Guisheng Zhai, Meng Li, and Yizhao Liu. 2020. Three-branch and multi-scale learning for fine-grained image recognition (TBMSL-Net). arXiv:2003.09150. https://arxiv.org/abs/2003.09150.
    [93]
    Han Zhang, Tao Xu, Mohamed Elhoseiny, Xiaolei Huang, Shaoting Zhang, Ahmed Elgammal, and Dimitris Metaxas. 2016. SPDA-CNN: Unifying semantic part detection and abstraction for fine-grained recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
    [94]
    Jian Zhang, Runsheng Zhang, Yaping Huang, and Qi Zou. 2019. Unsupervised part mining for fine-grained image classification. arXiv:1902.09941. https://arxiv.org/abs/1902.09941.
    [95]
    Lianbo Zhang, Shaoli Huang, Wei Liu, and Dacheng Tao. 2019. Learning a mixture of granularity-specific experts for fine-grained categorization. In Proceedings of the IEEE International Conference on Computer Vision.
    [96]
    Ning Zhang, Jeff Donahue, Ross Girshick, and Trevor Darrell. 2014. Part-based R-CNNs for fine-grained category detection. In European Conference on Computer Vision. Springer.
    [97]
    Heliang Zheng, Jianlong Fu, Tao Mei, and Jiebo Luo. 2017. Learning multi-attention convolutional neural network for fine-grained image recognition. In Proceedings of the IEEE International Conference on Computer Vision.
    [98]
    Heliang Zheng, Jianlong Fu, Zheng-Jun Zha, and Jiebo Luo. 2019. Learning deep bilinear transformation for fine-grained image representation. In Advances in Neural Information Processing Systems.
    [99]
    Xiaojin Zhu, Zoubin Ghahramani, and John D. Lafferty. 2003. Semi-supervised learning using Gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning (ICML’03).
    [100]
    Peiqin Zhuang, Yali Wang, and Yu Qiao. 2020. Learning attentive pairwise interaction for fine-grained classification. In AAAI.

    Cited By

    View all
    • (2024)Backdoor Two-Stream Video Models on Federated LearningACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3651307Online publication date: 7-Mar-2024
    • (2023)Semi-supervised and un-supervised clusteringInformation Systems10.1016/j.is.2023.102178114:COnline publication date: 1-Mar-2023
    • (2022)SETTI: A Self-supervised AdvErsarial Malware DeTection ArchiTecture in an IoT EnvironmentACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353642518:2s(1-21)Online publication date: 6-Oct-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 1s
    February 2022
    352 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3505206
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 January 2022
    Accepted: 01 September 2021
    Revised: 01 July 2021
    Received: 01 March 2021
    Published in TOMM Volume 18, Issue 1s

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Fine-grained visual categorization
    2. deep neural networks
    3. semi-supervised learning
    4. adversarial learning

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • Italian MIUR within PRIN 2017
    • Leonardo Finmeccanica S.p.A

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)95
    • Downloads (Last 6 weeks)9
    Reflects downloads up to 26 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Backdoor Two-Stream Video Models on Federated LearningACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3651307Online publication date: 7-Mar-2024
    • (2023)Semi-supervised and un-supervised clusteringInformation Systems10.1016/j.is.2023.102178114:COnline publication date: 1-Mar-2023
    • (2022)SETTI: A Self-supervised AdvErsarial Malware DeTection ArchiTecture in an IoT EnvironmentACM Transactions on Multimedia Computing, Communications, and Applications10.1145/353642518:2s(1-21)Online publication date: 6-Oct-2022
    • (undefined)Leveraging Frame- and Feature-Level Progressive Augmentation for Semi-supervised Action RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3655025

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media