Abstract
Active learning focuses on building competitive models using less data by utilizing intelligent sampling, thereby reducing the effort and cost associated with manual data annotation. In this paper an active learning method based on a balanced sampling of images with high and low confidence object detection scores for training an object detector is proposed. Images with higher object prediction scores are sampled using the uncertainty measure proposed by Yu et al. which utilizes detection, classification and distribution statistics. Though this method encourages balanced distribution in sampling, a deeper look into the sampled distribution reveals that the under-represented classes in the initial labeled pool remain skewed throughout the subsequent active learning cycles. To mitigate this problem, in each active learning cycle, we propose to sample an equal proportion of images with high and low confidence object prediction scores from the model trained in the last cycle, where the low confidence prediction sample selection is based on the model’s prediction scores. Experiments conducted on the UEC Food 100 dataset show that the proposed method performs better than the baseline random sampling, CALD and low confidence prediction sampling method by +4.7, +8.7, and +3.1 mean average precision (mAP), respectively. Moreover, consistently superior performance of the proposed method is also demonstrated on the PASCAL VOC’07 and PASCAL VOC’12 datasets.
Work done as intern at Swiggy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bengar, J.Z., van de Weijer, J., Fuentes, L.L., Raducanu, B.: Class-balanced active learning for image classification. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1536–1545 (2022)
Yoo, D., Kweon, I.S.: Learning loss for active learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 93–102 (2019)
Sener, O., Savarese, S.: Active learning for convolutional neural networks: a core-set approach. arXiv preprint arXiv:1708.00489 (2017)
Kao, C.-C., Lee, T.-Y., Sen, P., Liu, M.-Y.: Localization-aware active learning for object detection. In: Asian Conference on Computer Vision, pp. 506–522. Springer, Berlin (2018)
Haussmann, E., Fenzi, M., Chitta, K., Ivanecky, J., Xu, H., Roy, D., Mittel, A., Koumchatzky, N., Farabet, C., Alvarez, J.M.: Scalable active learning for object detection. IEEE Intell. Veh. Symp. (iv). IEEE 2020, 1430–1435 (2020)
Wang, K, Lin, Liang, Yan, Xiaopeng, Chen, Ziliang, Zhang, Dongyu, Zhang, Lei: Cost-effective object detection: active sample mining with switchable selection criteria. IEEE Trans. Neural Netw. Learn. Syst. 30(3), 834–850 (2018)
Yu, W., Zhu, S., Yang, T., Chen, C., Liu, M.: Consistency-based active learning for object detection. arXiv preprint arXiv:2103.10374 (2021)
Wang, K., Yan, X., Zhang, D., Zhang, L., Lin, L.: Towards human-machine cooperation: Self-supervised sample mining for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1605–1613 (2018)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisser- man, A.: The PASCAL visual object classes challenge 2007 (VOC2007) results (2007). http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisser-man, A.: The PASCAL Visual object classes challenge 2012 (VOC2012) results (2012). http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html
Matsuda, Y., Hoashi, H., Yanai, K.: Recognition of multiple-food images by detecting candidate regions. In: Proceeding of the IEEE International Conference on Multi- media and Expo (ICME) (2012)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 28 (2015)
Lin, T.-Y., Dollar, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2117–2125 (2017)
Loshchilov, l., Hutter, F.: Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Aithal, S.J., Adhikari, S.P., Ghorai, M., Misra, H. (2023). Balanced Sampling-Based Active Learning for Object Detection. In: Tistarelli, M., Dubey, S.R., Singh, S.K., Jiang, X. (eds) Computer Vision and Machine Intelligence. Lecture Notes in Networks and Systems, vol 586. Springer, Singapore. https://doi.org/10.1007/978-981-19-7867-8_26
Download citation
DOI: https://doi.org/10.1007/978-981-19-7867-8_26
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7866-1
Online ISBN: 978-981-19-7867-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)