A Two-Stage Approach for Bag Detection in Pedestrian Images

Du, Yuning; Ai, Haizhou; Lao, Shihong

doi:10.1007/978-3-319-16817-3_33

Yuning Du¹⁷,
Haizhou Ai¹⁷ &
Shihong Lao¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9006))

Included in the following conference series:

Asian Conference on Computer Vision

2378 Accesses
1 Citations

Abstract

Bag detection in pedestrian images is a very practical visual surveillance problem. It is challenging because bag appearance may vary greatly. In this paper, we propose a novel two-stage approach for bag detection in pedestrian images. Firstly, we utilize two stripe vocabulary forests to check whether a pedestrian is with a bag. Secondly, we locate the bag location by ranking the generated bottom-up region proposals. The ranker is learned with a convolutional neural network (CNN). Experiments are performed on a subset of CUHK person re-identification dataset that show the effectiveness of our approach for bag detection in pedestrian images. Although developed for a specific problem, our approach could be applied to detect other carrying objects in pedestrian images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bi-box Regression for Pedestrian Detection and Occlusion Estimation

Detection and Recognition of Badgers Using Deep Learning

Bag Detection and Retrieval in Street Shots

References

Zhao, R., Ouyang, W., Wang, X.: Unsupervised salience learning for person re-identification. In: CVPR (2013)
Google Scholar
Li, W., Wang, X.: Locally aligned feature transforms across views. In: CVPR (2013)
Google Scholar
Ma, B., Su, Y., Jurie, F.: BiCov: a novel image representation for person re-identification and face verification. In: BMVC (2012)
Google Scholar
Li, W., Zhao, R., Wang, X.: Human reidentification with transferred metric learning. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012, Part I. LNCS, vol. 7724, pp. 31–44. Springer, Heidelberg (2013)
Chapter Google Scholar
Zheng, W., Gong, S., Xiang, T.: Transfer re-identification: from person to set-based verification. In: CVPR (2012)
Google Scholar
Hirzer, M., Roth, P.M., Köstinger, M., Bischof, H.: Relaxed pairwise learned metric for person re-identification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 780–793. Springer, Heidelberg (2012)
Chapter Google Scholar
Wu, Y., Minoh, M., Mukunoki, M., Lao, S.: Set based discriminative ranking for recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 497–510. Springer, Heidelberg (2012)
Chapter Google Scholar
Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 262–275. Springer, Heidelberg (2008)
Chapter Google Scholar
Zheng, W., Gong, S., Xiang, T.: Person re-identification by probabilistic relative distance comparison. In: CVPR (2011)
Google Scholar
Satta, R., Fumera, G., Roli, F.: A general method for appearance-based people search based on textual queries. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012. LNCS, pp. 453–461. Springer, Heidelberg (2012)
Chapter Google Scholar
Layne, R., Hospedales, T.M., Gong, S.: Towards person identification and re-identification with attributes. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012. LNCS, vol. 7583, pp. 402–412. Springer, Heidelberg (2012)
Chapter Google Scholar
Damen, D., Hogg, D.: Detecting carried objects from sequences of walking pedestrians. IEEE Trans. PAMI 34, 1056–1067 (2012)
Article Google Scholar
Damen, D., Hogg, D.C.: Detecting carried objects in short video sequences. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 154–167. Springer, Heidelberg (2008)
Chapter Google Scholar
BenAbdelkader, C., Davis, L.: Detection of people carrying objects: a motion-based recognition approach. In: FG (2002)
Google Scholar
Haritaoglu, I., Cutler, R., Harwood, D., Davis, L.: Backpack: detection of people carrying objects using silhouettes. In: ICCV (1999)
Google Scholar
Uijlings, J., Sande, K., Gevers, T., Smeulders, A.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)
Article Google Scholar
Bourdev, L., Maji, S., Malik, J.: Describing people: a poselet-based approach to attribute classification. In: ICCV (2011)
Google Scholar
Baltieri, D., Vezzani, R., Cucchiara, R.: People orientation recognition by mixtures of wrapped distributions on random trees. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 270–283. Springer, Heidelberg (2012)
Chapter Google Scholar
Cao, L., Dikmen, M., Fu, Y., Huang, T.: Gender recognition from body. In: ACM MM (2008)
Google Scholar
Alexe, B., Deselaers, T., Ferrari, V.: Measuring the objectness of image windows. IEEE Trans. PAMI 34, 2189–2202 (2012)
Article Google Scholar
Endres, I., Hoiem, D.: Category independent object proposals. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 575–588. Springer, Heidelberg (2010)
Chapter Google Scholar
Alex, K., Ilya, S., Geoffrey, H.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012)
Google Scholar
Alex, K.: Cuda-convnet. (https://code.google.com/p/cuda-convnet/)
Wang, X., Hua, G., Han, T.: Detection by detections: non-parametric detector adaptation for a video. In: CVPR (2012)
Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR (2006)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Google Scholar
Hsu, C., Chang, C., Lin, C.: A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University (2003)
Google Scholar
Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: a library for large linear classification. JMLR 9, 1871–1874 (2008)
MATH Google Scholar
Amer, M.R., Xie, D., Zhao, M., Todorovic, S., Zhu, S.-C.: Cost-sensitive top-down/bottom-up inference for multiscale activity recognition. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part IV. LNCS, vol. 7575, pp. 187–200. Springer, Heidelberg (2012)
Chapter Google Scholar
Zhu, Y., Nayak, N., Roy-Chowdhury, A.: Context-aware modeling and recognition of activities in video. In: CVPR (2013)
Google Scholar
Bhargava, M., Chen, C., Ryoo, M., Aggarwal, J.: Detection of object abandonment using temporal logic. Mach. Vis. Appl. 20, 271–281 (2009)
Article Google Scholar

Download references

Acknowledgement

This work is supported in part by National Basic Research Program of China under Grant No.2011CB302203, and it is also supported by a grant from OMRON Corporation.

Author information

Authors and Affiliations

Computer Science and Technology Department, Tsinghua University, Beijing, China
Yuning Du & Haizhou Ai
OMRON Social Solutions Co., LTD, Tokyo, Japan
Shihong Lao

Authors

Yuning Du
View author publications
You can also search for this author in PubMed Google Scholar
Haizhou Ai
View author publications
You can also search for this author in PubMed Google Scholar
Shihong Lao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuning Du .

Editor information

Editors and Affiliations

Technische Universität München, Garching, Bayern, Germany
Daniel Cremers
University of Adelaide, Adelaide, South Australia, Australia
Ian Reid
Keio University, Yokohama, Kanagawa, Japan
Hideo Saito
University of California at Merced, Merced, California, USA
Ming-Hsuan Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, Y., Ai, H., Lao, S. (2015). A Two-Stage Approach for Bag Detection in Pedestrian Images. In: Cremers, D., Reid, I., Saito, H., Yang, MH. (eds) Computer Vision -- ACCV 2014. ACCV 2014. Lecture Notes in Computer Science(), vol 9006. Springer, Cham. https://doi.org/10.1007/978-3-319-16817-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-16817-3_33
Published: 17 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16816-6
Online ISBN: 978-3-319-16817-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Two-Stage Approach for Bag Detection in Pedestrian Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bi-box Regression for Pedestrian Detection and Occlusion Estimation

Detection and Recognition of Badgers Using Deep Learning

Bag Detection and Retrieval in Street Shots

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Two-Stage Approach for Bag Detection in Pedestrian Images

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bi-box Regression for Pedestrian Detection and Occlusion Estimation

Detection and Recognition of Badgers Using Deep Learning

Bag Detection and Retrieval in Street Shots

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation