Abstract
Detecting humans in films and videos is a challenging problem owing to the motion of the subjects, the camera and the background and to variations in pose, appearance, clothing, illumination and background clutter. We develop a detector for standing and moving people in videos with possibly moving cameras and backgrounds, testing several different motion coding schemes and showing empirically that orientated histograms of differential optical flow give the best overall performance. These motion-based descriptors are combined with our Histogram of Oriented Gradient appearance descriptors. The resulting detector is tested on several databases including a challenging test set taken from feature films and containing wide ranges of pose, motion and background variations, including moving cameras and backgrounds. We validate our results on two challenging test sets containing more than 4400 human examples. The combined detector reduces the false alarm rate by a factor of 10 relative to the best appearance-based detector, for example giving false alarm rates of 1 per 20,000 windows tested at 8% miss rate on our Test Set 1.
Chapter PDF
Similar content being viewed by others
References
Baker, S., Nayar, S.: Pattern rejection. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, San Francisco, California, USA (1996)
Comaniciu, D.: An algorithm for data-driven bandwidth selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 25(2), 281–288 (2003)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, San Diego, California, USA, pp. 886–893 (2005)
Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. In: Proceedings of the 9th International Conference on Computer Vision, Nice, France, vol. II, pp. 726–733 (2003)
Fleet, D., Jepson, A.: Stability of phase information. IEEE Transactions on Pattern Analysis and Machine Intelligence 15(12), 1253–1268 (1993)
Galvin, B., McCane, B., Novins, K., Mason, D., Mills, S.: Recovering motion fields: An evaluation of eight optical flow algorithms. In: Proceedings of the ninth British Machine Vision Conference, Southampton, England (1998), http://www.cs.otago.ac.nz/research/vision
Gavrila, D.M.: The visual analysis of human movement: A survey. Computer Vision and Image Understanding 73(1), 82–98 (1999)
Gavrila, D.M., Giebel, J., Munder, S.: Vision-based pedestrian detection: the protector+ system. In: Proc. of the IEEE Intelligent Vehicles Symposium, Parma, Italy (2004)
Gavrila, D.M., Philomin, V.: Real-time object detection for smart vehicles. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, Fort Collins, Colorado, USA, pp. 87–93 (1999)
Haritaoglu, I., Harwood, D., Davis, L.: W4: Real-time surveillance of people and their activities. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8), 809–830 (2000)
Horn, K.P., Schunck, G.: Determining optical flow. Artificial Intelligence 17, 185–203 (1981)
Joachims, T.: Making large-scale svm learning practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning, MIT Press, Cambridge (1999)
Leibe, B., Seemann, E., Schiele, B.: Pedestrian detection in crowded scenes. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, San Diego, California, USA, June 2005, pp. 876–885 (2005)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Mikolajczyk, K., Schmid, C., Zisserman, A.: Human detection based on a probabilistic assembly of robust part detectors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 69–82. Springer, Heidelberg (2004)
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(4), 349–361 (2001)
Papageorgiou, C., Poggio, T.: A trainable system for object detection. International Journal of Computer Vision 38(1), 15–33 (2000)
Proesmans, M., Van Gool, L., Pauwels, E., Oosterlinck, A.: Determination of optical flow and its discontinuities using non-linear diffusion. In: Eklundh, J.-O. (ed.) ECCV 1994. LNCS, vol. 801, pp. 295–304. Springer, Heidelberg (1994)
Schapire, R.E.: The boosting approach to machine learning, an overview. In: MSRI Workshop on Nonlinear Estimation and Classification (2002)
Schölkopf, B., Smola, A.: Learning with Kernels. The MIT Press, Cambridge (2002)
Sun, J., Rehg, J.M., Bobick, A.: Automatic cascade training with perturbation bias. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, vol. II, pp. 276–283 (2004)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the Conference on Computer Vision and Pattern Recognition, Kauai, Hawaii, USA, vol. I, pp. 511–518 (2001)
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: Proceedings of the 9th International Conference on Computer Vision, Nice, France, vol. 1, pp. 734–741 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dalal, N., Triggs, B., Schmid, C. (2006). Human Detection Using Oriented Histograms of Flow and Appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744047_33
Download citation
DOI: https://doi.org/10.1007/11744047_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33834-5
Online ISBN: 978-3-540-33835-2
eBook Packages: Computer ScienceComputer Science (R0)