Abstract
This paper presents a framework for event detection and video content analysis for visual surveillance applications. The system is able to coordinate the tracking of objects between multiple camera views, which may be overlapping or non-overlapping. The key novelty of our approach is that we can automatically learn a semantic scene model for a surveillance region, and have defined data models to support the storage of tracking data with different layers of abstraction into a surveillance database. The surveillance database provides a mechanism to generate video content summaries of objects detected by the system across the entire surveillance region in terms of the semantic scene model. In addition, the surveillance database supports spatio-temporal queries, which can be applied for event detection and notification applications.
Similar content being viewed by others
References
Black J, Ellis TJ (2002) Multi-view image surveillance and tracking. IEEE Workshop on Motion and Video Computing, Orlando, December 2002, pp 169–174
Black J, Ellis TJ, Rosin P (2003) A novel method for video tracking performance evaluation. The Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS). Nice, France, October 2003
Black J, Ellis TJ, Makris D (2004) A hierarchical database for visual surveillance applications. IEEE International Conference on Multimedia and Expo (ICME2004). Taipei, Taiwan
Cai Q, Aggarwal JK (1999) Tracking human motion in structured environments using a distributed camera system. IEEE Pattern Analysis and Machine Intelligence (PAMI) 2(11):1241–1247
Cai X, Ali F, Stipidis E (2003) MPEG4 over local area mobile surveillance system. IEE Intelligent Distributed Surveillance Systems London, UK
Chang T, Gong S (2001) Tracking multiple people with a multi-camera system. IEEE Workshop on Multi-Object Tracking (WOMOT01), Vancouver, Canada, July 2001, 19–28
Collins RT, Lipton AJ, Fujiyoshi H, Kanade T (2001) Algorithms for cooperative multisensor surveillance. Proc IEEE 89(10):1456–1476
Guler S, Liang WH, Pushee I (2003) A video event detection and mining framework. In: IEEE workshop on event mining detection and recognition of events in video, June 2003
Ivanov YA, Bobick AF (2000) Recognition of visual activities and interactions by stochastic parsing. IEEE Pattern Anal Mach Intell (PAMI) 22(8):852–872
Javed O, Shah M (2003) KNIGHT: a multi-camera surveillance system. In: IEEE international conference on multimedia and expo, July 2003, Baltimore
Katz B, Lin J, Stauffer C, Grimson E (2003) Answering questions about moving objects in surveillance videos. In: Proceedings of the 2003 Spring symposium on new directions in question answering, March 2003
Kogut GT, Trivedi MM (2001) Maintaining the identity of multiple vehicles as they travel through a video network. In: IEEE Workshop on Multi-Object Tracking (WOMOT01), Vancouver, Canada, July 2001, pp 29–34
Lee L, Romano R, Stein G (2000) Monitoring activities from multiple video streams: establishing a common coordinate frame. IEEE Pattern Analysis and Machine Intelligence (PAMI) 22(8):758–767
Makris D, Ellis T (2001) “Finding Paths in Video Sequences”, British machine vision conference BMVC2001, vol 1. University of Manchester, UK, pp 263–272
Makris D, Ellis T (2002) Path detection in video surveillance. Image and Vision Computing Journal. Elsevier, 20:895–903
Makris D, Ellis T (2002) Spatial and probabilistic modelling of pedestrian behaviour. British Machine Vision Conference (BMVC2002), vol. 2. Cardiff University, UK, pp 557–566
Makris D, Ellis T (2003) Automatic learning of an activity-based semantic scene model. IEEE Conference on Advanced Video and Signal Based Surveillance (AVSB 2003) Miami, July 2003
Martinez JM, Koenen R, Pereira F (2002) MPEG-7 the generic multimedia content description standard, part 1. IEEE Multimedia 9(2):78–87
Medioni G, Cohen I, Bremod F, Hongeng S, Nevatia R (2001) Event detection and analysis from video streams. IEEE Pattern Analysis and Machine Intelligence (PAMI) 23(8):873–889
Trivedi M, Bhonsel S, Gupta A (2001) Database architecture for autonomous transportation agents for on-scene networked incident management (ATON). International Conference on Pattern Recognition (ICPR) Barcelona, Spain, pp 4664–4667
Remagnino P, Jones GA (2001) Classifying surveillance events and attributes and behavior. British Machine Vision Conference (BMVC2001), Manchester, pp 685–694
Xu M, Ellis TJ (2001) Illumination-invariant motion detection using color mixture models. British Machine Vision Conference (BMVC 2001) Manchester, pp 163–172
Xu M, Ellis TJ (2002) Partial observation vs blind tracking through occlusion. British Machine Vision Conference (BMVC 2002), Cardiff, pp 777–786
Acknowledgements
We would like to acknowledge support from the UK Engineering and Physical Science Research Council (EPSRC) under grant number GR/M58030. Thanks also to Ming Xu.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix
Biographies
Dr James Black received the BEng and PhD degrees in computer systems engineering and information engineering in 1994 and 2004, respectively, from City University. He has worked both in industry and academia. His research has primarily focussed on multi-view image surveillance using a combination of motion and colour cues, and performance evaluation of video-tracking algorithms. His research interests are in computer vision, pattern recognition and machine learning. He is an associate member of the IEEE computer society.
Dr Dimitrios Makris received his first degree (Diploma) in Electrical and Computer Engineering from Aristotle University of Thessaloniki in 1999 and his PhD in Computer Vision from City University, London in 2004. He joined the School of Computing and Information Systems at Kingston University as a research assistant in October 2003 and he was promoted to lecturer in August 2004. His research interests are in the area of image processing, computer vision and machine learning. His experience is mainly in the area of motion and activity analysis, where he worked on 3D vision and on automated visual surveillance. He is a member of the Institute of Electrical& Electronic Engineers (IEEE), the Institute of Electrical Engineers (IEE) and the British Machine Vision Association (BMVA).
Professor Tim Ellis was appointed professor and Head of School of Computing and Information Systems at Kingston University in August 2003, where he is a member of the Digital Imaging Research Centre (DIRC). His research interests include: multi-camera video surveillance, real-time vision hardware, feature extraction in complex environments, colour-based object recognition, object tracking, real-time implementation of algorithms and the use of multiple cameras for extracting 3D measurements. His research has been funded by: an SERC Advanced Fellowship in Intelligent Instrumentation (1984–1999); the Scientific Research and Development Branch of the Home Office for a project in security surveillance (1986–1988); EPSRC IMCASM—Intelligent Multi-Camera Surveillance and Monitoring (GR/GR/M58030) and Hybrid Vision and Active Sensor Approach to Monitoring and Inspection Applications (GR/H41898), the Royal Society (1995–1997) and Electricite de France (1988–1991). He has been a member of the Executive Committee of the British Machine Vision Association since 1991 and was Chairman from 1997–2000. He was Co-Chair for BMVC2004 at Kingston University.
Rights and permissions
About this article
Cite this article
Black, J., Makris, D. & Ellis, T. Hierarchical database for a multi-camera surveillance system. Pattern Anal Applic 7, 430–446 (2004). https://doi.org/10.1007/s10044-005-0243-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-005-0243-8