Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Anomaly detection using manifold embedding and its applications in transportation corridors

Published: 01 August 2009 Publication History
  • Get Citation Alerts
  • Abstract

    The formation of secure transportation corridors, where cargoes and shipments from points of entry can be dispatched safely to highly sensitive and secure locations, is a high national priority. One of the key tasks of the program is the detection of anomalous cargo based on sensor readings in truck weigh stations. Due to the high variability, dimensionality, and/or noise content of sensor data in transportation corridors, appropriate feature representation is crucial to the success of anomaly detection methods in this domain. In this paper, we empirically investigate the usefulness of manifold embedding methods for feature representation in anomaly detection problems in the domain of transportation corridors. We focus on both linear methods, such as multi-dimensional scaling (MDS), as well as nonlinear methods, such as locally linear embedding (LLE) and isometric feature mapping (ISOMAP). Our study indicates that such embedding methods provide a natural mechanism for keeping anomalous points away from the dense/normal regions in the embedding of the data. We illustrate the efficacy of manifold embedding methods for anomaly detection through experiments on simulated data as well as real truck data from weigh stations.

    References

    [1]
    A.M. Andrew, Statistical pattern recognition, Robotica 18(2) (2000), 219-223.
    [2]
    A. Banerjee, S. Merugu, I. Dhillon and J. Ghosh, Clustering with bregman divergences, JMLR 6 (2005), 1705-1749.
    [3]
    D. Barbara, Y. Li, J. Couto, J.-L. Lin and S. Jajodia, Bootstrapping a data mining intrusion detection system, In Proceedings of the 2003 ACM Symposium on Applied Computing, pages 421-425. ACM Press, 2003.
    [4]
    M. Bernstein, V. de Silva, J. Langford and J. Tenenbaum, Graph approximations to geodesics on embedded manifolds. Technical report, Stanford University, 2000.
    [5]
    I. Borg and P. Groenen, Modern Multi-dimensioanl Scaling, pringer, 1996.
    [6]
    M.M. Breunig, H.-P. Kriegel, R.T. Ng and Jörg Sander, LOF: identifying density-based local outliers, In ACM International Conference on Management of Data (SIGMOD), pages 93-104, 2000.
    [7]
    C. Chang and C. Lin, Libsvm: a library for support vector machines, 2001.
    [8]
    M. Collins, S. Dasgupta and R. Schapire, A generalization of principal component analysis to the exponential family, In Proc. of the 14th Annual Conference on Neural Information Processing Systems (NIPS), 2001.
    [9]
    D. de Ridder and R. Duin, Locally linear embedding for classification. Technical Report PH-2002-01, Delft University of Technology, 2002.
    [10]
    D. Donoho and C. Grimes, Hessian eigenmaps: Locally linear embedding techniques for highdimensional data, Proceedinsg of the National Academy of Science 100(10) (2003).
    [11]
    R.O. Duda, P.E. Hart and D.G. Stork, Pattern Classification, John Wiley & Sons, 2001.
    [12]
    S. Forrest, S.A. Hofmeyr, A. Somayaji and T.A. Longstaff, A sense of self for unix processes, In Proceedinges of the IEEE Symposium on Research in Security and Privacy, pages 120-128. IEEE Computer Society Press, 1996.
    [13]
    J. Forster and M.K. Warmuth, Relative expected instantaneous loss bounds, In Proc. of the 13th Annual Conference on Computational Learing Theory (COLT), pages 90-99, 2000.
    [14]
    O.C. Jenkins and M.J. Mataric, A spatio-temporal extension to Isomap nonlinear dimensionality reduction, In Proceedings of the 21st International Conference on Machine Learning, 2004.
    [15]
    I. Joliffe, Principal Component Analysis, Springer-Verlag, 1996.
    [16]
    L.H. Kim and D.H. Finkel, Hyperspectral image processing using locally linear embedding. In Conference Proceedings. First International IEEE EMBS Conference on Neural Engineering, pages 316- 319, 2003.
    [17]
    E.M. Knorr and R.T. Ng, A unified notion of outliers: Properties and computation, In Knowledge Discovery and Data Mining, pages 219-222, 1997.
    [18]
    E.M. Knorr and R.T. Ng, Algorithms for mining distance-based outliers in large datasets. In Proc. 24th International Conference on Very Large Data Bases, VLDB, pages 392-403, 24-27, 1998.
    [19]
    D. Kulpinski, Lle and Isomap Analysis of Spectral and Color Images, Master's thesis, Simon Fraser University, 2002.
    [20]
    D. Marchette, A statistical method for profiling network traffic, In First USENIX Workshop on Intrusion Detection and Network Monitoring, pages 119-128, Santa Clara, CA, April 9-12, 1999.
    [21]
    N. Patwari, A.O. Hero and A. Pacholski, Manifold learning visualization of network traffic data, In SIGCOMM Workshop on Mining Network Data, 2005.
    [22]
    C. Phua, D. Alahakoon and V. Lee, Minority report in fraud detection: classification of skewed data, SIGKDD Explor Newsl 6(1) (2004), 50-59.
    [23]
    R. Pless, Image spaces and video trajectories: Using Isomap to explore video sequences, In Proceedings of IEEE International Conference on Computer Vision, 2003.
    [24]
    S. Roweis and L. Saul, Nonlinear dimensionality reduction by locally linear embedding, Science 290 (2000), 2323-2326.
    [25]
    B. Schoelkopf, J.C. Platt, J.C. Shawe-Taylor, A.J. Smola and R.C. Williamson, Estimating the support of a high-dimensional distribution, Neural Comput 13(7) (2001), 1443-1471.
    [26]
    J. Tanenbaum, V. de Silva and J. Langford, A global geometric framework for nonlinear dimensionality reduction, Science 290 (2000), 2319-2323.
    [27]
    T. Tangkuampien and T.-J. Chin, Locally linear embedding for markerless human motion capture using multiple cameras, In Proccedings of Digital Image Computing: Techniques and Applications, 2005.
    [28]
    D. Tax and R. Duin, Data domain description by support vectors, In Proceedings of ESANN, pages 251-256, 1999.
    [29]
    Introduction to Statistical Pattern Recognition. K. Fukunaga. Academic Press, 1990.
    [30]
    V. Chandola, A. Banerjee and V. Kumar, Outlier detection - A survey, In Preparation, 2007.
    [31]
    R. Vilalta and S. Ma, Predicting rare events in temporal domains, In Proceedings of ICDM'02, page 474, Washington, DC, USA, 2002. IEEE Computer Society.
    [32]
    N. Wu and J. Zhang, Factor analysis based anomaly detection, In IEEE Workshop on Information Assurance, West Point, NY, USA, June 2003. United States Military Academy.
    [33]
    X. Yang, H. Fu, H. Zha and J. Barlow, Semi-supervised nonlinear dimensionality reduction, In Proceedings of International Conference on Machine Learning (ICML), 2006.
    [34]
    H. Zha and Z. Zhang, Isometric embedding and continuum ISOMAP, In Proceedings of International Conference on Machine Learning (ICML), 2003.
    [35]
    Lawrence K. Saul and Sam T. Roweis, Think globally, fit locally: unsupervised learning of low dimensional manifolds, In Journal of Machine Learning Research (JMLR), 2003.

    Cited By

    View all
    • (2017)Noisy-free Length Discriminant Analysis with cosine hyperbolic framework for dimensionality reductionExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.03.03481:C(88-107)Online publication date: 15-Sep-2017
    • (2016)Entity embedding-based anomaly detection for heterogeneous categorical eventsProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence10.5555/3060621.3060815(1396-1403)Online publication date: 9-Jul-2016
    • (2013)Anomaly detection on ITS data via view associationProceedings of the ACM SIGKDD Workshop on Outlier Detection and Description10.1145/2500853.2500859(22-30)Online publication date: 11-Aug-2013
    • Show More Cited By

    Index Terms

    1. Anomaly detection using manifold embedding and its applications in transportation corridors
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Intelligent Data Analysis
      Intelligent Data Analysis  Volume 13, Issue 3
      Knowledge Discovery from Data Streams
      August 2009
      92 pages

      Publisher

      IOS Press

      Netherlands

      Publication History

      Published: 01 August 2009

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2017)Noisy-free Length Discriminant Analysis with cosine hyperbolic framework for dimensionality reductionExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.03.03481:C(88-107)Online publication date: 15-Sep-2017
      • (2016)Entity embedding-based anomaly detection for heterogeneous categorical eventsProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence10.5555/3060621.3060815(1396-1403)Online publication date: 9-Jul-2016
      • (2013)Anomaly detection on ITS data via view associationProceedings of the ACM SIGKDD Workshop on Outlier Detection and Description10.1145/2500853.2500859(22-30)Online publication date: 11-Aug-2013
      • (2013)Mining irregularities in maritime container itinerariesProceedings of the Joint EDBT/ICDT 2013 Workshops10.1145/2457317.2457325(45-51)Online publication date: 18-Mar-2013

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media