Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2339530.2339634acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Testing the significance of spatio-temporal teleconnection patterns

Published: 12 August 2012 Publication History

Abstract

Dipoles represent long distance connections between the pressure anomalies of two distant regions that are negatively correlated with each other. Such dipoles have proven important for understanding and explaining the variability in climate in many regions of the world, e.g., the El Nino climate phenomenon is known to be responsible for precipitation and temperature anomalies over large parts of the world. Systematic approaches for dipole detection generate a large number of candidate dipoles, but there exists no method to evaluate the significance of the candidate teleconnections. In this paper, we present a novel method for testing the statistical significance of the class of spatio-temporal teleconnection patterns called as dipoles. One of the most important challenges in addressing significance testing in a spatio-temporal context is how to address the spatial and temporal dependencies that show up as high autocorrelation. We present a novel approach that uses the wild bootstrap to capture the spatio-temporal dependencies, in the special use case of teleconnections in climate data. Our approach to find the statistical significance takes into account the autocorrelation, the seasonality and the trend in the time series over a period of time. This framework is applicable to other problems in spatio-temporal data mining to assess the significance of the patterns.

Supplementary Material

JPG File (307_t_talk_2.jpg)
MP4 File (307_t_talk_2.mp4)

References

[1]
J. M. Wallace and D. S. Gutzler. Téléconnexions in the geopotential height field during the northern hemisphere winter. Mon. Wea. Rev, 109:784--812, 1981.
[2]
H. Von Storch and F. W. Zwiers. Statistical analysis in climate research. Cambridge Univ Pr, 2002.
[3]
J. Kawale, S. Liess, A. Kumar, M. Steinbach, A. Ganguly, N. F. Samatova, F. Semazzi, P. Snyder, and V. Kumar. Data guided discovery of dynamic climate dipoles. In CIDU, pages 30--44, 2011.
[4]
J. Kawale, M. Steinbach, and V. Kumar. Discovering dynamic dipoles in climate data. In SIAM Conference on Data Mining, SDM, pages 107--118, 2011.
[5]
M. Steinbach, P.N. Tan, V. Kumar, S. Klooster, and C. Potter. Discovery of climate indices using clustering. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 446--455. ACM, 2003.
[6]
E. S. Edgington. Randomization tests, volume 147. CRC Press, 1995.
[7]
S. Hanhijarvi, G. Garriga, and K. Puolamaki. Randomization techniques for graphs. 2009.
[8]
A. Gionis, H. Mannila, T. Mielikäinen, and P. Tsaparas. Assessing data mining results via swap randomization. ACM Transactions on Knowledge Discovery from Data (TKDD), 1(3):14--es, 2007.
[9]
Nuno Castro and Paulo J. Azevedo. Time series motifs statistical significance. In SDM, pages 687--698, 2011.
[10]
P. Ferreira, P. Azevedo, C. Silva, and R. Brito. Mining approximate motifs in time series. In Discovery Science, pages 89--101. Springer, 2006.
[11]
T. Oates. Peruse: An unsupervised algorithm for finding recurrig patterns in time series. 2002.
[12]
W. Ulrich and N.J. Gotelli. Null model analysis of species nestedness patterns. Ecology, 88(7):1824--1831, 2007.
[13]
A. Waldron. Null models of geographic range size evolution reaffirm its heritability. American Naturalist, pages 221--231, 2007.
[14]
J. A. Veech. A null model for detecting nonrandom patterns of species richness along spatial gradients. Ecology, 81(4):1143--1149, 2000.
[15]
J. Besag and P. J. Diggle. Simple monte carlo tests for spatial pattern. Applied Statistics, pages 327--333, 1977.
[16]
D. S. Wilks. Statistical methods in the atmospheric sciences, volume 100. Academic press, 2011.
[17]
R. F. Engle and C. W. J. Granger. Co-integration and error correction: representation, estimation, and testing. Econometrica: Journal of the Econometric Society, pages 251--276, 1987.
[18]
C. W. J. Granger. Some properties of time series data and their use in econometric model specification. Journal of econometrics, 16(1):121--130, 1981.
[19]
E. C. Weatherhead, G. C. Reinsel, G. C. Tiao, X. L. Meng, D. Choi, W. K. Cheang, T. Keller, J. DeLuisi, D. J. Wuebbles, J. B. Kerr, et al. Factors affecting the detection of trends: Statistical considerations and applications to environmental data. Journal of Geophysical Research, 103(D14):17--149, 1998.
[20]
C. F. J. Wu. Jackknife, bootstrap and other resampling methods in regression analysis. The Annals of Statistics, 14(4):1261--1295, 1986.
[21]
E. Mammen. Bootstrap and wild bootstrap for high dimensional linear models. The Annals of Statistics, 21(1):255--285, 1993.
[22]
R. A. Fisher et al. On the "probable error" of a coefficient of correlation deduced from a small sample. Metron, 1:3--32, 1921.
[23]
W. Hardle and E. Mammen. Comparing nonparametric versus parametric regression fits. The Annals of Statistics, 21(4):1926--1947, 1993.
[24]
Y. Benjamini and D. Yekutieli. The control of the false discovery rate in multiple testing under dependency. Annals of statistics, pages 1165--1188, 2001.
[25]
E. Kalnay and et al. The ncep/ncar 40-year reanalysis project. Bull. Amer. Meteor. Soc., 77:437--471, 1996.
[26]
Climate prediction centre, http://www.cpc.ncep.noaa.gov/.
[27]
J. E. Janowiak. An investigation of interannual rainfall variability in africa. Journal of Climate, 1:240--255, 1988.
[28]
K. M. Hines, D. H. Bromwich, and G. J. Marshall. Artificial surface pressure trends in the ncep-ncar reanalysis over the southern ocean and antarctica. J. Climate, 13(22):3940--3952, 2000.

Cited By

View all
  • (2020)Exploring Correlation Network for Cheating DetectionACM Transactions on Intelligent Systems and Technology10.1145/336422111:1(1-23)Online publication date: 17-Jan-2020
  • (2020)Discovering Interesting Subpaths with Statistical Significance from Spatiotemporal DatasetsACM Transactions on Intelligent Systems and Technology10.1145/335418911:1(1-24)Online publication date: 9-Jan-2020
  • (2020)Data-Driven Approaches for Spatio-Temporal Analysis: A Survey of the State-of-the-ArtsJournal of Computer Science and Technology10.1007/s11390-020-9349-035:3(665-696)Online publication date: 29-May-2020
  • Show More Cited By

Index Terms

  1. Testing the significance of spatio-temporal teleconnection patterns

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tag

    1. significance testing

    Qualifiers

    • Research-article

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Exploring Correlation Network for Cheating DetectionACM Transactions on Intelligent Systems and Technology10.1145/336422111:1(1-23)Online publication date: 17-Jan-2020
    • (2020)Discovering Interesting Subpaths with Statistical Significance from Spatiotemporal DatasetsACM Transactions on Intelligent Systems and Technology10.1145/335418911:1(1-24)Online publication date: 9-Jan-2020
    • (2020)Data-Driven Approaches for Spatio-Temporal Analysis: A Survey of the State-of-the-ArtsJournal of Computer Science and Technology10.1007/s11390-020-9349-035:3(665-696)Online publication date: 29-May-2020
    • (2019) Dimensionality Reduction and Network Inference for Climate Data Using δ ‐MAPS: Application to the CESM Large Ensemble Sea Surface Temperature Journal of Advances in Modeling Earth Systems10.1029/2019MS00165411:6(1479-1515)Online publication date: 4-Jun-2019
    • (2018)Network Structure Inference, A SurveyACM Computing Surveys10.1145/315452451:2(1-39)Online publication date: 17-Apr-2018
    • (2018)Galaxy: Towards Scalable and Interpretable Explanation on High-Dimensional and Spatio-Temporal Correlated Climate Data2018 IEEE International Conference on Big Knowledge (ICBK)10.1109/ICBK.2018.00027(146-153)Online publication date: Nov-2018
    • (2018)Enhanced shared nearest neighbor clustering approach using fuzzy for teleconnection analysisSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-017-2767-422:24(8243-8258)Online publication date: 1-Dec-2018
    • (2017)Transdisciplinary Foundations of Geospatial Data ScienceISPRS International Journal of Geo-Information10.3390/ijgi61203956:12(395)Online publication date: 1-Dec-2017
    • (2016)WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decomposition2016 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2016.7840643(522-531)Online publication date: Dec-2016
    • (2015)Spatiotemporal Data Mining: A Computational PerspectiveISPRS International Journal of Geo-Information10.3390/ijgi40423064:4(2306-2338)Online publication date: 28-Oct-2015
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media