Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2623330.2623624acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

FUNNEL: automatic mining of spatially coevolving epidemics

Published: 24 August 2014 Publication History

Abstract

Given a large collection of epidemiological data consisting of the count of d contagious diseases for l locations of duration n, how can we find patterns, rules and outliers? For example, the Project Tycho provides open access to the count infections for U.S. states from 1888 to 2013, for 56 contagious diseases (e.g., measles, influenza), which include missing values, possible recording errors, sudden spikes (or dives) of infections, etc. So how can we find a combined model, for all these diseases, locations, and time-ticks? In this paper, we present FUNNEL, a unifying analytical model for large scale epidemiological data, as well as a novel fitting algorithm, FUNNELFIT, which solves the above problem. Our method has the following properties: (a) Sense-making: it detects important patterns of epidemics, such as periodicities, the appearance of vaccines, external shock events, and more; (b) Parameter-free: our modeling framework frees the user from providing parameter values; (c) Scalable: FUNNELFIT is carefully designed to be linear on the input size; (d) General: our model is general and practical, which can be applied to various types of epidemics, including computer-virus propagation, as well as human diseases. Extensive experiments on real data demonstrate that FUNNELFIT does indeed discover important properties of epidemics: (P1) disease seasonality, e.g., influenza spikes in January, Lyme disease spikes in July and the absence of yearly periodicity for gonorrhea; (P2) disease reduction effect, e.g., the appearance of vaccines; (P3) local/state-level sensitivity, e.g., many measles cases in NY; (P4) external shock events, e.g., historical flu pandemics; (P5) detect incongruous values, i.e., data reporting errors.

Supplementary Material

MP4 File (p105-sidebyside.mp4)

References

[1]
Promotion of healthy swimming after a statewide outbreak of cryptosporidiosis associated with recreational water venues--utah, 2008-2009. MMWR Morb Mortal Wkly Rep, 61(19):348--52, 2012.
[2]
R. M. Anderson and R. M. May. Infectious Diseases of Humans Dynamics and Control. Oxford University Press, 1992.
[3]
C. Böhm, C. Faloutsos, J.-Y. Pan, and C. Plant. Ric: Parameter-free noise-robust clustering. TKDD, 1(3), 2007.
[4]
G. E. Box, G. M. Jenkins, and G. C. Reinsel. Time Series Analysis: Forecasting and Control. Prentice Hall, Englewood Cliffs, NJ, 3rd edition, 1994.
[5]
D. CC. Smallpox in the united states: It's decline and geographic distribution. Public Health Reports, 55(50):2303--2312, 1940.
[6]
L. Chen and R. T. Ng. On the marriage of lp-norms and edit distance. In VLDB, pages 792--803, 2004.
[7]
I. N. Davidson, S. Gilpin, O. T. Carmichael, and P. B. Walker. Network discovery via constrained tensor analysis of fmri data. In KDD, pages 194--202, 2013.
[8]
D. J. Earn, P. Rohani, B. M. Bolker, and B. T. Grenfell. A simple model for complex dynamical transitions in epidemics. Science, 287(5453):667--70, 2000.
[9]
B. T. Grenfell, O. N. Bjornstad, and J. Kappey. Travelling waves and spatial hierarchies in measles epidemics. Nature, 414:716, 2001.
[10]
A. Jain, E. Y. Chang, and Y.-F. Wang. Adaptive stream resource management using kalman filters. In SIGMOD, pages 11--22, 2004.
[11]
J. Kephart and S. White. Directed-graph epidemiological models of computer viruses. In Research in Security and Privacy, 1991. Proceedings., 1991 IEEE Computer Society Symposium on, pages 343--359, May 1991.
[12]
R. Kumar, M. Mahdian, and M. McGlohon. Dynamics of conversations. In KDD, pages 553--562, 2010.
[13]
J.-G. Lee, J. Han, and K.-Y. Whang. Trajectory clustering: a partition-and-group framework. In SIGMOD, pages 593--604, 2007.
[14]
J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. Microscopic evolution of social networks. In KDD, pages 462--470, 2008.
[15]
L. Li, B. A. Prakash, and C. Faloutsos. Parsimonious linear fingerprinting for time series. PVLDB, 3(1):385--396, 2010.
[16]
Y. Matsubara, L. Li, E. E. Papalexakis, D. Lo, Y. Sakurai, and C. Faloutsos. F-trail: Finding patterns in taxi trajectories. In PAKDD (1), pages 86--98, 2013.
[17]
Y. Matsubara, Y. Sakurai, and C. Faloutsos. Autoplait: Automatic mining of co-evolving time sequences. In SIGMOD, 2014.
[18]
Y. Matsubara, Y. Sakurai, C. Faloutsos, T. Iwata, and M. Yoshikawa. Fast mining and forecasting of complex time-stamped events. In KDD, pages 271--279, 2012.
[19]
Y. Matsubara, Y. Sakurai, B. A. Prakash, L. Li, and C. Faloutsos. Rise and fall patterns of information diffusion: model and implications. In KDD, pages 6--14, 2012.
[20]
F. NM, G. AP, and B. RM. Ecological and immunological determinants of influenza evolution. Nature, 422(6930):428--33, 2003.
[21]
S. Papadimitriou and P. S. Yu. Optimal multi-scale patterns in time series streams. In SIGMOD Conference, pages 647--658, 2006.
[22]
F. PE and C. JA. Measles in england and wales--i: An analysis of factors underlying seasonal patterns. Epidemiol, 11(1):5--14, 1982.
[23]
B. A. Prakash, A. Beutel, R. Rosenfeld, and C. Faloutsos. Winner takes all: competing viruses or ideas on fair-play networks. In WWW, pages 1037--1046, 2012.
[24]
B. A. Prakash, D. Chakrabarti, M. Faloutsos, N. Valler, and C. Faloutsos. Threshold conditions for arbitrary cascade models on arbitrary networks. In ICDM, pages 537--546, 2011.
[25]
T. Rakthanmanon, B. J. L. Campana, A. Mueen, G. E. A. P. A. Batista, M. B. Westover, Q. Zhu, J. Zakaria, and E. J. Keogh. Searching and mining trillions of time series subsequences under dynamic time warping. In KDD, pages 262--270, 2012.
[26]
Y. Sakurai, S. Papadimitriou, and C. Faloutsos. Braid: Stream mining th rough group lag correlations. In SIGMOD, pages 599--610, 2005.
[27]
D. SF. Seasonal variation in host susceptibility and cycles of certain infectious diseases. Emerg Infect Dis., 7(3):369--74, 2001.
[28]
M. SM, E. RJ, M. A, and M. P. Seasonality in six enterically transmitted diseases and ambient temperature. Am J Trop Med Hyg., 2014.
[29]
L. Stone, R. Olinky, and A. Huppert. Seasonal dynamics of recurrent epidemics. Nature, 446:533--536, March 2007.
[30]
J. Sun, D. Tao, and C. Faloutsos. Beyond streams and graphs: dynamic tensor analysis. In KDD, pages 374--383, 2006.
[31]
Y. Tao, C. Faloutsos, D. Papadias, and B. Liu. Prediction and indexing of moving objects with unknown motion patterns. In SIGMOD, pages 611--622, 2004.
[32]
W. G. van Panhuis, J. Grefenstette, S. Y. Jung, N. S. Chok, A. Cross, H. Eng, B. Y. Lee, V. Zadorozhny, S. Brown, D. Cummings, and D. S. Burke. Contagious diseases in the united states from 1888 to the present. NEJM, 369(22):2152--2158, 2013.
[33]
M. Vlachos, D. Gunopulos, and G. Kollios. Discovering similar multidimensional trajectories. In ICDE, pages 673--684, 2002.

Cited By

View all
  • (2024)Predicting Assembly Geometric Errors Based on Transformer Neural NetworksMachines10.3390/machines1203016112:3(161)Online publication date: 27-Feb-2024
  • (2024)TS-Fastformer: Fast Transformer for Time-series ForecastingACM Transactions on Intelligent Systems and Technology10.1145/363063715:2(1-20)Online publication date: 22-Feb-2024
  • (2024)Asformer: Learning From Adjacent ScaleICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10445968(5900-5904)Online publication date: 14-Apr-2024
  • Show More Cited By

Index Terms

  1. FUNNEL: automatic mining of spatially coevolving epidemics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2014
    2028 pages
    ISBN:9781450329569
    DOI:10.1145/2623330
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 August 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. automatic mining
    2. epidemics
    3. time-series

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    KDD '14
    Sponsor:

    Acceptance Rates

    KDD '14 Paper Acceptance Rate 151 of 1,036 submissions, 15%;
    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Predicting Assembly Geometric Errors Based on Transformer Neural NetworksMachines10.3390/machines1203016112:3(161)Online publication date: 27-Feb-2024
    • (2024)TS-Fastformer: Fast Transformer for Time-series ForecastingACM Transactions on Intelligent Systems and Technology10.1145/363063715:2(1-20)Online publication date: 22-Feb-2024
    • (2024)Asformer: Learning From Adjacent ScaleICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP48485.2024.10445968(5900-5904)Online publication date: 14-Apr-2024
    • (2024)GRAformer: A gated residual attention transformer for multivariate time series forecastingNeurocomputing10.1016/j.neucom.2024.127466581(127466)Online publication date: May-2024
    • (2024)MDCNet: Long-term time series forecasting with mode decomposition and 2D convolutionKnowledge-Based Systems10.1016/j.knosys.2024.111986(111986)Online publication date: May-2024
    • (2023)Limited resource allocation in a non-Markovian worldProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/660(5950-5958)Online publication date: 19-Aug-2023
    • (2023)A Recurrent Neural Network based Generative Adversarial Network for Long Multivariate Time Series ForecastingProceedings of the 2023 ACM International Conference on Multimedia Retrieval10.1145/3591106.3592306(181-189)Online publication date: 12-Jun-2023
    • (2023)KAE-Informer: A Knowledge Auto-Embedding Informer for Forecasting Long-Term Workloads of MicroservicesProceedings of the ACM Web Conference 202310.1145/3543507.3583288(1551-1561)Online publication date: 30-Apr-2023
    • (2023)Look Ahead: Improving the Accuracy of Time-Series Forecasting by Previewing Future Time FeaturesProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3592013(2134-2138)Online publication date: 19-Jul-2023
    • (2023)MLGNet: A Multi-Period Local and Global Temporal Dynamic Pattern Integration Network for Long-Term Forecasting2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC53992.2023.10394530(4028-4033)Online publication date: 1-Oct-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media