Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3437963.3441670acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
extended-abstract

Interpretability and Effectiveness of Machine Learning Methods for Sequence Mining in Various Domains

Published: 08 March 2021 Publication History
  • Get Citation Alerts
  • Abstract

    There is a diverse variety of demographic data that can be analyzed with modern methods of data mining to achieve better results. On the one hand, the main chosen task is to compare different methods for the next event prediction and gender prediction, on the other hand, we pay special attention to interpretable patterns describing demographic behavior in the studied problems. There were considered interpretable methods as decision trees and their ensembles and semi- or non-interpretable methods, such as the SVM method with different customized kernels tailored for demographers' needs and neural networks, respectively. The best accuracy results were obtained with two-channel Convolutional Neural Networks.

    References

    [1]
    Francesco C. Billari, Johannes Fürnkranz, and Alexia Prskawetz. 2006. Timing, Sequencing, and Quantum of Life Course Events: A Machine Learning Approach. European Journal of Population / Revue européenne de Démographie, Vol. 22, 1 (2006), 37--65. https://doi.org/10.1007/s10680-005--5549-0
    [2]
    Aleksey Buzmakov, Elias Egho, Nicolas Jay, Sergei O. Kuznetsov, Amedeo Napoli, and Chedy Ra"i ssi. 2016. On mining complex sequential data by means of FCA and pattern structures. Int. J. Gen. Syst., Vol. 45, 2 (2016), 135--159. https://doi.org/10.1080/03081079.2015.1072925
    [3]
    Guozhu Dong and Jinyan Li. 2018. Emerging Patterns .Springer New York, New York, NY, 1294--1297. https://doi.org/10.1007/978--1--4614--8265--9_145
    [4]
    Danil Gizdatullin, Jaume Baixeries, Dmitry I. Ignatov, Ekaterina Mitrofanova, Anna Muratova, and Thomas H. Espy. 2019. Learning Interpretable Prefix-Based Patterns from Demographic Sequences. In Intelligent Data Processing, Vadim V. Strijov, Dmitry I. Ignatov, and Konstantin V. Vorontsov (Eds.). Springer International Publishing, Cham, 74--91.
    [5]
    Dmitry I. Ignatov, Ekaterina Mitrofanova, Anna Muratova, and Danil Gizdatullin. 2015. Pattern Mining and Machine Learning for Demographic Sequences. In Knowledge Engineering and Semantic Web, Pavel Klinov and Dmitry Mouromtsev (Eds.). Springer International Publishing, Cham, 225--239.
    [6]
    Mehdi Kaytoue, V'i ctor Codocedo, Aleksey Buzmakov, Jaume Baixeries, Sergei O. Kuznetsov, and Amedeo Napoli. 2015. Pattern Structures and Concept Lattices for Data Mining and Knowledge Processing. In Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2015, Porto, Portugal, September 7--11, 2015, Proceedings, Part III (Lecture Notes in Computer Science, Vol. 9286), Albert Bifet, Michael May, Bianca Zadrozny, Ricard Gavaldà, Dino Pedreschi, Francesco Bonchi, Jaime S. Cardoso, and Myra Spiliopoulou (Eds.). Springer, 227--231. https://doi.org/10.1007/978--3--319--23461--8_19
    [7]
    Scott M Lundberg and Su-In Lee. 2017. A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 4765--4774. http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf
    [8]
    Christoph Molnar. 2019. Interpretable Machine Learning . https://christophm.github.io/interpretable-ml-book/.
    [9]
    Anna Muratova, Danil Gizdatullin, Dmitry I Ignatov, and Ekaterina Mitrofanova. 2016. Revealing knowledge in demographic sequences. In Sociology and society: social inequality and social justice (Yekaterinburg, October 19--21 2016). Materials of the V All-Russian Sociological Congress / Under total. ed .: V.A.Mansurov. M.: Russian Society of Sociologists, pp. 9601--9615, 2016 .
    [10]
    Anna Muratova, Dmitry I. Ignatov, and Ekaterina Mitrofanova. 2020. Machine Learning Methods for Demographic Data Analysis. In Communications in Computer and Information Science (accepted to AIST 2020) .
    [11]
    Anna Muratova, Robiul Islam, Ekaterina Mitrofanova, and Dmitry I Ignatov. 2019. Searching for Interpretable Demographic Patterns. In Proceedings of the Fifth Workshop on Experimental Economics and Machine Learning at the National Research University Higher School of Economics co-located with the Seventh International Conference on Applied Research in Economics (iCare7), Perm, Russia, September 26, 2019. http://ceur-ws.org/Vol-2479/paper2.pdf
    [12]
    Anna Muratova, Pavel Sushko, and Thomas H. Espy. 2017. Black-Box Classification Techniques for Demographic Sequences : from Customised SVM to RNN. In Proceedings of the Fourth Workshop on Experimental Economics and Machine Learning (EEML 2017), Dresden, Germany, September 17--18, 2017 / Ed. by R. Tagiew, D. I. Ignatov, A. Hilbert, K. Heinrich, R. Delhibabu. Aachen : CEUR Workshop Proceedings, 2017. P. 31--40. http://ceur-ws.org/Vol-1968/paper4.pdf
    [13]
    Jiaxi Tang and Ke Wang. 2018. Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (Marina Del Rey, CA, USA) (WSDM '18). Association for Computing Machinery, New York, NY, USA, 565--573. https://doi.org/10.1145/3159652.3159656
    [14]
    An Yan, Shuo Cheng, Wang-Cheng Kang, Mengting Wan, and Julian McAuley. 2019. CosRec: 2D Convolutional Neural Networks for Sequential Recommendation. arxiv: 1908.09972 [cs.IR]

    Index Terms

    1. Interpretability and Effectiveness of Machine Learning Methods for Sequence Mining in Various Domains

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining
      March 2021
      1192 pages
      ISBN:9781450382977
      DOI:10.1145/3437963
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 March 2021

      Check for updates

      Author Tags

      1. classification
      2. data mining
      3. life trajectories
      4. neural networks
      5. sequence mining

      Qualifiers

      • Extended-abstract

      Funding Sources

      • Basic Research Program at the National Research University Higher School of Economics and funded by the Russian Academic Excelle
      • Faculty of Social Sciences National Research University Higher School of Economics

      Conference

      WSDM '21

      Acceptance Rates

      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 106
        Total Downloads
      • Downloads (Last 12 months)11
      • Downloads (Last 6 weeks)1

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media