Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3635059.3635065acmotherconferencesArticle/Chapter ViewAbstractPublication PagespciConference Proceedingsconference-collections
research-article
Open access

Explainable Graph Neural Networks on Linked Statistical Data for Predicting Scottish House Prices

Published: 14 February 2024 Publication History

Abstract

In the complex landscape of the real estate market, accurately estimating house prices is of paramount importance for a wide array of stakeholders. While traditional statistical methods have been employed in the past, machine learning and deep learning models are increasingly demonstrating their efficacy in predicting house prices by capturing intricate patterns and relationships between features of the housing market, such as neighborhood characteristics and geographical proximity to amenities. Recently, Graph Neural Networks have shown great performance in various predictive tasks, effectively capturing complex spatial dependencies in graph-strucured data. Towards this direction, this paper utilizes a Graph Convolutional Network and linked Open Government Data from the official Scottish data portal to predict house prices in Scotland. The results showed that the Graph Neural Network model achieved an accuracy of 0.84 on the test set. The GNN’s decisions are also interpreted using the GNNexplainer model, that identified important graph structures as well as node features that are most influential for the prediction. Comparative Illness Factor, an indicator of health conditions, is identified as the feature that primarily influences the model’s decisions for a particular data zone.

References

[1]
Luc Anselin and Nancy Lozano-Gracia. 2009. Spatial hedonic models. In Palgrave handbook of econometrics. Springer, 1213–1250.
[2]
Steven Bourassa, Eva Cantoni, and Martin Hoesli. 2010. Predicting house prices with spatial dependence: A comparison of alternative methods. Journal of Real Estate Research 32, 2 (2010), 139–160.
[3]
Steven C. Bourassa, Eva Cantoni, and Martin Hoesli. 2007. Spatial Dependence, Housing Submarkets, and House Price Prediction. The Journal of Real Estate Finance and Economics 35, 2 (01 Aug 2007), 143–160. https://doi.org/10.1007/s11146-007-9036-8
[4]
Petros Brimos, Areti Karamanou, Evangelos Kalampokis, and Konstantinos Tarabanis. 2023. Graph Neural Networks and Open-Government Data to Forecast Traffic Flow. Information 14, 4 (2023). https://doi.org/10.3390/info14040228
[5]
Petros Brimos, Areti Karamanou, Evangelos Kalampokis, and Konstantinos Tarabanis. 2023. Traffic Flow Prediction with Swiss Open Data: A Deep Learning Approach. In Electronic Government, Ida Lindgren, Csaba Csáki, Evangelos Kalampokis, Marijn Janssen, Gabriela Viale Pereira, Shefali Virkar, Efthimios Tambouris, and Anneke Zuiderwijk (Eds.). Springer Nature Switzerland, Cham, 313–328.
[6]
Jie Chen, Tengfei Ma, and Cao Xiao. 2018. FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling. In International Conference on Learning Representations. https://openreview.net/forum?id=rytstxWAW
[7]
Jianbo Chen, Le Song, Martin Wainwright, and Michael Jordan. 2018. Learning to explain: An information-theoretic perspective on model interpretation. In International Conference on Machine Learning. PMLR, 883–892.
[8]
Sarkar Snigdha Sarathi Das, Mohammed Eunus Ali, Yuan-Fang Li, Yong-Bin Kang, and Timos Sellis. 2021. Boosting house price predictions using geo-spatial network embedding. Data Mining and Knowledge Discovery 35, 6 (01 Nov 2021), 2221–2250. https://doi.org/10.1007/s10618-021-00789-x
[9]
Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional neural networks on graphs with fast localized spectral filtering. Advances in neural information processing systems 29 (2016).
[10]
Eurostat. 2022. Housing in Europe — 2022 interactive edition. https://doi.org/10.2785/798608
[11]
Thorben Funke, Megha Khosla, and Avishek Anand. 2021. Hard Masking for Explaining Graph Neural Networks. https://openreview.net/forum?id=uDN8pRAdsoC
[12]
Isabella Gollini, Binbin Lu, Martin Charlton, Christopher Brunsdon, and Paul Harris. 2015. GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models. Journal of Statistical Software 63, 17 (2015), 1–50. https://doi.org/10.18637/jss.v063.i17
[13]
Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. Advances in neural information processing systems 30 (2017).
[14]
Qiang Huang, Makoto Yamada, Yuan Tian, Dinesh Singh, and Yi Chang. 2022. Graphlime: Local interpretable model explanations for graph neural networks. IEEE Transactions on Knowledge and Data Engineering (2022).
[15]
Evangelos Kalampokis, Efthimios Tambouris, and Konstantinos Tarabanis. 2016. Linked Open Cube Analytics Systems: Potential and Challenges. IEEE Intelligent Systems 31, 5 (2016), 89–92. https://doi.org/10.1109/MIS.2016.82
[16]
Areti Karamanou, Petros Brimos, Evangelos Kalampokis, and Konstantinos Tarabanis. 2022. Exploring the Quality of Dynamic Open Government Data Using Statistical and Machine Learning Methods. Sensors 22, 24 (2022). https://doi.org/10.3390/s22249684
[17]
Areti Karamanou, Evangelos Kalampokis, and Konstantinos Tarabanis. 2022. Linked Open Government Data to Predict and Explain House Prices: The Case of Scottish Statistics Portal. Big Data Research 30 (2022), 100355. https://doi.org/10.1016/j.bdr.2022.100355
[18]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[19]
Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations. https://openreview.net/forum?id=SJU4ayYgl
[20]
Scott M Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. Advances in neural information processing systems 30 (2017).
[21]
Dongsheng Luo, Wei Cheng, Dongkuan Xu, Wenchao Yu, Bo Zong, Haifeng Chen, and Xiang Zhang. 2020. Parameterized explainer for graph neural network. Advances in neural information processing systems 33 (2020), 19620–19631.
[22]
Juan Manuel Perez Martinez, Rafael Berlanga, Maria Jose Aramburu, and Torben Bach Pedersen. 2008. Integrating Data Warehouses with Web Data: A Survey. IEEE Transactions on Knowledge and Data Engineering 20, 7 (2008), 940–955. https://doi.org/10.1109/TKDE.2007.190746
[23]
Phillip E Pope, Soheil Kolouri, Mohammad Rostami, Charles E Martin, and Heiko Hoffmann. 2019. Explainability methods for graph convolutional neural networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10772–10781.
[24]
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. " Why should i trust you?" Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 1135–1144.
[25]
Juan Ramón Rico-Juan and Paloma Taltavull de La Paz. 2021. Machine learning with explainability or spatial hedonics tools? An analysis of the asking prices in the housing market in Alicante, Spain. Expert Systems with Applications 171 (2021), 114590. https://doi.org/10.1016/j.eswa.2021.114590
[26]
Alvaro Sanchez-Gonzalez, Nicolas Heess, Jost Tobias Springenberg, Josh Merel, Martin Riedmiller, Raia Hadsell, and Peter Battaglia. 2018. Graph networks as learnable physics engines for inference and control. In International Conference on Machine Learning. PMLR, 4470–4479.
[27]
Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T Schütt, Klaus-Robert Müller, and Grégoire Montavon. 2021. Higher-order explanations of graph neural networks via relevant walks. IEEE transactions on pattern analysis and machine intelligence 44, 11 (2021), 7581–7596.
[28]
Robert Schwarzenberg, Marc Hübner, David Harbecke, Christoph Alt, and Leonhard Hennig. 2019. Layerwise Relevance Visualization in Convolutional Text Graph Classifiers. In Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-13). Association for Computational Linguistics, Hong Kong, 58–62. https://doi.org/10.18653/v1/D19-5308
[29]
Ziyi Sun and Jing Zhang. 2022. Research on Prediction of Housing Prices Based on GA-PSO-BP Neural Network Model: Evidence from Chongqing, China. INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE 33, 06N07 (SEP 2022), 805–818. https://doi.org/10.1142/S0129054122420163
[30]
Efthimios Tambouris and Konstantinos Tarabanis. 2021. Towards Inclusive Integrated Public Service (IPS) Co-Creation and Provision. In DG.O2021: The 22nd Annual International Conference on Digital Government Research (Omaha, NE, USA) (DG.O’21). Association for Computing Machinery, New York, NY, USA, 458–462. https://doi.org/10.1145/3463677.3463726
[31]
Petar Velickovic, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, Yoshua Bengio, 2017. Graph attention networks. stat 1050, 20 (2017), 10–48550.
[32]
Minh Vu and My T Thai. 2020. Pgm-explainer: Probabilistic graphical model explanations for graph neural networks. Advances in neural information processing systems 33 (2020), 12225–12235.
[33]
Yongji Wu, Defu Lian, Yiheng Xu, Le Wu, and Enhong Chen. 2020. Graph convolutional networks with markov random field reasoning for social spammer detection. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 1054–1061.
[34]
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and Philip S. Yu. 2021. A Comprehensive Survey on Graph Neural Networks. IEEE Transactions on Neural Networks and Learning Systems 32, 1 (2021), 4–24. https://doi.org/10.1109/TNNLS.2020.2978386
[35]
Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. 2019. How Powerful are Graph Neural Networks?. In International Conference on Learning Representations. https://openreview.net/forum?id=ryGs6iA5Km
[36]
Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 974–983.
[37]
Zhitao Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, and Jure Leskovec. 2019. Gnnexplainer: Generating explanations for graph neural networks. Advances in neural information processing systems 32 (2019).
[38]
Hao Yuan, Jiliang Tang, Xia Hu, and Shuiwang Ji. 2020. Xgnn: Towards model-level explanations of graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 430–438.
[39]
Hao Yuan, Haiyang Yu, Shurui Gui, and Shuiwang Ji. 2022. Explainability in graph neural networks: A taxonomic survey. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).
[40]
Jiani Zhang, Xingjian Shi, Junyuan Xie, Hao Ma, Irwin King, and Dit-Yan Yeung. 2018. GaAN: Gated Attention Networks for Learning on Large and Spatiotemporal Graphs. In Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence. 339–349.
[41]
Yue Zhang, David Defazio, and Arti Ramesh. 2021. Relex: A model-agnostic relational model explainer. In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. 1042–1049.
[42]
Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. 2020. Graph neural networks: A review of methods and applications. AI Open 1 (2020), 57–81. https://doi.org/10.1016/j.aiopen.2021.01.001

Index Terms

  1. Explainable Graph Neural Networks on Linked Statistical Data for Predicting Scottish House Prices

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      PCI '23: Proceedings of the 27th Pan-Hellenic Conference on Progress in Computing and Informatics
      November 2023
      304 pages
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 14 February 2024

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Explainable Graph Neural Networks
      2. Graph Neural Networks
      3. house price prediction
      4. linked statistical data

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      • Horizon 2020 Framework Programme award number(s)

      Conference

      PCI 2023

      Acceptance Rates

      Overall Acceptance Rate 190 of 390 submissions, 49%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 85
        Total Downloads
      • Downloads (Last 12 months)85
      • Downloads (Last 6 weeks)30
      Reflects downloads up to 03 Sep 2024

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media