research-article

Open access

Are Large Language Models Capable of Causal Reasoning for Sensing Data Analysis?

Authors:

Sungchul Kim, and

Shijia PanAuthors Info & Claims

EdgeFM '24: Proceedings of the Workshop on Edge and Mobile Foundation Models

June 2024

Pages 24 - 29

https://doi.org/10.1145/3662006.3662064

Published: 11 June 2024 Publication History

Abstract

The correlation analysis between socioeconomic factors and environmental impact is essential for policy making to ensure sustainability and economic development simultaneously. With the development of Internet of Things (IoT), citizen science IoT monitoring provides valuable environmental measurements, such as PM 2.5 for air quality monitoring. However, socioeconomic factors are usually interconnected and confound each other, making accurate correlation analysis challenging. To isolate this information on an individual socioeconomic factor, we need to mitigate the confounding effect (e.g., propensity score matching) of other factors on the environmental sensing data. Large language models (LLMs) have shown remarkable capabilities in data reasoning, making us wonder if they can conduct causal reasoning and answer questions like "What is the most important socioeconomic factor that impacts regional air quality?"

In this paper, we present a new evaluation framework named "Order-of-Thought" based on Bloom's Taxonomy pedagogical framework to quantify the LLMs' ability for causal reasoning. We apply this evaluation framework with both natural language-based and program-based prompting strategies. Our evaluation uncovers the exceptional potentials of LLMs in causal reasoning for sensing data analysis, offering valuable insights regarding their capabilities and limitations, and providing useful directions to further achieve a higher-order thought.

References

[1]

Antonio Fernando Boing, Priyanka deSouza, Alexandra Crispim Boing, Rockli Kim, and SV Subramanian. 2022. Air pollution, socioeconomic status, and age-specific mortality risk in the United States. JAMA Network Open 5, 5 (2022), e2213540--e2213540.

[2]

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877--1901.

[3]

Wenhu Chen, Xueguang Ma, Xinyi Wang, and William W Cohen. 2022. Program of thoughts prompting: Disentangling computation from reasoning for numerical reasoning tasks. arXiv preprint arXiv:2211.12588 (2022).

[4]

Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, and Tony Xia. 2023. Theoremqa: A theorem-driven question answering dataset. In The 2023 Conference on Empirical Methods in Natural Language Processing.

[5]

Zhizhang Hu, Shangjie Du, Yuning Chen, Xuan Zhang, Wan Du, Asa Bradman, and Shijia Pan. 2024. Poster Abstract: Enhancing Fault Resilience of Air Quality Monitoring in San Joaquin Valley: A Data Equity Analysis. In Proceedings of the 21st ACM Conference on Embedded Networked Sensor Systems (SenSys '23). Association for Computing Machinery, New York, NY, USA, 514--515. https://doi.org/10.1145/3625687.3628384

Digital Library

[6]

Zhizhang Hu, Tong Yu, Ruiyi Zhang, and Shijia Pan. 2022. CIPhy: Causal Intervention with Physical Confounder from IoT Sensor Data for Robust Occupant Information Inference. In Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems. 966--972.

Digital Library

[7]

Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, LYU Zhiheng, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, et al. 2023. Cladder: Assessing causal reasoning in language models. In Thirty-seventh Conference on Neural Information Processing Systems.

[8]

Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, and Yusuke Iwasawa. 2022. Large language models are zero-shot reasoners. Advances in neural information processing systems 35 (2022), 22199--22213.

[9]

David R Krathwohl. 2002. A revision of Bloom's taxonomy: An overview. Theory into practice 41, 4 (2002), 212--218.

[10]

Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, and Yansong Feng. 2024. Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data. arXiv preprint arXiv:2402.17644 (2024).

[11]

Xiaoyu Liu, Paiheng Xu, Junda Wu, Jiaxin Yuan, Yifan Yang, Yuhang Zhou, Fuxiao Liu, Tianrui Guan, Haoliang Wang, Tong Yu, et al. 2024. Large language models and causal inference in collaboration: A comprehensive survey. arXiv preprint arXiv:2403.09606 (2024).

[12]

Xiao Liu, Da Yin, Chen Zhang, Yansong Feng, and Dongyan Zhao. 2023. The magic of IF: Investigating causal reasoning abilities in large language models of code. arXiv preprint arXiv:2305.19213 (2023).

[13]

Judea Pearl. 2010. The foundations of causal inference. Sociological Methodology 40, 1 (2010), 75--149.

[14]

Judea Pearl, Madelyn Glymour, and Nicholas P Jewell. 2016. Causal inference in statistics: A primer. John Wiley & Sons.

[15]

Jonas Peters, Dominik Janzing, and Bernhard Schölkopf. 2017. Elements of causal inference: foundations and learning algorithms. The MIT Press.

Digital Library

[16]

Jun Rentschler and Nadezda Leonova. 2023. Global air pollution exposure and poverty. Nature Communications 14, 1 (2023), 4432.

[17]

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, et al. 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in neural information processing systems 35 (2022), 24824--24837.

[18]

Moritz Willig, Matej Zečević, Devendra Singh Dhami, and Kristian Kersting. 2022. Can foundation models talk causality? arXiv preprint arXiv:2206.10591 (2022).

[19]

Yilun Zhao, Yitao Long, Hongjun Liu, Linyong Nan, Lyuhao Chen, Ryo Kamoi, Yixin Liu, Xiangru Tang, Rui Zhang, and Arman Cohan. 2023. DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data. arXiv preprint arXiv:2311.09805 (2023).

[20]

Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, and Arman Cohan. 2023. Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track. 160--175.

Recommendations

Causal Reasoning and Meno’s Paradox
Abstract
Causal reasoning is an aspect of learning, reasoning, and decision-making that involves the cognitive ability to discover relationships between causal relata, learn and understand these causal relationships, and make use of this causal knowledge ...
Read More
Distilling Multi-Step Reasoning Capabilities into Smaller Language Model
ICMLC '24: Proceedings of the 2024 16th International Conference on Machine Learning and Computing

Commonsense reasoning is an essential and vital ability for humans in daily life. The current benchmark fails to reflect the actual reasoning ability of the language model equipped, even though many prior works received impressive performance. ...
Read More
Causal default reasoning: principles and algorithms
AAAI'94: Proceedings of the Twelfth AAAI National Conference on Artificial Intelligence

The minimal model semantics is a natural interpretation of defaults yet it often yields a behavior that is too weak. This weakness has been traced to the inability of minimal models to reflect certain implicit preferences among defaults, in particular, ...
Read More

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

EdgeFM '24: Proceedings of the Workshop on Edge and Mobile Foundation Models

June 2024

44 pages

ISBN:9798400706639

DOI:10.1145/3662006

Copyright © 2024 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGMOBILE: ACM Special Interest Group on Mobility of Systems, Users, Data and Computing

In-Cooperation

SIGOPS: ACM Special Interest Group on Operating Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2024

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

UC Merced Spring 2023 Climate Action Seed Competition Grant

Conference

MOBISYS '24

Sponsor:

SIGMOBILE

MOBISYS '24: The 22nd Annual International Conference on Mobile Systems, Applications and Services

June 3 - 7, 2024

Tokyo, Minato-ku, Japan

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
42
Total Downloads

Downloads (Last 12 months)42
Downloads (Last 6 weeks)42

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents