Abstract
ChatGPT represents a significant advancement in the field of Artificial Intelligence (AI), showcasing the development of a robust AI system capable of multitasking and generating human-like language. At present, many scholars have done evaluations on ChatGPT in terms of language, reasoning, and scientific knowledge abilities, based on benchmarks or well-crafted questions. However, to the best of our knowledge, there is currently no existing comparative analysis from a cognitive perspective that directly assesses ChatGPT alongside humans. Metaphor, serving as a manifestation of linguistic creativity, provides a valuable avenue for examining cognition. This is due to the mapping relationship it establishes between the target and source conceptual domains, reflecting distinct cognitive patterns. In this paper, we use a metaphor processing tool, MetaPro, to analyze the cognitive differences between ChatGPT and humans through the metaphorical expressions in ChatGPT- and human-generated text. We illustrate the preferences in metaphor usage, concept mapping, and cognitive pattern variances across different domains. The methodology utilized in this study makes a valuable contribution to the task-agnostic evaluation of AI systems and cognitive research. The insights garnered from this research prove instrumental in comprehending the cognitive distinctions between ChatGPT and humans, facilitating the identification of potential cognitive biases within ChatGPT.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The data used in this study will be made available upon request.
Notes
Ecological validity pertains to how well the design or evaluation setup aligns with the authentic work context of the user. It focuses on the accuracy with which the design or evaluation mirrors the pertinent characteristics of the interaction’s ecology, capturing its context in the real world or environment [6].
We define metaphorical cognition as the reflection of cognition through metaphors, encompassing elements such as the cognition of target concepts, source concepts, and their mappings.
While dictionaries may contain the meanings of numerous conventional metaphors, their mere inclusion is not a feature to identify the metaphoricity of a lexical unit. According to Metaphor Identification Procedure [12], a metaphor is identified through the semantic contrast between its contextual and basic meanings. The basic meaning of a metaphor is typically more concrete, related to bodily action, more precise, and historically older.
Italics denote metaphors; small capital words denote concepts.
In this work, the representation of a concept mapping takes the form of “a target concept is a source concept”.
A word association test involves the presentation of a stimulus word to a participant, who subsequently provides the initial word that comes to mind in response.
The thematic apperception test is a projective psychological evaluation that requires individuals to furnish interpretations for scenes characterized by ambiguity.
The Rorschach test is a projective technique used in psychological assessment, involving the individual’s task of describing their interpretations of ten inkblots. These inkblots consist of a combination of black or gray elements and others featuring patches of color.
Questions in the dataset were generated by humans only. Thus, we did not parse questions.
References
Shidiq M. The use of artificial intelligence-based Chat-GPT and its challenges for the world of education; from the viewpoint of the development of creative writing skills. In: Proceeding of International Conference on Education, Society and Humanity; 2023;1:353–357
Méndez G, Gervás P. Using ChatGPT for story sifting in narrative generation. In: Proceedings of The 14th International Conference on Computational Creativity; 2023
Qin C, Zhang A, Zhang Z, Chen J, Yasunaga M, Yang D. Is ChatGPT a general-purpose natural language processing task solver? 2023 arXiv:2302.06476
Soni M, Wade V Comparing abstractive summaries generated by ChatGPT to real summaries through blinded reviewers and text classification algorithms. 2023 arXiv:2303.17650
Mao R, Chen G, Zhang X, Guerin F, Cambria E. GPTEval: a survey on assessments of ChatGPT and GPT-4. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), Torino, Italia; 2024. pp. 7844–7866
Hartson R, Pyla PS. Chapter 14 - rigorous empirical evaluation: preparation. In: Hartson R, Pyla PS (eds.) The UX Book; 2012. pp. 503–536. Morgan Kaufmann, Boston
Rapaport D, Gill M, Schafer R. Diagnostic psychological testing: the theory, statistical evaluation, and diagnostic application of a battery of tests. 1946:2
Crossley SA, Kyle K, McNamara DS. Sentiment analysis and social cognition engine (SEANCE): an automatic tool for sentiment, social cognition, and social-order analysis. Behav Res Methods. 2017;49:803–21.
Naskar D, Singh S.R, Kumar D, Nandi S, Rivaherrera EOdl. Emotion dynamics of public opinions on Twitter. ACM Trans Inf Syst (TOIS). 2020;38(2):1–24
Orabi AH, Buddhitha P, Orabi MH, Inkpen D. Deep learning for depression detection of Twitter users. In: Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: from Keyboard to Clinic; 2018. pp. 88–97
Priva UC, Austerweil JL. Analyzing the history of cognition using topic models. Cognit. 2015;135:4–9.
Pragglejaz G. MIP: a method for identifying metaphorically used words in discourse. Metaphor Symb. 2007;22(1):1–39.
Lakoff G, Johnson M. Metaphors we live by; 1980
Lakoff G. The contemporary theory of metaphor. Metaphor and thought; 1993:202–251
Mao R, Li X, He K, Ge M, Cambria E. MetaPro online: a computational metaphor processing online system. In: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (System Demonstrations); 2023;3:127–135
Guo B, Zhang X, Wang Z, Jiang M, Nie J, Ding Y, Yue J, Wu Y. How close is ChatGPT to human experts? Comparison corpus, evaluation, and detection. 2023 arXiv:2301.07597
Ge M, Mao R, Cambria E. A survey on computational metaphor processing techniques: from identification, interpretation, generation to application. Artif Intell Rev. 2023;56:1829–95.
Mao R, Liu Q, He K, Li W, Cambria E. The biases of pre-trained language models: an empirical study on prompt-based sentiment analysis and emotion detection. IEEE Trans Affect Comput. 2023;14(3):1743–53.
Fan C, Lin J, Mao R, Cambria E. Fusing pairwise modalities for emotion recognition in conversations. Inf Fusion. 2024;106:102306.
William D, Suhartono D. Text-based depression detection on social media posts: a systematic literature review. Procedia Comput Sci. 2021;179:582–9.
Ji S, Pan S, Li X, Cambria E, Long G, Huang Z. Suicidal ideation detection: a review of machine learning methods and applications. IEEE Trans Comput Soc Syst. 2020;8(1):214–26.
Karami A, Zhu M, Goldschmidt B, Boyajieff HR, Najafabadi MM. COVID-19 vaccine and social media in the US: exploring emotions and discussions on Twitter. Vaccines. 2021;9(10):1059.
Xia E, Yue H, Liu H. Tweet sentiment analysis of the 2020 US presidential election. In: Companion Proceedings of the Web Conference 2021; 2021. pp. 367–371
Duong C, Liu Q, Mao R, Cambria E. Saving earth one tweet at a time through the lens of artificial intelligence. In: 2022 International Joint Conference on Neural Networks (IJCNN); 2022. pp. 1–9
Duong C, Raghuram VC, Lee A, Mao R, Mengaldo G, Cambria E. Neurosymbolic AI for mining public opinions about wildfires. Cognit Comput. 2023;16:1531–53.
Garcia MB, Cunanan-Yabut A. Public sentiment and emotion analyses of Twitter data on the 2022 Russian invasion of Ukraine. In: 2022 9th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE); 2022. pp. 242–247. IEEE
Guntuku SC, Preotiuc-Pietro D, Eichstaedt JC, Ungar LH. What twitter profile and posted images reveal about depression and anxiety. In: Proceedings of the International AAAI Conference on Web and Social Media; 2019;13:236–246
Wu X, Pan F, Nguyen T, Feng Y, Liu C, Nguyen CD, Luu AT. On the affinity, rationality, and diversity of hierarchical topic modeling. In: Proceedings of the AAAI Conference on Artificial Intelligence; 2024;38:19261–19269
Wu X, Dong X, Nguyen TT, Luu AT. Effective neural topic modeling with embedding clustering regularization. In: International Conference on Machine Learning; 2023. pp. 37335–37357. PMLR
Taleb NN. The black swan: the impact of the highly improbable. 2007:2
Glette-Iversen I, Aven T. On the meaning of and relationship between dragon-kings, black swans and related concepts. Reliab Eng Syst Saf. 2021;211:107625.
Arrese Á. The use of ‘bubble’ as an economic metaphor in the news: the case of the ‘real estate bubble’ in Spain. Lang Commun. 2021;78:100–8.
Ruane E, Farrell S, Ventresque A. User perception of text-based chatbot personality. In: Chatbot Research and Design: 4th International Workshop; 2021. pp. 32–47. Springer
Liu B, Sundar SS. Should machines express sympathy and empathy? Experiments with a health advice chatbot. Cyberpsychology Behav Soc Netw. 2018;21(10):625–36.
Santo L.E, Cardoso A, Wiggins G. Theoretical learning creators and creative scientists. In: 13th International Conference on Computational Creativity. 2022. Association for Computational Creativity
Moghaddam SR, Honey CJ. Boosting theory-of-mind performance in large language models via prompting. 2023. arXiv:2304.11490
Hutson M. How does ChatGPT ‘think’? Psychology and neuroscience crack open AI large language models. Nat. 2024;629(8014):986–8.
Mao R, He K, Ong CB, Liu Q, Cambria E. MetaPro 2.0: computational metaphor processing on the effectiveness of anomalous language modeling. In: Findings of the Association for Computational Linguistics: ACL; 2024. pp. 9891–9908. Association for Computational Linguistics, Bangkok, Thailand
Manro R, Mao R, Dahiya L, Ma Y, Cambria E. A cognitive analysis of CEO speeches and their effects on stock markets. In: Proceedings of the 5th International Conference on Financial Technology (ICFT), 2024. Singapore
Prosser J, Cohen LJ, Steinfeld M, Eisenberg D, London ED, Galynker II. Neuropsychological functioning in opiate-dependent subjects receiving and following methadone maintenance treatment. Drug Alcohol Depend. 2006;84(3):240–7.
Wiggins JS. Paradigms of personality assessment. 2003
De Vos GA. Boyer LB. Symbolic analysis cross-culturally: the rorschach test; 2021.
Han S, Mao R, Cambria E. Hierarchical attention network for explainable depression detection on Twitter aided by metaphor concept mappings. In: Proceedings of the 29th International Conference on Computational Linguistics (COLING); 2022. pp. 94–104
Mao R, Zhang T, Liu Q, Hussain A, Cambria E. Unveiling diplomatic narratives: analyzing United Nations Security Council debates through metaphorical cognition. In: Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci); 2024. pp. 1709–1716. Rotterdam, the Netherlands
Mao R, Li X. Bridging towers of multi-task learning with a gating mechanism for aspect-based sentiment analysis and sequential metaphor identification. Proceed AAAI Conf Artif Intell. 2021;35(15):13534–42.
Mao R, Li X, Ge M, Cambria E. MetaPro: a computational metaphor processing model for text pre-processing. Inf Fusion. 2022;86–87:30–43.
Ge M, Mao R, Cambria E. Explainable metaphor identification inspired by conceptual metaphor theory. Proceed AAAI Conf Artif Intell. 2022;36(10):10681–9.
Steen GJ, Dorst AG, Herrmann JB, Kaal A, Krennmayr T, Pasma T. A method for linguistic metaphor identification: from MIP to MIPVU. 2010;2010:14.
Fellbaum C. WordNet: an electronic lexical database. 1998
Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V. RoBERTa: a robustly optimized BERT pretraining approach. 2019. arXiv:1907.11692
Sackett DL. Evidence-based medicine. In: Seminars in Perinatology; 1997;21:3–5. Elsevier
Mao R, Du K, Ma Y, Zhu L, Cambria E. Discovering the cognition behind language: financial metaphor analysis with MetaPro. In: 2023 IEEE International Conference on Data Mining (ICDM); 2023. pp. 1211–1216. IEEE
Jamrozik A, McQuire M, Cardillo ER, Chatterjee A. Metaphor: bridging embodiment to abstraction. Psychon Bull Rev. 2016;23:1080–9.
Sperber D, Wilson D. Relevance: communication and cognition. 2nd ed. 1995
Pennington J, Socher R, Manning CD. GloVe: global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing; 2014. pp. 1532–1543
Lloyd S. Least squares quantization in PCM. IEEE Trans Inf Theory. 1982;28(2):129–37.
Cambria E, Mao R, Chen M, Wang Z, Ho SB. Seven pillars for the future of artificial intelligence. IEEE Intell Syst. 2023;38(6):62–9.
Zhu L, Li W, Mao R, Pandelea V, Cambria E. PAED: zero-shot persona attribute extraction in dialogues. In: Proceedings of the 61st annual meeting of the association for Computational Linguistics (ACL); 2023;1:9771–9787
Zhu L, Mao R, Cambria E, Jansen BJ. Neurosymbolic AI for personalized sentiment analysis. In: Proceedings of international conference on human-computer Interaction (HCII); 2024. Washington DC, USA
Funding
This research/project is supported by the Ministry of Education, Singapore, under its MOE Academic Research Fund Tier 2 (STEM RIE2025 Award MOE-T2EP20123-0005). Guanyi Chen is supported by the Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning and the National Language Resources Monitoring and Research Center for Network Media of Central China Normal University in Wuhan, China.
Author information
Authors and Affiliations
Contributions
Rui Mao: conceptualization, methodology, software, formal analysis, visualization, writing—original draft, writing—review and editing. Guanyi Chen: formal analysis, visualization, writing—review and editing. Xiao Li: investigation, validation, writing—review and editing. Mengshi Ge: investigation, validation, writing—review and editing. Erik Cambria: conceptualization, data curation, writing—review and editing, supervision.
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mao, R., Chen, G., Li, X. et al. A Comparative Analysis of Metaphorical Cognition in ChatGPT and Human Minds. Cogn Comput 17, 35 (2025). https://doi.org/10.1007/s12559-024-10393-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12559-024-10393-y