Abstract
Sarcasm detection is challenging in natural language processing since its peculiar linguistic expression. Thanks in part to the availability of considerable annotated resources for some datasets, current supervised learning-based approaches can achieve promising performance in sarcasm detection. In real-world scenarios, annotating data for the peculiar language expression of sarcasm proves challenging. Consequently, recent studies have delved into unsupervised learning approaches for sarcasm detection, seeking to mitigate the labor-intensive process of annotation. In this paper, we present a novel unsupervised sarcasm detection method leveraging abundant unlabeled social media data. Our approach revolves around employing prompts as a cornerstone. Initially, we gathered approximately 3 million texts from Twitter through targeted hashtag-based searches, segregating them into sarcasm and non-sarcasm categories based on associated hashtags. Subsequently, these collected texts undergo training using a pre-trained BERT model, customized for masked language modeling and coined as SarcasmBERT. This step aims to enhance the model’s grasp of sarcastic cues within the text. Finally, we devise prompts tailored for the unlabeled data to execute unsupervised sarcasm detection effectively. Our experimental findings across six benchmark datasets highlight the superiority of our method over state-of-the-art unsupervised baselines. Additionally, the integration of our SarcasmBERT into established BERT-based sarcasm detection methods showcases a direct avenue for enhancing performance, thereby illustrating its potential for immediate and substantial improvements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Agrawal, A., An, A.: Affective representations for sarcasm detection. In: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 1029–1032 (2018)
Agrawal, A., An, A., Papagelis, M.: Leveraging transitions of emotions for sarcasm detection. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1505–1508 (2020)
Bamman, D., Smith, N.: Contextualized sarcasm detection on twitter. In: Proceedings of the International AAAI Conference on Web and Social Media vol. 9, pp. 574–577 (2015)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Cambria, E., Li, Y., Xing, F.Z., Poria, S., Kwok, K.: Senticnet 6: ensemble application of symbolic and subsymbolic AI for sentiment analysis. In: Proceedings of the 29th ACM International Conference on Information and Knowledge Management, pp. 105–114 (2020)
Cambria, E., Speer, R., Havasi, C., Hussain, A.: Senticnet: a publicly available semantic resource for opinion mining. In: 2010 AAAI Fall Symposium Series (2010)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186 (2019)
González-Ibánez, R., Muresan, S., Wacholder, N.: Identifying sarcasm in twitter: a closer look. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 581–586 (2011)
Jena, A.K., Sinha, A., Agarwal, R.: C-net: contextual network for sarcasm detection. In: Proceedings of the Second Workshop on Figurative Language Processing, pp. 61–66 (2020)
Joshi, A., Agrawal, S., Bhattacharyya, P., Carman, M.J.: Expect the Unexpected: harnessing sentence completion for sarcasm detection. In: Hasida, K., Pa, W.P. (eds.) PACLING 2017. CCIS, vol. 781, pp. 275–287. Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-8438-6_22
Khodak, M., Saunshi, N., Vodrahalli, K.: A large self-annotated corpus for sarcasm. arXiv preprint arXiv:1704.05579 (2017)
Kinga, D., Adam, J.B.: Adam: a method for stochastic optimization (2015)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)
Lou, C., Liang, B., Gui, L., He, Y., Dang, Y., Xu, R.:. Affective dependency graph for sarcasm detection. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1844–1849 (2021)
Lukin, S., Walker, M.: Really? well. apparently bootstrapping improves the performance of sarcasm and nastiness classifiers for online dialogue. arXiv preprint arXiv:1708.08572 (2017)
Panayotov, V., Chen, G., Povey, D., Khudanpur, S.: Librispeech: an asr corpus based on public domain audio books. In: 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp. 5206–5210. IEEE (2015)
áš Ptáček, T., Habernal, I., Hong, J.: Sarcasm detection on Czech and English twitter. In: Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical papers, pp. 213–223 (2014)
Riloff, E., Qadir, A., Surve, P., De Silva, L., Gilbert, N., Huang, R.: Sarcasm as contrast between a positive sentiment and negative situation. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 704–714 (2013)
Karen Sparck Jones: A statistical interpretation of term specificity and its application in retrieval. J. Document. 28(1), 11–21 (1972)
Vaswani, A.: Attention is all you need. In: Advances in Neural Information Processing Systems, 30 (2017)
Wang, R., et al.: Masking and generation: an unsupervised method for sarcasm detection. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2172–2177 (2022)
Acknowledgment
This work was partially supported by the Key Technologies Research and Development Program of Shenzhen JSGG20210802154400001, Shenzhen Foundational Research Funding JCYJ20220818102415032, and the Joint Lab of HITSZ and China Merchants Securities.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lin, Q. et al. (2024). An Unsupervised Method for Sarcasm Detection with Prompts. In: Pan, X., Jin, T., Zhang, LJ. (eds) Cognitive Computing – ICCC 2023. ICCC 2023. Lecture Notes in Computer Science, vol 14207. Springer, Cham. https://doi.org/10.1007/978-3-031-51671-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-51671-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-51670-2
Online ISBN: 978-3-031-51671-9
eBook Packages: Computer ScienceComputer Science (R0)