Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3477495.3531979acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article
Open access

Few-Shot Stance Detection via Target-Aware Prompt Distillation

Published: 07 July 2022 Publication History

Abstract

Stance detection aims to identify whether the author of a text is in favor of, against, or neutral to a given target. The main challenge of this task comes two-fold: few-shot learning resulting from the varying targets and the lack of contextual information of the targets. Existing works mainly focus on solving the second issue by designing attention-based models or introducing noisy external knowledge, while the first issue remains under-explored. In this paper, inspired by the potential capability of pre-trained language models (PLMs) serving as knowledge bases and few-shot learners, we propose to introduce prompt-based fine-tuning for stance detection. PLMs can provide essential contextual information for the targets and enable few-shot learning via prompts. Considering the crucial role of the target in stance detection task, we design target-aware prompts and propose a novel verbalizer. Instead of mapping each label to a concrete word, our verbalizer maps each label to a vector and picks the label that best captures the correlation between the stance and the target. Moreover, to alleviate the possible defect of dealing with varying targets with a single hand-crafted prompt, we propose to distill the information learned from multiple prompts. Experimental results show the superior performance of our proposed model in both full-data and few-shot scenarios.

Supplementary Material

MP4 File (SIGIR22-fp1052.mp4)
Stance detection aims to identify whether the author of a text is in favor of, against, or neutral to a given target. There are two main challenges in this task: few-shot learning caused by the varying targets, and a lack of contextual information about the targets. Existing research focuses mostly on solving the second issue by designing attention-based models or introducing noisy external knowledge, while the first issue remains under-explored. In this paper, we propose to introduce prompt-based fine-tuning for stance detection. Considering the crucial role of the target in stance detection task, we design target-aware prompts and propose a novel verbalizer. Furthermore, we propose to distill the information learned from multiple prompts. Experimental results show the superior performance of our proposed model in both full-data and few-shot scenarios.

References

[1]
Henrik Bøhler, Petter Asla, Erwin Marsi, and Rune Sætre. 2016. Idi@ ntnu at semeval-2016 task 6: Detecting stance in tweets using shallow features and glove vectors for word representation. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016). 445--450.
[2]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171--4186.
[3]
Kuntal Dey, Ritvik Shrivastava, and Saroj Kaushik. 2018. Topical stance detection for Twitter: A two-phase LSTM model using attention. In European Conference on Information Retrieval. Springer, 529--536.
[4]
Jiachen Du, Lin Gui, Ruifeng Xu, Yunqing Xia, and Xuan Wang. 2020. Commonsense knowledge enhanced memory network for stance classification. IEEE Intelligent Systems 35, 4 (2020), 102--109.
[5]
Jiachen Du, Ruifeng Xu, Yulan He, and Lin Gui. 2017. Stance classification with target-specific neural attention networks. International Joint Conferences on Artificial Intelligence.
[6]
Tianyu Gao, Adam Fisch, and Danqi Chen. 2021. Making Pre-trained Language Models Better Few-shot Learners. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 3816--3830.
[7]
Kazuaki Hanawa, Akira Sasaki, Naoaki Okazaki, and Kentaro Inui. 2019. Stance detection attending external knowledge from wikipedia. Journal of Information Processing 27 (2019), 499--506.
[8]
Momchil Hardalov, Arnav Arora, Preslav Nakov, and Isabelle Augenstein. 2021. Cross-Domain Label-Adaptive Stance Detection. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 9011--9028.
[9]
Momchil Hardalov, Arnav Arora, Preslav Nakov, and Isabelle Augenstein. 2021. Few-shot cross-lingual stance detection with sentiment-based pre-training. arXiv preprint arXiv:2109.06050 (2021).
[10]
Tomás Hercig, Peter Krejzl, Barbora Hourová, Josef Steinberger, and Ladislav Lenc. 2017. Detecting Stance in Czech News Commentaries. In ITAT. 176--180.
[11]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).
[12]
Mirko Lai, Viviana Patti, Giancarlo Ruffo, and Paolo Rosso. 2018. Stance evolution and twitter interactions in an italian political debate. In International Conference on Applications of Natural Language to Information Systems. Springer, 15--27.
[13]
Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. 2019. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019).
[14]
David MJ Lazer, Matthew A Baum, Yochai Benkler, Adam J Berinsky, Kelly M Greenhill, Filippo Menczer, Miriam J Metzger, Brendan Nyhan, Gordon Pennycook, David Rothschild, et al. 2018. The science of fake news. Science 359, 6380 (2018), 1094--1096.
[15]
Teven Le Scao and Alexander M Rush. 2021. How many data points is a prompt worth?. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2627--2636.
[16]
Yingjie Li and Cornelia Caragea. 2019. Multi-task stance detection with sentiment and stance lexicons. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 6299--6305.
[17]
Bin Liang, Yonghao Fu, Lin Gui, Min Yang, Jiachen Du, Yulan He, and Ruifeng Xu. 2021. Target-adaptive Graph for Cross-target Stance Detection. In Proceedings of the Web Conference 2021. 3453--3464.
[18]
Rui Liu, Zheng Lin, Yutong Tan, and Weiping Wang. 2021. Enhancing zero-shot and few-shot stance detection with commonsense knowledge graph. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 3152--3157.
[19]
Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, and Jie Tang. 2021. GPT Understands, Too. arXiv preprint arXiv:2103.10385 (2021).
[20]
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019).
[21]
Xiaofei Ma, Zhiguo Wang, Patrick Ng, Ramesh Nallapati, and Bing Xiang. 2019. Universal text representation from bert: An empirical study. arXiv preprint arXiv:1910.07973 (2019).
[22]
Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu, and Colin Cherry. 2016. Semeval-2016 task 6: Detecting stance in tweets. In Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016). 31--41.
[23]
Akiko Murakami and Rudy Raymond. 2010. Support or oppose? Classifying positions in online debates from reply activities and opinion expressions. In Coling 2010: Posters. 869--875.
[24]
Fabio Petroni, Tim Rocktäschel, Sebastian Riedel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu, and Alexander Miller. 2019. Language Models as Knowledge Bases?. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 2463--2473.
[25]
Kashyap Popat, Subhabrata Mukherjee, Andrew Yates, and Gerhard Weikum. 2019. STANCY: Stance Classification Based on Consistency Cues. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLPIJCNLP). 6413--6418.
[26]
Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. Improving language understanding by generative pre-training. (2018).
[27]
Gayathri Rajendran, Bhadrachalam Chitturi, and Prabaharan Poornachandran. 2018. Stance-in-depth deep neural approach to stance classification. Procedia computer science 132 (2018), 1646--1653.
[28]
Carlos Abel Córdova Sáenz and Karin Becker. 2021. Interpreting BERT-based stance classification: a case study about the Brazilian COVID vaccination. In Anais do XXXVI Simpósio Brasileiro de Bancos de Dados. SBC, 73--84.
[29]
Timo Schick and Hinrich Schütze. 2021. Exploiting Cloze-Questions for FewShot Text Classification and Natural Language Inference. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 255--269.
[30]
Timo Schick and Hinrich Schütze. 2021. It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2339--2352.
[31]
Anirban Sen, Manjira Sinha, Sandya Mannarswamy, and Shourya Roy. 2018. Stance classification of multi-perspective consumer health information. In Proceedings of the ACM India Joint International Conference on Data Science and Management of Data. 273--281.
[32]
Ronald Seoh, Ian Birle, Mrinal Tak, Haw-Shiuan Chang, Brian Pinette, and Alfred Hough. 2021. Open Aspect Target Sentiment Classification with Natural Language Prompts. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 6311--6322.
[33]
Umme Aymun Siddiqua, Abu Nowshed Chy, and Masaki Aono. 2019. Tweet stance detection using an attention based neural ensemble model. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 1868--1873.
[34]
Swapna Somasundaran and Janyce Wiebe. 2009. Recognizing stances in online debates. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 226--234
[35]
Dhanya Sridhar, Lise Getoor, and Marilyn Walker. 2014. Collective stance classification of posts in online debate forums. In Proceedings of the Joint Workshop on Social Dynamics and Personal Attributes in Social Media. 109--117.
[36]
Christian Stab, Tristan Miller, Benjamin Schiller, Pranav Rai, and Iryna Gurevych. 2018. Cross-topic Argument Mining from Heterogeneous Sources. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 3664--3674.
[37]
Qingying Sun, Zhongqing Wang, Shoushan Li, Qiaoming Zhu, and Guodong Zhou. 2019. Stance detection via sentiment information and neural network model. Frontiers of Computer Science 13, 1 (2019), 127--138.
[38]
Yuqing Sun and Yang Li. 2021. Stance Detection with Knowledge Enhanced BERT. In CAAI International Conference on Artificial Intelligence. Springer, 239--250.
[39]
Matt Thomas, Bo Pang, and Lillian Lee. 2006. Get out the vote: Determining support or opposition from Congressional floor-debate transcripts. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. 327--335.
[40]
Martin Tutek, Ivan Sekuli, Paula Gombar, Ivan Paljak, Filip ulinovi, Filip Boltui, Mladen Karan, Domagoj Alagi, and Jan najder. 2016. Takelab at semeval-2016 task 6: Stance classification in tweets using a genetic algorithm based ensemble. In Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016). 464--468.
[41]
Limin Wang and Dexin Wang. 2021. Solving Stance Detection on Tweets as Multi-Domain and Multi-Task Text Classification. IEEE Access 9 (2021), 157780-- 157789.
[42]
Chang Xu, Cécile Paris, Surya Nepal, Ross Sparks, Chong Long, and Yafang Wang. 2020. DAN: Dual-View Representation Learning for Adapting Stance Classifiers to New Domains. arXiv preprint arXiv:2003.06514 (2020).
[43]
Wenpeng Yin, Jamaal Hay, and Dan Roth. 2019. Benchmarking Zero-shot Text Classification: Datasets. Evaluation, and Entailment Approach. In emnlp (2019).
[44]
Bowen Zhang, Min Yang, Xutao Li, Yunming Ye, Xiaofei Xu, and Kuai Dai. 2020. Enhancing cross-target stance detection with transferable semantic-emotion knowledge. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 3188--3197.
[45]
Xin Zhang, Jianhua Yuan, Yanyan Zhao, and Bing Qin. 2021. Knowledge Enhanced Target-Aware Stance Detection on Tweets. In China Conference on Knowledge Graph and Semantic Computing. Springer, 171--184.
[46]
Arkaitz Zubiaga, Ahmet Aker, Kalina Bontcheva, Maria Liakata, and Rob Procter. 2018. Detection and resolution of rumours in social media: A survey. ACM Computing Surveys (CSUR) 51, 2 (2018), 1--36.

Cited By

View all
  • (2025)Large Language Model Enhanced Logic Tensor Network for Stance DetectionNeural Networks10.1016/j.neunet.2024.106956183(106956)Online publication date: Mar-2025
  • (2024)Distantly Supervised Explainable Stance Detection via Chain-of-Thought SupervisionMathematics10.3390/math1207111912:7(1119)Online publication date: 8-Apr-2024
  • (2024)Leveraging Chain-of-Thought to Enhance Stance Detection with Prompt-TuningMathematics10.3390/math1204056812:4(568)Online publication date: 13-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2022
3569 pages
ISBN:9781450387323
DOI:10.1145/3477495
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 July 2022

Check for updates

Author Tags

  1. few-shot learning
  2. prompt-based fine-tuning
  3. stance detection

Qualifiers

  • Research-article

Funding Sources

Conference

SIGIR '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)429
  • Downloads (Last 6 weeks)40
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Large Language Model Enhanced Logic Tensor Network for Stance DetectionNeural Networks10.1016/j.neunet.2024.106956183(106956)Online publication date: Mar-2025
  • (2024)Distantly Supervised Explainable Stance Detection via Chain-of-Thought SupervisionMathematics10.3390/math1207111912:7(1119)Online publication date: 8-Apr-2024
  • (2024)Leveraging Chain-of-Thought to Enhance Stance Detection with Prompt-TuningMathematics10.3390/math1204056812:4(568)Online publication date: 13-Feb-2024
  • (2024)Topic-Specific Political Stance Inference in Social Networks With Case StudiesIEEE Access10.1109/ACCESS.2024.336048712(21921-21935)Online publication date: 2024
  • (2024)Commonsense-based adversarial learning framework for zero-shot stance detectionNeurocomputing10.1016/j.neucom.2023.126943563:COnline publication date: 1-Jan-2024
  • (2024)Joint contrastive learning for prompt-based few-shot language learnersNeural Computing and Applications10.1007/s00521-024-09502-736:14(7861-7875)Online publication date: 21-Feb-2024
  • (2023)Knowledge-enhanced Prompt-tuning for Stance DetectionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/358876722:6(1-20)Online publication date: 16-Jun-2023
  • (2023)Topic-Aware Contrastive Learning and K-Nearest Neighbor Mechanism for Stance DetectionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615085(2362-2371)Online publication date: 21-Oct-2023
  • (2023)Adversarial Topic-Aware Memory Network for Cross-Lingual Stance Detection2023 IEEE International Conference on Intelligence and Security Informatics (ISI)10.1109/ISI58743.2023.10297287(1-6)Online publication date: 2-Oct-2023
  • (2023)What are Pros and Cons? Stance Detection and Summarization on Feature Request2023 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)10.1109/ESEM56168.2023.10304865(1-12)Online publication date: 26-Oct-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media