Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3583780.3614984acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

MultiPLe: Multilingual Prompt Learning for Relieving Semantic Confusions in Few-shot Event Detection

Published: 21 October 2023 Publication History
  • Get Citation Alerts
  • Abstract

    Event detection (ED) is a challenging task in the field of information extraction. Due to the monolingual text and rampant confusing triggers, traditional ED models suffer from semantic confusions in terms of polysemy and synonym, leading to severe detection mistakes. Such semantic confusions can be further exacerbated in a practical situation where scarce labeled data cannot provide sufficient semantic clues. To mitigate such bottleneck, we propose a multilingual prompt learning (MultiPLe) framework for few-shot event detection (FSED), including three components, i.e., a multilingual prompt, a hierarchical prototype and a quadruplet contrastive learning module. In detail, to ease the polysemy confusion, the multilingual prompt module develops the in-context semantics of triggers via the multilingual disambiguation and prior knowledge in pretrained language models. Then, the hierarchical prototype module is adopted to diminish the synonym confusion by connecting the captured inmost semantics of fuzzy triggers with labels at a fine granularity. Finally, we employ the quadruplet contrastive learning module to tackle the insufficient label representation and potential noise. Experiments on two public datasets show that MultiPLe outperforms the state-of-the-art baselines in weighted F1-score, presenting a maximum improvement of 13.63% for FSED.

    References

    [1]
    Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In NeurIPS. https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html
    [2]
    Aditi Chaudhary, Zaid Sheikh, Antonis Anastasopoulos, and Graham Neubig. 2021. Reducing Confusion in Active Learning for Part-Of-Speech Tagging. Trans. Assoc. Comput. Linguistics, Vol. 9 (2021), 1--16. https://doi.org/10.1162/tacl_a_00350
    [3]
    Yubo Chen, Liheng Xu, Kang Liu, Daojian Zeng, and Jun Zhao. 2015. Event extraction via dynamic multi-pooling convolutional neural networks. In ACL. The Association for Computational Linguistics, 167--176. https://doi.org/10.3115/v1/p15--1017
    [4]
    Shumin Deng, Ningyu Zhang, Jiaojian Kang, Yichi Zhang, Wei Zhang, and Huajun Chen. 2020. Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection. In WSDM. ACM, 151--159. https://doi.org/10.1145/3336191.3371796
    [5]
    George R. Doddington, Alexis Mitchell, Mark A. Przybocki, Lance A. Ramshaw, Stephanie M. Strassel, and Ralph M. Weischedel. 2004. The Automatic Content Extraction (ACE) Program - Tasks, Data, and Evaluation. In LREC. European Language Resources Association. http://www.lrec-conf.org/proceedings/lrec2004/summaries/5.htm
    [6]
    Li Fei-Fei, Robert Fergus, and Pietro Perona. 2006. One-Shot Learning of Object Categories. IEEE Trans. Pattern Anal. Mach. Intell., Vol. 28, 4 (2006), 594--611. https://doi.org/10.1109/TPAMI.2006.79
    [7]
    Tianyu Gao, Adam Fisch, and Danqi Chen. 2021. Making Pre-trained Language Models Better Few-shot Learners. In ACL/IJCNLP. The Association for Computational Linguistics, 3816--3830. https://doi.org/10.18653/v1/2021.acl-long.295
    [8]
    Biyang Guo, Songqiao Han, Xiao Han, Hailiang Huang, and Ting Lu. 2021. Label Confusion Learning to Enhance Text Classification Models. In AAAI. AAAI Press, 12929--12936. https://ojs.aaai.org/index.php/AAAI/article/view/17529
    [9]
    Abhyuday N Jagannatha and Hong Yu. 2016. Bidirectional RNN for medical event detection in electronic health records. In NAACL-HLT, Vol. 2016. The Association for Computational Linguistics, 473. https://doi.org/10.18653/v1/n16--1056
    [10]
    Viet Dac Lai, Minh Van Nguyen, Thien Huu Nguyen, and Franck Dernoncourt. 2021. Graph Learning Regularization and Transfer Learning for Few-Shot Event Detection. In SIGIR. ACM, 2172--2176. https://doi.org/10.1145/3404835.3463054
    [11]
    Haochen Li, Tong Mo, Hongcheng Fan, Jingkun Wang, Jiaxi Wang, Fuhao Zhang, and Weiping Li. 2022. KiPT: Knowledge-injected Prompt Tuning for Event Detection. In COLING. International Committee on Computational Linguistics, 1943--1952. https://aclanthology.org/2022.coling-1.169
    [12]
    Jian Liu, Yubo Chen, Kang Liu, and Jun Zhao. 2018a. Event Detection via Gated Multilingual Attention Mechanism. In AAAI. AAAI Press, 4865--4872. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16371
    [13]
    Minqian Liu, Shiyu Chang, and Lifu Huang. 2022. Incremental Prompting: Episodic Memory Prompt for Lifelong Event Detection. In COLING. International Committee on Computational Linguistics, 2157--2165. https://aclanthology.org/2022.coling-1.189
    [14]
    Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, and Graham Neubig. 2023. Pre-Train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing. ACM Comput. Surv., Vol. 55, 9 (2023). https://doi.org/10.1145/3560815
    [15]
    Xiao Liu, Zhunchen Luo, and Heyan Huang. 2018b. Jointly Multiple Events Extraction via Attention-based Graph Information Aggregation. In EMNLP. The Association for Computational Linguistics, 1247--1256. https://doi.org/10.18653/v1/d18--1156
    [16]
    Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. In ICLR. OpenReview.net. https://openreview.net/forum?id=Bkg6RiCqY7
    [17]
    Thien Huu Nguyen, Kyunghyun Cho, and Ralph Grishman. 2016. Joint event extraction via recurrent neural networks. In NAACL-HLT. The Association for Computational Linguistics, 300--309. https://doi.org/10.18653/v1/n16--1034
    [18]
    Thien Huu Nguyen and Ralph Grishman. 2015. Event detection and domain adaptation with convolutional neural networks. In ACL. The Association for Computational Linguistics, 365--371. https://doi.org/10.3115/v1/p15--2060
    [19]
    Thien Huu Nguyen and Ralph Grishman. 2018. Graph Convolutional Networks With Argument-Aware Pooling for Event Detection. In AAAI. AAAI Press, 5900--5907. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16329
    [20]
    Hao Peng, Ruitong Zhang, Shaoning Li, Yuwei Cao, Shirui Pan, and Philip Yu. 2022. Reinforced, incremental and cross-lingual event detection from social messages. IEEE Trans. Pattern Anal. Mach. Intell., Vol. 45, 1 (2022), 980--998. https://doi.org/10.1109/TPAMI.2022.3144993
    [21]
    Kunxun Qi, Hai Wan, Jianfeng Du, and Haolan Chen. 2022. Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates. In ACL. Association for Computational Linguistics, 1910--1923. https://aclanthology.org/2022.acl-long.134
    [22]
    Jake Snell, Kevin Swersky, and Richard S. Zemel. 2017. Prototypical Networks for Few-shot Learning. In NeurIPS. 4077--4087. https://proceedings.neurips.cc/paper/2017/hash/cb8da6767461f2812ae4290eac7cbc42-Abstract.html
    [23]
    Chengyu Song, Fei Cai, Jianming Zheng, Xiang Zhao, and Taihua Shao. 2022. AugPrompt: Knowledgeable augmented-trigger prompt for few-shot event classification. Information Processing & Management (2022), 103153. https://doi.org/10.1016/j.ipm.2022.103153
    [24]
    Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H. S. Torr, and Timothy M. Hospedales. 2018. Learning to Compare: Relation Network for Few-Shot Learning. In CVPR. 1199--1208. https://doi.org/10.1109/CVPR.2018.00131
    [25]
    Meihan Tong, Bin Xu, Shuai Wang, Yixin Cao, Lei Hou, Juanzi Li, and Jun Xie. 2020. Improving Event Detection via Open-domain Trigger Knowledge. In ACL. The Association for Computational Linguistics, 5887--5897. https://doi.org/10.18653/v1/2020.acl-main.522
    [26]
    Lifu Tu, Caiming Xiong, and Yingbo Zhou. 2022. Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models. In EMNLP. Association for Computational Linguistics, 5478--5485. https://aclanthology.org/2022.findings-emnlp.401
    [27]
    Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research, Vol. 9, 86 (2008), 2579--2605. http://jmlr.org/papers/v9/vandermaaten08a.html
    [28]
    Mengru Wang, Jianming Zheng, Fei Cai, Taihua Shao, and Honghui Chen. 2022b. DRK: Discriminative Rule-based Knowledge for Relieving Prediction Confusions in Few-shot Relation Extraction. In COLING. International Committee on Computational Linguistics, 2129--2140. https://aclanthology.org/2022.coling-1.186
    [29]
    Peiyi Wang, Runxin Xu, Tianyu Liu, Damai Dai, Baobao Chang, and Zhifang Sui. 2021. Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification. In CIKM. ACM, 1969--1978. https://doi.org/10.1145/3459637.3482236
    [30]
    Sijia Wang, Mo Yu, and Lifu Huang. 2022a. The Art of Prompting: Event Detection based on Type Specific Prompts. CoRR, Vol. abs/2204.07241 (2022). https://doi.org/10.48550/arXiv.2204.07241
    [31]
    Siyuan Wang, Jianming Zheng, Xuejun Hu, Fei Cai, Chengyu Song, and Xueshan Luo. 2023. MsPrompt: Multi-step Prompt Learning for Debiasing Few-shot Event Detection. CoRR, Vol. abs/2305.09335 (2023). https://doi.org/10.48550/arXiv.2305.09335
    [32]
    Wei Xiang, Zhenglin Wang, Lu Dai, and Bang Wang. 2022. ConnPrompt: Connective-cloze Prompt Learning for Implicit Discourse Relation Recognition. In COLING. International Committee on Computational Linguistics, 902--911. https://aclanthology.org/2022.coling-1.75
    [33]
    Yuting Yang, Wenqiang Lei, Pei Huang, Juan Cao, Jintao Li, and Tat-Seng Chua. 2023. A Dual Prompt Learning Framework for Few-Shot Dialogue State Tracking. In WWW. ACM, 1468--1477. https://doi.org/10.1145/3543507.3583238
    [34]
    Haojie Zhang, Mingfei Liang, Ruobing Xie, Zhenlong Sun, Bo Zhang, and Leyu Lin. 2023. Better Pre-Training by Reducing Representation Confusion. In EACL. Association for Computational Linguistics, 2280--2291. https://aclanthology.org/2023.findings-eacl.176
    [35]
    Senhui Zhang, Tao Ji, Wendi Ji, and Xiaoling Wang. 2022. Zero-Shot Event Detection Based on Ordered Contrastive Learning and Prompt-Based Prediction. In NAACL. The Association for Computational Linguistics, 2572--2580. https://doi.org/10.18653/v1/2022.findings-naacl.196
    [36]
    Mengjie Zhao and Hinrich Schü tze. 2021. Discrete and Soft Prompting for Multilingual Models. In EMNLP. Association for Computational Linguistics, 8547--8555. https://doi.org/10.18653/v1/2021.emnlp-main.672
    [37]
    Jianming Zheng, Fei Cai, Wanyu Chen, Wengqiang Lei, and Honghui Chen. 2021. Taxonomy-aware Learning for Few-Shot Event Detection. In WWW. ACM, 3546--3557. https://doi.org/10.1145/3442381.3449949

    Index Terms

    1. MultiPLe: Multilingual Prompt Learning for Relieving Semantic Confusions in Few-shot Event Detection

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
      October 2023
      5508 pages
      ISBN:9798400701245
      DOI:10.1145/3583780
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 21 October 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. few-shot event detection
      2. prompt learning
      3. semantic confusions

      Qualifiers

      • Research-article

      Funding Sources

      • Scientific Research Project of National University of Defense Technology

      Conference

      CIKM '23
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 115
        Total Downloads
      • Downloads (Last 12 months)115
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 09 Aug 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media