Abstract
Automatic monitoring of adverse drug events (ADEs) or reactions (ADRs) is currently receiving significant attention from the biomedical community. In recent years, user-generated data on social media has become a valuable resource for this task. Neural models have achieved impressive performance on automatic text classification for ADR detection. Yet, training and evaluation of these methods are carried out on user-generated texts about a targeted drug. In this paper, we assess the robustness of state-of-the-art neural architectures across different drug groups. We investigate several strategies to use pseudo-labeled data in addition to a manually annotated train set. Out-of-dataset experiments diagnose the bottleneck of supervised models in terms of breakdown performance, while additional pseudo-labeled data improves overall results regardless of the text selection strategy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The terms ADEs and adverse drug reactions (ADRs) are often used interchangeably.
- 2.
References
Alimova, I., Tutubalina, E.: Automated detection of adverse drug reactions from social media posts with machine learning. In: van der Aalst, W.M.P., et al. (eds.) AIST 2017. LNCS, vol. 10716, pp. 3–15. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73013-4_1
Alimova, I., Tutubalina, E.: Multiple features for clinical relation extraction: a machine learning approach. J. Biomed. Inform. 103, 103382 (2020)
Alimova, I., Tutubalina, E.: Entity-level classification of adverse drug reaction: a comparative analysis of neural network models. Program. Comput. Softw. 45(8), 439–447 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186 (2019)
Gupta, S., Gupta, M., Varma, V., Pawar, S., Ramrakhiyani, N., Palshikar, G.K.: Co-training for extraction of adverse drug reaction mentions from tweets. In: Pasi, G., Piwowarski, B., Azzopardi, L., Hanbury, A. (eds.) ECIR 2018. LNCS, vol. 10772, pp. 556–562. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-76941-7_44
Gupta, S., Pawar, S., Ramrakhiyani, N., Palshikar, G.K., Varma, V.: Semi-supervised recurrent neural network for adverse drug reaction mention extraction. BMC Bioinform. 19(8), 212 (2018). https://doi.org/10.1186/s12859-018-2192-4
Karimi, S., Metke-Jimenez, A., Kemp, M., Wang, C.: CADEC: a corpus of adverse drug event annotations. J. Biomed. Inform. 55, 73–81 (2015)
Klein, A., et al.: overview of the fifth social media mining for health applications (# smm4h) shared tasks at COLING 2020. In: Proceedings of the Fifth Social Media Mining for Health Applications Workshop and Shared Task, pp. 27–36 (2020)
Lee, J., Yoon, W., Kim, S., Kim, D., So, C., Kang, J.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2020)
Lee, K., et al.: Adverse drug event detection in tweets with semi-supervised convolutional neural networks. In: Proceedings of the 26th International Conference on World Wide Web, pp. 705–714 (2017)
Li, Z., Yang, Z., Luo, L., Xiang, Y., Lin, H.: Exploiting adversarial transfer learning for adverse drug reaction detection from texts. J. Biomed. Inform. 106, 103431 (2020)
Ma, D., Li, S., Zhang, X., Wang, H.: Interactive attention networks for aspect-level sentiment classification. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 4068–4074 (2017)
Magge, A., et al.: Overview of the sixth social media mining for health applications (# smm4h) shared tasks at NAACL 2021. In: Proceedings of the Sixth Social Media Mining for Health (# SMM4H) Workshop and Shared Task, pp. 21–32 (2021)
Perez, A., Weegar, R., Casillas, A., Gojenola, K., Oronoz, M., Dalianis, H.: Semi-supervised medical entity recognition: a study on Spanish and Swedish clinical corpora. J. Biomed. Inform. 71, 16–30 (2017)
Rakhsha, M., Keyvanpour, M.R., Shojaedini, S.V.: Detecting adverse drug reactions from social media based on multichannel convolutional neural networks modified by support vector machine. In: 2021 7th International Conference on Web Research (ICWR), pp. 48–52. IEEE (2021)
Tutubalina, E., Nikolenko, S.: Exploring convolutional neural networks and topic models for user profiling from drug reviews. Multimed. Tools Appl. 77(4), 4791–4809 (2018)
Wu, L., et al.: Study of serious adverse drug reactions using FDA-approved drug labeling and MedDRA. BMC Bioinform. 20(2), 129–139 (2019)
Zolnoori, M., et al.: A systematic approach for developing a corpus of patient reported adverse drug events: a case study for SSRI and SNRI medications. J. Biomed. Inform. 90, 103091 (2019)
Acknowledgments
This work was supported by the Russian Science Foundation grant # 18-11-00284.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alimova, I., Tutubalina, E. (2022). Selection of Pseudo-Annotated Data for Adverse Drug Reaction Classification Across Drug Groups. In: Burnaev, E., et al. Analysis of Images, Social Networks and Texts. AIST 2021. Lecture Notes in Computer Science, vol 13217. Springer, Cham. https://doi.org/10.1007/978-3-031-16500-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-16500-9_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16499-6
Online ISBN: 978-3-031-16500-9
eBook Packages: Computer ScienceComputer Science (R0)