Abstract
Adverse drug reaction (ADR) is a major health concern. Identifying text that mentions ADRs from a large volume of social media data discussing other topics is a key preliminary but nontrivial task for drug-ADR pair detection. This task suffers from severe imbalance issue. Moreover, prior studies have overlooked the simultaneous use of high-level abstract information contained in data and the domain-specific information embedded in knowledge bases. Therefore, we propose a novel multi-view active learning approach, in which a selection strategy is tailored to the imbalanced dataset and various document representations are regarded as multi views. We capture data-driven and domain-specific information by resorting to deep learning methods and handcrafted feature engineering, respectively. Experimental results demonstrate the effectiveness of our proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
References
Sarker, A., et al.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)
Zhang, Y., Cui, S., Gao, H.: Adverse drug reaction detection on social media with deep linguistic features. J. Biomed. Inf. 106, 103437 (2020)
Hazell, L., Shakir, S.A.: Under-reporting of adverse drug reactions. Drug Saf. 29(5), 385–396 (2006)
Amante, D.J., et al.: Access to care and use of the Internet to search for health information: results from the US national health interview survey. J. Med. Internet Res. 17(4), e106 (2015)
Wu, C., et al.: Detecting tweets mentioning drug name and adverse drug reaction with hierarchical tweet representation and multi-head self-attention. In: Empirical Methods in Natural Language Processing (2018)
Fan, B., et al.: Adverse drug event detection and extraction from open data: A deep learning approach. Inf. Process. Manage. 57(1), 102131 (2020)
Dai, H., Wang, C.: Classifying adverse drug reactions from imbalanced twitter data. Int. J. Med. Inf. 129, 122–132 (2019)
Kim, D., et al.: Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec. Inf. Sci. 477, 15–29 (2019)
Yang, M., Kiang, M., Shang, W.: Filtering big data from social media–building an early warning system for adverse drug reactions. J. Biomed. Inf. 54, 230–240 (2015)
Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inf. 53, 196–207 (2015)
Liu, J., Wang, G., Chen, G.: Identifying adverse drug events from social media using an improved semi-supervised method. IEEE Intell. Syst. 34, 1 (2019)
Emadzadeh, E., et al.: Hybrid semantic analysis for mapping adverse drug reaction mentions in tweets to medical terminology. In: American Medical Informatics Association, Washington, D.C (2017)
Chowdhury, S., Zhang, C., Yu, P.S.: Multi-task pharmacovigilance mining from social media posts. In: Proceedings of the 2018 World Wide Web Conference (2018)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1974)
Li, Y., et al.: Imbalanced text sentiment classification using universal and domain-specific knowledge. Knowl. Based Syst. 160, 1–15 (2018)
Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, Harrahs and Harveys, Lake Tahoe (2013)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, Beijing, China (2014)
Cao, N., et al.: A deceptive review detection framework: combination of coarse and fine-grained features. Expert Syst. Appl. 156, 113465 (2020)
Guo, C., Xie, L., Liu, G., Wang, X.: A text representation model based on convolutional neural network and variational auto encoder. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 225–235. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_21
Li, Y., et al.: Incorporating knowledge into neural network for text representation. Expert Syst. Appl. 96, 103–114 (2018)
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Eleventh Annual Conference on Computational Learning Theory. ACM Madison (1998)
Bengio, Y., et al.: Greedy layer-wise training of deep networks. In: Advances in Neural Information Processing Systems. Vancouver, B.C., Canada (2007)
Bengio, Y., et al.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)
Muslea, I., Minton, S., Knoblock, C.A.: Active learning with multiple views. J. Artif. Intell. Res. 27, 203–233 (2006)
Acknowledgments
This work is partially supported by the National Natural Science Foundation of China (Nos. 71701142 and 71971067), and China Postdoctoral Science Foundation (No. 2018M640346).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, J., Huang, L., Zhang, C. (2021). An Active Learning Approach for Identifying Adverse Drug Reaction-Related Text from Social Media Using Various Document Representations. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-87571-8_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87570-1
Online ISBN: 978-3-030-87571-8
eBook Packages: Computer ScienceComputer Science (R0)