Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

An Active Learning Approach for Identifying Adverse Drug Reaction-Related Text from Social Media Using Various Document Representations

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2021)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12999))

Included in the following conference series:

  • 2862 Accesses

Abstract

Adverse drug reaction (ADR) is a major health concern. Identifying text that mentions ADRs from a large volume of social media data discussing other topics is a key preliminary but nontrivial task for drug-ADR pair detection. This task suffers from severe imbalance issue. Moreover, prior studies have overlooked the simultaneous use of high-level abstract information contained in data and the domain-specific information embedded in knowledge bases. Therefore, we propose a novel multi-view active learning approach, in which a selection strategy is tailored to the imbalanced dataset and various document representations are regarded as multi views. We capture data-driven and domain-specific information by resorting to deep learning methods and handcrafted feature engineering, respectively. Experimental results demonstrate the effectiveness of our proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://www.pewresearch.org/internet/fact-sheet/social-media/.

  2. 2.

    https://www.nlm.nih.gov/research/umls/index.html.

  3. 3.

    https://mmtx.nlm.nih.gov/.

  4. 4.

    https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/CST/index.html.

  5. 5.

    http://sideeffects.embl.de/.

  6. 6.

    https://www.canada.ca/en/health-canada/services/drugs-health-products/medeffect-canada.html.

  7. 7.

    https://www.nlm.nih.gov/research/umls/sourcereleasedocs/current/CHV/index.html.

  8. 8.

    https://code.google.com/p/negex/.

  9. 9.

    https://wordnet.princeton.edu/.

  10. 10.

    http://mallet.cs.umass.edu/.

  11. 11.

    http://diego.asu.edu/Publications/ADRClassify.html.

  12. 12.

    https://radimrehurek.com/gensim/.

  13. 13.

    https://github.com/rajarsheem/libsdae-autoencoder-tensorflow.

  14. 14.

    http://diego.asu.edu/Publications/ADRClassify.html.

  15. 15.

    http://www.cs.waikato.ac.nz/ml/weka/.

References

  1. Sarker, A., et al.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)

    Article  Google Scholar 

  2. Zhang, Y., Cui, S., Gao, H.: Adverse drug reaction detection on social media with deep linguistic features. J. Biomed. Inf. 106, 103437 (2020)

    Article  Google Scholar 

  3. Hazell, L., Shakir, S.A.: Under-reporting of adverse drug reactions. Drug Saf. 29(5), 385–396 (2006)

    Article  Google Scholar 

  4. Amante, D.J., et al.: Access to care and use of the Internet to search for health information: results from the US national health interview survey. J. Med. Internet Res. 17(4), e106 (2015)

    Article  Google Scholar 

  5. Wu, C., et al.: Detecting tweets mentioning drug name and adverse drug reaction with hierarchical tweet representation and multi-head self-attention. In: Empirical Methods in Natural Language Processing (2018)

    Google Scholar 

  6. Fan, B., et al.: Adverse drug event detection and extraction from open data: A deep learning approach. Inf. Process. Manage. 57(1), 102131 (2020)

    Article  Google Scholar 

  7. Dai, H., Wang, C.: Classifying adverse drug reactions from imbalanced twitter data. Int. J. Med. Inf. 129, 122–132 (2019)

    Article  Google Scholar 

  8. Kim, D., et al.: Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec. Inf. Sci. 477, 15–29 (2019)

    Article  Google Scholar 

  9. Yang, M., Kiang, M., Shang, W.: Filtering big data from social media–building an early warning system for adverse drug reactions. J. Biomed. Inf. 54, 230–240 (2015)

    Article  Google Scholar 

  10. Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inf. 53, 196–207 (2015)

    Article  Google Scholar 

  11. Liu, J., Wang, G., Chen, G.: Identifying adverse drug events from social media using an improved semi-supervised method. IEEE Intell. Syst. 34, 1 (2019)

    Article  Google Scholar 

  12. Emadzadeh, E., et al.: Hybrid semantic analysis for mapping adverse drug reaction mentions in tweets to medical terminology. In: American Medical Informatics Association, Washington, D.C (2017)

    Google Scholar 

  13. Chowdhury, S., Zhang, C., Yu, P.S.: Multi-task pharmacovigilance mining from social media posts. In: Proceedings of the 2018 World Wide Web Conference (2018)

    Google Scholar 

  14. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1974)

    Article  Google Scholar 

  15. Li, Y., et al.: Imbalanced text sentiment classification using universal and domain-specific knowledge. Knowl. Based Syst. 160, 1–15 (2018)

    Article  Google Scholar 

  16. Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, Harrahs and Harveys, Lake Tahoe (2013)

    Google Scholar 

  17. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, Beijing, China (2014)

    Google Scholar 

  18. Cao, N., et al.: A deceptive review detection framework: combination of coarse and fine-grained features. Expert Syst. Appl. 156, 113465 (2020)

    Article  Google Scholar 

  19. Guo, C., Xie, L., Liu, G., Wang, X.: A text representation model based on convolutional neural network and variational auto encoder. In: Wang, G., Lin, X., Hendler, J., Song, W., Xu, Z., Liu, G. (eds.) WISA 2020. LNCS, vol. 12432, pp. 225–235. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60029-7_21

    Chapter  Google Scholar 

  20. Li, Y., et al.: Incorporating knowledge into neural network for text representation. Expert Syst. Appl. 96, 103–114 (2018)

    Article  Google Scholar 

  21. Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Eleventh Annual Conference on Computational Learning Theory. ACM Madison (1998)

    Google Scholar 

  22. Bengio, Y., et al.: Greedy layer-wise training of deep networks. In: Advances in Neural Information Processing Systems. Vancouver, B.C., Canada (2007)

    Google Scholar 

  23. Bengio, Y., et al.: A neural probabilistic language model. J. Mach. Learn. Res. 3, 1137–1155 (2003)

    MATH  Google Scholar 

  24. Muslea, I., Minton, S., Knoblock, C.A.: Active learning with multiple views. J. Artif. Intell. Res. 27, 203–233 (2006)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

This work is partially supported by the National Natural Science Foundation of China (Nos. 71701142 and 71971067), and China Postdoctoral Science Foundation (No. 2018M640346).

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, J., Huang, L., Zhang, C. (2021). An Active Learning Approach for Identifying Adverse Drug Reaction-Related Text from Social Media Using Various Document Representations. In: Xing, C., Fu, X., Zhang, Y., Zhang, G., Borjigin, C. (eds) Web Information Systems and Applications. WISA 2021. Lecture Notes in Computer Science(), vol 12999. Springer, Cham. https://doi.org/10.1007/978-3-030-87571-8_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-87571-8_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-87570-1

  • Online ISBN: 978-3-030-87571-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics