Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Linking Semantic Fingerprints of Literature – from Simple Neural Embeddings Towards Contextualized Pharmaceutical Networks

  • Conference paper
  • First Online:
Digital Libraries for Open Knowledge (TPDL 2019)

Abstract

The exponential growth of publications in medical digital libraries requires new access paths that go beyond term-based searches, as these increasingly lead to thousands of results. An effective approach for this problem is to extract important pharmaceutical entities and their relations to each other in order to reveal the embedded knowledge in digital libraries. State-of-the-art approaches in the field of neural-language models (NLMs) enable progress in learning and predicting such relations in terms of semantic quality, scalability, and performance and already now make them valuable for important research tasks such as hypothesis generation. However, in the field of pharmacy a simple list of (predicted) associations is often challenging to interpret because, between typical pharmaceutical entities, such as active substances, diseases, and genes, complex associations will exist. A contextualized network of pharmaceutical entities can support the exploration of these associations and will help to assess and interpret predicted relationships. On the other hand, the prerequisite for building meaningful entity networks is an answer to the question: When is an NLM-learned entity relation meaningful? In this paper, we investigate this question for important pharmaceutical entity relations in the form of drug-disease associations (DDAs). To do so, we present a new methodology to determine entity-specific thresholds for the existence of associations. Such entity-specific thresholds open-up the possibility of automatically constructing (meaningful) embedded pharmaceutical networks, which can then be used to explore and to explain learned relationships between pharmaceutical entities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://deeplearning4j.org/.

  2. 2.

    https://www.ncbi.nlm.nih.gov/pubmed/.

  3. 3.

    https://www.drugbank.ca/.

  4. 4.

    https://lucene.apache.org/.

  5. 5.

    https://deeplearning4j.org/word2vec.

References

  1. Greene, D., Cunningham, P.: Producing a unified graph representation from multiple social network views. In Proceedings of the 5th Annual ACM Web Science Conference, pp. 118–121. ACM, May 2013

    Google Scholar 

  2. MEDLINE®/PubMed® Data Element (Field) Descriptions. (U.S. National Library of Medicine). https://www.nlm.nih.gov/bsd/mms/medlineelements.html#ab. Accessed 4 Apr 2019

  3. Zhang, W., et al.: Predicting drug-disease associations based on the known association bipartite network. In: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 503–509. IEEE, November 2017

    Google Scholar 

  4. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  5. Dudley, J.T., Deshpande, T., Butte, A.J.: Exploiting drug–disease relationships for computational drug repositioning. Briefings Bioinform. 12(4), 303–311 (2011)

    Article  Google Scholar 

  6. Wawrzinek, J., Balke, W.-T.: Measuring the semantic world – how to map meaning to high-dimensional entity clusters in PubMed? In: Dobreva, M., Hinze, A., Žumer, M. (eds.) ICADL 2018. LNCS, vol. 11279, pp. 15–27. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04257-8_2

    Chapter  Google Scholar 

  7. Hill, F., Reichart, R., Korhonen, A.: SimLex-999: Evaluating semantic models with (genuine) similarity estimation. Comput. Linguist. 41(4), 665–695 (2015)

    Article  MathSciNet  Google Scholar 

  8. Elekes, Á., Schäler, M., Böhm, K.: On the various semantics of similarity in word embedding models. In 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL), pp. 1–10. IEEE, June 2017

    Google Scholar 

  9. Al-Natsheh, Hussein T., Martinet, L., Muhlenbach, F., Rico, F., Zighed, D.A.: Metadata enrichment of multi-disciplinary digital library: a semantic-based approach. In: Méndez, E., Crestani, F., Ribeiro, C., David, G., Lopes, J.C. (eds.) TPDL 2018. LNCS, vol. 11057, pp. 32–43. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00066-0_3

    Chapter  Google Scholar 

  10. Wawrzinek, J., Pinto, J.M., Balke, W.T.: Linking semantic fingerprints of literature – from simple neural embeddings towards contextualized pharmaceutical networks (supplement) (2019). http://www.ifis.cs.tu-bs.de/sites/default/files/wawrzinek-pinto-balke-Technical-Report.pdf. Accessed 18 June 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Janus Wawrzinek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wawrzinek, J., González Pinto, J.M., Balke, WT. (2019). Linking Semantic Fingerprints of Literature – from Simple Neural Embeddings Towards Contextualized Pharmaceutical Networks. In: Doucet, A., Isaac, A., Golub, K., Aalberg, T., Jatowt, A. (eds) Digital Libraries for Open Knowledge. TPDL 2019. Lecture Notes in Computer Science(), vol 11799. Springer, Cham. https://doi.org/10.1007/978-3-030-30760-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-30760-8_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30759-2

  • Online ISBN: 978-3-030-30760-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics