A Standard Lexical-Terminological Resource for the Bio Domain

Quochi, Valeria; Del Gratta, Riccardo; Sassolini, Eva; Bartolini, Roberto; Monachini, Monica; Calzolari, Nicoletta

doi:10.1007/978-3-642-04235-5_28

Valeria Quochi²¹,
Riccardo Del Gratta²¹,
Eva Sassolini²¹,
Roberto Bartolini²¹,
Monica Monachini²¹ &
…
Nicoletta Calzolari²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5603))

Included in the following conference series:

Language and Technology Conference

670 Accesses
1 Citations

Abstract

The present paper describes a large-scale lexical resource for the biology domain designed both for human and for machine use. This lexicon aims at semantic interoperability and extendability, through the adoption of ISO-LMF standard for lexical representation and through a granular and distributed encoding of relevant information. The first part of this contribution focuses on three aspects of the model that are of particular interest to the biology community: the treatment of term variants, the representation on bio events and the alignment with a domain ontology. The second part of the paper describes the physical implementation of the model: a relational database equipped with a set of automatic uploading procedures. Peculiarity of the BioLexicon is that it combines features of both terminologies and lexicons. A set verbs relevant for the domain is also represented with full details on their syntactic and semantic argument structure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Lexicons, Terminologies, Ontologies: Reflections from Experiences in Resource Construction

LIME: The Metadata Module for OntoLex

Semantic Integration and Enrichment of Heterogeneous Biological Databases

References

Calzolari, N., Bertagna, F., Lenci, A., Monachini, M. (eds.): Standards and Best Practices for Multilingual Computational Lexicons. MILE (The Multilingual ISLE Lexical Entry). ISLE CLWG Deliverable D2.2 & 3.2 Pisa (2003)
Google Scholar
Carroll, J., McCarthy., D.: Word sense disambiguation using automatically acquired verbal preferences. Computers and the Humanities. Senseval Special Issue 34(1-2) (2000)
Google Scholar
Cimiano, P., Hotho, A., Staab, S.: Clustering Concept Hierarchies from Text. In: Proceedings of the LREC 2004, Lisbon, Portugal (2004)
Google Scholar
Faure, D., Nedellac, C.: A corpus-based conceptual clustering method for verb frames and ontology. In: Velardi, P. (ed.) Proceedings of the LREC Workshop on Adapting lexical and corpus resources to sublanguages and applications. ELRA (1998)
Google Scholar
Ferrucci, D., Lally, A.: UIMA: an architectural approach to unstructured information processing in the corporate research environment. Natural Language Engeneering 10(3-4), 327–348 (2004)
Article Google Scholar
Fersøe, H.: Validation Manual for Lexica. Technical Report. ELRA. Release 2.0 (2004)
Google Scholar
Fersøe, H., Monachini, M.: ELRA Validation Methodology and Standard Promotion for Linguistic Resources. In: Proceedings of the LREC 2004, Lisbon, Portugal, pp. 941–944 (2004)
Google Scholar
Francopulo, G., et al.: The relevance of standards for research infrastructure. In: Proceedings of the LREC 2006. Genoa, Italy (2006b)
Google Scholar
Hahn, U., Markó, K.: Joint knowledge capture for grammars and ontologies. In: Proceedings of the 1st international conference on knowledge capture, Victoria, British Columbia, Canada (2001)
Google Scholar
Harkema, H., et al.: A Large Scale Terminology Resource for Biomedical Text Processing. In: Proceedings of the BioLINK 2004, pp. 53–60. ACL (2001)
Google Scholar
Hindle, D.: Noun classification from predicate argument structures. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (1990)
Google Scholar
ISO-12620. Terminology and other content language resources- Data Categories- Specifications of data categories and management of a Data Category Registry for language resources. Technical Report. ISO/TC37/SC3/WG4 (2006)
Google Scholar
Kors, J.A., et al.: Combination of Genetic Databases for Improving Identification of Genes and Proteins in Text. In: Proceedings of the BioLINK 2005. ACL (2005)
Google Scholar
Lapata, M., Brew, C.: Using Subcategorization to Resolve Verb Class Ambiguity. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, College Park, MD, pp. 397–404 (1999)
Google Scholar
Nenadic, G., Ananiadou, S., McNaught, J.: Enhancing Automatic Term Recognition through Term Variation. In: Proceedings of the 20th Coling, Geneve, Switzerland (2004)
Google Scholar
Pereira, F., Tishby, N., Lee, L.: Distributional clustering of English Words. In: Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, pp. 183–190. ACL (1993)
Google Scholar
Ruimy, N., et al.: A computational semantic lexicon of Italian: SIMPLE. Linguistica Computazionale XVIII-XIX, 821–864 (2003)
Google Scholar
Spasic, I., Nenadic, G., Ananiadou, S.: Using Domain-Specific Verbs for Term Classification. In: Proceedings of the ACL 2003 Workshop on Natural Language Processing in Biomedicine, pp. 17–24 (2003)
Google Scholar
SPECIALIST Lexicon and Lexical Tools. Natural Library of Medicine. UMLS Release Documentation, http://www.nlm.nih.gov/pubs/factsheets/umlslex.html
Wright, S.E.: A global data category registry for interoperable language resources. In: Proceedings of the LREC 2004, Lisbon, Portugal (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

CNR - Istituto di Linguistica Computazionale, Area della Ricerca CNR, Via Moruzzi 1, 56100, Pisa, Italy
Valeria Quochi, Riccardo Del Gratta, Eva Sassolini, Roberto Bartolini, Monica Monachini & Nicoletta Calzolari

Authors

Valeria Quochi
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Del Gratta
View author publications
You can also search for this author in PubMed Google Scholar
Eva Sassolini
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Bartolini
View author publications
You can also search for this author in PubMed Google Scholar
Monica Monachini
View author publications
You can also search for this author in PubMed Google Scholar
Nicoletta Calzolari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Mathematics and Computer Science, Adam Mickiewicz University in Poznań, ul. Umultowska 87, P.O. Box, 61614, Poznań, Poland
Zygmunt Vetulani
Language Technology Lab, German Research Center for Artificial Intelligence (DFKI), Campus D 3 1, Stuhlsatzenhausweg 3, D-66123, Saarbrücken, Germany
Hans Uszkoreit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Quochi, V., Del Gratta, R., Sassolini, E., Bartolini, R., Monachini, M., Calzolari, N. (2009). A Standard Lexical-Terminological Resource for the Bio Domain. In: Vetulani, Z., Uszkoreit, H. (eds) Human Language Technology. Challenges of the Information Society. LTC 2007. Lecture Notes in Computer Science(), vol 5603. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04235-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-04235-5_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04234-8
Online ISBN: 978-3-642-04235-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Standard Lexical-Terminological Resource for the Bio Domain

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Lexicons, Terminologies, Ontologies: Reflections from Experiences in Resource Construction

LIME: The Metadata Module for OntoLex

Semantic Integration and Enrichment of Heterogeneous Biological Databases

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Standard Lexical-Terminological Resource for the Bio Domain

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Lexicons, Terminologies, Ontologies: Reflections from Experiences in Resource Construction

LIME: The Metadata Module for OntoLex

Semantic Integration and Enrichment of Heterogeneous Biological Databases

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation