Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Recursive Named Entity Recognition

  • Chapter
  • First Online:
Advances in Knowledge Discovery and Management

Abstract

Named entity recognition (NER) seeks to locate and classify named entities into predefined categories (persons, organizations, brandnames, sports teams, etc.). NER is often considered as one of the main modules designed to structure a text. We describe our system which is characterized by (1) the use of limited resources, and (2) the embedding of results from other modules such as coreference resolution and relation extraction. The system is based on the output of a dependency parser that adopts an iterative execution flow that embeds results from other modules. At each iteration, candidate categories are generated and are all considered in subsequent iterations. The main advantage of such a system is to select the best candidate only at the end of the process, taking into account all the elements provided by the different modules. Another advantage is that the system does not need a large amount of resources. The system is compared to state-of-the-art academic and industrial systems and obtains the best results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    All in all, we tested sixteen systems including: Google, Alchemy (IBM), Gate (Tablan et al. 2013), SEM (Dupont and Tellier 2014), NERC-fr (Azpeitia et al. 2014), Polyglot-Ner (Al-Rfou et al. 2015), AllenNLP (Gardner et al. 2018), and Spacy: https://spacy.io.

  2. 2.

    http://nerd.eurecom.fr/ontology/.

  3. 3.

    cf. Lopez et al. (2016).

  4. 4.

    This class already exists in NERD.

  5. 5.

    From Lopez et al. (2017).

  6. 6.

    Thanks to the “Person, to phone, Person” triple.

  7. 7.

    These scores are: Acronym: 0.8; Completion: 0.4; Coordination: 0.6; Coreference: 0.3; Descriptor: 0.2; Left and right context: 1.0; Relation: 0.6; Relation: 0.6; Comparison: 0.9; Local expression: 1.0; Projection by memory: 0.2; Meta-rules: 1.0.

  8. 8.

    To compare, DBpedia contains hundreds of thousands of people, places and organizations.

  9. 9.

    http://www.jeuxdemots.org.

  10. 10.

    Many equivalent classes have been added manually to cover the ontologies used by DBpedia.

  11. 11.

    http://tln.lifat.univ-tours.fr/Tln_Corpus80jours.html.

  12. 12.

    https://www.emvista.com.

  13. 13.

    https://cloud.google.com/natural-language/.

  14. 14.

    https://www.ibm.com/watson/developercloud/alchemy-language.html.

  15. 15.

    https://spacy.io.

  16. 16.

    The goal of SRL is to determine “who does what to whom”, “when”, “where” etc.

References

  • Al-Rfou, R., Kulkarni, V., Perozzi, B., & Skiena, S. (2015). Polyglot-ner: Massive multilingual named entity recognition. In Proceedings of the 2015 SIAM International Conference on Data Mining (pp. 586–594). SIAM.

    Google Scholar 

  • Azpeitia, A., Cuadros, M., Gaines, S., & Rigau, G. (2014). Nerc-fr: Supervised named entity recognition for French. In International Conference on Text, Speech, and Dialogue (pp. 158–165). Springer.

    Google Scholar 

  • Baldwin, T., de Marneffe, M.-C., Han, B., Kim, Y.-B., Ritter, A., & Xu, W. (2015). Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition. In Proceedings of the Workshop on Noisy User-generated Text (pp. 126–135).

    Google Scholar 

  • Consortium, L. D. et al. (2004). Ace (automatic content extraction) English annotation guidelines for entities, version 5.6. 1 2005.05. 23.

    Google Scholar 

  • Dupont, Y. & Tellier, I. (2014). A named entity recognizer for French (un reconnaisseur d’entités nommées du français) [in french]. Proceedings of TALN 2014 (Volume 3: System Demonstrations) 3, 40–41.

    Google Scholar 

  • Ezzat, M. (2014). Acquisition de relations entre entités nommées à partir de corpus. PhD thesis, Paris, INALCO.

    Google Scholar 

  • Galliano, S., Gravier, G., & Chaubard, L. (2009). The ester 2 evaluation campaign for the rich transcription of French radio broadcasts. In Tenth Annual Conference of the International Speech Communication Association.

    Google Scholar 

  • Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., & Zettlemoyer, L. (2018). Allennlp: A deep semantic natural language processing platform. http://arxiv.org/abs/1803.07640.

  • Gildea, D., & Jurafsky, D. (2002). Automatic labeling of semantic roles. Computational Linguistics, 28(3), 245–288.

    Article  Google Scholar 

  • Kong, L., Alberti, C., Andor, D., Bogatyy, I., & Weiss, D. (2017). DRAGNN: A transition-based framework for dynamically connected neural networks. arXiv:abs/1703.04474.

  • Lafourcade, M., & Joubert, A. (2008). Jeuxdemots: un prototype ludique pour l’émergence de relations entre termes. In JADT’08: Journées internationales d’Analyse statistiques des Données Textuelles (pp. 657–666).

    Google Scholar 

  • Lecuit, É., Maurel, D., & Vitas, D. (2011). Les noms propres se traduisent-ils?? étude d’un corpus multilingue. Corpus, 10, 201–218.

    Article  Google Scholar 

  • Lopez, C., Nooralahzadeh, F., Cabrio, E., Segond, F., & Gandon, F. (2016). Provoc: une ontologie pour décrire des produits sur le web. In IC2016: 27es Journées francophones d’Ingénierie des Connaissances.

    Google Scholar 

  • Lopez, C., Partalas, I., Balikas, G., Derbas, N., Martin, A., Reutenauer, C., Segond, F., & Amini, M.-R. (2017). Cap 2017 challenge: Twitter named entity recognition. http://arxiv.org/abs/1707.07568.

  • Lopez, C., Segond, F., Hondermarck, O., Curtoni, P., & Dini, L. (2014). Generating a resource for products and brandnames recognition. application to the cosmetic domain. In LREC (pp. 2559–2564).

    Google Scholar 

  • McDonald, D. D. (1996). Internal and external evidence in the identification and semantic categorization of proper names, corpus processing for lexical acquisition.

    Google Scholar 

  • Mikheev, A., Moens, M., & Grover, C. (1999). Named entity recognition without gazetteers. In Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics (pp. 1–8). Association for Computational Linguistics.

    Google Scholar 

  • Nouvel, D., Antoine, J.-Y., Friburger, N., & Soulet, A. (2011). Recognizing named entities using automatically extracted transduction rules. In Language & Technology Conference (LTC’11).

    Google Scholar 

  • Nouvel, D., Antoine, J.-Y., Friburger, N., & Soulet, A. (2012). Coupling knowledge-based and data-driven systems for named entity recognition. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data (pp. 69–77). Association for Computational Linguistics.

    Google Scholar 

  • Nouvel, D., Ehrmann, M., & Rosset, S. (2016). Named Entities for Computational Linguistics. Hoboken: Wiley.

    Google Scholar 

  • Rizzo, G., & Troncy, R. (2012). Nerd: A framework for unifying named entity recognition and disambiguation extraction tools. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics (pp. 73–76). Association for Computational Linguistics.

    Google Scholar 

  • Sagot, B., & Stern, R. (2012). Aleda, a free large-scale entity database for French. In LREC 2012: Eighth International Conference on Language Resources and Evaluation (p. 4).

    Google Scholar 

  • Sekine, S., & Nobata, C. (2004). Definition, dictionaries and tagger for extended named entity hierarchy. In LREC (pp. 1977–1980). Lisbon, Portugal.

    Google Scholar 

  • Strötgen, J., & Gertz, M. (2010). Heideltime: High quality rule-based extraction and normalization of temporal expressions. In Proceedings of the 5th International Workshop on Semantic Evaluation (pp. 321–324). Association for Computational Linguistics.

    Google Scholar 

  • Tablan, V., Roberts, I., Cunningham, H., & Bontcheva, K. (2013). Gatecloud.net: A platform for large-scale, open-source text processing on the cloud. Philosophical Transactions of the Royal Society A, 371(1983), 20120071.

    Google Scholar 

  • Wakao, T., Gaizauskas, R., & Wilks, Y. (1996). Evaluation of an algorithm for the recognition and classification of proper names. In Proceedings of the 16th conference on Computational linguistics-Volume 1 (pp. 418–423). Association for Computational Linguistics.

    Google Scholar 

  • Zirikly, A., & Diab, M. (2015). Named entity recognition for Arabic social media. In Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing (pp. 176–185).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cédric Lopez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Lopez, C. et al. (2022). Recursive Named Entity Recognition. In: Jaziri, R., Martin, A., Rousset, MC., Boudjeloud-Assala, L., Guillet, F. (eds) Advances in Knowledge Discovery and Management. Studies in Computational Intelligence, vol 1004. Springer, Cham. https://doi.org/10.1007/978-3-030-90287-2_2

Download citation

Publish with us

Policies and ethics