Abstract
Named entity recognition (NER) seeks to locate and classify named entities into predefined categories (persons, organizations, brandnames, sports teams, etc.). NER is often considered as one of the main modules designed to structure a text. We describe our system which is characterized by (1) the use of limited resources, and (2) the embedding of results from other modules such as coreference resolution and relation extraction. The system is based on the output of a dependency parser that adopts an iterative execution flow that embeds results from other modules. At each iteration, candidate categories are generated and are all considered in subsequent iterations. The main advantage of such a system is to select the best candidate only at the end of the process, taking into account all the elements provided by the different modules. Another advantage is that the system does not need a large amount of resources. The system is compared to state-of-the-art academic and industrial systems and obtains the best results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
cf. Lopez et al. (2016).
- 4.
This class already exists in NERD.
- 5.
From Lopez et al. (2017).
- 6.
Thanks to the “Person, to phone, Person” triple.
- 7.
These scores are: Acronym: 0.8; Completion: 0.4; Coordination: 0.6; Coreference: 0.3; Descriptor: 0.2; Left and right context: 1.0; Relation: 0.6; Relation: 0.6; Comparison: 0.9; Local expression: 1.0; Projection by memory: 0.2; Meta-rules: 1.0.
- 8.
To compare, DBpedia contains hundreds of thousands of people, places and organizations.
- 9.
- 10.
Many equivalent classes have been added manually to cover the ontologies used by DBpedia.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
The goal of SRL is to determine “who does what to whom”, “when”, “where” etc.
References
Al-Rfou, R., Kulkarni, V., Perozzi, B., & Skiena, S. (2015). Polyglot-ner: Massive multilingual named entity recognition. In Proceedings of the 2015 SIAM International Conference on Data Mining (pp. 586–594). SIAM.
Azpeitia, A., Cuadros, M., Gaines, S., & Rigau, G. (2014). Nerc-fr: Supervised named entity recognition for French. In International Conference on Text, Speech, and Dialogue (pp. 158–165). Springer.
Baldwin, T., de Marneffe, M.-C., Han, B., Kim, Y.-B., Ritter, A., & Xu, W. (2015). Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition. In Proceedings of the Workshop on Noisy User-generated Text (pp. 126–135).
Consortium, L. D. et al. (2004). Ace (automatic content extraction) English annotation guidelines for entities, version 5.6. 1 2005.05. 23.
Dupont, Y. & Tellier, I. (2014). A named entity recognizer for French (un reconnaisseur d’entités nommées du français) [in french]. Proceedings of TALN 2014 (Volume 3: System Demonstrations) 3, 40–41.
Ezzat, M. (2014). Acquisition de relations entre entités nommées à partir de corpus. PhD thesis, Paris, INALCO.
Galliano, S., Gravier, G., & Chaubard, L. (2009). The ester 2 evaluation campaign for the rich transcription of French radio broadcasts. In Tenth Annual Conference of the International Speech Communication Association.
Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., & Zettlemoyer, L. (2018). Allennlp: A deep semantic natural language processing platform. http://arxiv.org/abs/1803.07640.
Gildea, D., & Jurafsky, D. (2002). Automatic labeling of semantic roles. Computational Linguistics, 28(3), 245–288.
Kong, L., Alberti, C., Andor, D., Bogatyy, I., & Weiss, D. (2017). DRAGNN: A transition-based framework for dynamically connected neural networks. arXiv:abs/1703.04474.
Lafourcade, M., & Joubert, A. (2008). Jeuxdemots: un prototype ludique pour l’émergence de relations entre termes. In JADT’08: Journées internationales d’Analyse statistiques des Données Textuelles (pp. 657–666).
Lecuit, É., Maurel, D., & Vitas, D. (2011). Les noms propres se traduisent-ils?? étude d’un corpus multilingue. Corpus, 10, 201–218.
Lopez, C., Nooralahzadeh, F., Cabrio, E., Segond, F., & Gandon, F. (2016). Provoc: une ontologie pour décrire des produits sur le web. In IC2016: 27es Journées francophones d’Ingénierie des Connaissances.
Lopez, C., Partalas, I., Balikas, G., Derbas, N., Martin, A., Reutenauer, C., Segond, F., & Amini, M.-R. (2017). Cap 2017 challenge: Twitter named entity recognition. http://arxiv.org/abs/1707.07568.
Lopez, C., Segond, F., Hondermarck, O., Curtoni, P., & Dini, L. (2014). Generating a resource for products and brandnames recognition. application to the cosmetic domain. In LREC (pp. 2559–2564).
McDonald, D. D. (1996). Internal and external evidence in the identification and semantic categorization of proper names, corpus processing for lexical acquisition.
Mikheev, A., Moens, M., & Grover, C. (1999). Named entity recognition without gazetteers. In Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics (pp. 1–8). Association for Computational Linguistics.
Nouvel, D., Antoine, J.-Y., Friburger, N., & Soulet, A. (2011). Recognizing named entities using automatically extracted transduction rules. In Language & Technology Conference (LTC’11).
Nouvel, D., Antoine, J.-Y., Friburger, N., & Soulet, A. (2012). Coupling knowledge-based and data-driven systems for named entity recognition. In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data (pp. 69–77). Association for Computational Linguistics.
Nouvel, D., Ehrmann, M., & Rosset, S. (2016). Named Entities for Computational Linguistics. Hoboken: Wiley.
Rizzo, G., & Troncy, R. (2012). Nerd: A framework for unifying named entity recognition and disambiguation extraction tools. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics (pp. 73–76). Association for Computational Linguistics.
Sagot, B., & Stern, R. (2012). Aleda, a free large-scale entity database for French. In LREC 2012: Eighth International Conference on Language Resources and Evaluation (p. 4).
Sekine, S., & Nobata, C. (2004). Definition, dictionaries and tagger for extended named entity hierarchy. In LREC (pp. 1977–1980). Lisbon, Portugal.
Strötgen, J., & Gertz, M. (2010). Heideltime: High quality rule-based extraction and normalization of temporal expressions. In Proceedings of the 5th International Workshop on Semantic Evaluation (pp. 321–324). Association for Computational Linguistics.
Tablan, V., Roberts, I., Cunningham, H., & Bontcheva, K. (2013). Gatecloud.net: A platform for large-scale, open-source text processing on the cloud. Philosophical Transactions of the Royal Society A, 371(1983), 20120071.
Wakao, T., Gaizauskas, R., & Wilks, Y. (1996). Evaluation of an algorithm for the recognition and classification of proper names. In Proceedings of the 16th conference on Computational linguistics-Volume 1 (pp. 418–423). Association for Computational Linguistics.
Zirikly, A., & Diab, M. (2015). Named entity recognition for Arabic social media. In Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing (pp. 176–185).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Lopez, C. et al. (2022). Recursive Named Entity Recognition. In: Jaziri, R., Martin, A., Rousset, MC., Boudjeloud-Assala, L., Guillet, F. (eds) Advances in Knowledge Discovery and Management. Studies in Computational Intelligence, vol 1004. Springer, Cham. https://doi.org/10.1007/978-3-030-90287-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-90287-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-90286-5
Online ISBN: 978-3-030-90287-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)