Abstract
The process of document annotation for the Semantic Web is complex and time consuming, as it requires a great deal of manual annotation. Information extraction from texts (IE) is a technology used by some very recent systems for reducing the burden of annotation. The integration of IE systems in annotation tools is quite a new development and there is still the necessity of thinking the impact of the IE system on the whole annotation process. In this paper we initially discuss a number of requirements for the use of IE as support for annotation. Then we present and discuss a model of interaction that addresses such issues and Melita, an annotation framework that implements a methodology for active annotation for the Semantic Web based on IE. Finally we present an experiment that quantifies the gain in using IE as support to human annotators.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Reference
F. Ciravegna, A. Lavelli, G. Satta: “Bringing information extraction out of the labs: the Pinocchio Environment”, in ECAI2000, Proc. of the 14th European Conference on Artificial Intelligence, ed., W. Horn, Amsterdam, 2000. IOS Press
P. Kogut and W. Holmes: “Applying Information Extraction to Generate DAML Annotations from Web Pages”, K-CAP 2001 Workshop Knowledge Markup & Semantic Annotation, Victoria B.C., Canada (2001).
M. E. Califf, D. Freitag, N. Kushmerick and I. Muslea (eds.): AAAI-99 Workshop on Machine Learning for Information Extraction, Orlando Florida (1999), http://www.isi.edu/~muslea/RISE/ML4IE/
R. Basili, F. Ciravegna, R. Gaizauskas (eds.) ECAI2000 Workshop on Machine Learning for IE, Berlin (2000), http://www.dcs.shef.ac.uk/~fabio/ecai-workshop.html
F. Ciravegna, N. Kushmerick, R. Mooney and I. Muslea (eds.), IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence, Seattle, (2001), http://www.smi.ucd.ie/ATEM2001/
M. Vargas-Vera, Enrico Motta, J. Domingue, M. Lanzoni, A. Stutt and F. Ciravegna: “MnM: Ontology driven semi-automatic or automatic support for semantic markup”, Proc. of the 13th International Conference on Knowledge Engineering and Knowledge Management, EKAW02, Sigiienza, Spain (2002).
S. Handschuh, S. Staab and F. Ciravegna: “S-CREAM-Semi-automatic CREAtion of Metadata”, Proc. of the 13th International Conference on Knowledge Engineering and Knowledge Management, EKAW02, Sigiienza, Spain, (2002).
F. Ciravegna and D. Petrelli: “User Involvement in Adaptive Information Extraction: Position Paper” in Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence, Seattle (2001).
D. Maynard, V. Tablan, H. Cunningham, C. Ursu, H. Saggion, K. Bontcheva and Y. Wilks: “Architectural Elements of Language Engineering Robustness”, Journal of Natural Language Engineering, Special Issue on Robust Methods in Analysis of Natural Language Data, forthcoming in 2002.
F. Ciravegna: “Adaptive Information Extraction from Text by Rule Induction and Generalisation” in Proceedings of 17th International Joint Conference on Artificial Intelligence (2001).
F. Ciravegna: “(LP)2, an Adaptive Algorithm for Information Extraction from Web-related Texts” in Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence (IJCAI-01), Seattle, August, 2001
N. Kushmerick, D. Weld and R. Doorenbos: ‘Wrapper induction for information extraction’, Proc. of 15th International Conference on Artificial Intelligence, Japan (1997).
F. Ciravegna: “Challenges in Information Extraction from Text for Knowledge Management”, IEEE Intelligent Systems and Their Applications, 16–6, November, (2001).
M. E. Califf: ‘Relational Learning Techniques for Natural Language’ IE, PhD. thesis, Univ. Texas, Austin, (1998), http://www.cs.utexas.edu/users/mecaliff
D. Freitag and N. Kushmerick, ‘Boosted wrapper induction’, in R. Basili, F. Ciravegna, R. Gaizauskas (eds). ECAI2000 Workshop on Machine Learning for Information Extraction, Berlin, 2000, http://www.dcs.shef.ac.uk/~fabio/ecai-workshop.html.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ciravegna, F., Dingli, A., Petrelli, D., Wilks, Y. (2002). User-System Cooperation in Document Annotation Based on Information Extraction. In: Gómez-Pérez, A., Benjamins, V.R. (eds) Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web. EKAW 2002. Lecture Notes in Computer Science(), vol 2473. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45810-7_15
Download citation
DOI: https://doi.org/10.1007/3-540-45810-7_15
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44268-4
Online ISBN: 978-3-540-45810-4
eBook Packages: Springer Book Archive