Improving Web Data Annotations with Spreading Activation

Gelgi, Fatih; Vadrevu, Srinivas; Davulcu, Hasan

doi:10.1007/11581062_8

Fatih Gelgi²¹,
Srinivas Vadrevu²¹ &
Hasan Davulcu²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3806))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1220 Accesses

Abstract

The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by the human eye, “meaningful information” is still largely inaccessible for the computer applications. In this paper, we present automated algorithms to gather meta-data and instance information by utilizing global regularities on the Web and incorporating the contextual information. Our system is distinguished since it does not require domain specific engineering. Experimental evaluations were successfully performed on the TAP knowledge base and the faculty-course home pages of computer science departments containing 16,861 Web pages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

PrettyTags: An Open-Source Tool for Easy and Customizable Textual MultiLevel Semantic Annotations

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

Article 01 November 2018

ADnOTO: A Self-adaptive System for Automatic Ontology-Based Annotation of Unstructured Documents

References

Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley, Reading (1999)
Google Scholar
Davulcu, H., Vadrevu, S., Nagarajan, S., Ramakrishnan, I.V.: Ontominer: Bootstrapping and populating ontologies from domain specific web sites. IEEE Intelligent Systems 18(5) (September 2003)
Google Scholar
Vadrevu, S., Nagarajan, S., Gelgi, F., Davulcu, H.: Automated metadata and instance extraction from news web sites. In: The 2005 IEEE/WIC/ACM International Conference on Web Intelligence, Compiegne University of Technology, France (2005) (to appear)
Google Scholar
Ashish, N., Knoblock, C.A.: Semi-automatic wrapper generation for internet information sources. In: Conference on Cooperative Information Systems, pp. 160–169 (1997)
Google Scholar
Kushmerick, N., Weld, D.S., Doorenbos, R.B.: Wrapper induction for information extraction. In: Intl. Joint Conference on Artificial Intelligence, pp. 729–737 (1997)
Google Scholar
Crescenzi, V., Mecca, G., Merialdo, P.: Roadrunner: Towards automatic data extraction from large web sites. In: Proceedings of 27th International Conference on Very Large Data Bases, pp. 109–118 (2001)
Google Scholar
Arasu, A., Garcia-Molina, H.: Extracting structured data from web pages. In: ACM SIGMOD, San Diego, USA (2003)
Google Scholar
Etzioni, O., Cafarella, M., Downey, D., Kok, S., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Web-scale information extraction in knowitall. In: Intl. World Wide Web Conf. (2004)
Google Scholar
Ciravegna, F., Chapman, S., Dingli, A., Wilks, Y.: Learning to harvest information for the semantic web. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 312–326. Springer, Heidelberg (2004)
Chapter Google Scholar
Dill, S., Tomlin, J.A., Zien, J.Y., Eiron, N., Gibson, D., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A.: Semtag and seeker: Bootstrapping the semantic web via automated semantic annotation. In: Twelth International Conference on World Wide Web, pp. 178–186 (2003)
Google Scholar
Collins, A.M., Loftus, E.F.: A spreading activation theory of semantic processing. Psychological Review (82), 407–428 (1975)
Google Scholar
Salton, G., Buckley, C.: On the use of spreading activation methods in automatic information. In: Proceedings of the 11th international ACM SIGIR conference on Research and development in information retrieval, pp. 147–160. ACM Press, New York (1988)
Chapter Google Scholar
Guha, R.V., McCool, R.: Tap: A semantic web toolkit. Semantic Web Journal (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Arizona State University, Tempe, AZ, 85287, USA
Fatih Gelgi, Srinivas Vadrevu & Hasan Davulcu

Authors

Fatih Gelgi
View author publications
You can also search for this author in PubMed Google Scholar
Srinivas Vadrevu
View author publications
You can also search for this author in PubMed Google Scholar
Hasan Davulcu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Texas State University, San Marcos, TX,
Anne H. H. Ngu
Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, 153-8505, Tokyo, Japan
Masaru Kitsuregawa
University of Vienna, Vienna, Austria
Erich J. Neuhold
IBM Research Division, Thomas J. Watson Research Center, P.O. Box 218, 10598, New York, Yorktown Heights, USA
Jen-Yao Chung
School of Computer Science and Engineering, University of New South Wales, NSW 2052, Sydney, Australia
Quan Z. Sheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gelgi, F., Vadrevu, S., Davulcu, H. (2005). Improving Web Data Annotations with Spreading Activation. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, JY., Sheng, Q.Z. (eds) Web Information Systems Engineering – WISE 2005. WISE 2005. Lecture Notes in Computer Science, vol 3806. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581062_8

Download citation

DOI: https://doi.org/10.1007/11581062_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30017-5
Online ISBN: 978-3-540-32286-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving Web Data Annotations with Spreading Activation

Abstract

Access this chapter

Preview

Similar content being viewed by others

PrettyTags: An Open-Source Tool for Easy and Customizable Textual MultiLevel Semantic Annotations

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

ADnOTO: A Self-adaptive System for Automatic Ontology-Based Annotation of Unstructured Documents

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving Web Data Annotations with Spreading Activation

Abstract

Access this chapter

Preview

Similar content being viewed by others

PrettyTags: An Open-Source Tool for Easy and Customizable Textual MultiLevel Semantic Annotations

Active Learning and Crowdsourcing: A Survey of Optimization Methods for Data Labeling

ADnOTO: A Self-adaptive System for Automatic Ontology-Based Annotation of Unstructured Documents

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation