Abstract
Semantic Web Mining aims at combining the two fast-developing research areas Semantic Web and Web Mining. The idea is to improve, on the one hand, the results of Web Mining by exploiting the new semantic structures in the Web; and to make use of Web Mining, on the other hand, for building up the Semantic Web. This paper gives an overview of where the two areas meet today, and sketches ways of how a closer integration could be profitable.
Chapter PDF
Similar content being viewed by others
Keywords
- Association Rule
- Association Rule Mining
- Inductive Logic Programming
- Formal Concept Analysis
- Usage Mining
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
B. Berendt. Using site semantics to analyze, visualize and support navigation. Data Mining and Knowledge Discovery, 6:37–59, 2002.
B. Berendt and M. Spiliopoulou. Analysing navigation behaviour in web sites integrating multiple information systems. The VLDB Journal, 9(1):56–75, 2000.
S. Chakrabarti. Data mining for hypertext: A tutorial survey. SIGKDD Explorations, 1:1–11, 2000.
S. Chakrabarti, B. Dom, D. Gibson, J. Kleinberg, P. Raghavan, and S. Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text. In Proceedings of the 7th World-wide web conference (WWW7), 30(1–7), pages 65–74, 1998.
S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: A new approach to topic-specific web resource discovery. In Proceedings of the 8th World-wide web conference (WWW8), 31(11–16), pages 1623–1640, Toronto, May 1999.
Hans Chalupsky. Ontomorph: A translation system for symbolic knowledge. In Principles of Knowledge Representation and Reasoning: Proceedings of the Seventh International Conference (KR2000), pages 471–482, 2000.
G. Chang, M.J. Healey, J.A.M. McHugh, and J.T.L. Wang. Mining the World Wide Web. An Information Search Approach. Boston: Kluwer Academic Publishers, 2001.
E.H. Chi, P. Pirolli, and J. Pitkow. The scent of a site: a system for analyzing and predicting information scent, usage, and usability of a web site. In Proceedings of the ACM CHI 2000 Conference on Human Factors in Computing Systems, pages 161–168, Amsterdam: ACM Press., 2000.
Richard Cole and Gerd Stumme. Cem-a conceptual email manager. In Bernhard Ganter and Guy W. Mineau, editors, Proc. ICCS 2000, volume 1867 of LNAI, pages 438–452. Springer, 2000.
R. Cooley. Web Usage Mining: Discovery and Application of Interesting Patterns from Web Data. PhD thesis, University of Minnesota, Faculty of the Graduate School, 2000.
R. Cooley, B. Mobasher, and J. Srivastava. Data preparation for mining world wide web browsing patterns. Journal of Knowledge and Information Systems, 1(1):5–32, 1999.
M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigam, and S. Slattery. Learning to construct knowledge bases from the world wide web. Artificial Intelligence, 118(1–2):69–113, 2000.
L. Dehaspe and H. Toivonen. Discovery of frequent datalog patterns. Data Mining and Knowledge Discovery, 3(1):7–36, 1999.
Saso Dzeroski and Nada Lavrac, editors. Relational Data Mining. Springer, 2001.
M. Fernández, D. Fiorescu, A. Levi, and D. Sucin. Declarative specification of web sites with strudel. The VLDB Journal, 9:38–55, 2000.
B. Ganter. Attribute exploration with background knowledge. TCS, 217(2):215–233, 1999.
B. Ganter and G. Stumme. Creation and merging of ontology top-levels. In Proc. ECAI02. submitted, 2002.
B. Ganter and R. Wille. Formal Concept Analysis: Mathematical Foundations. Springer, Berlin-Heidelberg, 1999.
D. Hand, H. Mannila, and P. Smyth. Principles of Data Mining. Cambridge, MA: MIT Press, 2001.
Siegfried Handschuh and Steffen Staab. Authoring and annotation of web pages in CREAM. In Proc. Of WWW11. to appear, 2002.
Jerry Hobbs, Douglas Appelt, John Bear, David Israel, Megumi Kameyama, Mark Stickel, and Mabry Tyson. Fastus: A cascaded finite-state transducer for extracting information from natural-language text. In E. Roche and Y. Schabes, editors, Finite State Devices for Natural Language Processing. MIT Press, Cambridge MA, 1996.
A. Hotho, A. Maedche, and S. Staab. Ontology-based text clustering. In Proceedings of the IJCAI-2001 Workshop “Text Learning: Beyond Supervision”, August, Seattle, USA, 2001.
A. Hotho, A. Maedche, S. Staab, and R. Studer. SEAL-II — the soft spot between richly structured and unstructured knowledge. Journal of Universal Computer Science (J.UCS), 7(7):566–590, 2001.
E.H. Hovy. Combining and standardizing large-scale, practical ontologies for machine translation and other uses. In Proc. 1st Intl. Conf. on Language Resources and Evaluation (LREC), Granada, 1998.
H. Kato, T. Nakayama, and Y. Yamane. Navigation analysis tool based on the correlation between contents distribution and access patterns. In Working Notes of the Workshop on Web Mining for E-Commerce-Challenges and Opportunities (WebKDD 2000) at the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 95–104, Boston, MA, 2000.
Michael Kifer, Georg Lausen, and James Wu. Logical foundations of object-oriented and frame-based languages. Journal of the ACM, 42:741–843, 1995.
Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604–632, 1999.
W. Lin, S.A. Alvarez, and C. Ruiz. Efficient adaptive-support association rule mining for recommender systems. Data Mining and Knowledge Discovery, 6:83–105, 2002.
A. Maedche. Ontology Learning for the Semantic Web. Kluwer, 2002.
A. Maedche, M. Ehrig, S. Handschuh, L. Stojanovic, and R. Volz. Ontology-focused crawling of documents and relational metadata. In Proceedings of the Eleventh International World Wide Web Conference WWW-2002, Hawaii, 2002.
A. Maedche and S. Staab. Discovering conceptual relations from text. In ECAI-2000-European Conference on Artificial Intelligence. Proceedings of the 13th European Conference on Artificial Intelligence, pages 321–325. IOS Press, Amsterdam, 2000.
A. Maedche and S. Staab. Ontology learning for the semantic web. IEEE Intelligent Systems, 16(2):72–79, 2001.
D. McGuinness, R. Fikes, J. Rice, and S. Wilder. An environment for merging and testing large ontologies. In In the Proceedings of the Seventh International Conference on Principles of Knowledge Representation and Reasoning (KR2000), pages 483–493, Breckenridge, Colorado, USA, 2000.
B. Mobasher, R. Cooley, and J. Srivastava. Automatic personalization based on web usage mining. Communications of the ACM, 43(8):142–151, 2000.
B. Mobasher, H. Dai, T. Luo, Y. Sun, and J. Zhu. Integrating web usage and content mining for more effective personalization. In Proceedings of the International Conference on E-Commerce and Web Technologies (ECWeb2000), pages 165–176, Greenwich, UK, 2000.
N. Noy and M. Musen. Prompt: Algorithm and tool for automated ontology merging and alignment. In Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), pages 450–455, Austin, Texas, 2000.
D. Oberle. Semantic Community Web Portals-Personalization, Studienarbeit. Universität Karlsruhe, 2000.
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. In Proceedings of the 7th International World Wide Web Conference, pages 161–172, Brisbane, Australia, 1998.
S. Parent, B. Mobasher, and S. Lytinen. An adaptive agent for web exploration based of concept hierarchies. In Proceedings of the 9th International Conference on Human Computer Interaction, New Orleans, LA, 2001.
Ramana Rao Peter Pirolli, James Pitkow. Silk from a sow’s ear: Extracting usable structures from the web. In Proc. ACM Conf. Human Factors in Computing Systems, CHI, pages 118–125, New York, NY, 1996. ACM Press.
Tobias Scheffer and Stefan Wrobel. A sequential sampling algorithm for a general class of utility criteria. In Knowledge Discovery and Data Mining, pages 330–334, 2000.
M. Spiliopoulou and C. Pohle. Data mining for measuring and improving the success of web sites. Data Mining and Knowledge Discovery, 5:85–14, 2001.
J. Srivastava, R. Cooley, M. Deshpande, and P.-N. Tan. Web usage mining: discovery and application of usage patterns from web data. SIGKDD Explorations, 1(2):12–23, 2000.
G. Stumme and A. Maedche. FCA-Merge: Bottom-Up Merging of Ontologies. In IJCAI-2001-Proceedings of the 17th International Joint Conference on Artificial Intelligence, Seattle, USA, August, 1–6, 2001, pages 225–234, San Francisco, 2001. Morgen Kaufmann.
G. Stumme, R. Taouil, Y. Bastide, N. Pasqier, and L. Lakhal. Computing iceberg concept lattices with titanic. J. on Knowledge and Data Engineering (in print), 2002.
Gerd Stumme. Using ontologies and formal concept analysis for organizing business knowledge. In Proc. Referenzmodellierung 2001 (in print), 2002.
A.B. Williams and C Tsatsoulis. An instance-based approach for identifying candidate ontology relations within a multi-agent system. In Proceedings of the First Workshop on Ontology Learning OL’2000, Berlin, Germany, 2000. Fourteenth European Conference on Artificial Intelligence.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Berendt, B., Hotho, A., Stumme, G. (2002). Towards Semantic Web Mining. In: Horrocks, I., Hendler, J. (eds) The Semantic Web — ISWC 2002. ISWC 2002. Lecture Notes in Computer Science, vol 2342. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48005-6_21
Download citation
DOI: https://doi.org/10.1007/3-540-48005-6_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43760-4
Online ISBN: 978-3-540-48005-1
eBook Packages: Springer Book Archive