Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Two-level Clustering of Web Sites Using Self-Organizing Maps

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Web sites contain an ever increasing amount of information within their pages. As the amount of information increases so does the complexity of the structure of the web site. Consequently it has become difficult for visitors to find the information relevant to their needs. To overcome this problem various clustering methods have been proposed to cluster data in an effort to help visitors find the relevant information. These clustering methods have typically focused either on the content or the context of the web pages. In this paper we are proposing a method based on Kohonen’s self-organizing map (SOM) that utilizes both content and context mining clustering techniques to help visitors identify relevant information quicker. The input of the content mining is the set of web pages of the web site whereas the source of the context mining is the access-logs of the web site. SOM can be used to identify clusters of web sessions with similar context and also clusters of web pages with similar content. It can also provide means of visualizing the outcome of this processing. In this paper we show how this two-level clustering can help visitors identify the relevant information faster. This procedure has been tested to the access-logs and web pages of the Department of Informatics and Telecommunications of the University of Athens.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Andrade MA, Chacón P and Merelo-Guervós J (1993). Evaluation of secondary structure of proteins from UV circular dichroism spectra. Protein Eng 6(4): 383–390

    Article  Google Scholar 

  • Chekuri C, Goldwasser M, Raghavan P, Upfal E (1996) Web search using automatic classification. In: Sixth World Wide Web conference, San Jose, CA

  • Kaski S (1997). Computationally efficient approximation of a probabilistic model for document representation in the websom full-text analysis method. Neural Process Lett 5(2): 69–81

    Article  Google Scholar 

  • Kohonen T (2001). Self-organizing maps, 3rd edn. Springer-Verlag, Berlin

    MATH  Google Scholar 

  • Lagus K, Kaski S and Kohonen T (2004). Mining massive document collections by the WEBSOM method. Information Sci 163(1–3): 135–156

    Article  Google Scholar 

  • Merelo JJ et al (2004) Clustering web-based communities using self-organizing maps. In: Proceedings of IADIS conference on web based communities. Lisbon, Portugal

  • Mobasher B, Cooley R, Srivastava J (1999) Creating Adaptive Web Sites through Usage-based Clustering of URLs. In: Proceedings of 1999 workshop on knowledge and data engineering exchange, USA, pp 19–25

  • Quesada J, Merelo-Guervós JJ and Oliveras MJ (2002). Application of artificial aging techniques to samples of rum and comparison with traditionally aged rums by analysis with artificial neural nets. J Agric Food chem 50(6): 1470–1477

    Article  Google Scholar 

  • Romero G, Arenas MG, Castillo PA, Merelo JJ (2003) Visualization of neural network evolution. In: Lecture notes in computer science, LNCS, Nos. 2686–2687, Springer-Verlag, pp 534–541

  • Sammon JW Jr (1969). A nonlinear mapping for data structure analysis. IEEE Trans Comput 18: 401–409

    Article  Google Scholar 

  • Ultsch A (1993). Self-organizing neural networks for visualization and classification. In: Opitz, O, Lausen, B and Klar, R (eds) Information and classification, pp 307–313. Springer, London, UK,

    Google Scholar 

  • Vesanto J, Himberg J, Alhoniemi E, Parhankangas J (1999) Self-Organizing map in Matlab: the SOM Toolbox. In: Proceedings of the Matlab DSP conference. Espoo, Finland, pp 35–40

  • Zhang J, Caragea D, Honavar V (2005) Learning ontology-aware classifiers. In: Proceedings of the eight international conference on discovery science (DS 2005). Springer-Verlag, Berling

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dimitris Petrilis.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Petrilis, D., Halatsis, C. Two-level Clustering of Web Sites Using Self-Organizing Maps. Neural Process Lett 27, 85–95 (2008). https://doi.org/10.1007/s11063-007-9061-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-007-9061-x

Keywords