Abstract
The design and organization of a website reflects the authors intent. Since user perception and understanding of websites may differ from the authors, we propose a means to identify and quantify this difference in perception. In our approach we extract perceived semantic focus by analyzing user behavior in conjunction with keyword similarity.
By combining usage and content data we identify user groups with regard to the subject of the pages they visited. Our real world data shows that these user groups are nicely distinguishable by their content focus. By introducing a distance measure of keyword coincidence between web pages and user groups, we can identify pages of similar perceived interest. A discrepancy between perceived distance and link distance in the web graph indicates an inconsistency in the web site’s design. Determining usage similarity allows the web site author to optimize the content to the users needs.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Berendt, B., Hotho, A., Stumme, G.: Towards Semantic Web Mining. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, p. 264. Springer, Heidelberg (2002)
Chakrabarti, S.: Mining the Web - Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)
Calero, C., Ruiz, J., Piattini, M.: A Web Metrics Survey Using WQM Web Engineering. In: Koch, N., Fraternali, P., Wirsing, M. (eds.) ICWE 2004. LNCS, vol. 3140, pp. 147–160. Springer, Heidelberg (2004)
Cooley, R.: The Use of Web Structure and Content to Identify Subjectively Interesting Web Usage Patterns. ACM Transaction on Internet Technology 3(2), 93–116 (2003)
Dai, H., Mobasher, B.: Using ontologies to discover domain-level web usage profiles. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168. Springer, Heidelberg (2001)
Dhyani, D., Keong, N., Bhowmick, S.S.: A Survey ofWebMetrics. ACM Computing Surveys 34(4), 469–503 (2002)
He, X., Zha, H., Ding, C., Simon, H.: Web document clustering using hyperlink structures. Computational Statistics and Data Analysis 41, 19–45 (2002)
Mobasher, B., Dai, H., Luo, T., Sun, Y., Zhu, J.: Integrating Web Usage and Content Mining for More Effective Personalization. In: Proc. of the Int’l. Conf. on E-Commerce and Web Technologies (ECWeb 2000) (2000)
Oberle, D., Berendt, B., Hotho, A., Gonzalez, J.: Conceptual User Tracking. In: Proc. of the Atlantic Web Intelligence Conference, pp. 155–164 (2002)
Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)
Song, R., Liu, H., Wen, J., Ma, W.: Learning important models for web page blocks based on layout and content analysis. SIGKDD Explor. Newsl. 6(2), 14–23 (2004)
Stolz, C., Gedov, V., Yu, K., Neuneier, R., Skubacz, M.: Measuring Semantic Relations of Web Sites by Clustering of Local Context. In: Koch, N., Fraternali, P., Wirsing, M. (eds.) ICWE 2004. LNCS, vol. 3140, pp. 182–186. Springer, Heidelberg (2004)
Sun, A., Lim, E.-P.: Web Unit Mining: Finding and Classifying Subgraphs ofWeb Pages. In: Proceedings 12th Int. Conf. on Information and Knowledge Management, pp. 108–115. ACM Press, New York (2003)
Vadnerdonckt, J., Beirekdar, A., Noirhomme-Fraiture, M.: Automated Evaluation of Web Usability and Accessibility by Guideline Review. In: Koch, N., Fraternali, P., Wirsing, M. (eds.) ICWE 2004. LNCS, vol. 3140, pp. 17–30. Springer, Heidelberg (2004)
Zhu, J., Hong, J., Hughes, J.G.: PageCluster: Mining Conceptual Link Hierarchies from Web Log Files for Adaptive Web Site Navigation. ACM Journal Transaction on Internet Technology 4(2), 185–208 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stolz, C., Viermetz, M., Skubacz, M., Neuneier, R. (2005). Improving Semantic Consistency of Web Sites by Quantifying User Intent. In: Lowe, D., Gaedke, M. (eds) Web Engineering. ICWE 2005. Lecture Notes in Computer Science, vol 3579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11531371_42
Download citation
DOI: https://doi.org/10.1007/11531371_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27996-9
Online ISBN: 978-3-540-31484-4
eBook Packages: Computer ScienceComputer Science (R0)