Abstract
Managers of electronic commerce sites need to learn as much as possible about their customers and those browsing their virtual premises, in order to maximise the return on marketing expenditure. The discovery of marketing related navigation patterns requires the development of data mining algorithms capable of the discovery of sequential access patterns from web logs. This paper introduces a new algorithm called M i DAS that extends traditional sequence discovery with a wide range of web-specific features. Domain knowledge is described as flexible navigation templates that can specify generic navigational behaviour of interest, network structures for the capture of web site topologies, concept hierarchies and syntactic constraints. Unlike existing approaches MiDAS supports sequence discovery from multidimensional data, which allows the detection of sequences across monitored attributes, such as URLs and http referrers. Three methods for pruning the sequences, resulting in three different types of navigational behaviour are presented. The experimental evaluation has shown promising results in terms of functionality as well as scalability.
This research has partly been funded by the ESPRIT project No 26749 (MIMIC — Mining the Internet for Marketing IntelligenCe).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Srikant, R.: Mining Sequential Patterns. Proc. Int’l Conf. on Data Engineering (1995) 3–14
Anand, S.S., Bell, D.A., Hughes, J.G.: The Role of Domain Knowledge in Data Mining. Proc. 4th Int’l ACM Conf. on Information and Knowledge Management (1995) 37–43
Anand, S.S., Büchner, A.G.: Decision Support using Data Mining. FT Pitman Publishers (1998)
Anand, S.S., Büchner, A.G., Mulvenna, M.D., Hughes, J.G.: Discovering Internet Marketing Intelligence through Web Log Mining. Unicom.99
Bhowmick, S.S., Madria, S.K., Ng, W.-K., Lim E.P.: Web Mining in WHOWEDA. Some Issues, Proc. PRICAI98 Workshop on Knowledge Discovery and Data Mining (1998)
Borges, J., Levene, M.: Data Mining of User Navigation Patterns. Proc. WEBKDD99 Workshop on Web Usage Analysis and User Profiling (1999) 31–36 (same volume)
Büchner, A.G., Mulvenna, M.D.: Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining. ACM SIGMOD Record, 27:4 (1998) 54–61
Chen, M.S., Park, J.S., Yu, P.S.: Data Mining for Path Traversal Patterns in a Web Environment. Proc. 16th Int’l Conf. on Distributed Computing Systems (1996) 385–392
Cooley, R., Mobasher, R., Srivastava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. Proc. 9th IEEE Int’l Conf. on Tools with Artificial Intelligence (1997)
Cooley, R., Mobasher, R., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems 1:1 (1999)
Han, J., Fu, Y.: Dynamic Generation and Refinement of Concept Hierarchies for Knowledge Discovery in Databases. Proc. KDD’94 (1994) 157–168
Ling, C.X., Li, C.: Data Mining for Direct Marketing: Problems and Solutions. Proc. KDD’99 (1998) 73–79
Manilla, H., Toivonen, H., Inkeri, A.: Discovery of Frequent Episodes in Event Sequences. Proc. 2nd Int’l Conf. on Knowledge Discovery and Data Mining (1995) 210–215
Manilla, H., Toivonen, H.: Discovering generalized episodes using minimal occurrences. Proc. 2nd Int’l Conf. on Knowledge Discovery and Data Mining (1996) 146–151
Manilla, H., Toivonen, H., Inkeri, A.: Discovery of Frequent Episodes in Event Sequences. Data Mining and Knowledge Discovery, 1:3 (1997) 259–289
Mulvenna, M.D., Norwood, M.T., Büchner, A.G.: Data-driven Marketing. The Int’l Journal of Electronic Commerce and Business Media, 8:3 (1998) 32–35
Spiliopoulou, M.: The laborious way from data mining to web mining. Int’l Journal of Computing Systems, Science & Engineering, March(1999)
Spiliopoulou, M., Faulstich, L.C., Winkler, K.A.: A Data Miner analyzing the Navigational Behaviour of Web Users. Proc. ACAI’99 Workshop on Machine Learning in User Modelling (1999)
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. Proc. 5th Int’l Conf on Extending Database Technology (1996) 3–17
Žaíane, O.R, Xin, M., Han, J.: Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs. Proc. Advances in Digital Libraries Conf. (1998) 19–29
Zaki, M.J.: Efficient Enumeration of Frequent Sequences. 7th Int’l ACM Conf. on Information and Knowledge Management (1998) 68–75
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Baumgarten, M., Büchner, A.G., Anand, S.S., Mulvenna, M.D., Hughes, J.G. (2000). User-Driven Navigation Pattern Discovery from Internet Data. In: Masand, B., Spiliopoulou, M. (eds) Web Usage Analysis and User Profiling. WebKDD 1999. Lecture Notes in Computer Science(), vol 1836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44934-5_5
Download citation
DOI: https://doi.org/10.1007/3-540-44934-5_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67818-2
Online ISBN: 978-3-540-44934-8
eBook Packages: Springer Book Archive