Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/502512.502584acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Mining web logs for prediction models in WWW caching and prefetching

Published: 26 August 2001 Publication History

Abstract

Web caching and prefetching are well known strategies for improving the performance of Internet systems. When combined with web log mining, these strategies can decide to cache and prefetch web documents with higher accuracy. In this paper, we present an application of web log mining to obtain web-document access patterns and use these patterns to extend the well-known GDSF caching policies and prefetching policies. Using real web logs, we show that this application of data mining can achieve dramatic improvement to web-access performance.

References

[1]
M. Arlitt, R. Friedrich L. Cherkasova, J. DiUey, and T. Jin. Evaluating content management techniques for web proxy caches. In HP Technical report, Palo Alto, Apr. 1999.]]
[2]
R. Agrawal and R. Srikant. Minging Sequential Patterns. Proc. Of Int'l Conference on Data Engineering, Taipei, Taiwan, 1995]]
[3]
C. Aggarwal, J. L. Wolf, and P. S. Yu. Caching on the World Wide Web. In IEEE Transactions on Knowledge and Data Engineering, volume 11, pages 94-107, 1999.]]
[4]
P. Cao and S. Irani. Cost-aware www proxy caching algorithms. In USENIX Symposium on Internet Technologies and Systems, Monterey, CA, Dec. 1997.]]
[5]
Pitknw J. and Pirolli P. Mining longest repeating subsequences to predict www surfing. In Proceedings of the 1999 USENIX Annual Technical Conference, 1999.]]
[6]
T.M. Kroeger and D. D. E. Long. Predicting future filesystem actions from prior events. In USENIX 96, San Diego, Calif., Jan. 1996.]]
[7]
K. Chinen and S. Yamaguchi. An Interactive Prefetching Proxy Server for Improvement of WWW Latency. In Proceedings of the Seventh Annual Conference of the Internet Society (INEt'97), Kuala Lumpur, June 1997.]]
[8]
S. Schechter, M. Krishnan, and M.D. Smith. Using path profiles to predict http requests. In Proceedings of the Seventh International World Wide Web Conference Brisbane, Australia., 1998.]]
[9]
L. Cherkasova. Improving www proxies performance with greedy-dual-size-frequency caching policy. In HP Technical Report, Palo Alto, November 1998.]]
[10]
P. Cao, E. W. Felten, A. R. Karlin, and K. Li. A study of integrated prefetching and caching strategies. In Proceedings of the ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, May 1995.]]
[11]
K. Chinen and S. Yamaguchi. An interactive prefetching proxy server for improvement of www latency. In Proceedings of the Seventh Annual Conference of the Internet Society (INET '97), Kuala Lumpur, Malaysia, June 1997.]]
[12]
D. Duchamp. Prefetching hyperlinks. In Proceedings of the Second USENIX Symposium on Internet Technologies and Systems (USITS '99), Boulder, CO, October 1999.]]
[13]
Z. Su, Q. Yang, Y. Lu, and H. Zhang. Whatnext: A prediction system for web requests using n-gram sequence models. In Proceedings of the First International Conference on Web Information System and Engineering Conference, pages 200-207, Hong Kong, June 2000.]]
[14]
V. Padmanabhan and J. Mogul. Using predictive prefetching to improve world of the Seventeenth International Conference on very Large Database, pages 255-264, September 1991.]]
[15]
E. Cohen, B. Krishnamurthy, and J. Rexford. Evaluating server-assisted cache replacement in the web. In Proceedings of European Symposium on Algorithms, August 1998]]
[16]
M. Arlitt, R. Friedrich, L. Cherkasova, J. Dilley, and T. Jin. Evaluating content management techniques for web proxy caches. In IIP Technical report, Palo Alto, Apr. 1999.]]

Cited By

View all
  • (2023)Enhancing Accessibility to Data in Data-Intensive Web Applications by Using Intelligent Web Prefetching MethodologiesInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402350036533:09(1405-1438)Online publication date: 23-Aug-2023
  • (2022)JEDIProceedings of the 22nd ACM Internet Measurement Conference10.1145/3517745.3561466(679-693)Online publication date: 25-Oct-2022
  • (2022)A Comprehensive Study of Page-Rank AlgorithmEvolution in Computational Intelligence10.1007/978-981-16-6616-2_1(1-10)Online publication date: 24-Apr-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
August 2001
493 pages
ISBN:158113391X
DOI:10.1145/502512
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 August 2001

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Application to Caching and Prefetching on the WWW
  2. Web Log Mining

Qualifiers

  • Article

Conference

KDD01
Sponsor:

Acceptance Rates

KDD '01 Paper Acceptance Rate 31 of 237 submissions, 13%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 14 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Enhancing Accessibility to Data in Data-Intensive Web Applications by Using Intelligent Web Prefetching MethodologiesInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402350036533:09(1405-1438)Online publication date: 23-Aug-2023
  • (2022)JEDIProceedings of the 22nd ACM Internet Measurement Conference10.1145/3517745.3561466(679-693)Online publication date: 25-Oct-2022
  • (2022)A Comprehensive Study of Page-Rank AlgorithmEvolution in Computational Intelligence10.1007/978-981-16-6616-2_1(1-10)Online publication date: 24-Apr-2022
  • (2021)TRAGENProceedings of the 21st ACM Internet Measurement Conference10.1145/3487552.3487845(366-379)Online publication date: 2-Nov-2021
  • (2021)Leveraging user access patterns and advanced cyberinfrastructure to accelerate data delivery from shared-use scientific observatoriesFuture Generation Computer Systems10.1016/j.future.2021.03.004122(14-27)Online publication date: Sep-2021
  • (2020)Cache What You Need to CacheACM Transactions on Storage10.1145/339776616:3(1-24)Online publication date: 16-Jul-2020
  • (2020)Connecting Web Event Forecasting with Anomaly Detection: A Case Study on Enterprise Web Applications Using Self-supervised Neural NetworksSecurity and Privacy in Communication Networks10.1007/978-3-030-63086-7_27(481-502)Online publication date: 12-Dec-2020
  • (2019)The Next 700 Policy MinersProceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security10.1145/3319535.3354196(95-112)Online publication date: 6-Nov-2019
  • (2019)Clustering Web Users Based on K-means Algorithm for Reducing Time Access Cost2019 First International Conference of Intelligent Computing and Engineering (ICOICE)10.1109/ICOICE48418.2019.9035190(1-7)Online publication date: Dec-2019
  • (2019)LR-LRU: A PACS-Oriented Intelligent Cache Replacement PolicyIEEE Access10.1109/ACCESS.2019.29139617(58073-58084)Online publication date: 2019
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media