Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1062745.1062913acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Understanding the function of web elements for mobile content delivery using random walk models

Published: 10 May 2005 Publication History

Abstract

In this paper, we describe a method for understanding the function of web elements. It classifies web elements into five functional categories: Content (C), Related Links (R), Navigation and Support (N), Advertisement (A) and Form (F). We construct five graphs for a web page, and each graph is designed such that most of the probability mass of the stationary distribution is concentrated in nodes belong to its corresponding category. We perform random walks on these graphs until convergence and classify based on its rank value in different graphs. Our experiment shows that the new method performed very well comparing to basic machine learning methods.

References

[1]
Ian H. Witten and Eibe Frank, Data Mining: Practical machine learning tools with Java implementations," Morgan Kaufmann, San Francisco, 2000.
[2]
Lawrence Kai Shih and David R. Karger. Using URLs and Table Layout for Web Classification Tasks In Proceedings of the 13th International World Wide Web Conference, 2004
[3]
Ruihua Song, Haifeng Liu, Jirong Wen, Wei-Ying Ma. Learning Block Importance Models for Web Pages. In Proceedings of 13th International World Wide Web Conference, 2004.
[4]
Sergey Brin, Lawrence Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine. In Proceedings of the 7th International World Wide Web Conference, 1998.
[5]
Xiao-Dong Gu, Jinlin Chen, Wei Ying Ma, Guo-Liang Chen. Visual Based Content Understanding towards Web Adaptation. In Second International Conference on Adaptive Hypermedia and Adaptive Web-based Systems, 2002.
[6]
Xinyi Yin, Wee Sun Lee. Using Link Analysis to Improve Layout on Mobile Devices. In Proceedings of 13th International World Wide Web Conference, 2004.
[7]
Yudong Yang, HongJiang Zhang. HTML Page Analysis Based on Visual Cues. In 7th International Conference on Document Analysis and Recognition, 2001.

Cited By

View all
  • (2023)Thoughts on Visualization of Numerical Simulation for a Dynamical System2023 IEEE/ACIS 23rd International Conference on Computer and Information Science (ICIS)10.1109/ICIS57766.2023.10210244(170-174)Online publication date: 23-Jun-2023
  • (2023)Web Page Segmentation: A DOM-Structural Cohesion Analysis ApproachWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_25(319-333)Online publication date: 21-Oct-2023
  • (2020)Model-Driven Web Page Segmentation for Non Visual AccessComputational Linguistics10.1007/978-981-15-6168-9_17(191-205)Online publication date: 2-Jul-2020
  • Show More Cited By

Index Terms

  1. Understanding the function of web elements for mobile content delivery using random walk models

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WWW '05: Special interest tracks and posters of the 14th international conference on World Wide Web
      May 2005
      454 pages
      ISBN:1595930515
      DOI:10.1145/1062745
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 May 2005

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. HTML
      2. WWW (world wide web)
      3. classification

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 10 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Thoughts on Visualization of Numerical Simulation for a Dynamical System2023 IEEE/ACIS 23rd International Conference on Computer and Information Science (ICIS)10.1109/ICIS57766.2023.10210244(170-174)Online publication date: 23-Jun-2023
      • (2023)Web Page Segmentation: A DOM-Structural Cohesion Analysis ApproachWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_25(319-333)Online publication date: 21-Oct-2023
      • (2020)Model-Driven Web Page Segmentation for Non Visual AccessComputational Linguistics10.1007/978-981-15-6168-9_17(191-205)Online publication date: 2-Jul-2020
      • (2016)Web Page Segmentation and Its Application for Web Information Crawling2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI.2016.0097(598-605)Online publication date: Nov-2016
      • (2016)Advertisement Detection, Segmentation, and Classification for Newspaper Images and Website Snapshots2016 International Computer Symposium (ICS)10.1109/ICS.2016.0086(396-401)Online publication date: Dec-2016
      • (2016)Specification and discovery of web patternsInformation Sciences: an International Journal10.1016/j.ins.2015.08.052328:C(528-545)Online publication date: 20-Jan-2016
      • (2015)Automated classification and localization of daily deal content from the WebApplied Soft Computing10.1016/j.asoc.2015.02.02931:C(241-256)Online publication date: 1-Jun-2015
      • (2013)Heuristic role detection of visual elements of web pagesProceedings of the 13th international conference on Web Engineering10.1007/978-3-642-39200-9_12(123-131)Online publication date: 8-Jul-2013
      • (2013)Vision Based Page Segmentation AlgorithmRevised Selected Papers of the ICWE 2013 International Workshops on Current Trends in Web Engineering - Volume 829510.1007/978-3-319-04244-2_22(238-252)Online publication date: 8-Jul-2013
      • (2007)Infrequent Item Mining in Multiple Data StreamsProceedings of the Seventh IEEE International Conference on Data Mining Workshops10.1109/ICDMW.2007.59(569-574)Online publication date: 28-Oct-2007
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media