Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/647458.728405guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Visual Based Content Understanding towards Web Adaptation

Published: 29 May 2002 Publication History

Abstract

Web content structure is proposed to facilitate automatic web page adaptation in this paper. By identifying the logic relationship of web content based on layout information, web content structure effectively represents authors' presentation intention. An automatic top-down, tag-tree independent approach to detect web content structure is presented. It simulates how a user understands web layout structure based on his vision. Comparing to other content analysis techniques, our approach is independent to physical realization and works well even when the physical structure is far different from layout structure. Besides, our approach is an O(n)-time process which is much more efficient comparing to other approaches with O(n2)-time complexity. Furthermore, our approach is tag tree independent, which means it can be applied to web contents of arbitrary physical realization formats. Experiments show satisfactory results.

References

[1]
Ma, W.Y., Bedner, I., Chang, G., Kuchinsky, A., and H.J. Zhang. A Framework for Adaptive Content Delivery in Heterogeneous Network Environments. in Proc. MMCN2000 (SPIE Vol. 3969), San Jose, USA (2000) 86-100.
[2]
Chen, J.L., Yang, Y.D., and Zhang, H.J.: An Adaptive Web Content Delivery System, in Proc. AH2000, Springer (2000) 284-288.
[3]
Smith, J.R., Mohan, R., and Li, C.S.: Scalable Multimedia Delivery for Pervasive Computing, in Proc. of the 7th ACM International Conference on Multimedia (1999) 131-140.
[4]
Bickmore, T.W., and Schilit, B.N.: Digestor: Device-independent access to the World Wide Web, in Proc. WWW6 (1997) 655-663.
[5]
Hammer, J., Garcia-Monlina, H., Cho, J., Aranha, R., and A. Crespo: Extracting semistructured information from the web, in Proc. PODS/SIGMOD'97 (1997) 18-25.
[6]
Ashish, N., and Knoblock, C.: Wrapper generation for semi-structured Internet sources, in Proc. PODS/SIGMOD'97 (1997) 8-15.
[7]
Simth, D., and Lopez, M.: Information extraction for semi-structured documents, in Proc. PODS/SIGMOD'97 (1997) 117-121.
[8]
Nestorov, S., Abiteboul, S., and Motwani, R.: Inferring Structure in Semistructured Data, in Proc. PODS/SIGMOD'97 (1997) 39-43.
[9]
Embley, D.W., Jiang, Y., and Ng, Y.K.: Record-Boundary Discovery in Web Documents, in Proc. SIGMOD'99, Philadelphia PA (1999) 467-478.
[10]
Lim, S.J., and Ng, Y.K.: An Automated Approach for Retrieving Hierarchical Data from HTML Table, in Proc. CIKM'99, Kansas City, MO (1999) 466-474.
[11]
Yang, Y.D., and Zhang, H.J.: HTML Page Analysis Based on Visual Cues, in Proc. of the 6th International Conference on Document Analysis and Recognition, Seattle, USA (2001).
[12]
Tang, Y.Y., Cheriet, M., Liu, J., Said, J.N., and Suen, C.Y.: Document Analysis and Recognition by Computers, Handbook of Pattern Recognition and Computer Vision, World Scientific Publishing Company (1999).
[13]
Chen, J.L., Zhou, B.Y., Shi, J. Zhang, H.J., and Wu, Q.F.: Function-based Object Model Towards Website Adaptation, Proc. of the 10th International World Wide Web Conference, Hong Kong, China (2001) 587-596.
[14]
Yang, Y.D., Chen, J.L., and Zhang., H.J.: Adaptive Delivery of HTML Contents, in WWW9 Poster Proceedings (2000) 24-25.

Cited By

View all
  • (2016)Reconstructing User’s Attention on the Web through Mouse Movements and Perception-Based Content IdentificationACM Transactions on Applied Perception10.1145/291212413:3(1-21)Online publication date: 28-May-2016
  • (2011)Transmission reduction between mobile phone applications and RESTful APIsProceedings of the 2011 ACM Symposium on Applied Computing10.1145/1982185.1982280(445-450)Online publication date: 21-Mar-2011
  • (2010)Vi-DIFFProceedings of the 21st international conference on Database and expert systems applications: Part I10.5555/1881867.1881869(1-15)Online publication date: 30-Aug-2010
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
AH '02: Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
May 2002
612 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 29 May 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Reconstructing User’s Attention on the Web through Mouse Movements and Perception-Based Content IdentificationACM Transactions on Applied Perception10.1145/291212413:3(1-21)Online publication date: 28-May-2016
  • (2011)Transmission reduction between mobile phone applications and RESTful APIsProceedings of the 2011 ACM Symposium on Applied Computing10.1145/1982185.1982280(445-450)Online publication date: 21-Mar-2011
  • (2010)Vi-DIFFProceedings of the 21st international conference on Database and expert systems applications: Part I10.5555/1881867.1881869(1-15)Online publication date: 30-Aug-2010
  • (2010)Using visual pages analysis for optimizing web archivingProceedings of the 2010 EDBT/ICDT Workshops10.1145/1754239.1754287(1-7)Online publication date: 22-Mar-2010
  • (2008)Improving web information indexing and retrieval based on center block duplication detectionInternational Journal of Innovative Computing and Applications10.1504/IJICA.2008.0196871:3(194-204)Online publication date: 1-Jul-2008
  • (2008)Efficient web browsing on small screensProceedings of the working conference on Advanced visual interfaces10.1145/1385569.1385576(23-30)Online publication date: 28-May-2008
  • (2008)Perception-oriented online news extractionProceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries10.1145/1378889.1378952(363-366)Online publication date: 16-Jun-2008
  • (2008)Math information retrievalProceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries10.1145/1378889.1378921(187-196)Online publication date: 16-Jun-2008
  • (2007)Automatic document structure detection for data integrationProceedings of the 10th international conference on Business information systems10.5555/1759779.1759814(391-397)Online publication date: 25-Apr-2007
  • (2007)Web Page AnalysisProceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops10.5555/1339264.1339695(221-225)Online publication date: 2-Nov-2007
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media