Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1012807.1012829acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
Article

The site browser: catalyzing improvements in hypertext organization

Published: 09 August 2004 Publication History

Abstract

The Site Browser endeavors to build an overview browsing system for the entire Web. Overview browsing represents an alternative to the search-based view of information work, and does so by providing a consistent set of summary views which can be browsed interactively. The views partition and linearize the corpus for ready understanding and exploration. They show a web site's relation to other sites, the broad nature of the information it contains and how it is structured, and how it has changed over time. The design challenge is to generate useful summary information in a process which is fast enough to be updated daily. Our current system maintains a continuously updated archive of 46 million sites representing 2.3 billion web pages.

References

[1]
Alexa. http://www.alexa.com.
[2]
E. Amitay, D. Carmel, A. Darlow, R. Lempel, and A. Soffer. The Connectivity Sonar: detecting site functionality by structural patterns. In Proceedings of ACM Hypertext '03, pages 38--47. ACM Press, 2003.
[3]
Z. Bar-Yossef and S. Rajagopalan. Template detection via data mining and its applications. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), 2002.
[4]
T. Berners-Lee, J. Hendler, and O. Lassila. The Semantic Web. Scientific American, May 2001.
[5]
L. Y. Bing~Liu, Kaidi~Zhao. Visualizing web site comparisons. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), pages 693--703, 2002.
[6]
V. Boyapati, K. Chevrier, A. Finkel, N. Glance, T. Pierce, R. Stockton, and C. Whitmer. ChangeDetector{tm}: a site-level monitoring tool for the WWW. In Proceedings of the 11th International World Wide Web Conference (WWW 2002), pages 570--579, 2002.
[7]
S. Brin, R. Motwani, L. Page, and T. Winograd. What can you do with a web in your pocket? Data Engineering Bulletin, 21(2):37--47, 1998.
[8]
V. Bush. As we may think. The Atlantic Monthly, July 1945.
[9]
J. Cho and S. Roy. Impact of web search engines on page popularity. In Proceedings of the 13th International World Wide Web Conference (WWW2004), 2004.
[10]
P. Dave, U. P. Karadkar, R. Furuta, L. Francisco-Revilla, F. Shipman, S. Dash, and Z. Dalal. Browsing intricately interconnected paths. In Proceedings of ACM Hypertext '03, pages 95--103. ACM Press, 2003.
[11]
S. Dill, N. Eiron, D. Gibson, D. Gruhl, R. Guha, A. Jhingran, T. Kanungo, S. Rajagopalan, A. Tomkins, J. A. Tomlin, and J. Y. Zien. Semtag and seeker: Bootstrapping the semantic web via automated semantic annotation. In Proceedings of the 12th International World Wide Web Conference (WWW2003), May 2003.
[12]
S. Dill, N. Eiron, D. Gibson, D. Gruhl, A. Jhingran, T. Kanungo, K. S. McCurley, S. Rajagopalan, A. Tomkins, J. A. Tomlin, and J. Y. Zien. Seeker: An architecture for web-scale text analytics. Technical Report RJ 10233 (95107), IBM Research, February 2002.
[13]
N. Eiron and K. S. McCurley. Untangling compound documents on the web. In Proceedings of ACM Hypertext '03, 2003.
[14]
T. Haveliwala. Efficient encodings for document ranking vectors. In International Conference on Internet Computing, 2003.
[15]
M. Hearst. User interfaces and visualization. In R. Baeza-Yates and B. Ribeiro-Neto (Eds.) Modern information retrieval. NY: ACM Press., 1999.
[16]
Y. Maarek and I. Shaul. Webcutter: A system for dynamic and tailorable site mapping. In Proceedings of the 6th International World Wide Web Conference, 1997.
[17]
G. Marchionini and B. Brunk. Toward a general relation browser: A GUI for information architects. In Journal of Digital Information, volume 4, 2003.
[18]
K. S. McCurley. Geospatial mapping and navigation of the web. In Proceedings of the 10th International World Wide Web Conference (WWW2001), pages 221--229, Hong Kong, China, 2001.
[19]
D. Nation, C. Plaisant, G. Marchionini, and A. Komlodi. Visualizing websites using a hierarchical table of contents browser: WebTOC. In Designing for the Web: Practices and Reflections, 1997.
[20]
D. Quan and D. Karger. How to make a semantic web browser. In Proceedings of the 13th International World Wide Web Conference (WWW2004), 2004.
[21]
A. J. Sellen, R. Murphy, and K. L. Shaw. How knowledge workers use the web. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 227--234. ACM Press, 2002.
[22]
J. Teevan, C. Alvarado, M. S. Ackerman, and D. R. Karger. The perfect search engine is not enough: A study of orienteering behavior in directed search. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM Press, 2004.
[23]
K.-P. Yee, K. Swearingen, K. Li, and M. Hearst. Faceted metadata for image search and browsing. In Proceedings of the SIGCHI conference on Human factors in computing systems, pages 401--408. ACM Press, 2003.

Cited By

View all
  • (2023)Seven HypertextsProceedings of the 34th ACM Conference on Hypertext and Social Media10.1145/3603163.3609048(1-15)Online publication date: 4-Sep-2023
  • (2005)Discovering large dense subgraphs in massive graphsProceedings of the 31st international conference on Very large data bases10.5555/1083592.1083676(721-732)Online publication date: 30-Aug-2005

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HYPERTEXT '04: Proceedings of the fifteenth ACM conference on Hypertext and hypermedia
August 2004
284 pages
ISBN:1581138482
DOI:10.1145/1012807
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 August 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. aggregation
  2. overview browsing

Qualifiers

  • Article

Conference

HT04
Sponsor:
HT04: 15th Conference on Hypertext and Hypermedia
August 9 - 13, 2004
CA, Santa Cruz, USA

Acceptance Rates

Overall Acceptance Rate 378 of 1,158 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Seven HypertextsProceedings of the 34th ACM Conference on Hypertext and Social Media10.1145/3603163.3609048(1-15)Online publication date: 4-Sep-2023
  • (2005)Discovering large dense subgraphs in massive graphsProceedings of the 31st international conference on Very large data bases10.5555/1083592.1083676(721-732)Online publication date: 30-Aug-2005

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media