Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1526709.1526716acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Estimating the impressionrank of web pages

Published: 20 April 2009 Publication History

Abstract

The ImpressionRank of a web page (or, more generally, of a web site) is the number of times users viewed the page while browsing search results. ImpressionRank captures the visibility of pages and sites in search engines and is thus an important measure, which is of interest to web site owners, competitors, market analysts, and end users.
All previous approaches to estimating the ImpressionRank of a page rely on privileged access to private data sources, like the search engine's query log. In this paper we present the first external algorithm for estimating the ImpressionRank of a web page. This algorithm relies on access to three public data sources: the search engine, the query suggestion service of the search engine, and the web. In addition, the algorithm is local and uses modest resources. It can therefore be used by almost any party to estimate the ImpressionRank of any page on any search engine.
En route to estimating the ImpressionRank of a page, our algorithm solves a novel variant of the keyword extraction problem: it finds the most popular search keywords that drive impressions of a page.
Empirical analysis of the algorithm on the Google and Yahoo! search engines indicates that it is accurate and provides interesting insights about sites and search queries.

References

[1]
Z. Bar-Yossef and M. Gurevich. Mining search engine query logs via suggestion sampling. In 34th VLDB, 2008.
[2]
comScore. 61 billion searches conducted worldwide in August. www.comscore.com/press/release.asp?press=1802, 2008.
[3]
J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Commun. ACM, 51(1):107--113, 2008.
[4]
E. Frank, G. W. Paynter, I. H. Witten, C. Gutwin, and C. G. Nevill-Manning. Domain-specific keyphrase extraction. In 16th IJCAI, pages 668--673, 1999.
[5]
A. Fuxman, P. Tsaparas, K. Achan, and R. Agrawal. Using the wisdom of the crowds for keyword generation. In 17th WWW, pages 61--70, 2008.
[6]
J. Goodman and V. R. Carvalho. Implicit queries for email. In CEAS, July 2005.
[7]
A. Hulth. Improved automatic keyword extraction given more linguistic knowledge. In EMNLP, pages 216--223, 2003.
[8]
T. Joachims, L. Granka, B. Pan, H. Hembrooke, and G. Gay. Accurately interpreting clickthrough data as implicit feedback. In 28th SIGIR, pages 154--161, 2005.
[9]
D. Kelleher and S. Luz. Automatic hypertext keyphrase detection. In 22nd IJCAI, 2005.
[10]
S. J. Russell and P. Norvig. Artificial Intelligence. A Modern Approach. Prentice-Hall, 2nd edition, 2003.
[11]
P. C. Saraiva, E. S. de Moura, R. C. Fonseca, W. M. Jr., B. A. Ribeiro-Neto, and N. Ziviani. Rank-preserving two-level caching for scalable search engines. In 24th SIGIR, pages 51--58, 2001.
[12]
W. tau Yih, J. Goodman, and V. R. Carvalho. Finding advertising keywords on web pages. In 15th WWW, pages 213--222, 2006.
[13]
P. D. Turney. Learning algorithms for keyphrase extraction. Inf. Retr., 2(4):303--336, 2000.
[14]
P. D. Turney. Coherent keyphrase extraction via web mining. In 20th IJCAI, pages 434--439, 2003.
[15]
M. B. Valentine. Google drives 70 percent of traffic to most web sites. http://searchengineoptimism.com/Google_refers_70_percent.html.
[16]
Y. Xie and D. R. O'Hallaron. Locality in search engine queries and its implications for caching. In 21st INFOCOM, 2002.

Cited By

View all

Index Terms

  1. Estimating the impressionrank of web pages

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '09: Proceedings of the 18th international conference on World wide web
    April 2009
    1280 pages
    ISBN:9781605584874
    DOI:10.1145/1526709

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 April 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. auto-completions
    2. data mining
    3. estimation
    4. impressionrank
    5. popular keyword extraction
    6. search engines
    7. suggestions

    Qualifiers

    • Research-article

    Conference

    WWW '09
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Keyword Extraction for Medium-Sized Documents Using Corpus-Based Contextual Semantic SmoothingComplexity10.1155/2022/70157642022Online publication date: 1-Jan-2022
    • (2020)Interactive Proofs for Social GraphsAdvances in Cryptology – CRYPTO 202010.1007/978-3-030-56877-1_20(574-601)Online publication date: 17-Aug-2020
    • (2015)Estimating Clustering Coefficients and Size of Social Networks via Random WalkACM Transactions on the Web10.1145/27903049:4(1-20)Online publication date: 28-Sep-2015
    • (2013)Mining a search engine's corpus without a query poolProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505748(29-38)Online publication date: 27-Oct-2013
    • (2013)Estimating clustering coefficients and size of social networks via random walkProceedings of the 22nd international conference on World Wide Web10.1145/2488388.2488436(539-550)Online publication date: 13-May-2013
    • (2012)Aggregate suppression for enterprise search enginesProceedings of the 2012 ACM SIGMOD International Conference on Management of Data10.1145/2213836.2213890(469-480)Online publication date: 20-May-2012
    • (2012)Web log analysisData Mining and Knowledge Discovery10.1007/s10618-011-0228-824:3(663-696)Online publication date: 1-May-2012
    • (2011)Efficient Search Engine MeasurementsACM Transactions on the Web10.1145/2019643.20196455:4(1-48)Online publication date: 1-Oct-2011
    • (2011)Learning website hierarchies for keyword enrichment in contextual advertisingProceedings of the fourth ACM international conference on Web search and data mining10.1145/1935826.1935892(425-434)Online publication date: 9-Feb-2011
    • (2011)Web log analysis: a review of a decade of studies about information acquisition, inspection and interpretation of user interactionData Mining and Knowledge Discovery10.1007/s10618-011-0236-8Online publication date: 6-Sep-2011
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media