Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2766462.2767704acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Local Ranking Problem on the BrowseGraph

Published: 09 August 2015 Publication History

Abstract

The "Local Ranking Problem" (LRP) is related to the computation of a centrality-like rank on a local graph, where the scores of the nodes could significantly differ from the ones computed on the global graph. Previous work has studied LRP on the hyperlink graph but never on the BrowseGraph, namely a graph where nodes are webpages and edges are browsing transitions. Recently, this graph has received more and more attention in many different tasks such as ranking, prediction and recommendation. However, a web-server has only the browsing traffic performed on its pages (local BrowseGraph) and, as a consequence, the local computation can lead to estimation errors, which hinders the increasing number of applications in the state of the art. Also, although the divergence between the local and global ranks has been measured, the possibility of estimating such divergence using only local knowledge has been mainly overlooked. These aspects are of great interest for online service providers who want to: (i) gauge their ability to correctly assess the importance of their resources only based on their local knowledge, and (ii) take into account real user browsing fluxes that better capture the actual user interest than the static hyperlink network. We study the LRP problem on a BrowseGraph from a large news provider, considering as subgraphs the aggregations of browsing traces of users coming from different domains. We show that the distance between rankings can be accurately predicted based only on structural information of the local graph, being able to achieve an average rank correlation as high as 0.8.

References

[1]
E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. In SIGIR, pages 19--26, New York, NY, USA, 2006. ACM.
[2]
R. Andersen, C. Borgs, J. Chayes, J. Hopcraft, V. S. Mirrokni, and S.-H. Teng. Local computation of pagerank contributions. In WAW, pages 150--165, San Diego, CA, USA, 2007. Springer-Verlag.
[3]
Z. Bar-Yossef and L.-T. Mashiach. Local approximation of pagerank and reverse pagerank. In CIKM, pages 279--288, Napa Valley, California, USA, 2008. ACM Press.
[4]
A. Barrat, M. Barthlemy, and A. Vespignani. Dynamical Processes on Complex Networks. Cambridge University Press, New York, NY, USA, 2008.
[5]
K. Bharat, A. Broder, M. Henzinger, P. Kumar, and S. Venkatasubramanian. The connectivity server: fast access to linkage information on the web. In WWW, volume 30, pages 469--477, Brisbane, Australia, 4 1998. Elsevier Science Publ B. V.
[6]
P. Boldi, M. Santini, and S. Vigna. Do your worst to make the best : Paradoxical effects in pagerank incremental computations. In WAW, pages 168--180. Springer, 2004.
[7]
L. Breiman. Random forests. Machine Learning, 45(1):5--32, oct 2001.
[8]
M. Bressan, E. Peserico, U. Padova, and L. Pretto. The power of local information in pagerank. In WWW Companion, pages 179--180, Rio de Janeiro, Brazil, 2013.
[9]
M. Bressan and L. Pretto. Local computation of pagerank: the ranking side. In CIKM, pages 631--640. ACM, 2011.
[10]
Y.-Y. Chen, Q. Gan, and T. Suel. Local methods for estimating pagerank values. In CIKM, pages 381--389, New York, NY, USA, 2004. ACM.
[11]
L. Chiarandini, P. Grabowicz, M. Trevisiol, and A. Jaimes. Leveraging browsing patterns for topic discovery and photostream recommendation. In ICWSM, Cambridge, MA, USA, 2013. AAAI.
[12]
L. Chiarandini, M. Trevisiol, and A. Jaimes. Discovering social photo navigation patterns. In ICME, pages 31--36. IEEE, 2012.
[13]
J. Cho, H. Garcia-Molina, and L. Page. Efficient crawling through url ordering. In WWW, volume 30, pages 161--172, Brisbane, Australia, 4 1998. Elsevier Science Publ B. V.
[14]
J. V. Davis and I. S. Dhillon. Large scale analysis of web revisitation patterns. In KDD, volume 08, pages 116--125, Philadelphia, PA, USA, 2006. ACM Press.
[15]
Z. Gyöngyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with trustrank. In VLDB, pages 576--587, Toronto, ON, Canada, 2004.
[16]
J. Hu, H.-J. Zeng, H. Li, C. Niu, and Z. Chen. Demographic prediction based on user's browsing behavior. In WWW, pages 151--160, New York, NY, USA, 2007. ACM.
[17]
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632, 1999.
[18]
J. Lehmann, M. Lalmas, and R. Baeza-Yates. Measuring inter-site engagement. In Big Data, 2013 IEEE International Conference on, pages 228--236. IEEE, 2014.
[19]
R. Lempel and S. Moran. Salsa : The stochastic approach for link- structure analysis. Challenge, 19(2):131--160, 2001.
[20]
J. Liu, P. Dolan, and E. R. Pedersen. Personalized news recommendation based on click behavior. In IUI, pages 31--40, New York, NY, USA, 2010. ACM.
[21]
M. Liu, R. Cai, M. Zhang, and L. Zhang. User browsing behavior-driven web crawling. In CIKM, pages 87--92, New York, NY, USA, 2011. ACM.
[22]
Y. Liu, B. Gao, T.-Y. Liu, Y. Zhang, Z. Ma, S. He, and H. Li. Browserank: letting web users vote for page importance. SIGIR, 31:451--458, 2008.
[23]
Y. Liu, T.-Y. Liu, B. Gao, Z. Ma, and H. Li. A framework to compute page importance based on user behaviors. Information Retrieval, 13(1):22--45, 6 2009.
[24]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. World Wide Web Internet And Web Information Systems, 54(2):1--17, 1998.
[25]
A. J. Smola and B. Schölkopf. A tutorial on support vector regression. Technical report, Statistics and Computing, 2003.
[26]
C. Strobl, A.-L. Boulesteix, T. Kneib, T. Augustin, and A. Zeileis. Conditional variable importance for random forests. BMC Bioinformatics, 9(1):307, 2008.
[27]
M. Trevisiol, L. M. Aiello, R. Schifanella, and A. Jaimes. Cold-start news recommendation with domain-dependent browse graph. In RecSys, Foster City, CA, 2014. ACM.
[28]
M. Trevisiol, L. Chiarandini, L. M. Aiello, and A. Jaimes. Image ranking based on user browsing behavior. In SIGIR, pages 445--454, New York, NY, USA, 2012. ACM.
[29]
M. Tsagkias and R. Blanco. Language intent models for inferring user browsing behavior. In SIGIR, pages 335--344, New York, NY, USA, 2012. ACM.
[30]
S. Wasserman and K. Faust. Social Network Analysis: Methods and Applications. Cambridge Univ. Press, 1994.
[31]
B. Weischedel and E. K. R. E. Huizingh. Website optimization with web metrics: A case study. In ICEC, pages 463--470, New York, NY, USA, 2006. ACM.
[32]
R. W. White. Investigating behavioral variability in web search. In In Proc. WWW, pages 21--30, 2007.
[33]
R. W. White and J. Huang. Assessing the scenic route: measuring the value of search trails in web logs. In SIGIR, pages 587--594, New York, USA, 2010. ACM.

Cited By

View all
  • (2018)A Location-Query-Browse Graph for Contextual RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.276605930:2(204-218)Online publication date: 1-Feb-2018
  • (2017)Parallel computations of local PageRank problem based on Graphics Processing UnitConcurrency and Computation: Practice and Experience10.1002/cpe.424529:24Online publication date: 24-Aug-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval
August 2015
1198 pages
ISBN:9781450336215
DOI:10.1145/2766462
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 August 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. browsegraph
  2. centrality algorithms
  3. domain-specific browsing graphs
  4. local ranking problem
  5. pagerank

Qualifiers

  • Research-article

Funding Sources

  • Ministry of Science and Innovation of Spain
  • EU-FET NADINE
  • Yahoo Faculty Research Engagement Program

Conference

SIGIR '15
Sponsor:

Acceptance Rates

SIGIR '15 Paper Acceptance Rate 70 of 351 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2018)A Location-Query-Browse Graph for Contextual RecommendationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.276605930:2(204-218)Online publication date: 1-Feb-2018
  • (2017)Parallel computations of local PageRank problem based on Graphics Processing UnitConcurrency and Computation: Practice and Experience10.1002/cpe.424529:24Online publication date: 24-Aug-2017

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media