Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Ranking Techniques for Finding Correlated Webpages

  • Conference paper
  • First Online:
IT Convergence and Security 2012

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 215))

Abstract

In general,when users try to search information, they can have difficulties to express the information as exact queries. Therefore, users consume many times to find useful webpages. Previous techniques could not solve the problem effectively. In this paper, we propose an algorithm, RCW (Ranking technique for finding Correlated Webpages) for improving previous ranking techniques. Our method makes it possible to retrieve not only basic webpages but also correlated webpages. Therefore, RCW algorithm in this paper can help users easily look for meaningful information without using exact queries. To find correlated webpages, the algorithm applies a novel technique for computing correlations among webpages. In performance evaluation, we test precision, recall, and NDCG of our RCW compared with the other popular system. In this result, RCW guarantees that itfinds the number of correlated webpages greater than the other method, and shows high ratios in terms of precision, recall, and NDCG.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Hulth A, Karlren J, Jonsson A, Bostrom H, Asker L (2010) Automatic keyword extraction using domain knowledge. Lect Notes Comput Sci 472–482

    Google Scholar 

  2. Ishii H, Tempo R (2010) Distributed randomized algorithms for the page rank computation. IEEE Control Syst Soc 55(9):1987–2002

    Google Scholar 

  3. Ermelinda O, Massimo R (2011) Towards a spatial instance learning method for deep web pages. In: Industrial conference on data mining (ICDM), pp 270–285

    Google Scholar 

  4. Fu L, Mmeng Y, Xia Y, Yu H (2010) Web content extraction based on webpage layout analysis. In: Information technology and computer science (ITCS), pp 40–43

    Google Scholar 

  5. Baillie M, Carman M, Crestani F (2011) A multi-collection latent topic model for federated search. Inf Retrieval 14(4):390–412

    Google Scholar 

  6. Ricardo Y, Carlos C, Flavio J, Vassilis P, Fabrizio S (2007) Challenges on distributed web retrieval. In: International conference on data engineering, pp 15–20

    Google Scholar 

  7. Flora T (2011) Web-based geographic search engine for location-aware search in Singapore. Expert Syst Appl (ESWA) 38(1):1011–1016

    Google Scholar 

  8. Song G, Yajie M, Liu Y, Chunping L (2009) Topic-based computing model for web page popularity and website influence. In: Australasian conference on artificial intelligence, pp 210–219

    Google Scholar 

  9. Costantinos D, Christos M, Yannis P, Evangelos T, Athanasios T (2010) A web page usage prediction scheme using sequence indexing and clustering techniques. Data Knowl Eng (DKE) 69(4):371–382

    Google Scholar 

  10. Sandeepkumar S, Sahely B, Sundararajan S, Rajeev R, Prithviraj S (2011) Web information extraction using markov logic networks. In: Knowledge discovery and data mining (KDD), pp 1406–1414

    Google Scholar 

  11. Metzler D (2008) Generalized inverse document frequency. In: Conference on information and knowledge management, pp 399–408

    Google Scholar 

  12. CLucene Project web page http://clucene.sourceforge.net/

Download references

Acknowledgments

This research was supported by the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF No. 2012-0003740 and 2012-0000478).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Unil Yun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media Dordrecht

About this paper

Cite this paper

Pyun, G., Yun, U. (2013). Ranking Techniques for Finding Correlated Webpages. In: Kim, K., Chung, KY. (eds) IT Convergence and Security 2012. Lecture Notes in Electrical Engineering, vol 215. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-5860-5_130

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-5860-5_130

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-5859-9

  • Online ISBN: 978-94-007-5860-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics