Ranking Techniques for Finding Correlated Webpages

Pyun, Gwangbum; Yun, Unil

doi:10.1007/978-94-007-5860-5_130

Gwangbum Pyun⁵ &
Unil Yun⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 215))

936 Accesses
1 Citations

Abstract

In general,when users try to search information, they can have difficulties to express the information as exact queries. Therefore, users consume many times to find useful webpages. Previous techniques could not solve the problem effectively. In this paper, we propose an algorithm, RCW (Ranking technique for finding Correlated Webpages) for improving previous ranking techniques. Our method makes it possible to retrieve not only basic webpages but also correlated webpages. Therefore, RCW algorithm in this paper can help users easily look for meaningful information without using exact queries. To find correlated webpages, the algorithm applies a novel technique for computing correlations among webpages. In performance evaluation, we test precision, recall, and NDCG of our RCW compared with the other popular system. In this result, RCW guarantees that itfinds the number of correlated webpages greater than the other method, and shows high ratios in terms of precision, recall, and NDCG.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Ontology-Based Ranking in Search Engine

A Novel Ranking Technique Based on Page Queries

Ontology-Based Semantic Search for Large-Scale RDF Data

References

Hulth A, Karlren J, Jonsson A, Bostrom H, Asker L (2010) Automatic keyword extraction using domain knowledge. Lect Notes Comput Sci 472–482
Google Scholar
Ishii H, Tempo R (2010) Distributed randomized algorithms for the page rank computation. IEEE Control Syst Soc 55(9):1987–2002
Google Scholar
Ermelinda O, Massimo R (2011) Towards a spatial instance learning method for deep web pages. In: Industrial conference on data mining (ICDM), pp 270–285
Google Scholar
Fu L, Mmeng Y, Xia Y, Yu H (2010) Web content extraction based on webpage layout analysis. In: Information technology and computer science (ITCS), pp 40–43
Google Scholar
Baillie M, Carman M, Crestani F (2011) A multi-collection latent topic model for federated search. Inf Retrieval 14(4):390–412
Google Scholar
Ricardo Y, Carlos C, Flavio J, Vassilis P, Fabrizio S (2007) Challenges on distributed web retrieval. In: International conference on data engineering, pp 15–20
Google Scholar
Flora T (2011) Web-based geographic search engine for location-aware search in Singapore. Expert Syst Appl (ESWA) 38(1):1011–1016
Google Scholar
Song G, Yajie M, Liu Y, Chunping L (2009) Topic-based computing model for web page popularity and website influence. In: Australasian conference on artificial intelligence, pp 210–219
Google Scholar
Costantinos D, Christos M, Yannis P, Evangelos T, Athanasios T (2010) A web page usage prediction scheme using sequence indexing and clustering techniques. Data Knowl Eng (DKE) 69(4):371–382
Google Scholar
Sandeepkumar S, Sahely B, Sundararajan S, Rajeev R, Prithviraj S (2011) Web information extraction using markov logic networks. In: Knowledge discovery and data mining (KDD), pp 1406–1414
Google Scholar
Metzler D (2008) Generalized inverse document frequency. In: Conference on information and knowledge management, pp 399–408
Google Scholar
CLucene Project web page http://clucene.sourceforge.net/

Download references

Acknowledgments

This research was supported by the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (NRF No. 2012-0003740 and 2012-0000478).

Author information

Authors and Affiliations

Department of Computer Science, Chungbuk National University, 410, Gaesin-dong, Heungdeok-gu, Cheongju, Republic of Korea
Gwangbum Pyun & Unil Yun

Authors

Gwangbum Pyun
View author publications
You can also search for this author in PubMed Google Scholar
Unil Yun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Unil Yun .

Editor information

Editors and Affiliations

Convergence Security, Kyoung-gi University, Suwon, Gyeonggi-do, Korea, Republic of (South Korea)
Kuinam J. Kim
Dept. of Computer Information Engineerin, Sangji University, Wonju-si Gangwon-do, Korea, Republic of (South Korea)
Kyung-Yong Chung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pyun, G., Yun, U. (2013). Ranking Techniques for Finding Correlated Webpages. In: Kim, K., Chung, KY. (eds) IT Convergence and Security 2012. Lecture Notes in Electrical Engineering, vol 215. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-5860-5_130

Download citation

DOI: https://doi.org/10.1007/978-94-007-5860-5_130
Published: 11 December 2012
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-5859-9
Online ISBN: 978-94-007-5860-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Ranking Techniques for Finding Correlated Webpages

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Ontology-Based Ranking in Search Engine

A Novel Ranking Technique Based on Page Queries

Ontology-Based Semantic Search for Large-Scale RDF Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Ranking Techniques for Finding Correlated Webpages

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Ontology-Based Ranking in Search Engine

A Novel Ranking Technique Based on Page Queries

Ontology-Based Semantic Search for Large-Scale RDF Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation