article

Automatic keyword prediction using Google similarity distance

Authors:

Shi-Jen LinAuthors Info & Claims

Expert Systems with Applications: An International Journal, Volume 37, Issue 3

Pages 1928 - 1938

https://doi.org/10.1016/j.eswa.2009.07.016

Published: 01 March 2010 Publication History

Abstract

In this paper, we present a new approach to help users using search engines without entering any keywords. What we want to do is to predict what word the users may want to search before they think about it. Most of the studies done in this field focus on how to help users enter keywords or how to re-rank the search results in order to make them more precise. Both of those methods need to establish a user behavior model and a repository in which to save the logs. In our proposed method, we use the Google similarity distance to measure keywords in the Webpage to find the potential keywords for the users. Thus, we do not need any repository. All the executions are on-line and real-time. Then, we extract all the important keywords as the potential search keywords. In this way, we can use these professional keywords to achieve precise search results. We believe that this can be useful in many areas such as e-learning and can also be used in mobile devices.

References

[1]

Learning to find answers to questions on the web. ACM Transactions on Internet Technology. v4 i2. 129-162.

Digital Library

[2]

SearchPad: Explicit capture of search context to support web search. Computer Networks. v33 i1-6. 493-501.

Digital Library

[3]

Evaluating variable-length Markov chain models for analysis of user web navigation sessions. IEEE Transactions on Knowledge and Data Engineering. v19 i4. 441-452.

Digital Library

[4]

Chien, L. F. (1997). PAT-tree-based keyword extraction for Chinese information retrieval. In Proceedings of the 20th annual international ACM SIGIR conference on research and development in information retrieval (pp. 50-59).

Digital Library

[5]

The Google similarity distance. IEEE Transactions on Knowledge and Data Engineering. v19 i3. 370-383.

Digital Library

[6]

How well does the world wide web represent human language?. The Economist.

[7]

Using lexical chains for keyword extraction. Information Processing and Management. v43 i6. 1705-1714.

Digital Library

[8]

Mining text using keywords distributions. Journal of Intelligent Information Systems. v10 i3. 281-300.

Digital Library

[9]

Placing search in context: The concept revisited. ACM Transactions on Information Systems. v20 i1. 116-131.

[10]

Information retrieval and artificial intelligence. Artificial Intelligence. v114 i1-2. 257-281.

Digital Library

[11]

Methods for comparing rankings of search engine results. Computer Networks. v50 i10. 1448-1463.

Digital Library

[12]

Fussy cognitive map approach to web-mining inference amplification. Expert System with Applications. v22. 197-211.

[13]

An information filtering model on the web and its application in jobagent. Knowledge-Based Systems. v13 i5. 285-296.

[14]

Personalized web search for improving retrieval effectiveness. IEEE Transactions on Knowledge and Data Engineering. v16 i1. 28-40.

Digital Library

[15]

Web log mining. Web Intelligence. 174-194.

[16]

Keyword extraction from a single document using word co-ocuurrence statistical information. International Journal on Artificial Intelligence Tools. v13 i1. 157-169.

[17]

On the peninsula phenomenon in web graph and its implications on web search. Computer Networks. v51 i1. 177-189.

Digital Library

[18]

A theory of term importance in automatic text analysis. Journal of the American society for Information Science. v26 i1. 33-44.

[19]

Word length, sentence length and frequency - Zipf revisited. Studia Linguistica. v58 i1. 37-52.

[20]

Machine learning in automated text categorization. ACM Computing Surveys. v34 i1. 1-47.

Digital Library

[21]

Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems (TOIS). v18 i1. 79-112.

Digital Library

[22]

Yang, Q., Zhang, H., Tian, I.&Li, Y. (2001). Mining web logs for prediction models in WWW caching and prefetching. In Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining (pp. 473-478).

Digital Library

[23]

Representation and construction of ontologies for web intelligence. International Journal of Foundation of Computer Science. v13 i4. 555-570.

Cited By

Weng CHuang CChen YHuang Y(2023)New information search model for online reviews with the perspective of user requirementsMultimedia Tools and Applications10.1007/s11042-023-14847-782:18(28165-28185)Online publication date: 20-Feb-2023
https://dl.acm.org/doi/10.1007/s11042-023-14847-7
Bordoloi MBiswas S(2023)Sentiment analysis: A survey on design framework, applications and future scopesArtificial Intelligence Review10.1007/s10462-023-10442-256:11(12505-12560)Online publication date: 20-Mar-2023
https://dl.acm.org/doi/10.1007/s10462-023-10442-2
Wang HYe JYu ZWang JMao C(2020)Unsupervised Keyword Extraction Methods Based on a Word Graph NetworkInternational Journal of Ambient Computing and Intelligence10.4018/IJACI.202004010411:2(68-79)Online publication date: 1-Apr-2020
https://dl.acm.org/doi/10.4018/IJACI.2020040104
Show More Cited By

Automatic keyword prediction using Google similarity distance
1. Information systems

Recommendations

Evaluating Google queries based on language preferences

This paper evaluates the assumption that users expect search engines to retrieve the same results for queries regardless of the language or the location of the originator. The dependency of the Google search engine on the language and location from ...
Using Google latent semantic distance to extract the most relevant information

Research highlights We adapted the Google similarity distance algorithm into a more efficient new algorithm. We used the PLSA to enhance the original 2-gram NGD into a 3-gram algorithm. To extract the most important sequence of keywords to provide the ...
Children's eye-fixations on google search results
ASIST '16: Proceedings of the 79th ASIS&T Annual Meeting: Creating Knowledge, Enhancing Lives through Information & Technology

We investigate how children in grades 6 and 8 (ages 11 and 13, respectively) read search engine results pages (SERPs) in the context of searching Google. We use eye-tracking to detect children's reading of SERPs, and the effect of grade level and task ...

Comments

Information & Contributors

Information

Published In

cover image Expert Systems with Applications: An International Journal

Expert Systems with Applications: An International Journal Volume 37, Issue 3

March, 2010

901 pages

ISSN:0957-4174

Issue’s Table of Contents

Copyright © Elsevier Ltd © 2009.

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 March 2010

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 09 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Weng CHuang CChen YHuang Y(2023)New information search model for online reviews with the perspective of user requirementsMultimedia Tools and Applications10.1007/s11042-023-14847-782:18(28165-28185)Online publication date: 20-Feb-2023
https://dl.acm.org/doi/10.1007/s11042-023-14847-7
Bordoloi MBiswas S(2023)Sentiment analysis: A survey on design framework, applications and future scopesArtificial Intelligence Review10.1007/s10462-023-10442-256:11(12505-12560)Online publication date: 20-Mar-2023
https://dl.acm.org/doi/10.1007/s10462-023-10442-2
Wang HYe JYu ZWang JMao C(2020)Unsupervised Keyword Extraction Methods Based on a Word Graph NetworkInternational Journal of Ambient Computing and Intelligence10.4018/IJACI.202004010411:2(68-79)Online publication date: 1-Apr-2020
https://dl.acm.org/doi/10.4018/IJACI.2020040104
Younas MJawawi DGhani IShah M(2020)Extraction of non-functional requirement using semantic similarity distanceNeural Computing and Applications10.1007/s00521-019-04226-532:11(7383-7397)Online publication date: 1-Jun-2020
https://dl.acm.org/doi/10.1007/s00521-019-04226-5
Rakhimova DTurganbayeva A(2020)Approach to Extract Keywords and Keyphrases of Text Resources and Documents in the Kazakh LanguageComputational Collective Intelligence10.1007/978-3-030-63007-2_56(719-729)Online publication date: 30-Nov-2020
https://dl.acm.org/doi/10.1007/978-3-030-63007-2_56
Devika RSubramaniyaswamy V(2019)A semantic graph-based keyword extraction model using ranking method on big social dataWireless Networks10.1007/s11276-019-02128-x27:8(5447-5459)Online publication date: 4-Sep-2019
https://dl.acm.org/doi/10.1007/s11276-019-02128-x
Khatiwada STushev MMahmoud A(2018)Just enough semanticsInformation and Software Technology10.1016/j.infsof.2017.08.01293:C(45-57)Online publication date: 1-Jan-2018
https://dl.acm.org/doi/10.1016/j.infsof.2017.08.012
Quirchmayr TPaech BKohl RKarey HKasdepke G(2018)Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manualsEmpirical Software Engineering10.1007/s10664-018-9597-623:6(3630-3683)Online publication date: 1-Dec-2018
https://dl.acm.org/doi/10.1007/s10664-018-9597-6
Ruas TGrosky W(2017)Keyword Extraction Through Contextual Semantic Analysis of DocumentsProceedings of the 9th International Conference on Management of Digital EcoSystems10.1145/3167020.3167043(150-156)Online publication date: 7-Nov-2017
https://dl.acm.org/doi/10.1145/3167020.3167043
Mahmoud ABradshaw G(2015)Estimating Semantic Relatedness in Source CodeACM Transactions on Software Engineering and Methodology10.1145/282425125:1(1-35)Online publication date: 2-Dec-2015
https://dl.acm.org/doi/10.1145/2824251
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents