Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Automatic keyword prediction using Google similarity distance

Published: 01 March 2010 Publication History

Abstract

In this paper, we present a new approach to help users using search engines without entering any keywords. What we want to do is to predict what word the users may want to search before they think about it. Most of the studies done in this field focus on how to help users enter keywords or how to re-rank the search results in order to make them more precise. Both of those methods need to establish a user behavior model and a repository in which to save the logs. In our proposed method, we use the Google similarity distance to measure keywords in the Webpage to find the potential keywords for the users. Thus, we do not need any repository. All the executions are on-line and real-time. Then, we extract all the important keywords as the potential search keywords. In this way, we can use these professional keywords to achieve precise search results. We believe that this can be useful in many areas such as e-learning and can also be used in mobile devices.

References

[1]
Learning to find answers to questions on the web. ACM Transactions on Internet Technology. v4 i2. 129-162.
[2]
SearchPad: Explicit capture of search context to support web search. Computer Networks. v33 i1-6. 493-501.
[3]
Evaluating variable-length Markov chain models for analysis of user web navigation sessions. IEEE Transactions on Knowledge and Data Engineering. v19 i4. 441-452.
[4]
Chien, L. F. (1997). PAT-tree-based keyword extraction for Chinese information retrieval. In Proceedings of the 20th annual international ACM SIGIR conference on research and development in information retrieval (pp. 50-59).
[5]
The Google similarity distance. IEEE Transactions on Knowledge and Data Engineering. v19 i3. 370-383.
[6]
How well does the world wide web represent human language?. The Economist.
[7]
Using lexical chains for keyword extraction. Information Processing and Management. v43 i6. 1705-1714.
[8]
Mining text using keywords distributions. Journal of Intelligent Information Systems. v10 i3. 281-300.
[9]
Placing search in context: The concept revisited. ACM Transactions on Information Systems. v20 i1. 116-131.
[10]
Information retrieval and artificial intelligence. Artificial Intelligence. v114 i1-2. 257-281.
[11]
Methods for comparing rankings of search engine results. Computer Networks. v50 i10. 1448-1463.
[12]
Fussy cognitive map approach to web-mining inference amplification. Expert System with Applications. v22. 197-211.
[13]
An information filtering model on the web and its application in jobagent. Knowledge-Based Systems. v13 i5. 285-296.
[14]
Personalized web search for improving retrieval effectiveness. IEEE Transactions on Knowledge and Data Engineering. v16 i1. 28-40.
[15]
Web log mining. Web Intelligence. 174-194.
[16]
Keyword extraction from a single document using word co-ocuurrence statistical information. International Journal on Artificial Intelligence Tools. v13 i1. 157-169.
[17]
On the peninsula phenomenon in web graph and its implications on web search. Computer Networks. v51 i1. 177-189.
[18]
A theory of term importance in automatic text analysis. Journal of the American society for Information Science. v26 i1. 33-44.
[19]
Word length, sentence length and frequency - Zipf revisited. Studia Linguistica. v58 i1. 37-52.
[20]
Machine learning in automated text categorization. ACM Computing Surveys. v34 i1. 1-47.
[21]
Improving the effectiveness of information retrieval with local context analysis. ACM Transactions on Information Systems (TOIS). v18 i1. 79-112.
[22]
Yang, Q., Zhang, H., Tian, I.&Li, Y. (2001). Mining web logs for prediction models in WWW caching and prefetching. In Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining (pp. 473-478).
[23]
Representation and construction of ontologies for web intelligence. International Journal of Foundation of Computer Science. v13 i4. 555-570.

Cited By

View all
  • (2023)New information search model for online reviews with the perspective of user requirementsMultimedia Tools and Applications10.1007/s11042-023-14847-782:18(28165-28185)Online publication date: 20-Feb-2023
  • (2023)Sentiment analysis: A survey on design framework, applications and future scopesArtificial Intelligence Review10.1007/s10462-023-10442-256:11(12505-12560)Online publication date: 20-Mar-2023
  • (2020)Unsupervised Keyword Extraction Methods Based on a Word Graph NetworkInternational Journal of Ambient Computing and Intelligence10.4018/IJACI.202004010411:2(68-79)Online publication date: 1-Apr-2020
  • Show More Cited By
  1. Automatic keyword prediction using Google similarity distance

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Expert Systems with Applications: An International Journal
    Expert Systems with Applications: An International Journal  Volume 37, Issue 3
    March, 2010
    901 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 March 2010

    Author Tags

    1. Prediction
    2. Recommendation
    3. Search engines

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 09 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)New information search model for online reviews with the perspective of user requirementsMultimedia Tools and Applications10.1007/s11042-023-14847-782:18(28165-28185)Online publication date: 20-Feb-2023
    • (2023)Sentiment analysis: A survey on design framework, applications and future scopesArtificial Intelligence Review10.1007/s10462-023-10442-256:11(12505-12560)Online publication date: 20-Mar-2023
    • (2020)Unsupervised Keyword Extraction Methods Based on a Word Graph NetworkInternational Journal of Ambient Computing and Intelligence10.4018/IJACI.202004010411:2(68-79)Online publication date: 1-Apr-2020
    • (2020)Extraction of non-functional requirement using semantic similarity distanceNeural Computing and Applications10.1007/s00521-019-04226-532:11(7383-7397)Online publication date: 1-Jun-2020
    • (2020)Approach to Extract Keywords and Keyphrases of Text Resources and Documents in the Kazakh LanguageComputational Collective Intelligence10.1007/978-3-030-63007-2_56(719-729)Online publication date: 30-Nov-2020
    • (2019)A semantic graph-based keyword extraction model using ranking method on big social dataWireless Networks10.1007/s11276-019-02128-x27:8(5447-5459)Online publication date: 4-Sep-2019
    • (2018)Just enough semanticsInformation and Software Technology10.1016/j.infsof.2017.08.01293:C(45-57)Online publication date: 1-Jan-2018
    • (2018)Semi-automatic rule-based domain terminology and software feature-relevant information extraction from natural language user manualsEmpirical Software Engineering10.1007/s10664-018-9597-623:6(3630-3683)Online publication date: 1-Dec-2018
    • (2017)Keyword Extraction Through Contextual Semantic Analysis of DocumentsProceedings of the 9th International Conference on Management of Digital EcoSystems10.1145/3167020.3167043(150-156)Online publication date: 7-Nov-2017
    • (2015)Estimating Semantic Relatedness in Source CodeACM Transactions on Software Engineering and Methodology10.1145/282425125:1(1-35)Online publication date: 2-Dec-2015
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media