Abstract
This paper drew up a personalized recommender system model combined the text categorization with the pagerank. The document or the page was considered in two sides: the content of the document and the domain it belonged to. The features were extracted in order to form the feature vector, which would be used in computing the difference between the documents or keywords with the user’s interests and the given domain. It set up the structure of four block levels in information management of a website. The link information was downloaded in the domain block level, which is the top level of the structure. In the host block level, the links were divided into two parts, the inter-link and the intra-link. All links were setup with different weights. The stationary eigenvector of the link matrix was calculated. The final order of documents was determined by the vector distance and the eigenvector of the link matrix.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wang, J.-C., et al.: State of the Art of Information Retrieval on the Web. Journal of Computer Research & Development 38(2), 187–193 (2001)
Kempf, J.: Evolving the Internet Addressing Architecture. In: Proceedings of the 2004 International Symposium on Applications and the Internet (SAINT 2004) (2004)
Rafiei, D., Mendelzon, A.: What is this page known for Computing web page reputations. In: 9th International World Wide Web Conference, Amsterdam, Netherlands (May 2000)
Shaw, N.G., et al.: A comprehensive agent-based architecture for intelligent information retrieval in a distributed heterogeneous environment. Decision Support Systems 32, 401–415 (2002)
Lempel, R., Moran, S.: The stochastic approach for link-structure analysis (SALSA) and the TKC effect. In: 9th International World Wide Web Conference, Amsterdam, Netherlands (May 2000)
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank Citation Ranking: Bringing Order to the Web. Stanford Digital Libraries Working Paper (1998)
Haveliwala, T.: Efficient computation of PageRank. Technical report, Computer Science Department, Stanford University (1999)
Xing, W., Ghorbani, A.: Weighted PageRank Algorithm. In: Second Annual Conference on Communication Networks and Services Research (CNSR 2004), pp. 305–314 (2004)
Soucy, P., Mineau, G.W.: Beyond TFIDF Weighting for Text Categorization in the Vector Space Model. In: Proceeding of IJCAI-2005, pp. 1136–1141 (2005)
Guo, G., Wang, H., Bell, D.A., Bi, Y., Greer, K.: An kNN Model-Based Approach and Its Application in Text Categorization. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 559–570. Springer, Heidelberg (2004)
Steinberger, R., Pouliquen, B., Hagman, J.: Cross-Lingual Document Similarity Calculation Using the Multilingual Thesaurus EUROVOC. In: Gelbukh, A. (ed.) CICLing 2002. LNCS, vol. 2276, pp. 415–424. Springer, Heidelberg (2002)
Jing, L.-P., Huang, H.-K.: Improved feature selection approach tfidf in text mining - Machine. In: Proceedings of the First International Conference of Machine learning and Cybernetics, 2002, pp. 944–946 (2002)
Bianchini, M., Gori, M., Scarselli, F.: Inside Pagerank. ACM Transactions on Internet Technology 5(1), 92–128 (2005)
Jiang, X.-M., Xue, G.-R., Song, W.-G., Zeng, H.-J., Chen, Z., Ma, W.-Y.: Exploiting PageRank at Different Block Level. In: Zhou, X., Su, S., Papazoglou, M.P., Orlowska, M.E., Jeffery, K. (eds.) WISE 2004. LNCS, vol. 3306, pp. 241–252. Springer, Heidelberg (2004)
Arslan, B., Ricci, F., Mirzadeh, N., Venturini, A.: A dynamic approach to feature weighting. In: Proceedings of Data Mining 2002 Conference (2002)
Joachims, T.: A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning table of contents (ICML), pp. 143–151 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, Hm., Guo, Y., Feng, Bq. (2006). Optimizing Personalized Retrieval System Based on Web Ranking. In: Grigoriev, D., Harrison, J., Hirsch, E.A. (eds) Computer Science – Theory and Applications. CSR 2006. Lecture Notes in Computer Science, vol 3967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11753728_63
Download citation
DOI: https://doi.org/10.1007/11753728_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34166-6
Online ISBN: 978-3-540-34168-0
eBook Packages: Computer ScienceComputer Science (R0)