Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2047403.2047409acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Learning semantic content-based profiles for cross-language recommendations

Published: 06 June 2011 Publication History

Abstract

The exponential growth of the Web is the most influential factor that contributes to the increasing importance of cross-lingual text retrieval and filtering systems. Indeed, relevant information exists in different languages, thus users need to find documents in languages different from the one the query is formulated in. In this context, an emerging requirement is to sift through the increasing flood of multilingual text: this poses a renewed challenge for designing effective multilingual Information Filtering systems. Content-based filtering systems adapt their behavior to individual users by learning their preferences from documents that were already deemed relevant. The learning process aims to construct a profile of the user that can be later exploited in selecting/recommending relevant items. User profiles are generally represented using keywords in a specific language. For example, if a user likes movies whose plots are written in Italian, a content-based filtering algorithm will learn a profile for that user which contains Italian words, thus failing in recommending movies whose plots are written in English, although they might be definitely interesting. Moreover, keywords suffer of typical Information Retrieval-related problems such as polysemy and synonymy. In this paper, we propose a language-independent content-based recommender system, called MARS (MultilAnguage Recommender System), that builds cross-language user profiles, by shifting the traditional text representation based on keywords, to a more complex language-independent representation based on word meanings. The proposed strategy relies on a knowledge-based word sense disambiguation technique that exploits MultiWordNet as sense inventory. As a consequence, content-based user profiles become language-independent and can be exploited for recommending items represented in a language different from the one used in the content-based user profile. Experiments conducted in a movie recommendation scenario show the effectiveness of the approach.

References

[1]
M. J. Pazzani and D. Billsus, "Content-Based Recommendation Systems," in The Adaptive Web, ser. Lecture Notes in Computer Science, vol. 4321, 2007, pp. 325--341, iSBN 978-3-540-72078-2.
[2]
L. Bentivogli, E. Pianta, and C. Girardi, "Multiwordnet: developing an aligned multilingual database," in 1st International Conference on Global WordNet, Mysore, India, 2002.
[3]
P. Lops, C. Musto, F. Narducci, M. de Gemmis, P. Basile, and G. Semeraro, "Mars: a multilanguage recommender system," in Proc. 1st International Workshop on Information Heterogeneity and Fusion in Recommender Systems, 2010, pp. 24--31.
[4]
A. Damankesh, J. Singh, F. Jahedpari, K. Shaalan, and F. Oroumchian, "Using human plausible reasoning as a framework for multilingual information filtering," in CLEF 2008: Proc. 9th Workshop Cross-Language Evaluation Forum, Corfu, Greece.
[5]
D. W. Oard, "Alternative approaches for cross-language text retrieval," in AAAI Symposium on Cross-Language Text and Speech Retrieval. AAAI, 1997.
[6]
L. Ballesteros and W. B. Croft, "Phrasal translation and query expansion techniques for cross-language information retrieval," in SIGIR '97: Proc. 20th annual international ACM SIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM, 1997, pp. 84--91.
[7]
B. S. Martin Potthast and M. Anderka, "A wikipedia-based multilingual retrieval model," in Advances in Information Retrieval, 2008, pp. 522--530.
[8]
E. Gabrilovich and S. Markovitch, "Computing semantic relatedness using wikipedia-based explicit semantic analysis," in IJCAI, 2007, pp. 1606--1611.
[9]
P. Basile, M. de Gemmis, A. Gentile, L. Iaquinta, P. Lops, and G. Semeraro, "META - MultilanguagE Text Analyzer," in Proceedings of the Language and Speech Technnology Conference - LangTech 2008, February 28--29, 2008, Rome, Italy, 2008, pp. 137--140.
[10]
G. Miller, "WordNet: An On-Line Lexical Database," International Journal of Lexicography, vol. 3, no. 4, 1990, (Special Issue).
[11]
F. Sebastiani, "Machine Learning in Automated Text Categorization," ACM Computing Surveys, vol. 34, no. 1, 2002.
[12]
M. de Gemmis, P. Lops, G. Semeraro, and P. Basile, "Integrating Tags in a Semantic Content-based Recommender," in Proceedings of the 2008 ACM Conference on Recommender Systems, RecSys 2008, Lausanne, Switzerland, October 23--25, 2008, 2008, pp. 163--170.

Cited By

View all
  • (2018)Saugaus ir sveiko būsto didžiųjų duomenų analitinė-rekomendacinė sistema10.20334/2018-033-MOnline publication date: 2018
  • (2012)PMHR 2011ACM SIGIR Forum10.1145/2093346.209336245:2(94-98)Online publication date: 9-Jan-2012
  • (2011)Personalised multilingual hypertext retrievalProceedings of the First Workshop on Personalised Multilingual Hypertext Retrieval10.1145/2047403.2047404(1-4)Online publication date: 6-Jun-2011

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PMHR '11: Proceedings of the First Workshop on Personalised Multilingual Hypertext Retrieval
June 2011
61 pages
ISBN:9781450308977
DOI:10.1145/2047403
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content-based recommender system
  2. cross-language recommender system
  3. multiwordnet
  4. word sense disambiguation

Qualifiers

  • Research-article

Conference

HT '11
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Saugaus ir sveiko būsto didžiųjų duomenų analitinė-rekomendacinė sistema10.20334/2018-033-MOnline publication date: 2018
  • (2012)PMHR 2011ACM SIGIR Forum10.1145/2093346.209336245:2(94-98)Online publication date: 9-Jan-2012
  • (2011)Personalised multilingual hypertext retrievalProceedings of the First Workshop on Personalised Multilingual Hypertext Retrieval10.1145/2047403.2047404(1-4)Online publication date: 6-Jun-2011

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media