Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2247596.2247623acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Dynamic diversification of continuous data

Published: 27 March 2012 Publication History

Abstract

Result diversification has recently attracted considerable attention as a means of increasing user satisfaction in recommender systems, as well as in web and database search. In this paper, we focus on the problem of selecting the k-most diverse items from a result set. Whereas previous research has mainly considered the static version of the problem, in this paper, we exploit the dynamic case in which the result set changes over time, as for example, in the case of notification services. We define the Continuous k-Diversity Problem along with appropriate constraints that enforce continuity requirements on the diversified results. Our proposed approach is based on cover trees and supports dynamic item insertion and deletion. The diversification problem is in general NP-complete; we provide theoretical bounds that characterize the quality of our solution based on cover trees with respect to the optimal solution. Finally, we report experimental results concerning the efficiency and effectiveness of our approach on a variety of real and synthetic datasets.

References

[1]
Faces dataset. http://www.informedia.cs.cmu.edu.
[2]
Flickr. http://www.flickr.com/.
[3]
Flickr dataset. Available at http://www.tagora-project.eu.
[4]
Forest cover dataset. Available at http://kdd.ics.uci.edu.
[5]
Greek cities dataset. Available at http://www.rtreeportal.org.
[6]
Twitter. http://www.twitter.com.
[7]
R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying search results. In WSDM, 2009.
[8]
A. Beygelzimer, S. Kakade, and J. Langford. Cover trees for nearest neighbor. In ICML, 2006.
[9]
R. Boim, T. Milo, and S. Novgorodov. Diversification and refinement in collaborative filtering recommender. In CIKM, 2011.
[10]
M. Drosou and E. Pitoura. Diversity over continuous data. IEEE Data Eng. Bull., 32(4):49--56, 2009.
[11]
M. Drosou and E. Pitoura. Search result diversification. SIGMOD Record, 39(1):41--47, 2010.
[12]
M. Drosou, K. Stefanidis, and E. Pitoura. Preference-aware publish/subscribe delivery with diversity. In DEBS, 2009.
[13]
E. Erkut. The discrete p-dispersion problem. European Journal of Operational Research, 46(1), 1990.
[14]
E. Erkut, Y. Ülküsal, and O. Yeniçerioglu. A comparison of p-dispersion heuristics. Computers & OR, 21(10), 1994.
[15]
S. Gollapudi and A. Sharma. An axiomatic approach for result diversification. In WWW, 2009.
[16]
J. R. Haritsa. The kndn problem: A quest for unity in diversity. IEEE Data Eng. Bull., 32(4):15--22, 2009.
[17]
T. Kollar. Fast Nearest Neighbors. http://nicksgroup.csail.mit.edu/TK/Technical_Reports/covertrees.pdf.
[18]
B. Liu and H. V. Jagadish. Using trees to depict a forest. PVLDB, 2(1):133--144, 2009.
[19]
Z. Liu, P. Sun, and Y. Chen. Structured search result differentiation. PVLDB, 2(1), 2009.
[20]
E. Minack, W. Siberski, and W. Nejdl. Incremental diversification for very large sets: a streaming-based approach. In SIGIR, 2011.
[21]
E. Minack, W. Siberski, and W. Nejdl. Incremental diversification for very large sets: a streaming-based approach. In SIGIR, 2011.
[22]
A. Tamir. Obnoxious facility location on graphs. SIAM J. Discrete Math., 4(4):550--567, 1991.
[23]
E. Vee, U. Srivastava, J. Shanmugasundaram, P. Bhat, and S. Amer-Yahia. Efficient computation of diverse query results. In ICDE, 2008.
[24]
M. R. Vieira, H. L. Razente, M. C. N. Barioni, M. Hadjieleftheriou, D. Srivastava, C. T. Jr., and V. J. Tsotras. On query result diversification. In ICDE, 2011.
[25]
C. Yu, L. V. S. Lakshmanan, and S. Amer-Yahia. It takes variety to make a world: diversification in recommender systems. In EDBT, 2009.
[26]
C. -N. Ziegler, S. M. McNee, J. A. Konstan, and G. Lausen. Improving recommendation lists through topic diversification. In WWW, 2005.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '12: Proceedings of the 15th International Conference on Extending Database Technology
March 2012
643 pages
ISBN:9781450307901
DOI:10.1145/2247596
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 March 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. continuous
  2. diversity
  3. top-k

Qualifiers

  • Research-article

Conference

EDBT '12

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Serendipity-based Points-of-Interest NavigationACM Transactions on Internet Technology10.1145/339119720:4(1-32)Online publication date: 1-Oct-2020
  • (2019)Spatial keyword search: a surveyGeoInformatica10.1007/s10707-019-00373-yOnline publication date: 4-Jul-2019
  • (2018)RC-indexProceedings of the VLDB Endowment10.14778/3192965.319296911:7(773-786)Online publication date: 1-Mar-2018
  • (2018)Beyond Greedy SearchProceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3234944.3234967(99-106)Online publication date: 10-Sep-2018
  • (2018)Advisory Search and Security on Data Mining using Clustering Approaches2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT)10.1109/ICICCT.2018.8473252(256-261)Online publication date: Apr-2018
  • (2018)Relevant Filtering in a Distributed Content‐based Publish/Subscribe SystemNoSQL Data Models10.1002/9781119528227.ch7(203-244)Online publication date: 6-Aug-2018
  • (2017)A survey of query result diversificationKnowledge and Information Systems10.1007/s10115-016-0990-451:1(1-36)Online publication date: 1-Apr-2017
  • (2017)Diversity-Aware Continuous Top-k Queries in Social NetworksOn the Move to Meaningful Internet Systems. OTM 2017 Conferences10.1007/978-3-319-69462-7_7(84-92)Online publication date: 20-Oct-2017
  • (2017)Continuous Summarization over Microblog ThreadsDatabase Systems for Advanced Applications10.1007/978-3-319-55699-4_31(511-526)Online publication date: 22-Mar-2017
  • (2016)Diversified set monitoring over distributed data streamsProceedings of the 10th ACM International Conference on Distributed and Event-based Systems10.1145/2933267.2933298(1-12)Online publication date: 13-Jun-2016
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media