research-article

Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation

Authors:

Qi Shen,

Bo Long,

Jian PeiAuthors Info & Claims

WWW '22: Proceedings of the ACM Web Conference 2022

Pages 2153 - 2162

https://doi.org/10.1145/3485447.3512088

Published: 25 April 2022 Publication History

Get Access

Abstract

Conversational recommendation system (CRS) is able to obtain fine-grained and dynamic user preferences based on interactive dialogue. Previous CRS assumes that the user has a clear target item, which often deviates from the real scenario, that is for many users who resort to CRS, they might not have a clear idea about what they really like. Specifically, the user may have a clear single preference for some attribute types (e.g. brand) of items, while for other attribute types (e.g. color), the user may have multiple preferences or even no clear preferences, which leads to multiple acceptable attribute instances (e.g. black and red) of one attribute type. Therefore, the users could show their preferences over items under multiple combinations of attribute instances rather than a single item with unique combination of all attribute instances. As a result, we first propose a more realistic conversational recommendation learning setting, namely Multi-Interest Multi-round Conversational Recommendation (MIMCR), where users may have multiple interests in attribute instance combinations and accept multiple items with partially overlapped combinations of attribute instances. To effectively cope with the new CRS learning setting, in this paper, we propose a novel learning framework, namely Multiple Choice questions based Multi-Interest Policy Learning (MCMIPL). In order to obtain user preferences more efficiently, the agent generates multiple choice questions rather than binary yes/no ones on specific attribute instance. Furthermore, we propose a union set strategy to select candidate items instead of existing intersection set strategy in order to overcome over-filtering items during the conversation. Finally, we design a Multi-Interest Policy Learning (MIPL) module, which utilizes captured multiple interests of the user to decide next action, either asking attribute instances or recommending items. Extensive experimental results on four datasets demonstrate the superiority of our method for the proposed MIMCR setting.

References

[1]

Richard Bellman and Robert Kalaba. 1957. On the role of dynamic programming in statistical communication theory. IRE Transactions on Information Theory 3, 3 (1957), 197–203.

Abstract

References

Cited By

Index Terms

Recommendations

Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning

Learning to Infer User Implicit Preference in Conversational Recommendation

Multi-view Hypergraph Contrastive Policy Learning for Conversational Recommendation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations