research-article

Learning to Infer User Hidden States for Online Sequential Advertising

Authors:

Lan Luo,

Rui Luo,

Miao Xu,

Han Li,

Jian Xu,

Kun GaiAuthors Info & Claims

CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management

Pages 2677 - 2684

https://doi.org/10.1145/3340531.3412721

Published: 19 October 2020 Publication History

Get Access

Abstract

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy.In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

Supplementary Material

MP4 File (3340531.3412721.mp4)

To drive purchase in online advertising, it is of the advertiser?s great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy. In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. However, a consumer?s purchase intent is unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method?s superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

Download
11.81 MB

References

[1]

Vibhanshu Abhishek, Peter Fader, and Kartik Hosanagar. 2012. Media exposure through the funnel: A model of multi-stage attribution. Available at SSRN 2158421 (2012).

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Online Advertising: Experimental Facts on Ethics, Involvement, and Product Type

How to measure the effectiveness of online advertising in online marketplaces

Online Display Advertising: Modeling the Effects of Multiple Creatives and Individual Impression Histories

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations