research-article

Public Access

Multidimensional Dynamic Pricing for Welfare Maximization

Authors:

Aleksandrs Slivkins,

Jonathan Ullman,

Zhiwei Steven WuAuthors Info & Claims

ACM Transactions on Economics and Computation (TEAC), Volume 8, Issue 1

Article No.: 6, Pages 1 - 35

https://doi.org/10.1145/3381527

Published: 17 April 2020 Publication History

All formats PDF

Abstract

We study the problem of a seller dynamically pricing d distinct types of indivisible goods, when faced with the online arrival of unit-demand buyers drawn independently from an unknown distribution. The goods are not in limited supply, but can only be produced at a limited rate and are costly to produce. The seller observes only the bundle of goods purchased at each day, but nothing else about the buyer’s valuation function. Our main result is a dynamic pricing algorithm for optimizing welfare (including the seller’s cost of production) that runs in time and a number of rounds that are polynomial in d and the approximation parameter. We are able to do this despite the fact that (i) the price-response function is not continuous, and even its fractional relaxation is a non-concave function of the prices, and (ii) the welfare is not observable to the seller.

We derive this result as an application of a general technique for optimizing welfare over divisible goods, which is of independent interest. When buyers have strongly concave, Hölder continuous valuation functions over d divisible goods, we give a general polynomial time dynamic pricing technique. We are able to apply this technique to the setting of unit-demand buyers despite the fact that in that setting the goods are not divisible, and the natural fractional relaxation of a unit-demand valuation is not strongly concave. To apply our general technique, we introduce a novel price randomization procedure that has the effect of implicitly inducing buyers to “regularize” their valuations with a strongly concave function. Finally, we also extend our results to a limited-supply setting in which the supply of each good cannot be replenished.

References

[1]

Sydney Afriat. 1967. The construction of utility functions from expenditure data. Int. Econ. Rev. 8, 1 (1967), 67--77.

[2]

Alekh Agarwal, Dean P. Foster, Daniel J. Hsu, Sham M. Kakade, and Alexander Rakhlin. 2013. Stochastic convex optimization with bandit feedback. SIAM J. Optimiz. 23, 1 (2013), 213--240.

Digital Library

[3]

Kareem Amin, Rachel Cummings, Lili Dworkin, Michael Kearns, and Aaron Roth. 2015. Online learning and profit maximization from revealed preferences. In Proceedings of the 29th AAAI Conference on Artificial Intelligence (AAAI’15).

[4]

Moshe Babaioff, Shaddin Dughmi, Robert D. Kleinberg, and Aleksandrs Slivkins. 2015. Dynamic pricing with limited supply. ACM Trans. Econ. Comput. 3, 1 (2015), 4.

Digital Library

[5]

Ashwinkumar Badanidiyuru, Robert Kleinberg, and Aleksandrs Slivkins. 2013. Bandits with knapsacks. In Proceedings of the 54th IEEE Symposium on Foundations of Computer Science (FOCS’13).

Digital Library

[6]

Maria-Florina Balcan, Avrim Blum, and Yishay Mansour. 2008. Item pricing for revenue maximization. In Proceedings of the 9th ACM Conference on Electronic Commerce (EC’08). 50--59.

Digital Library

[7]

Maria-Florina Balcan, Florin Constantin, Satoru Iwata, and Lei Wang. 2012. Learning valuation functions. In Proceedings of the 25th Annual Conference on Learning Theory (COLT’12).

[8]

Maria-Florina Balcan, Amit Daniely, Ruta Mehta, Ruth Urner, and Vijay V. Vazirani. 2014. Learning economic parameters from revealed preferences. In Proceedings of the 10th International Conference on Web and Internet Economics (WINE’14).

[9]

Eyal Beigman and Rakesh Vohra. 2006. Learning from revealed preference. In Proceedings of the 7th ACM Conference on Electronic Commerce (EC’06). 36--42.

Digital Library

[10]

Omar Besbes and Assaf Zeevi. 2009. Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operat. Res. 57 (2009), 1407--1420. Issue 6.

Digital Library

[11]

Omar Besbes and Assaf J. Zeevi. 2012. Blind network revenue management. Operat. Res. 60, 6 (2012), 1537--1550.

Digital Library

[12]

Avrim Blum, Anupam Gupta, Yishay Mansour, and Ankit Sharma. 2011. Welfare and profit maximization with production costs. In Proceedings of the 52nd IEEE Symp. on Foundations of Computer Science (FOCS’11). 77--86.

Digital Library

[13]

Arnoud V. Den Boer. 2015. Dynamic pricing and learning: Historical origins, current research, and new directions. Surveys Operat. Res. Manage. Sci. 20, 1 (June 2015).

[14]

Josef Broder and Paat Rusmevichientong. 2012. Dynamic pricing under a general parametric choice model. Operat. Res. 60, 4 (2012), 965--980.

Digital Library

[15]

Sébastien Bubeck. 2015. Convex optimization: Algorithms and complexity. Found. Trends Mach. Learn. 8, 3--4 (2015), 231--357.

Digital Library

[16]

Sébastien Bubeck and Nicolo Cesa-Bianchi. 2012. Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Found. Trends Mach. Learn. 5, 1 (2012).

[17]

Sébastien Bubeck, Ofer Dekel, Tomer Koren, and Yuval Peres. 2015. Bandit convex optimization: regret in one dimension. In Proceedings of the 28th Conference on Learning Theory (COLT’15). 266--278.

[18]

Tanmoy Chakraborty, Zhiyi Huang, and Sanjeev Khanna. 2013. Dynamic and nonuniform pricing strategies for revenue maximization. SIAM J. Comput, 42, 6 (2013), 2424--2451.

Digital Library

[19]

Arnoud V. den Boer and Bert Zwart. 2014. Simultaneously learning and optimizing using controlled variance pricing. Manage. Sci. 60, 3 (2014), 770--783.

Digital Library

[20]

Michal Feldman, Nick Gravin, and Brendan Lucier. 2015. Combinatorial auctions via posted prices. In Proceedings of the 26th ACM-SIAM Symp. on Discrete Algorithms (SODA’15). 123--135.

[21]

Abraham Flaxman, Adam Kalai, and H. Brendan McMahan. 2005. Online convex optimization in the bandit setting: Gradient descent without a gradient. In Proceedings of the 16th ACM-SIAM Symp. on Discrete Algorithms (SODA’05). 385--394.

[22]

J. C. Gittins. 1989. Multi-Armed Bandit Allocation Indices. John Wiley 8 Sons.

[23]

Elad Hazan and Kfir Y. Levy. 2014. Bandit convex optimization: Towards tight bounds. In Proceedings of the 27th Advances in Neural Information Processing Systems (NIPS’14). 784--792.

[24]

Justin Hsu, Jamie Morgenstern, Ryan Rogers, Aaron Roth, and Rakesh Vohra. 2016. Do prices coordinate markets? In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing. ACM, 440--453.

Digital Library

[25]

N. Bora Keskin and Assaf J. Zeevi. 2014. Dynamic pricing with an unknown demand model: Asymptotically optimal semi-myopic policies. Operat. Res. 62, 5 (2014), 1142--1167.

Digital Library

[26]

Robert Kleinberg and Tom Leighton. 2003. The value of knowing a demand curve: Bounds on regret for online posted-price auctions. In Proceedings of the 44th IEEE Symposium on Foundations of Computer Science (FOCS’03). 594--605.

[27]

Andreu Mas-Colell, Michael D. Whinston, and Jerry R. Green. 1995. Microeconomic Theory. Oxford University Press.

[28]

Aaron Roth, Jonathan Ullman, and Zhiwei Steven Wu. 2016. Watch and learn: Optimizing from revealed preferences feedback. In Proceedings of the 48th Annual ACM SIGACT Symppsium on Theory of Computing (STOC’16).

Digital Library

[29]

Ariel Rubinstein. 2012. Lecture Notes in Microeconomic Theory: The Economic Agent. Princeton University Press.

[30]

Paul A. Samuelson. 1938. A note on the pure theory of consumers’ behavior. Economica 5, 17 (1938), 61--71.

[31]

Maurice Sion. 1958. On general minimax theorems. Pacific J. Math. 8, 1 (1958), 171--176.

Digital Library

[32]

Hal R. Varian. 2006. Revealed preference. In Samuelsonian Economics and the Twenty-first Century, Michael Szenberg, Lall Ramrattan, and Aron A. Gottesman (Eds.). Oxford University Press, 99--115.

[33]

Zizhuo Wang, Shiming Deng, and Yinyu Ye. 2014. Close the gaps: A learning-while-doing algorithm for single-product revenue management problems. Operat. Res. 62, 2 (2014), 318--331.

Digital Library

[34]

Manfred Warmuth. 2009. A perturbation that makes “Follow the leader” equivalent to “Randomized weighted majority.” Retrieved on 19 March, 2016 from http://classes.soe.ucsc.edu/cmps290c/%0DSpring09/lect/10/wmkalai-rewrite.pdf.

[35]

Morteza Zadimoghaddam and Aaron Roth. 2012. Efficiently learning from revealed preference. In Proceedings of the 8th International Workshop on Internet and Network Economics (WINE’12) (Lecture Notes in Computer Science), Vol. 7695. Springer, 114--127.

Digital Library

[36]

Martin Zinkevich. 2003. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning (ICML’03).

Digital Library

Cited By

Rokhlin D(2024)On the Dual Gradient Descent Method for the Resource Allocation Problem in Multiagent SystemsJournal of Applied and Industrial Mathematics10.1134/S199047892402013318:2(316-332)Online publication date: 15-Aug-2024
https://doi.org/10.1134/S1990478924020133
Harris KHeidari HWu ZRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)Stateful Strategic RegressionProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3542462(28728-28741)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.5555/3540261.3542462
Ashlagi ILeshno JQian PSaberi A(undefined)Price Discovery in Waiting Lists: A Connection to Stochastic Gradient DescentSSRN Electronic Journal10.2139/ssrn.4192003
https://doi.org/10.2139/ssrn.4192003

Index Terms

Multidimensional Dynamic Pricing for Welfare Maximization
1. Theory of computation
  1. Theory and algorithms for application domains
    1. Algorithmic game theory and mechanism design
      1. Algorithmic mechanism design
    2. Machine learning theory
      1. Online learning theory

Recommendations

Multidimensional Dynamic Pricing for Welfare Maximization
EC '17: Proceedings of the 2017 ACM Conference on Economics and Computation

We study the problem of a seller dynamically pricing d distinct types of indivisible goods, when faced with the online arrival of unit-demand buyers drawn independently from an unknown distribution. The goods are not in limited supply, but can only be ...
Pricing to Maximize Revenue and Welfare Simultaneously in Large Markets
WINE 2016: Proceedings of the 12th International Conference on Web and Internet Economics - Volume 10123

We study large markets with a single seller who can produce many types of goods, and many multi-minded buyers. The seller chooses posted prices for its many items, and the buyers purchase bundles to maximize their utility. For this setting, we consider ...
Item pricing for revenue maximization
EC '08: Proceedings of the 9th ACM conference on Electronic commerce

We consider the problem of pricing n items to maximize revenue when faced with a series of unknown buyers with complex preferences, and show that a simple pricing scheme achieves surprisingly strong guarantees.

We show that in the unlimited supply ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Economics and Computation

ACM Transactions on Economics and Computation Volume 8, Issue 1

Special Issue on EC'17

February 2020

150 pages

ISSN:2167-8375

EISSN:2167-8383

DOI:10.1145/3387139

Editors:
David Pennock
Microsoft
,
Ilya Segal
Stanford University

Issue’s Table of Contents

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2020

Accepted: 01 March 2020

Revised: 01 December 2018

Received: 01 May 2017

Published in TEAC Volume 8, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Sloan Foundation Fellowship
NSF
DARPA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
464
Total Downloads

Downloads (Last 12 months)150
Downloads (Last 6 weeks)19

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Rokhlin D(2024)On the Dual Gradient Descent Method for the Resource Allocation Problem in Multiagent SystemsJournal of Applied and Industrial Mathematics10.1134/S199047892402013318:2(316-332)Online publication date: 15-Aug-2024
https://doi.org/10.1134/S1990478924020133
Harris KHeidari HWu ZRanzato MBeygelzimer ADauphin YLiang PVaughan J(2021)Stateful Strategic RegressionProceedings of the 35th International Conference on Neural Information Processing Systems10.5555/3540261.3542462(28728-28741)Online publication date: 6-Dec-2021
https://dl.acm.org/doi/10.5555/3540261.3542462
Ashlagi ILeshno JQian PSaberi A(undefined)Price Discovery in Waiting Lists: A Connection to Stochastic Gradient DescentSSRN Electronic Journal10.2139/ssrn.4192003
https://doi.org/10.2139/ssrn.4192003
Nikzad AStrack P(undefined)Equity in Dynamic Matching: Extreme Waitlist PoliciesSSRN Electronic Journal10.2139/ssrn.4007681
https://doi.org/10.2139/ssrn.4007681

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents