Reinforcement Learning for Inventory Management | SpringerLink

Siddharth Singi⁹,
Siddharth Gopal⁹,
Shashikant Auti⁹ &
…
Rohit Chaurasia⁹

Part of the book series: Lecture Notes in Mechanical Engineering ((LNME))

2033 Accesses

Abstract

A comparison between four common reinforcement learning algorithms, namely deep Q network (DQN), double deep Q network (DDQN), prioritized experience reply (DQN + PER) and double DQN + PER; and discussion on the methodology with the limitations and advantages of each algorithm are included in this paper. In order to provide these insights, OpenAI environments that demonstrate the working of these algorithms was used. Mountain car environment was used to generalize our results and prove the consistency of our insights. Insights were derived by evaluating basic parameters like, episode length, minimum rewards, maximum rewards and average rewards. This study discusses strategies for including reinforcement learning in supply chain management by using it for inventory management.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Reinforcement Learning for Inventory Management

Chapter © 2020

Solving Inventory Management Problems through Deep Reinforcement Learning

Article 10 December 2022

Hybrid algorithm based on reinforcement learning for smart inventory management

Article Open access 03 August 2022

References

Christopher M (1992) Logistic and supply chain management. Pitman Publishing, London
Google Scholar
Watkins CJCH, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292
MATH Google Scholar
Lin L-J (1993) Reinforcement learning for robots using neural networks. Technical report, DTIC Document
Google Scholar
Baird L (1995) Residual algorithms: reinforcement learning with function approximation. In: Machine learning: proceedings of the twelfth international conference, pp 30–37
Google Scholar
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285
Article Google Scholar
Barbuceanu M, Fox MS (1996) Coordinating multiple agents in the supply chain. In: Proceedings of the fifth workshop on enabling technology for collaborative enterprises (WET ICE’96), Stanford University, CA, pp 134–141
Google Scholar
Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT Press, Cambridge
MATH Google Scholar
Diuk C, Cohen A, Littman ML (2008) An object-oriented representation for efficient reinforcement learning. In: Proceedings of the 25th international conference on machine learning, pp 240–247
Google Scholar
van Hasselt, H (2011) Insights in reinforcement learning. Ph.D. thesis, Utrecht University
Google Scholar
Sutton RS, Mahmood AR, White M (2015) An emphatic approach to the problem of off-policy temporal-difference learning. arXiv preprint arXiv:1503.04269
Van Hasselt H, Guez A, Silver D (2015) Deep reinforcement learning with double Q-learning
Google Scholar
Schaul T, Quan J, Antonoglou I, Silver D (2016) Prioritized experience replay
Google Scholar

Download references

Author information

Authors and Affiliations

Mechanical Engineering Department, Dwarkadas J. Sanghvi College of Engineering, Vile Parle, Mumbai, India
Siddharth Singi, Siddharth Gopal, Shashikant Auti & Rohit Chaurasia

Authors

Siddharth Singi
View author publications
You can also search for this author in PubMed Google Scholar
Siddharth Gopal
View author publications
You can also search for this author in PubMed Google Scholar
Shashikant Auti
View author publications
You can also search for this author in PubMed Google Scholar
Rohit Chaurasia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Siddharth Singi .

Editor information

Editors and Affiliations

Dwarkadas J. Sanghvi College of Engineering, Mumbai, India
Hari Vasudevan
Dwarkadas J. Sanghvi College of Engineering, Mumbai, India
Vijaya Kumar N. Kottur
RWTH Aachen University, Aachen, Germany
Amool A. Raina

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Singi, S., Gopal, S., Auti, S., Chaurasia, R. (2020). Reinforcement Learning for Inventory Management. In: Vasudevan, H., Kottur, V., Raina, A. (eds) Proceedings of International Conference on Intelligent Manufacturing and Automation. Lecture Notes in Mechanical Engineering. Springer, Singapore. https://doi.org/10.1007/978-981-15-4485-9_33

Download citation

DOI: https://doi.org/10.1007/978-981-15-4485-9_33
Published: 01 July 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4484-2
Online ISBN: 978-981-15-4485-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions