Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Part of the book series: Lecture Notes in Mechanical Engineering ((LNME))

  • 2033 Accesses

Abstract

A comparison between four common reinforcement learning algorithms, namely deep Q network (DQN), double deep Q network (DDQN), prioritized experience reply (DQN + PER) and double DQN + PER; and discussion on the methodology with the limitations and advantages of each algorithm are included in this paper. In order to provide these insights, OpenAI environments that demonstrate the working of these algorithms was used. Mountain car environment was used to generalize our results and prove the consistency of our insights. Insights were derived by evaluating basic parameters like, episode length, minimum rewards, maximum rewards and average rewards. This study discusses strategies for including reinforcement learning in supply chain management by using it for inventory management.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Christopher M (1992) Logistic and supply chain management. Pitman Publishing, London

    Google Scholar 

  2. Watkins CJCH, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292

    MATH  Google Scholar 

  3. Lin L-J (1993) Reinforcement learning for robots using neural networks. Technical report, DTIC Document

    Google Scholar 

  4. Baird L (1995) Residual algorithms: reinforcement learning with function approximation. In: Machine learning: proceedings of the twelfth international conference, pp 30–37

    Google Scholar 

  5. Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285

    Article  Google Scholar 

  6. Barbuceanu M, Fox MS (1996) Coordinating multiple agents in the supply chain. In: Proceedings of the fifth workshop on enabling technology for collaborative enterprises (WET ICE’96), Stanford University, CA, pp 134–141

    Google Scholar 

  7. Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT Press, Cambridge

    MATH  Google Scholar 

  8. Diuk C, Cohen A, Littman ML (2008) An object-oriented representation for efficient reinforcement learning. In: Proceedings of the 25th international conference on machine learning, pp 240–247

    Google Scholar 

  9. van Hasselt, H (2011) Insights in reinforcement learning. Ph.D. thesis, Utrecht University

    Google Scholar 

  10. Sutton RS, Mahmood AR, White M (2015) An emphatic approach to the problem of off-policy temporal-difference learning. arXiv preprint arXiv:1503.04269

  11. Van Hasselt H, Guez A, Silver D (2015) Deep reinforcement learning with double Q-learning

    Google Scholar 

  12. Schaul T, Quan J, Antonoglou I, Silver D (2016) Prioritized experience replay

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Siddharth Singi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Singi, S., Gopal, S., Auti, S., Chaurasia, R. (2020). Reinforcement Learning for Inventory Management. In: Vasudevan, H., Kottur, V., Raina, A. (eds) Proceedings of International Conference on Intelligent Manufacturing and Automation. Lecture Notes in Mechanical Engineering. Springer, Singapore. https://doi.org/10.1007/978-981-15-4485-9_33

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4485-9_33

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4484-2

  • Online ISBN: 978-981-15-4485-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics