article

Markov-Renewal Programming. II: Infinite Return Models, Example

Author:

William S. JewellAuthors Info & Claims

Operations Research, Volume 11, Issue 6

Pages 949 - 971

https://doi.org/10.1287/opre.11.6.949

Published: 01 December 1963 Publication History

Abstract

This paper is a continuation of a previous one which investigates programming over a Markov-renewal process---in which the intervals between transitions of a system from state i to state j are independent samples from a distribution that may depend upon both i and j. Given a reward structure, and a decision mechanism that influences both the rewards and the Markov-renewal process, the problem is to select alternatives at each transition so as to maximize total expected reward. The first portion of the paper investigated various finite-return models. In this part of the paper, we investigate the infinite-return models, where it becomes necessary to consider only stationary policies that maximize the dominant term in the reward. It is then important to specify whether the limiting experiment is I undiscounted, with the number of transitions n → ∞, II undiscounted, with a time horizon t → ∞, or III discounted, infinite n or t, with discount factor a → 0. In each case, a limiting form for the total expected reward is shown, and an algorithm developed to maximize the rate of return. The problem of finding the optimal or near-optimal policies in the case of ties is still computationally unresolved. Extensions to nonergodic processes are indicated, and special results for the two-state process are presented. Finally, an example of machine maintenance and repair is used to illustrate the generality of the models and the special problems that may arise.

References

[1]

R. E. BARLOW, "Applications of Semi-Markov Processes to Counter Problems,' Studies in Applied Probability and Management Science, Chapter 3 (K. J. ARROW, S. KARLIN, AND H. SCARF, eds.), Stanford University Press, Stanford, California, 1962.

[2]

R. BELLMAN, Dynamic Programming, Princeton University Press, Princeton, New Jersey, 1957.

[3]

R. BELLMAN, "A Markovian Decision Process," J. Math, and Mech. 6, 679-684 (1957).

[4]

R. BELLMAN AND S. E. DREYFUS, Applied Dynamic Programming, Princeton University Press, Princeton, New Jersey, 1962.

[5]

D. BLACKWELL, "On the Functional Equation of Dynamic Programming," J. Math. Analysis and Appl. 2 (1961).

[6]

D. BLACKWELL, "Discrete Dynamic Programming," Ann. Math. Stat. 33, 719-726 (1962).

[7]

J. S. DECANI, "On the Transient and Asymptotic Behavior of a Markov Chain 'Embedded' in Continuous Time," and "A Dynamic Programming Algorithm for Embedded Markov Chains," mimeographed, Aeronautical Computer Laboratory, U. S. Naval Air Development Center, Johnsville, Pennsylvania, 22 April 1963.

[8]

F. D'EPENOUX, "Sur un probleme de Production et de Stockage dans l'Aleatoire," Revue Francaise de Recherche Operationelle, No. 14, 3-16 (1960).

[9]

C. DERMAN, "On Sequential Decisions and Markov Chains," Manag. Sci. 9, 16-24 (1962).

[10]

R. A. HOWARD, Dynamic Programming and Markov Processes, Technology Press and Wiley Press, New York, 1960.

[11]

W. S. JEWELL, "The Properties of Recurrent-Event Processes," Opns. Res. 8, 446-472 (1960).

Digital Library

[12]

W. S. JEWELL, "Markov Renewal Programming," Research Report 37, Operations Research Center, Institute of Engineering Research, University of California, Berkeley, 23 October 1962.

[13]

J. G. KEMENY AND J. L. SNELL, Finite Markov Chains, D. Van Nostrand, Princeton, New Jersey, 1960.

[14]

M. KLEIN, "Inspection-Maintenance-Replacement Schedules under Markovian Deterioration," Manag. Sci. 9, 25-32 (1962).

[15]

A. S. MANNE, "Linear Programming and Sequential Decisions," Manag. Sci. 6, 259-267 (1960).

Digital Library

[16]

R. M. OLIVER, "A Linear Programming Formulation of Some Markov Decision Processes," presented at a meeting of the Institute of Management Sciences and OPERATIONS RESEARCH SOCIETY OF AMERICA, Monterey, California, April, 1960.

[17]

R. PYKE, "Markov Renewal Processes: Definitions and Preliminary Properties," Ann. Math. Stat. 33, 1231-1242 (1961).

[18]

R. PYKE, "Markov Renewal Processes with Finitely Many States," Ann. Math. Stat. 33, 1243-1259 (1961).

[19]

R. PYKE, "Markov Renewal Processes of Zero Order and Their Applications to Counter Theory," Studies in Applied Probability and Management Science, Chapter 11 (K. J. ARROW, S. KARLIN, AND H. SCARF, eds.), Stanford University Press, Stanford, California, 1962.

[20]

R. PYKE, "Limit Theorems for Markov Renewal Processes," Technical Report No. 24, Columbia University (1961).

[21]

P. SCHWEITZER, private communication, March, 1963.

[22]

W. L. SMITH, "Asymptotic Renewal Theorems," Proc. Roy. Soc. Edinburgh, Series A 64, 9-48 (1954).

[23]

P. WOLFE AND G. B. DANTZIG, "Linear Programming in a Markov Chain," Opns. Res. 10, 702-710 (1962).

Digital Library

Cited By

Cui TRaju JZhang Z(2008)A Price Discrimination Model of Trade PromotionsMarketing Science10.1287/mksc.1070.031427:5(779-795)Online publication date: 1-Sep-2008
https://dl.acm.org/doi/10.1287/mksc.1070.0314
Baykal-Gürsoy MGürsoy K(2007)Semi-markov decision processesProbability in the Engineering and Informational Sciences10.1017/S026996480700037X21:4(635-657)Online publication date: 1-Oct-2007
https://dl.acm.org/doi/10.1017/S026996480700037X
Denardo E(2006)Contraction Mappings in the Theory Underlying Dynamic ProgrammingSIAM Review10.1137/10090309:2(165-177)Online publication date: 18-Jul-2006
https://dl.acm.org/doi/10.1137/1009030
Show More Cited By

Markov-Renewal Programming. II: Infinite Return Models, Example
1. Mathematics of computing
  1. Probability and statistics
    1. Probabilistic representations
    2. Stochastic processes

Recommendations

Markov-Renewal Programming. I: Formulation, Finite Return Models

A special structure in dynamic programming which has been studied by Bellman, Blackwell, D'Épenoux, Derman, Howard, Manne, Oliver, Wolfe and Dantzig, and others is the problem of programming over a Markov chain This paper extends their results and ...
Markov renewal decision processes with finite horizon

We investigate Markov renewal decision processes with finite horizon, countable state space, general action space and unbounded rewards. Under rather weak restrictions we derive the optimality equation and state conditions ensuring the convergence of ...
Approximate Solution of Markov Renewal Programs with Finite Time Horizon

The present paper investigates the error committed by using an infinite time horizon Markov renewal program as an approximation of the (often more realistic) Markov renewal program with a finite time horizon t₀. Under weak assumptions the error is shown ...

Comments

Information & Contributors

Information

Published In

cover image Operations Research

Operations Research Volume 11, Issue 6

December 1963

176 pages

ISSN:0030-364X

Issue’s Table of Contents

Publisher

INFORMS

Linthicum, MD, United States

Publication History

Published: 01 December 1963

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 09 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Cui TRaju JZhang Z(2008)A Price Discrimination Model of Trade PromotionsMarketing Science10.1287/mksc.1070.031427:5(779-795)Online publication date: 1-Sep-2008
https://dl.acm.org/doi/10.1287/mksc.1070.0314
Baykal-Gürsoy MGürsoy K(2007)Semi-markov decision processesProbability in the Engineering and Informational Sciences10.1017/S026996480700037X21:4(635-657)Online publication date: 1-Oct-2007
https://dl.acm.org/doi/10.1017/S026996480700037X
Denardo E(2006)Contraction Mappings in the Theory Underlying Dynamic ProgrammingSIAM Review10.1137/10090309:2(165-177)Online publication date: 18-Jul-2006
https://dl.acm.org/doi/10.1137/1009030
Albright S(1977)Structure of Optimal Policies in Complex Queuing SystemsOperations Research10.1287/opre.25.6.102025:6(1020-1027)Online publication date: 1-Dec-1977
https://dl.acm.org/doi/10.1287/opre.25.6.1020
Aggarwal VChandrasekaran RNair K(1977)Markov ratio decision processesJournal of Optimization Theory and Applications10.1007/BF0093254121:1(27-37)Online publication date: 1-Jan-1977
https://dl.acm.org/doi/10.1007/BF00932541
Buzacott JCallahan J(1973)The Pit Charging Problem in Steel ProductionManagement Science10.1287/mnsc.20.4.66520:4-part-ii(665-674)Online publication date: 1-Dec-1973
https://dl.acm.org/doi/10.1287/mnsc.20.4.665
Lippman SRoss S(1971)The Streetwalker’s DilemmaSIAM Journal on Applied Mathematics10.1137/012003620:3(336-342)Online publication date: 1-May-1971
https://dl.acm.org/doi/10.1137/0120036
Denardo EFox B(1968)Multichain Markov Renewal ProgramsSIAM Journal on Applied Mathematics10.1137/011603816:3(468-487)Online publication date: 1-May-1968
https://dl.acm.org/doi/10.1137/0116038
Fox B(1967)Existence of Stationary Optimal Policies for Some Markov Renewal ProgramsSIAM Review10.1137/10090799:3(573-576)Online publication date: 1-Jul-1967
https://dl.acm.org/doi/10.1137/1009079
Fox B(1966)Markov Renewal Programming by Linear Fractional ProgrammingSIAM Journal on Applied Mathematics10.1137/011411014:6(1418-1432)Online publication date: 1-Nov-1966
https://dl.acm.org/doi/10.1137/0114110

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents