Subject: total discounted cost - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Subject total discounted cost

Creator:: Flores-Hernández, Rosa M.
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: Markov decision process, total discounted cost, total discounted reward, increasing optimal policy, decreasing optimal policy, and policy iteration algorithm
Language:: English
Description:: In this paper there are considered Markov decision processes (MDPs) that have the discounted cost as the objective function, state and decision spaces that are subsets of the real line but are not necessarily finite or denumerable. The considered MDPs have a cost function that is possibly unbounded, and dynamic independent of the current state. The considered decision sets are possibly non-compact. In the context described, conditions to obtain either an increasing or decreasing optimal stationary policy are provided; these conditions do not require assumptions of convexity. Versions of the policy iteration algorithm (PIA) to approximate increasing or decreasing optimal stationary policies are detailed. An illustrative example is presented. Finally, comments on the monotonicity conditions and the monotone versions of the PIA that are applied to discounted MDPs with rewards are given.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

Creator:: Flores, Rosa M. and Montes-de-Oca, Raúl
Format:: bez média and svazek
Type:: model:article and TEXT
Subject:: monotone minimizer in an optimization problem, Markov control process, total discounted cost, average cost, and monotone optimal policy
Language:: English
Description:: Firstly, in this paper there is considered a certain class of possibly unbounded optimization problems on Euclidean spaces, for which conditions that permit to obtain monotone minimizers are given. Secondly, the theory developed in the first part of the paper is applied to Markov control processes (MCPs) on real spaces with possibly unbounded cost function, and with possibly noncompact control sets, considering both the discounted and the average cost as optimality criterion. In the context described, conditions to obtain monotone optimal policies are provided. For the conditions of MCPs presented in the article, several controlled models including, in particular, two inventory/production systems and the linear regulator problem are supplied.
Rights:: http://creativecommons.org/publicdomain/mark/1.0/ and policy:public

Search