Dynamic programming for stochastic target problems and geometric flows
We solve an optimal cost problem for a stochastic Navier-Stokes equation in space dimension 2 by proving existence and uniqueness of a smooth solution of the corresponding Hamilton-Jacobi-Bellman equation.
In this paper, we study one kind of stochastic recursive optimal control problem for the systems described by stochastic differential equations with delay (SDDE). In our framework, not only the dynamics of the systems but also the recursive utility depend on the past path segment of the state process in a general form. We give the dynamic programming principle for this kind of optimal control problems and show that the value function is the viscosity solution of the corresponding infinite dimensional...
In many western economies, the phenomenon of ageing population implies that the large Pay-As-You-Go (PAYGO) social security system will run into several severe financial difficulties. In that direction, this paper constructs a discrete-time stochastic model for a quasi PAYGO social security system to allow the potential accumulation of a special (contingency) fund, which can oscillate so as to absorb fluctuations in the various system parameters involved. The basic difference equation is analytically...
In the wine AOC system, the regulation of quantities performed by the professional organizations is aimed to smooth the variations of the quality of the wine due to the variations in the climate that affect the quality of the grapes. Nevertheless, this regulation could be damaging to the consumers due to the price increase resulting from the reduction of the quantities sold on the market. We propose a stochastic control model and a simulation tool able to measure the effects of this mechanism...
In many markets, especially in energy markets, electricity markets for instance, the detention of the physical asset is quite difficult. This is also the case for crude oil as treated by Davis (2000). So one can identify a good proxy which is an asset (financial or physical) (one)whose the spot price is significantly correlated with the spot price of the underlying (e.g. electicity or crude oil). Generally, the market could become incomplete. We explicit exact hedging strategies for exponential...
This paper is concerned with the stochastic linear quadratic optimal control problems (LQ problems, for short) for which the coefficients are allowed to be random and the cost functionals are allowed to have negative weights on the square of control variables. We propose a new method, the equivalent cost functional method, to deal with the LQ problems. Comparing to the classical methods, the new method is simple, flexible and non-abstract. The new method can also be applied to deal with nonlinear...
A linear-quadratic control problem with an infinite time horizon for some infinite dimensional controlled stochastic differential equations driven by a fractional Brownian motion is formulated and solved. The feedback form of the optimal control and the optimal cost are given explicitly. The optimal control is the sum of the well known linear feedback control for the associated infinite dimensional deterministic linear-quadratic control problem and a suitable prediction of the adjoint optimal system...
Optimal control with long run average cost functional of a partially observed Markov process is considered. Under the assumption that the transition probabilities are equivalent, the existence of the solution to the Bellman equation is shown, with the use of which optimal strategies are constructed.
This paper deals with Markov decision processes (MDPs) with real state space for which its minimum is attained, and that are upper bounded by (uncontrolled) stochastically ordered (SO) Markov chains. We consider MDPs with (possibly) unbounded costs, and to evaluate the quality of each policy, we use the objective function known as the average cost. For this objective function we consider two Markov control models and . and have the same components except for the transition laws. The transition...
For a discrete-time Markov control process with the transition probability , we compare the total discounted costs ...
This work concerns Markov decision chains with finite state and action sets. The transition law satisfies the simultaneous Doeblin condition but is unknown to the controller, and the problem of determining an optimal adaptive policy with respect to the average reward criterion is addressed. A subset of policies is identified so that, when the system evolves under a policy in that class, the frequency estimators of the transition law are consistent on an essential set of admissible state-action pairs,...
This paper provides a summary of our recent work on the problem of combined estimation and control of systems described by finite state, hidden Markov models. We establish the stochastic framework for the problem, formulate a separated control policy with risk-sensitive cost functional, describe an estimation scheme for the parameters of the hidden Markov model that describes the plant, and finally indicate how the combined estimation and control problem can be re-formulated in a framework that...