Mean-Field Optimal Control

Massimo Fornasier; Francesco Solombrino

Mean-Field Optimal Control

Massimo Fornasier; Francesco Solombrino

ESAIM: Control, Optimisation and Calculus of Variations (2014)

Volume: 20, Issue: 4, page 1123-1152
ISSN: 1292-8119

Access Full Article

top

Access to full text

Abstract

top

We introduce the concept of mean-field optimal control which is the rigorous limit process connecting finite dimensional optimal control problems with ODE constraints modeling multi-agent interactions to an infinite dimensional optimal control problem with a constraint given by a PDE of Vlasov-type, governing the dynamics of the probability distribution of interacting agents. While in the classical mean-field theory one studies the behavior of a large number of small individuals freely interacting with each other, by simplifying the effect of all the other individuals on any given individual by a single averaged effect, we address the situation where the individuals are actually influenced also by an external policy maker, and we propagate its effect for the number N of individuals going to infinity. On the one hand, from a modeling point of view, we take into account also that the policy maker is constrained to act according to optimal strategies promoting its most parsimonious interaction with the group of individuals. This will be realized by considering cost functionals including L1-norm terms penalizing a broadly distributed control of the group, while promoting its sparsity. On the other hand, from the analysis point of view, and for the sake of generality, we consider broader classes of convex control penalizations. In order to develop this new concept of limit rigorously, we need to carefully combine the classical concept of mean-field limit, connecting the finite dimensional system of ODE describing the dynamics of each individual of the group to the PDE describing the dynamics of the respective probability distribution, with the well-known concept of Γ-convergence to show that optimal strategies for the finite dimensional problems converge to optimal strategies of the infinite dimensional problem.

How to cite

top

MLA
BibTeX
RIS

Fornasier, Massimo, and Solombrino, Francesco. "Mean-Field Optimal Control." ESAIM: Control, Optimisation and Calculus of Variations 20.4 (2014): 1123-1152. <http://eudml.org/doc/272875>.

@article{Fornasier2014,
abstract = {We introduce the concept of mean-field optimal control which is the rigorous limit process connecting finite dimensional optimal control problems with ODE constraints modeling multi-agent interactions to an infinite dimensional optimal control problem with a constraint given by a PDE of Vlasov-type, governing the dynamics of the probability distribution of interacting agents. While in the classical mean-field theory one studies the behavior of a large number of small individuals freely interacting with each other, by simplifying the effect of all the other individuals on any given individual by a single averaged effect, we address the situation where the individuals are actually influenced also by an external policy maker, and we propagate its effect for the number N of individuals going to infinity. On the one hand, from a modeling point of view, we take into account also that the policy maker is constrained to act according to optimal strategies promoting its most parsimonious interaction with the group of individuals. This will be realized by considering cost functionals including L1-norm terms penalizing a broadly distributed control of the group, while promoting its sparsity. On the other hand, from the analysis point of view, and for the sake of generality, we consider broader classes of convex control penalizations. In order to develop this new concept of limit rigorously, we need to carefully combine the classical concept of mean-field limit, connecting the finite dimensional system of ODE describing the dynamics of each individual of the group to the PDE describing the dynamics of the respective probability distribution, with the well-known concept of Γ-convergence to show that optimal strategies for the finite dimensional problems converge to optimal strategies of the infinite dimensional problem.},
author = {Fornasier, Massimo, Solombrino, Francesco},
journal = {ESAIM: Control, Optimisation and Calculus of Variations},
keywords = {Sparse optimal control; mean-field limit; Γ-limit; optimal control with ODE constraints; optimal control with PDE constraints; sparse optimal control; -limit; ODE constraints; PDE constraints; cost functional},
language = {eng},
number = {4},
pages = {1123-1152},
publisher = {EDP-Sciences},
title = {Mean-Field Optimal Control},
url = {http://eudml.org/doc/272875},
volume = {20},
year = {2014},
}

TY - JOUR
AU - Fornasier, Massimo
AU - Solombrino, Francesco
TI - Mean-Field Optimal Control
JO - ESAIM: Control, Optimisation and Calculus of Variations
PY - 2014
PB - EDP-Sciences
VL - 20
IS - 4
SP - 1123
EP - 1152
AB - We introduce the concept of mean-field optimal control which is the rigorous limit process connecting finite dimensional optimal control problems with ODE constraints modeling multi-agent interactions to an infinite dimensional optimal control problem with a constraint given by a PDE of Vlasov-type, governing the dynamics of the probability distribution of interacting agents. While in the classical mean-field theory one studies the behavior of a large number of small individuals freely interacting with each other, by simplifying the effect of all the other individuals on any given individual by a single averaged effect, we address the situation where the individuals are actually influenced also by an external policy maker, and we propagate its effect for the number N of individuals going to infinity. On the one hand, from a modeling point of view, we take into account also that the policy maker is constrained to act according to optimal strategies promoting its most parsimonious interaction with the group of individuals. This will be realized by considering cost functionals including L1-norm terms penalizing a broadly distributed control of the group, while promoting its sparsity. On the other hand, from the analysis point of view, and for the sake of generality, we consider broader classes of convex control penalizations. In order to develop this new concept of limit rigorously, we need to carefully combine the classical concept of mean-field limit, connecting the finite dimensional system of ODE describing the dynamics of each individual of the group to the PDE describing the dynamics of the respective probability distribution, with the well-known concept of Γ-convergence to show that optimal strategies for the finite dimensional problems converge to optimal strategies of the infinite dimensional problem.
LA - eng
KW - Sparse optimal control; mean-field limit; Γ-limit; optimal control with ODE constraints; optimal control with PDE constraints; sparse optimal control; -limit; ODE constraints; PDE constraints; cost functional
UR - http://eudml.org/doc/272875
ER -

References

top

[1] N. Ahmed and X. Ding, Controlled McKean-Vlasov equations. Commun. Appl. Anal.5 (2001) 183–206. Zbl1084.49506 MR1844190
[2] L. Ambrosio, N. Fusco and D. Pallara, Functions of Bounded Variation and Free Discontinuity Problems. Oxford, Clarendon Press (2000). Zbl0957.49001 MR1857292
[3] L. Ambrosio, N. Gigli and G. Savaré, Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lect. Math. ETH Zürich 2nd, edition. Birkhäuser Verlag, Basel (2008). Zbl1145.35001 MR2401600
[4] D. Andersson and B. Djehiche, A maximum principle for SDEs of mean-field type. Appl. Math. Opt.63 (2011) 341–356. Zbl1215.49034 MR2784835
[5] M. Ballerini, N. Cabibbo, R. Candelier, A. Cavagna, E. Cisbani, L. Giardina, L. Lecomte, A. Orlandi, G. Parisi, A. Procaccini, M. Viale and V. Zdravkovic, Interaction ruling animal collective behavior depends on topological rather than metric distance: evidence from a field study. Proc. National Academy of Sci.105 (2008) 1232–1237.
[6] A. Bensoussan, J. Frehse and P. Yam, Mean field games and mean field type control theory. Springer, New York (2013). Zbl1287.93002 MR3134900
[7] A. Bressan and B. Piccoli, Introduction to the Mathematical Theory of Control, vol. 2 of AIMS Ser. Appl. Math.. American Institute of Mathematical Sciences (AIMS), Springfield, MO (2007). Zbl1127.93002 MR2347697
[8] R. Buckdahn, B. Djehiche and J. Li, A general stochastic maximum principle for sdes of mean-field type. Appl. Math. Opt.64 (2011) 197–216. Zbl1245.49036 MR2822408
[9] S. Camazine, J. Deneubourg, N. Franks, J. Sneyd, G. Theraulaz and E. Bonabeau, Self-Organization in Biological Systems. Princeton University Press (2003). Zbl1130.92009 MR2343706
[10] J.A. Cañizo, J.A. Carrillo and J. Rosado, A well-posedness theory in measures for some kinetic models of collective motion. Math. Model. Meth. Appl. Sci.21 (2011) 515–539. Zbl1218.35005
[11] M. Caponigro, M. Fornasier, B. Piccoli and E. Trélat, Sparse stabilization and control of the Cucker−Smale model. Preprint: arXiv:1210.5739 (2012). Zbl1275.49003
[12] J.A. Carrillo, Y.-P. Choi and M. Hauray, The derivation of swarming models: mean-field limit and Wasserstein distances. Preprint: arXiv:1304.5776 (2013).
[13] J.A. Carrillo, M.R. D’Orsogna and V. Panferov, Double milling in self-propelled swarms from kinetic theory. Kinet. Relat. Models2 (2009) 363–378. Zbl1195.92069 MR2507454
[14] J. A. Carrillo, M. Fornasier, G. Toscani and F. Vecil, Particle, kinetic, and hydrodynamic models of swarming, in Math. Modeling of Collective Behavior in Socio-Economic and Life Sci., edited by G. Naldi, L. Pareschi, G. Toscani and N. Bellomo. Model. Simul. Sci. Engrg. Technol. Birkhäuser, Boston (2010) 297–336. Zbl1211.91213 MR2744704
[15] E. Casas, C. Clason and K. Kunisch, Approximation of elliptic control problems in measure spaces with sparse solutions. SIAM J. Control Optim.50 (2012) 1735–1752. Zbl06111073 MR2974716
[16] Y. Chuang, M. D’Orsogna, D. Marthaler, A. Bertozzi and L. Chayes, State transition and the continuum limit for the 2D interacting, self-propelled particle system. Physica D232 (2007) 33–47. Zbl05182405 MR2369988
[17] Y. Chuang, Y. Huang, M. D’Orsogna and A. Bertozzi, Multi-vehicle flocking: scalability of cooperative control algorithms using pairwise potentials. IEEE Int. Conference on Robotics and Automation (2007) 2292–2299.
[18] C. Clason and K. Kunisch, A duality-based approach to elliptic control problems in non-reflexive Banach spaces. ESAIM: COCV 17 (2011) 243–266. Zbl1213.49041 MR2775195
[19] C. Clason and K. Kunisch, A measure space approach to optimal source placement. Comput. Optim. Appl.53 (2012) 155–171. Zbl1258.49041 MR2964839
[20] I. Couzin and N. Franks, Self-organized lane formation and optimized traffic flow in army ants. Proc. R. Soc. London B270 (2002) 139–146.
[21] I. Couzin, J. Krause, N. Franks and S. Levin, Effective leadership and decision making in animal groups on the move. Nature433 (2005) 513–516.
[22] A. J. Craig and I. Flügge-Lotz, Investigation of optimal control with a minimum-fuel consumption criterion for a fourth-order plant with two control inputs; synthesis of an efficient suboptimal control. J. Basic Engrg.87 (1965) 39–58.
[23] E. Cristiani, B. Piccoli and A. Tosin, Modeling self-organization in pedestrians and animal groups from macroscopic and microscopic viewpoints, in Mathematical Modeling of Collective Behavior in Socio-Economic and Life Sciences, edited by G. Naldi, L. Pareschi, G. Toscani and N. Bellomo. Model. Simul. Sci. Engrg. Technol. Birkhäuser, Boston (2010). Zbl1211.91214 MR2744705
[24] E. Cristiani, B. Piccoli and A. Tosin, Multiscale modeling of granular flows with application to crowd dynamics. Multiscale Model. Simul.9 (2011) 155–182. Zbl1221.35232 MR2769993
[25] F. Cucker and J.-G. Dong, A general collision-avoiding flocking framework. IEEE Trans. Automat. Control56 (2011) 1124–1129. MR2815917
[26] F. Cucker and E. Mordecki, Flocking in noisy environments. J. Math. Pures Appl.89 (2008) 278–296. Zbl1273.91404 MR2401690
[27] F. Cucker and S. Smale, Emergent behavior in flocks. IEEE Trans. Automat. Control 52 (2007) 852–862,. MR2324245
[28] F. Cucker and S. Smale, On the mathematics of emergence. Japan J. Math.2 (2007) 197–227. Zbl1166.92323 MR2295620
[29] F. Cucker, S. Smale and D. Zhou, Modeling language evolution. Found. Comput. Math.4 (2004) 315–343. Zbl1083.68131 MR2078666
[30] G. Dal Maso, An Introduction to Γ-Convergence. Progress in Nonlinear Differ. Eqs. Appl., vol. 8. Birkhäuser Boston Inc., Boston, MA (1993). Zbl0816.49001 MR1201152
[31] H. Federer, Geometric Measure Theory. Die Grundlehren der mathematischen Wissenschaften in Einzeldarstellungen, vol. 153. Springer-Verlag, Berlin, Heidelberg, New York (1969). Zbl0176.00801 MR257325
[32] A.F. Filippov, Differential equations with Discontinuous Righthand Sides. Vol. 18 of Math. Appl. (Soviet Series). Translated from the Russian. Kluwer Academic Publishers Group, Dordrecht (1988). Zbl0664.34001 MR1028776
[33] R. Glowinski, Numerical Methods for Nonlinear Variational Problems. Scientific Comput. Springer-Verlag, Berlin (2008). Reprint of the 1984 original. Zbl1139.65050 MR2423313
[34] G. Grégoire and H. Chaté, Onset of collective and cohesive motion. Phys. Rev. Lett. 92 (2004).
[35] R. Herzog, G. Stadler and G. Wachsmuth, Directional sparsity in optimal control of partial differential equations. SIAM J. Control Optim.50 (2012) 943–963. Zbl1244.49038 MR2914236
[36] M. Huang, P. Caines and R. Malhamé, Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions. Proc. of the 42nd IEEE Conference on Decision and Control Maui, Hawaii USA (2003) 98–103.
[37] A. Jadbabaie, J. Lin and A.S. Morse, Correction to: “Coordination of groups of mobile autonomous agents using nearest neighbor rules” [48 (2003) 988–1001; MR 1986266]. IEEE Trans. Automat. Control 48 (2003) 1675. MR1986266
[38] J. Ke, J. Minett, C.-P. Au and W.-Y. Wang, Self-organization and selection in the emergence of vocabulary. Complexity7 (2002) 41–54. MR1911875
[39] E. F. Keller and L.A. Segel, Initiation of slime mold aggregation viewed as an instability. J. Theoret. Biol.26 (1970) 399–415. Zbl1170.92306
[40] A. Koch and D. White, The social lifestyle of myxobacteria. Bioessays20 (1998) 1030–1038.
[41] J.-M. Lasry and P.-L. Lions, Mean field games. Japan J. Math.2 (2007) 229–260. Zbl1156.91321 MR2295621
[42] N. Leonard and E. Fiorelli, Virtual leaders, artificial potentials and coordinated control of groups. Proc. of 40th IEEE Conf. Decision Contr. (2001) 2968–2973.
[43] H. Niwa, Self-organizing dynamic model of fish schooling. J. Theoret. Biol.171 (1994) 123–136.
[44] M. Nuorian, P. Caines and R. Malhamé, Synthesis of Cucker−Smale type flocking via mean field stochastic control theory: Nash equilibria. Proc. of 48th Allerton Conf. Comm., Cont. Comp., Monticello, Illinois (2010) 814–815.
[45] M. Nuorian, P. Caines and R. Malhamé, Mean field analysis of controlled Cucker−Smale type flocking: Linear analysis and perturbation equations. Proc. of 18th IFAC World Congress Milano, Italy (2011) 4471–4476.
[46] J. Parrish and L. Edelstein-Keshet, Complexity, pattern and evolutionary trade-offs in animal aggregation. Science294 (1999) 99–101.
[47] J. Parrish, S. Viscido and D. Gruenbaum, Self-organized fish schools: An examination of emergent properties. Biol. Bull.202 (2002) 296–305.
[48] L. Perea, G. Gómez and P. Elosegui, Extension of the Cucker–Smale control law to space flight formations. AIAA J. Guidance, Control, and Dynamics 32 2009527–537.
[49] B. Perthame, Mathematical tools for kinetic equations. Bull. Am. Math. Soc., New Ser. 41 (2004) 205–244. Zbl1151.82351 MR2043752
[50] B. Perthame, Transport Equations in Biology. Basel, Birkhäuser (2007). Zbl1185.92006 MR2270822
[51] K. Pieper and B. Vexler, A priori error analysis for discretization of sparse elliptic optimal control problems in measure space. SIAM J. Control Optim.51 (2013) 2788–2808. Zbl06227446 MR3072225
[52] Y. Privat, E. Trélat and E. Zuazua, Complexity and regularity of maximal energy domains for the wave equation with fixed initial data. Discrete Contin. Dyn. Syst. Ser. A. Zbl1332.93069
[53] R. Rannacher and B. Vexler, Adaptive finite element discretization in PDE-based optimization. GAMM-Mitt. 33 (2010) 177–193. Zbl1207.49023 MR2843958
[54] W. Romey, Individual differences make a difference in the trajectories of simulated schools of fish. Ecol. Model.92 (1996) 65–77.
[55] M.B. Short, M. R. D’Orsogna, V.B. Pasour, G.E. Tita, P.J. Brantingham, A.L. Bertozzi and L.B. Chayes, A statistical model of criminal behavior. Math. Models Methods Appl. Sci.18 (2008) 1249–1267. Zbl1180.35530 MR2438215
[56] G. Stadler, Elliptic optimal control problems with L1-control cost and applications for the placement of control devices. Comput. Optim. Appl.44 (2009) 159–181. Zbl1185.49031 MR2556849
[57] K. Sugawara and M. Sano, Cooperative acceleration of task performance: Foraging behavior of interacting multi-robots system. Phys. D100 (1997) 343–354. Zbl0908.92036
[58] J. Toner and Y. Tu, Long-range order in a two-dimensional dynamical xy model: How birds fly together. Phys. Rev. Lett.75 (1995) 4326–4329.
[59] T. Vicsek, A. Czirok, E. Ben-Jacob, I. Cohen and O. Shochet, Novel type of phase transition in a system of self-driven particles. Phys. Rev. Lett.75 (1995) 1226–1229.
[60] T. Vicsek and A. Zafeiris, Collective motion. Phys. Rep.517 (2012) 71–140.
[61] C. Villani, Optimal Transport, vol. 338. Grundlehren der Math. Wissenschaften, [Fundamental Principles of Mathematical Science]. Springer-Verlag, Berlin (2009). Old and new. Zbl1156.53003 MR2459454
[62] G. Vossen and H. Maurer, L1 minimization in optimal control and applications to robotics. Optim. Control Appl. Methods27 (2006) 301–321. MR2283487
[63] G. Wachsmuth and D. Wachsmuth, Convergence and regularization results for optimal control problems with sparsity functional. ESAIM: COCV 17 (2011) 858–886. Zbl1228.49032 MR2826983
[64] C. Yates, R. Erban, C. Escudero, L. Couzin, J. Buhl, L. Kevrekidis, P. Maini and D. Sumpter, Inherent noise can facilitate coherence in collective swarm motion. Proc. Natl. Acad. Sci.106 (2009) 5464–5469.

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.