Thomas Chaffre PhD
Learning Stochastic Adaptive Control using A Bio-Inspired Experience Replay with an AUV
This project is part of the Machine Learning and Control project.
Funding is from Region Bretagne, South Austalia and Naval Group.
Supervisors : Estelle CHAUVEAU, Gilles LE CHENADEC, Paulo SANTOS, Karl SAMMUT and Benoit CLEMENT
Defense (link)
Board of Examiners
M. FILLIAT David, Professeur, ENSTA Paris - reviewer - (rapport)
M. BANDYOPADHYAY Tirthankar, Senior Researcher, CSIRO, Brisbane - reviewer - (rapport)
M. BUCHE Cédric, Professeur des Universités, IRL CROSSING / ENIB, Adelaide
M. CREUZE Vincent, Professeur des Universités, Université de Montpellier
Mme FANTONI Isabelle, Directrice de Recherche, LS2N, Nantes
M. SAMMUT Karl, Professeur, Flinders University, Adelaide, co-directeur de thèse
M. CLEMENT Benoît, Professeur, ENSTA Bretagne, IRL CROSSING, Adelaide, co-directeur de thèse
M. LE CHENADEC Gilles, Enseignant Chercheur, ENSTA Bretagne, Lab-STICC, Brest
M. SANTOS Paulo, Associate Professor, Flinders University, Adelaide
Mme CHAUVEAU Estelle, Docteur, Ingénieur de recherche, Naval Group, Ollioules
Abstract
Sea disturbances vary unrelentingly making it necessary for autonomous underwater vehicles (AUVs) to
continuously adapt their control systems to accommodate such changes. The complexity and dynamics of underwater processes are,
however, difficult to be described analytically. While adaptive control systems could potentially be used to handle such changes, the
field of adaptive control faces three main challenges: the dependency on the availability of a priori knowledge of the underlying
processes; the need for well defined governing equations; and the implementation of these equations on physical systems. In the context
of the underwater environment, these descriptions are not readily available. The objective of this thesis was to formalize a novel learning-
based adaptive control using deep reinforcement learning and adaptive pole placement control to compensate for the known part of the
process and to extract information on the unknown part directly from sensors feedback so as to compensate for the unobservable current
disturbance. In addition, we proposed a novel experience replay mechanism that considers the characteristic of the biological replay
mechanism. The methods were validated in simulation and in real life, demonstrating the benefits of combining both theories against
using them separately. Mots clés: Adaptive Control, Deep Reinforcement Learning, Robotics, Underwater Vehicles
Publications
Thomas Chaffre, Jonathan Wheare, Andrew Lammas, Paulo Santos, Gilles Le Chenadec, Karl Sammut, Estelle Chauveau, and Benoit Clement (2022). Simtoreal transfer of adaptive control parameters for improved robustness to sea current variations, under review.
Thomas Chaffre, Paulo E. Santos, Gilles Le Chenadec, Estelle Chauveau, Karl Sammut, and Benoit Clement
(2022). Learning Stochastic Adaptive Control using a BioInspired Experience Replay Experience Replay.
In TechRxiv.
Hector Kohler, Benoit Clement, Thomas Chaffre, and Gilles Le Chenadec (2022). PID Tuning using CrossEntropy DeepLearning: a Lyapunov Stability Analysis. In Proceedings of the 14th IFAC CAMS.
Thomas Chaffre, Julien Moras, Adrien ChanHonTong, Julien Marzat, Karl Sammut, Gilles Le Chenadec, and Benoit Clement (2022). Learningbased vs Modelfree Adaptive Control of a MAV under Wind Gust. In Informatics in Control, Automation and Robotics pp 362–385, LNEE, SPRINGER.
Thomas Chaffre, Gilles Le Chenadec, Karl Sammut, Estelle Chauveau, and Benoit Clement (2021). Direct Adaptive PolePlacement Controller using Deep Reinforcement Learning: Application to AUV Control. In Proceedings of the 13th IFAC Conference on Control Applications in Marine Systems, Robotics and Vehicles
(CAMS).
Thomas Chaffre, Julien Moras, Adrien Chan Hon Tong, and Julien Marzat (2020). SimtoReal Transfer with Incremental Environment Complexity for Reinforcement Learning of DepthBased Robot Navigation., in Proceedings of the 16th ICINCO.
Sola Yoann, Chaffre Thomas, Le Chenadec Gilles, Sammut Karl, and Clement Benoit (2020). Evaluation of a DeepReinforcementLearningbased Controller for the Control of an Autonomous Underwater Vehicle. In Proceedings of Global Oceans 2020.
Seminar presentation
Thomas Chaffre. Deep reinforcement learning and transfer of adaptive control parameters for improved robustness to current disturbance. At Centre de Recherche en Automatique de Nancy (CRAN) (2022).
Thomas Chaffre. Learningbased adaptive control of AUVs. At Sirehna at Technocampus Ocean (2022).
Thomas Chaffre, Paulo E. Santos, Gilles Le Chenadec, Estelle Chauveau, Karl Sammut, and Benoit Clement. Learning Stochastic Adaptive Control using a BioInspired Experience Replay Experience Replay. Poster presentation during the visit of Antoine Petit, President and CEO of CNRS, at IRL CROSSING (2022).
Thomas Chaffre. Learning based adaptive control of AUVs. At IRL CROSSING (2021).
|