1 code implementation • 18 May 2024 • Luke Bhan, Yuexin Bian, Miroslav Krstic, Yuanyuan Shi
With this gym, we then present the first set of model-free, reinforcement learning algorithms for solving this series of benchmark problems, achieving stability, although at a higher cost compared to model-based PDE backstepping.
1 code implementation • 15 Jan 2024 • Maxence Lamarque, Luke Bhan, Yuanyuan Shi, Miroslav Krstic
This requires an adaptive approach to PDE control, i. e., an estimation of the plant coefficients conducted concurrently with control, where a separate PDE for the gain kernel must be solved at each timestep upon the update in the plant coefficient function estimate.
no code implementations • 4 Jan 2024 • Luke Bhan, Yuanyuan Shi, Iasson Karafyllis, Miroslav Krstic, James B. Rawlings
In the paper we provide explicit formulae for MHEs for both hyperbolic and parabolic PDEs, as well as simulation results that illustrate theoretically guaranteed convergence of the MHEs.
1 code implementation • 4 Jan 2024 • Maxence Lamarque, Luke Bhan, Rafael Vazquez, Miroslav Krstic
The recently introduced neural operators (NO) can be trained to produce the gain functions, rapidly in real time, for each state value, without requiring a PDE solution.
1 code implementation • 18 Mar 2023 • Miroslav Krstic, Luke Bhan, Yuanyuan Shi
The designs of gains for controllers and observers for PDEs, such as PDE backstepping, are mappings of system model functions into gain functions.
2 code implementations • 28 Feb 2023 • Luke Bhan, Yuanyuan Shi, Miroslav Krstic
While, in the existing PDE backstepping, finding the gain kernel requires (one offline) solution to an integral equation, the neural operator (NO) approach we propose learns the mapping from the functional coefficients of the plant PDE to the kernel function by employing a sufficiently high number of offline numerical solutions to the kernel integral equation, for a large enough number of the PDE model's different functional coefficients.
1 code implementation • 19 May 2022 • Luke Bhan, Marcos Quinones-Grueiro, Gautam Biswas
In this work, we address the problem of solving complex collaborative robotic tasks subject to multiple varying parameters.