no code implementations • 2 Dec 2023 • Cyrus Neary, Christian Ellis, Aryaman Singh Samyal, Craig Lennon, Ufuk Topcu
We propose and demonstrate a compositional framework for training and verifying reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to deploy reliable and adaptable RL policies on physical hardware.
1 code implementation • 30 Dec 2022 • Franck Djeumou, Christian Ellis, Murat Cubuktepe, Craig Lennon, Ufuk Topcu
First, they require an excessive amount of data due to the information asymmetry between the expert and the learner.
1 code implementation • 20 Jul 2022 • Luke Boegner, Manbir Gulati, Garrett Vanhoy, Phillip Vallance, Bradley Comar, Silvija Kokalj-Filipovic, Craig Lennon, Robert D. Miller
We also introduce TorchSig, a signals processing machine learning toolkit that can be used to generate this dataset.
no code implementations • 28 May 2021 • Franck Djeumou, Murat Cubuktepe, Craig Lennon, Ufuk Topcu
Nevertheless, the resulting formulation is still nonconvex due to the intrinsic nonconvexity of the so-called forward problem, i. e., computing an optimal policy given a reward function, in POMDPs.