no code implementations • 21 Oct 2023 • Carmel Fiscko, Aayushya Agarwal, Yihan Ruan, Soummya Kar, Larry Pileggi, Bruno Sinopoli
We present a stochastic first-order optimization method specialized for deep neural networks (DNNs), ECCO-DNN.
no code implementations • 24 Apr 2023 • Carmel Fiscko, Soummya Kar, Bruno Sinopoli
The controller's objective is to find an optimal policy that maximizes the value of the expected system given a priori knowledge of the agents' dropout probabilities.
no code implementations • 5 Feb 2023 • Carmel Fiscko, Soummya Kar, Bruno Sinopoli
In this work we investigate an importance sampling approach for evaluating policies for a structurally time-varying factored Markov decision process (MDP), i. e. the policy's value is estimated with a high-probability confidence interval.
no code implementations • 15 Nov 2022 • Aayushya Agarwal, Carmel Fiscko, Soummya Kar, Larry Pileggi, Bruno Sinopoli
To find the value of the critical point, we propose a time step search routine for Forward Euler discretization that controls the local truncation error, a method adapted from circuit simulation ideas.
no code implementations • 11 Jul 2022 • Carmel Fiscko, Soummya Kar, Bruno Sinopoli
To efficiently find a policy in this rapidly scaling space, we propose a clustered Bellman operator that optimizes over the action space for one cluster at any evaluation.