Search Results for author: Homayoon Farrahi

Found 2 papers, 2 papers with code

Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization

1 code implementation • 9 May 2023 • Homayoon Farrahi, A. Rupam Mahmood

In this work, we investigate the widely-used baseline hyper-parameter values of two policy gradient algorithms -- PPO and SAC -- across different cycle times.

Paper
Code

Model-free Policy Learning with Reward Gradients

1 code implementation • 9 Mar 2021 • Qingfeng Lan, Samuele Tosatto, Homayoon Farrahi, A. Rupam Mahmood

As a key component in reinforcement learning, the reward function is usually devised carefully to guide the agent.

Continuous Control Policy Gradient Methods

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.