Search Results for author: André Correia

Found 6 papers, 2 papers with code

Hierarchical Decision Mamba

1 code implementation13 May 2024 André Correia, Luís A. Alexandre

In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models.

D4RL Imitation Learning +1

Music to Dance as Language Translation using Sequence Models

1 code implementation22 Mar 2024 André Correia, Luís A. Alexandre

We train our method on AIST++ and PhantomDance data sets to teach a robotic arm to dance, but our method can be applied to a full humanoid robot.

Translation

DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety

no code implementations8 May 2023 André Correia, Luís Alexandre

We propose a task-agnostic method that leverages small sets of safe and unsafe demonstrations to improve the safety of RL agents during learning.

A Survey of Demonstration Learning

no code implementations20 Mar 2023 André Correia, Luís A. Alexandre

This paper provides a survey of demonstration learning, where we formally introduce the demonstration problem along with its main challenges and provide a comprehensive overview of the process of learning from demonstrations from the creation of the demonstration data set, to learning methods from demonstrations, and optimization by combining demonstration learning with different machine learning methods.

Reinforcement Learning (RL)

Hierarchical Decision Transformer

no code implementations21 Sep 2022 André Correia, Luís A. Alexandre

Our method outperforms the baselines in eight out of ten tasks of varied horizons and reward frequencies without prior task knowledge, showing the advantages of the hierarchical model approach for learning from demonstrations using a sequence model.

D4RL reinforcement-learning +1

Contrastive Learning from Demonstrations

no code implementations30 Jan 2022 André Correia, Luís A. Alexandre

This paper presents a framework for learning visual representations from unlabeled video demonstrations captured from multiple viewpoints.

Contrastive Learning reinforcement-learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.