no code implementations • 19 May 2024 • Suorong Yang, Peijia Li, Xin Xiong, Furao Shen, Jian Zhao
The policy network optimizes the variability within the augmented data, while the target network utilizes the adaptively augmented samples for training.