Framework Employed for Training and Evaluation: On the left, we depict the agent’s interaction during both training and evaluation phases with the simulation environments, highlighting the incorporation of disturbances in the loop. On the right, we illustrate the deployment of the trained policy, while performing open-loop control on the real FP system.
Simulation performed using Omniverse Isaac Sim, showcasing the platform's ability to quickly (a model is trained in parallel across multiple environments) and converges in less than 10 minutes.
Direct transfer of simulation-trained policies to the physical floating platform demonstrates successful sim-to-real capabilities. The system maintains stable control despite real-world uncertainties including air currents, sensor noise, and mechanical variations.
Matteo El-Hariry, Antoine Richard, Vivek Muralidharan, Matthieu Geist, Miguel Olivares-Mendez
IROS '24 (oral presentation) & MASSpace'24 (International Workshop on Autonomous Agents and Multi-Agent Systems for Space Applications)