Multi-rotor Aerial Vehicles Control

In this independent study, I applied deep reinforcement learning (deep RL) techniques to develop controllers for ModQuad, a special type of multi-rotor aerial vehicle consisting of modular flying structures. I deployed a model-free Proximal Policy Optimization (PPO) approach to training controllers for a zoo of different ModQuad structures (shown in the left figure). The trained controller can robustly stabilize ModQuad at fixed points from various starting conditions and in the presence of external force disturbances. Additionally, leveraging the trained RL controller, I devised a artificial potential field based approach to online path planning, which enabled the ModQuad to follow a series of waypoints without having to pre-compute the entire trajectory offline.

Deep RL policy controls a ModQuad to hover at a fixed point.

Artificial potential field based waypoints following for a 2x2 ModQuad. Waypoints indicated by blue dots.