Abstract: This article presents an online reinforcement learning (RL) algorithm to learn the distributed optimal containment control solution for underactuated surface vehicles subject to modeling ...