Multi-agent reinforcement learning behavioral control for nonlinear second-order systems