Formation control of a mono-operated UAV fleet through ad-hoc communications: a Q-learning approach | IEEE Conference Publication | IEEE Xplore