Reinforcement learning for dynamic aerial base station positioning