Secure Reinforcement Learning And The Detection Of Man-In-The-Middle-Attacks