Scalable Bayesian Reinforcement Learning