Enhancing Performance Of Reinforcement Learning Models In The Presence Of Noisy Rewards