Dopamine Research framework for fast prototyping of reinforcement learning algorithms