reinforcement learning scheduler