Teaching Machines to Learn: Unlocking Reinforcement Learning in AI