RL Convergence and the Shifting Axis of LLM Reasoning Competition | AI Insight Note