Search for a command to run...
Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation