Engineering
RLVR trains a 4B model to nail Atlassian API calls
RLVR for tool-use agents trains a 4B model to hit 1.00 reward on Atlassian API tasks, up from a 0.35 baseline on Confluence page creation. Synthetic environments and verifiable rewards close the schema gap.