Tag

#tool-use-agents

1 story tagged tool-use-agents.

Bar chart showing RLVR training lifting Atlassian tool-use reward from 4B baselines of 0.35, 0.52, 0.68, and 0.92 to 1.00, 0.95, 1.00, and 1.00 across four workflow scenarios.

RLVR trains a 4B model to nail Atlassian API calls

RLVR for tool-use agents trains a 4B model to hit 1.00 reward on Atlassian API tasks, up from a 0.35 baseline on Confluence page creation. Synthetic environments and verifiable rewards close the schema gap.

Lars Cornelissen · Jul 3, 2026