Multimodal reinforcement learning with agentic verifier for AI agents
• Argos trains multimodal RL agents to reward answers grounded in visual and temporal evidence, not just plausibility. • Automated verification selects specialized tools per answer