Microsoftresearch

Multimodal reinforcement learning with agentic verifier for AI agents

• Argos trains multimodal RL agents to reward answers grounded in visual and temporal evidence, not just plausibility. • Automated verification selects specialized tools per answer

Online math tutoring service uses AI to help boost students' skills and confidence

• Eedi offers AI‑driven online math tutoring to address learning gaps post‑COVID. • Students start with a 10‑question dynamic quiz that maps their strengths and weaknesses. • Micro