TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models
• TPRU dataset addresses temporal and procedural gaps in multimodal LLMs, enabling richer embodied AI. • Comprised of robotic manipulation and GUI navigation scenes with 3 tasks: T