TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

• TPRU dataset addresses temporal and procedural gaps in multimodal LLMs, enabling richer embodied AI. • Comprised of robotic manipulation and GUI navigation scenes with 3 tasks: T

Research & Labs · February 24, 2026 (updated February 24, 2026) · 1 min · 187 words
TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

TemporalBench: A Benchmark for Evaluating LLM-Based Agents on Contextual and Event-Informed Time Series Tasks

• TemporalBench offers a multi-domain benchmark for temporal reasoning in LLM agents. • Four-tier taxonomy tests historical structure, context-free, contextual, and event-condition

Research & Labs · February 17, 2026 (updated February 24, 2026) · 1 min · 154 words