Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!
• Humans can sift through thousands of images, spotting subtle patterns-a skill AI still struggles to match. • Traditional VQA systems answer questions about single images, missing