Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

• Computer Science > Machine Learning [Submitted on 22 Feb 2026] Title:Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning View PDF HTML (experimental)Abstract

Research & Labs · February 25, 2026 (updated February 25, 2026) · 2 min · 246 words
Task-Aware Exploration via a Predictive Bisimulation Metric

Task-Aware Exploration via a Predictive Bisimulation Metric

• TEB introduces task-aware exploration for visual RL with sparse rewards. • Uses predictive bisimulation metric to learn behaviorally grounded task representations. • Adds predict

Research & Labs · February 24, 2026 (updated February 24, 2026) · 1 min · 164 words

Deep sea landscapes are a new frontier of human exploration-here's what we may find

• When we dream of landscapes, we might imagine rolling valleys or rugged mountains. • But there is a whole landscape hidden from human view: the secret world of the seafloor.

Science · February 22, 2026 (updated February 24, 2026) · 1 min · 133 words
APEX-SQL: Talking to the data via Agentic Exploration for Text-to-SQL

APEX-SQL: Talking to the data via Agentic Exploration for Text-to-SQL

• Computer Science > Databases [Submitted on 11 Feb 2026] Title:APEX-SQL: Talking to the data via Agentic Exploration for Text-to-SQL View PDF HTML (experimental)Abstract:Text-to-S

Research & Labs · February 20, 2026 (updated February 24, 2026) · 2 min · 253 words
Guided Exploration of Sequential Rules

Guided Exploration of Sequential Rules

• Computer Science > Databases [Submitted on 6 Feb 2026] Title:Guided Exploration of Sequential Rules View PDF HTML (experimental)Abstract:In pattern mining, sequential rules provi

Research & Labs · February 20, 2026 (updated February 24, 2026) · 1 min · 203 words