Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO

Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO

• Computer Science > Machine Learning [Submitted on 5 Feb 2026] Title:Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO View PDF

Research & Labs · February 23, 2026 (updated February 24, 2026) · 2 min · 334 words