Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO
• Computer Science > Machine Learning [Submitted on 5 Feb 2026] Title:Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO View PDF