KD4MT: A Survey of Knowledge Distillation for Machine Translation
• KD used for compression and knowledge transfer in MT, shaping supervision and translation quality. • Survey covers 105 papers up to Oct 2025, providing comprehensive landscape of
• KD used for compression and knowledge transfer in MT, shaping supervision and translation quality. • Survey covers 105 papers up to Oct 2025, providing comprehensive landscape of
• Uses trace rewriting to deter unauthorized knowledge distillation from large language models. • Introduces anti-distillation techniques that degrade training usefulness while kee