BiScale: Energy-Efficient Disaggregated LLM Serving via Phase-Aware Placement and DVFS

BiScale: Energy-Efficient Disaggregated LLM Serving via Phase-Aware Placement and DVFS

• Prefill/decode disaggregation improves latency-throughput tradeoff for large language model serving. • Energy consumption remains high; autoscaling is too coarse-grained for rapi

FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations

FineRef: Fine-Grained Error Reflection and Correction for Long-Form Generation with Citations

• FineRef introduces fine-grained error reflection for citation mismatch and irrelevance in long‑form LLM generation. • Two‑stage training: supervised fine‑tuning with attempt‑refl

Research & Labs · February 24, 2026 (updated February 24, 2026) · 1 min · 189 words
Quantifying construct validity in large language model evaluations

Quantifying construct validity in large language model evaluations

• LLM benchmarks often misrepresent true model capabilities due to contamination and annotator errors. • Construct validity is essential to ensure benchmarks truly measure desired

Research & Labs · February 18, 2026 (updated February 24, 2026) · 1 min · 161 words