Quantifying construct validity in large language model evaluations
• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Quantifying construct validity in large language model evaluations View PDF HTML (experimental)Abstrac
• Computer Science > Artificial Intelligence [Submitted on 17 Feb 2026] Title:Quantifying construct validity in large language model evaluations View PDF HTML (experimental)Abstrac