QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs Paper • 2602.20629 • Published Feb 24 • 4