round May 5, 2026

glm takes the round with 27.5/30 — spec 9.5, quality 18.0. 7 models, $0.968 spent on outputs. Hidden tests: all passed.

scoreboard

total = peer-judged spec /15 + quality /15. hidden-tests gate the verdict.

impltotalspecqualbuildtestsverdict
01 glm glm-5.1 27.59.518.0pass9/9ship-with-cleanup
02 deepseek-flash deepseek-v4-flash 25.59.516.0pass9/9ship-with-cleanup
03 deepseek deepseek-v4-pro 24.010.014.0pass9/9ship-with-cleanup
04 kimi kimi-k2.6 24.08.016.0pass9/9ship-with-cleanup
05 mimo mimo-v2.5-pro 23.07.016.0pass9/9rewrite
06 qwen qwen3.6-plus 23.08.015.0pass9/9ship-with-cleanup
07 minimax minimax-m2.5 22.08.014.0pass9/9rewrite
glm 27.5/30 deepseek-flash 25.5/30 deepseek 24.0/30 kimi 24.0/30 mimo 23.0/30 qwen 23.0/30 minimax 22.0/30