i think these posttraining-automation benchmarks are even more important than they seem
when models cross the threshold of being able posttrain other models, hopefully there will be a cambrian explosion of the types of minds
authoring minds will become an accessible artform
GLM 5.2 is 5x cheaper than Opus 4.8 and 11x than Fable 5, yet it tops PostTrainBench.
That’s exciting because lower costs make personalized intelligence economically viable. Every company and country should be able to own models trained on its own data and have sovereignty over it. The future is millions of models, each crafted around the data, values, and decisions of the people who rely on them.

















