MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...
Archi Padilla, New Mexico BARC (border animal rescue coalition) vice president, had asked to address the council with changes their organization would like to see made to the Hurley animal control ...