Buuuuuuuuuuuullllsssss********

05. November 2025

The French Government Laun­ches an LLM Lea­der­board Com­pa­ra­ble to LMare­na, Empha­si­zing Euro­pean Lan­guages and Ener­gy Efficiency

Mis­tral medi­um num­ber one. Very medi­um, very ener­gy efficient.

Ha.

Here: https://comparia.beta.gouv.fr/ranking

It mea­su­res some­thing that is cal­led the “BT score of satis­fac­tion”. BT stands for Bri­tish Telecom, or…?

Deep­seek Chat v3.2 mis­sing, Mini­max M2 mis­sing, GLM mis­sing, Kimi K2 mis­sing, qwen3-32b hig­hest ran­ked Qwen model, grok-3-mini-beta bea­ting grok-4-fast, and hig­hest ran­king grok model, gemi­ni 2.5 flash hig­hest ran­king goog­le model, nemo­tron with a gre­at top 20 score, gpt-oss-120b bea­ting out gpt-5.

edit2: GLM 4.6 und Kimi K2 sind aus­wähl­bar, GLM hat­te aber zu weni­ge votes.

HA, yeah. Yeah.

edit: Hat was Gutes. Gra­tis Clau­de Son­net 4.5 und Gemi­ni Flash use ohne log­in. Merkt sich die Model-Auswahl. 🙂

Gut, wenn die fran­zö­si­sche Regie­rung dafür bezahlt.. 😉









Hinterlasse eine Antwort