News
It achieved an 8.0% higher win rate over DeepSeek R1, suggesting that its strengths generalize beyond just logic or math-heavy challenges.
Apparently it was huge on TikTok like a year ago, but now that I'm 40 I am no longer expected to know/care about anything ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results