原文整理页

swyx 在 AI Engineer Europe 大会上分享 SWE-rebench 排行榜,探讨评估体系构建及模型作弊现象

来源作者:swyx 🇬🇧 @aidotengineer (@swyx)原始来源:https://x.com/swyx/status/2042217493666623496

中文导读

swyx 在 AI Engineer Europe 大会上分享 SWE-rebench 排行榜,探讨评估体系构建及模型作弊现象。

正文 Markdown

At 15:10 today, I’ll be speaking about our SWE-rebench leaderboard at AI Engineer Europe. I'll cover how we build evals and how models cheat! Come listen and let's chat! So far, this is the coolest applied AI event in London. Respect and thanks to @aiDotEngineer and @swyx See you there! 👋