🏆 Leaderboard 🏆

Leaderboard shows the pass@1 and ranking of LLMs on EvoEval benchmarks ordered by EvoEval score (average of all benchmarks). We label instruction-following models with .