#autocodebench
1 total
- AutoCodeBench: When LLMs Generate Code Benchmarks
Why the Elixir column stands out in AutoCodeBench, and how it opens up a discussion of difficulty equivalence in automatically generated multilingual code benchmarks
1 total
Why the Elixir column stands out in AutoCodeBench, and how it opens up a discussion of difficulty equivalence in automatically generated multilingual code benchmarks