#autocodebench

1 total

2026-04-19
AutoCodeBench: When LLMs Generate Code Benchmarks
Why the Elixir column stands out in AutoCodeBench, and how it opens up a discussion of difficulty equivalence in automatically generated multilingual code benchmarks

#paper-reading #autocodebench #elixir #code-generation #benchmark #LLM