[11/18/2025] SWE-bench Verified and Multilingual now only accepts submissions from academic teams and research institutions with open source methods and peer-reviewed publications. Click to read full ...
Code and data for our ICLR 2024 paper SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Please refer our website for the public leaderboard and the change log for information on the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results