Name: GitTaskBench
Rating: 3.246 (246 reviews)
Author: QuantaAlpha

<div align="center"> <h1 align="center" style="color: #2196F3; font-size: 24px; font-weight: 600; margin: 20px 0; line-height: 1.4;"> 🚀 GitTaskBench: <span style="color: #555; font-weight: 400; font-size: 18px;"><em>A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging</em></span> </h1> <p style="margin: 20px 0;"> <a href="https://arxiv.org/pdf/2508.18993"><img src="https://img.shields.io/badge/arXiv-2508.18993-B31B1B.svg?style=flat-square&logo=arxiv&logoColor=white" /></a> <a href="https://gittaskbench.github.io"><img src="https://img.shields.io/badge/🌐_LeaderBoard-GitTaskBench-4A90E2.svg?style=flat-square&logo=github&logoColor=white" /></a> <a href="https://github.com/QuantaAlpha/RepoMaster"><img src="https://img.shields.io/badge/Agent-RepoMaster-4A90E2.svg?style=flat-square&logo=github&logoColor=white" /></a> <a href="https://quantaalpha.github.io/"><img src="https://img.shields.io/badge/Team-QuantaAlpha-00A98F.svg?style=flat-square&logo=opensourceinitiative&logoColor=white" /></a> </p> <a href="https://gittaskbench.github.io/"> <img src="figs/leaderboard.png" width="800" /><br> </a> </div>

📰 News

2025.09.19 🎉 Excited to announce that our papers have been accepted to <u>NeurIPS 2025</u> — RepoMaster as a Spotlight (≈3.2%) and SE-Agent as a Poster (≈24.52%)!
2025.08.28 🎉 We open-sourced RepoMaster — an AI agent that leverages GitHub repos to solve complex real-world tasks.
2025.08.26 🎉 We open-sourced — a repo-level benchmark & tooling suite for real-world tasks.

GitTaskBench

📰 News

🧭 Motivation and Goal