BigCodeBench
🧠A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.
Docs, Tutorials & ResourcesPythonPyTorch
how-to-build-a-coding-agent
✨A step-by-step workshop that teaches you how to build your own AI-powered coding assistant, starting from a basic chatbot and progressively adding powerful tools like file reading, shell command execution, and code search.
Docs, Tutorials & ResourcesGoAI Agents
MiroThinker
✨An open-source deep research agent optimized for research and prediction tasks, achieving 80.8% Avg@8 score on the challenging GAIA benchmark, featuring 256K context window support and up to 600 tool calls per task.
Agent & ToolingPythonAI Agents
learn-claude-code
✨An educational project that teaches you how to build modern AI coding agents from scratch through progressive tutorials, featuring 5 versions from simple bash tools to a complete skills system.
Docs, Tutorials & ResourcesPythonAI Agents