Agent Park - Agent Project Navigator

All Projects

1 projects

BigCodeBench

🧠

A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.

PythonPyTorch大语言模型

VIEW DETAILS →

Per page

Page 1 / 1 · 1 total

Browse by Filters

Project Type

Filter by Domain

Filter by Product Form

All Projects

BigCodeBench

STAY UPDATED