Agent Park - Agent Project Navigator

All Projects

6 projects

ART (Agent Reinforcement Trainer)

🧠

An open-source reinforcement learning framework that trains multi-step agents for real-world tasks using GRPO, supporting Qwen2.5, Qwen3, Llama, and other large language models.

Model & Inference FrameworkPythonvLLM

VIEW DETAILS →

BigCodeBench

🧠

A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.

Docs, Tutorials & ResourcesPythonPyTorch

VIEW DETAILS →

StateSpace

✨

A modern framework for probabilistic programming and Bayesian analysis designed for research and data analytics, featuring intuitive APIs and flexible model definition capabilities。

Model & Inference FrameworkPythonMachine Learning

VIEW DETAILS →

AgentGym

✨

A comprehensive platform for training, evaluating, and evolving LLM-based agents across diverse environments with standardized benchmarks.

Agent & ToolingPythonPyTorch

VIEW DETAILS →

mario-ai

✨

A reinforcement learning environment for Mario AI, offering trainable agents to play Super Mario games.

Agent & ToolingPythonPyTorch

VIEW DETAILS →

DeepResearch

✨

DeepResearch is an open-source deep research agent developed by Alibaba, designed for long-horizon, deep information-seeking tasks. With 30.5 billion total parameters but only 3.3 billion activated per token, it demonstrates state-of-the-art performance across various agentic search benchmarks like Humanity's Last Exam, BrowseComp, and WebWalkerQA.

Model & Inference FrameworkPythonPyTorch

VIEW DETAILS →

Per page

Page 1 / 1 · 6 total

Browse by Filters

Project Type

Filter by Domain

Filter by Product Form

All Projects

ART (Agent Reinforcement Trainer)

BigCodeBench

StateSpace

AgentGym

mario-ai

DeepResearch

STAY UPDATED