DISCOVER THE FUTURE OF AI AGENTSarrow_forward

All Projects

6 projects

ART (Agent Reinforcement Trainer)

🧠

An open-source reinforcement learning framework that trains multi-step agents for real-world tasks using GRPO, supporting Qwen2.5, Qwen3, Llama, and other large language models.

Model & Inference FrameworkPythonvLLM

BigCodeBench

🧠

A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.

Docs, Tutorials & ResourcesPythonPyTorch

StateSpace

A modern framework for probabilistic programming and Bayesian analysis designed for research and data analytics, featuring intuitive APIs and flexible model definition capabilities。

Model & Inference FrameworkPythonMachine Learning

AgentGym

A comprehensive platform for training, evaluating, and evolving LLM-based agents across diverse environments with standardized benchmarks.

Agent & ToolingPythonPyTorch

mario-ai

A reinforcement learning environment for Mario AI, offering trainable agents to play Super Mario games.

Agent & ToolingPythonPyTorch

DeepResearch

DeepResearch is an open-source deep research agent developed by Alibaba, designed for long-horizon, deep information-seeking tasks. With 30.5 billion total parameters but only 3.3 billion activated per token, it demonstrates state-of-the-art performance across various agentic search benchmarks like Humanity's Last Exam, BrowseComp, and WebWalkerQA.

Model & Inference FrameworkPythonPyTorch
Per page

Page 1 / 1 · 6 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.

rocket_launch