ART (Agent Reinforcement Trainer)
🧠An open-source reinforcement learning framework that trains multi-step agents for real-world tasks using GRPO, supporting Qwen2.5, Qwen3, Llama, and other large language models.
An open-source reinforcement learning framework that trains multi-step agents for real-world tasks using GRPO, supporting Qwen2.5, Qwen3, Llama, and other large language models.
A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.
A modern framework for probabilistic programming and Bayesian analysis designed for research and data analytics, featuring intuitive APIs and flexible model definition capabilities。
A comprehensive platform for training, evaluating, and evolving LLM-based agents across diverse environments with standardized benchmarks.
A reinforcement learning environment for Mario AI, offering trainable agents to play Super Mario games.
DeepResearch is an open-source deep research agent developed by Alibaba, designed for long-horizon, deep information-seeking tasks. With 30.5 billion total parameters but only 3.3 billion activated per token, it demonstrates state-of-the-art performance across various agentic search benchmarks like Humanity's Last Exam, BrowseComp, and WebWalkerQA.
Page 1 / 1 · 6 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.