slime
✨An LLM post-training framework for RL scaling by Tsinghua THUDM, deeply integrating Megatron-LM training with SGLang inference engine for distributed reinforcement learning on large models like GLM, Qwen, DeepSeek, and Llama.
An LLM post-training framework for RL scaling by Tsinghua THUDM, deeply integrating Megatron-LM training with SGLang inference engine for distributed reinforcement learning on large models like GLM, Qwen, DeepSeek, and Llama.
An open-source framework by Stream for building vision AI agents that work with any model or video provider, leveraging Stream's edge network for ultra-low latency video experiences.
An AI agent that runs in your terminal, helping you complete software development tasks and terminal operations. It can read and edit code, execute shell commands, search and fetch web pages, and autonomously plan and adjust actions during execution.
A high-throughput, low-latency inference framework designed for serving generative AI and reasoning models in multi-node distributed environments.
A curated collection of autonomous agents (LLM) research papers updated daily, providing the latest AI research findings for researchers and developers。
Page 1 / 1 · 5 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.