slime
✨An LLM post-training framework for RL scaling by Tsinghua THUDM, deeply integrating Megatron-LM training with SGLang inference engine for distributed reinforcement learning on large models like GLM, Qwen, DeepSeek, and Llama.
Vision-Agents
✨An open-source framework by Stream for building vision AI agents that work with any model or video provider, leveraging Stream's edge network for ultra-low latency video experiences.
Agent & ToolingPythonPyTorch
NVIDIA Dynamo
🧠A high-throughput, low-latency inference framework designed for serving generative AI and reasoning models in multi-node distributed environments.
Model & Inference FrameworkRustPython