llmfit
✨A Rust-based cross-platform CLI tool that right-sizes LLM models to your system's RAM, CPU, and GPU by detecting specs and recommending optimal models and quantization strategies. Covers 206 models from 57 providers.
A Rust-based cross-platform CLI tool that right-sizes LLM models to your system's RAM, CPU, and GPU by detecting specs and recommending optimal models and quantization strategies. Covers 206 models from 57 providers.
A minimal, hackable experimental harness for training LLMs on a single GPU node, covering all stages from pretraining to a ChatGPT-like UI.
An open-source framework by Stream for building vision AI agents that work with any model or video provider, leveraging Stream's edge network for ultra-low latency video experiences.
AirLLM optimizes inference memory usage, enabling 70B large language models to run on a single 4GB GPU card without quantization, distillation, or pruning. It now also supports running 405B Llama3.1 models on 8GB VRAM.
A modern AI gateway system that provides a unified OpenAI, Anthropic, Gemini and AI SDK compatible API, enabling seamless integration across multiple AI providers with automatic request translation and comprehensive tracing capabilities.
Microsoft's family of open-source frontier voice AI models including both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models, designed for long-form audio processing with multilingual support.
Trinity-RFT is a general-purpose, flexible and user-friendly framework for LLM reinforcement fine-tuning (RFT). It decouples RFT into three coordinated components: Explorer, Trainer, and Buffer, enabling users with different backgrounds to train LLM-powered agents for specific domains.
A unified and easy-to-extend tool-agent training framework based on verl, supporting diverse tool use scenarios and enabling AI agents to be trained via reinforcement learning for tool-calling capabilities.
A tool that gracefully solves hCaptcha challenges using multimodal large language models, without relying on browser extensions or third-party captcha services.
A benchmark for evaluating the code generation capabilities of large language models, featuring 1,140 software-engineering-oriented programming tasks with two modes (Complete and Instruct) to test models on complex instructions and diverse function call scenarios.
Page 1 / 2 · 19 total
Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.