AWorld
✨An open-source framework for building, evaluating, and training general multi-agent systems. Features natural language agent creation, distributed reinforcement learning training pipeline, and complex environment interactions. Ranks top on authoritative benchmarks including GAIA, OSWorld, and VisualWebArena.
Rankify
✨A modular Python toolkit developed by the University of Innsbruck that integrates information retrieval, re-ranking, and RAG generation, featuring 40+ pre-processed datasets and single-line pipeline construction.
Model & Inference FrameworkNatural Language ProcessingSDK
Roboflow Trackers
✨A plug-and-play multi-object tracking (MOT) Python library offering modular implementations of classic algorithms like SORT and ByteTrack. Features a detector-agnostic design compatible with any object detection model (YOLO, DETR, etc.), supporting video files, cameras, RTSP streams, and more. Provides unified CLI tools and Python API with built-in evaluation metrics (CLEAR, HOTA, Identity).
MultimodalDeep LearningSDK
Machine Learning Systems (CS249r Book)
✨An interactive open-access textbook on Machine Learning Systems engineering from Harvard University, integrating the TinyTorch framework with hands-on edge deployment labs, covering the full spectrum from ML fundamentals to system optimization.
OtherDeep LearningMachine Learning
Hands-On Large Language Models
✨Official code repository for the O'Reilly book "Hands-On Large Language Models". Features 12 core chapters and bonus content covering Tokens, Transformers, RAG, and Fine-tuning. Includes 300+ illustrations and runnable Jupyter Notebooks optimized for Colab and local environments.
Natural Language ProcessingRAG大语言模型
AI Engineering Hub
✨A comprehensive AI engineering hub featuring 93+ production-ready projects with in-depth tutorials and implementations for LLMs, RAGs, AI Agents, and MCP, covering beginner to advanced skill levels.
Other大语言模型Model Context Protocol
MiniCPM-o
✨An end-to-side omnimodal LLM by Tsinghua THUNLP supporting vision, speech, and full-duplex multimodal live streaming, optimized for mobile deployment with performance rivaling Gemini 2.5 Flash.
大语言模型MultimodalTransformers
Heretic
✨Fully automatic censorship removal tool for language models using directional ablation with TPE parameter optimization to remove safety alignment while minimizing refusal behaviors and preserving original model capabilities. Supports dense, multimodal, and MoE architectures.
Multimodal大语言模型Transformers
CogAgent
✨An open-sourced end-to-end VLM-based GUI Agent developed by Tsinghua University and Zhipu AI, built on GLM-4V-9B bilingual VLM, enabling cross-platform GUI automation and reasoning via screenshots and natural language instructions.
Model & Inference Framework大语言模型Multimodal
Awesome Long Chain-of-Thought Reasoning
✨A curated collection of resources for Long Chain-of-Thought (Long-CoT) reasoning in LLMs, featuring papers, implementations, and datasets to track the latest advancements in the field.
OtherNatural Language ProcessingAI Agents