DISCOVER THE FUTURE OF AI AGENTSarrow_forward

All Projects

85 projects

Familiar

Open-source, local-first macOS menu bar app that automatically captures screen context (screenshot OCR, clipboard mirroring) to keep AI tools continuously informed about your work environment.

RAGMultimodalModel Context Protocol

Clawd Cursor

AI desktop agent that sees your screen, controls your cursor, and completes tasks autonomously. Features a 5-layer intelligent fallback pipeline, multiple AI providers (Anthropic/OpenAI/Ollama/Kimi), with Web Dashboard and REST API.

MultimodalAI AgentsAgent Framework

SmartCall-Agent

A modular voice AI platform based on LiveKit and OpenAI Realtime API, integrating RAG knowledge retrieval, JWT authentication, and MongoDB persistence for real-time outbound calling and domain-specific conversations.

Model & Inference Framework大语言模型Multimodal

Edge-Veda

On-device full-stack AI SDK for Flutter with LLM, Vision, Speech, Image Gen, and RAG; features compute budget contracts and adaptive QoS with zero cloud dependency.

大语言模型MultimodalSDK

NagaAgent

A four-service collaborative AI desktop assistant framework with streaming tool calling, GRAG knowledge graph memory, Live2D avatar, and voice interaction

RAGMultimodalAI Agents

AWorld

An open-source framework for building, evaluating, and training general multi-agent systems. Features natural language agent creation, distributed reinforcement learning training pipeline, and complex environment interactions. Ranks top on authoritative benchmarks including GAIA, OSWorld, and VisualWebArena.

OtherMultimodal大语言模型

Tandem

A local-first AI workspace built on Rust and Tauri, acting as an AI coworker for secure, supervised automation on any folder. Supports multiple LLM backends, MCP protocol extension, and multimodal file processing.

Model & Inference Framework大语言模型Multimodal

SimpleLLMFunc

A lightweight yet complete LLM/Agent application development framework. Uses decorators to transform function signatures and docstrings into prompts, enabling type-safe LLM capabilities without function body implementation. Features multi-provider support, multimodal I/O, tool calling, streaming, API key load balancing, and Langfuse observability integration.

Model & Inference Framework大语言模型Multimodal

Open Computer Use

An open-source full-stack framework for autonomous computer agents, enabling control of browsers, terminals, and desktop apps via natural language in Docker VMs. Maintained by coasty-ai under Apache 2.0 license, achieving 82% on OSWorld Benchmark.

Model & Inference FrameworkNatural Language ProcessingMultimodal

Headroom

The Context Optimization Layer for LLM Applications, delivering 40-90% token reduction through deterministic compression and intelligent caching with multi-modal support and reversible CCR mechanism

Model & Inference FrameworkSDK大语言模型
Per page
...

Page 1 / 9 · 85 total

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.

rocket_launch