DISCOVER THE FUTURE OF AI AGENTSarrow_forward

AlphaAvatar

calendar_todayAdded Feb 22, 2026
categoryAgent & Tooling
codeOpen Source
PythonMultimodalRAGAI AgentsAgent FrameworkWeb ApplicationAgent & ToolingDocs, Tutorials & ResourcesKnowledge Management, Retrieval & RAGProtocol, API & IntegrationComputer Vision & Multimodal

A learnable, configurable, and pluggable Omni-Avatar Assistant framework built on LiveKit, featuring real-time interaction, multimodal memory, user persona, and external tool integration.

Project Overview#

AlphaAvatar is an Apache-2.0 open-source project aiming to build a universal virtual assistant. It is a Python-based Agent framework that solves the challenge of integrating real-time audio/video interaction (LiveKit), LLM inference, long-term memory, user persona, and virtual characters, providing a digital assistant solution with "self-evolution" capabilities.

Core Value: Lowering the barrier to building real-time voice/video AI Agents with long-term memory and personalized interaction capabilities.

Use Cases:

  • Real-time voice/video virtual companionship and assistance
  • Intelligent customer service or educational tutoring Agents with long-term memory
  • Multimodal interaction research (speech recognition, speaker diarization, Live2D character driving)
  • Intelligent Q&A systems with external knowledge base (RAG) and deep web search

Core Capabilities & Plugins#

The project features a plugin-based design with two main categories: AlphaAvatar core plugins and tool plugins.

AlphaAvatar Core Plugins#

PluginStatusDescription
🧠 MemoryImplementedSelf-improving memory module supporting Assistant–User/Assistant–Tools/Assistant's self-memory capture and retrieval
🧬 PersonaImplementedAutomatic full modality user persona with speaker ID verification and real-time profile matching
😊 Virtual CharacterImplementedReal-time generated virtual character, integrated with AIRI live2d
💡 ReflectionPlannedOptimizer for automatic internal knowledge base construction
🗺️ PlanningPlannedLong-term planning for sequential and reliable actions
🤖 BehaviorPlannedBehavior logic and process flow controller

Tool Plugins#

PluginStatusDescription
🔍 DeepResearchImplementedNetwork access and deep search via Tavily API, supporting quick search/deep search/web-to-PDF
📖 RAGImplementedDocument knowledge access via RAG Anything with DeepResearch page indexing

Installation & Quick Start#

Requirements#

  • Python 3.11+
  • Dependencies: LiveKit Server, OpenAI API Key, Qdrant (cloud or self-hosted), Tavily API Key (optional)

Install from PyPI (Stable)#

uv venv .my-env --python 3.11
source .my-env/bin/activate
pip install alpha-avatar-agents

Install from GitHub (Latest)#

git clone --recurse-submodules https://github.com/AlphaAvatar/AlphaAvatar.git
cd AlphaAvatar
uv venv .venv --python 3.11
source .venv/bin/activate
uv sync --all-packages

Environment Variables#

export LIVEKIT_API_KEY=<your API Key>
export LIVEKIT_API_SECRET=<your API Secret>
export LIVEKIT_URL=<your LiveKit server URL>
export OPENAI_API_KEY=<your OpenAI API Key>
export QDRANT_URL='https://xxxxxx.us-east.aws.cloud.qdrant.io:6333'
export QDRANT_API_KEY=<your QDRANT API Key>
export TAVILY_API_KEY=<your TAVILY API Key>  # Optional

Run in Development Mode#

alphaavatar download-files
alphaavatar dev examples/pipline_openai_airi.yaml
# or
alphaavatar dev examples/pipline_openai_tools.yaml

Architecture Design#

  • Core Framework: Built on LiveKit Agents for real-time interaction streams
  • Modular Design: avatar-agents (core Agent logic & orchestration) + avatar-plugins (plugin implementations)
  • Context Manager: Core routing component distributing real-time interaction data to plugin models
  • Data Storage: Qdrant vector database for Memory and Persona Embedding storage
  • Multimodal Pipeline: LiveKit AV stream → STT → Context Manager (Persona/Memory) → LLM → Tools (DeepResearch/RAG) → TTS → Audio stream + Live2D drive

CLI Commands#

  • alphaavatar download-files: Initialize and download required resource files
  • alphaavatar dev <config_path>: Start Agent in development mode with specified YAML config

Version History#

DateVersionKey Updates
2026-01v0.3.1ADD tool calls during user-Assistant interactions to the Memory module
2026-01v0.3.0Support DeepResearch by tavily API
2025-12v0.2.0Support AIRI live2d-based virtual character display
2025-11v0.1.0Support automatic memory extraction, automatic user persona extraction and matching

Project Vision#

Build a universal assistant capable of recognizing users through multimodal streaming input. It should possess self-memory, autonomous reflection, and iterative self-evolution for real-time interaction. The assistant will seamlessly integrate with mainstream external tools to solve practical problems efficiently.

Related Projects

View All arrow_forward

STAY UPDATED

Get the latest AI tools and trends delivered straight to your inbox. No spam, just intelligence.

rocket_launch