← Back to Documentation Hub

🎭 AvaTar - Getting Started Guide

Interactive AI Avatar System

Overview

AvaTar is a real-time interactive AI avatar system that provides HeyGen-like functionality for creating conversational AI avatars with lip-sync, voice synthesis, and streaming video capabilities.

πŸš€ Quick Start

Prerequisites

Local Development Setup

# Clone the repository
git clone https://github.com/yourusername/AvaTar.git
cd AvaTar

# Copy environment template
cp env.example .env

# Edit .env and add your API keys
nano .env

# Start the system
docker-compose -f docker-compose-simple.yml up -d

# Check health
curl http://localhost:8000/health

Access the System

πŸ—οΈ Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚   Web Client    │────▢│   FastAPI       │────▢│  Redis Queue    β”‚
β”‚  (JavaScript)   │◀────│   Backend       │◀────│  & Pub/Sub      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
         β”‚                       β”‚                         β”‚
         β”‚                       β–Ό                         β–Ό
         β”‚              β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
         └─────────────▢│  WebSocket      β”‚     β”‚  Linly-Talker   β”‚
                        β”‚   Server        β”‚     β”‚  (Lip Sync)     β”‚
                        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

🎯 Key Features

Real-time Streaming

WebSocket-based video and audio streaming

Voice Synthesis

11Labs integration for natural voice generation

Lip Sync

Automated lip synchronization with speech

Conversational AI

Integration with OpenAI/Anthropic

Scalable Architecture

Microservices design with Redis queue

HeyGen Compatibility

Compatible with HeyGen Avatar API

πŸ”§ Configuration

Environment Variables

# API Keys
ELEVENLABS_API_KEY=your_elevenlabs_key
OPENAI_API_KEY=your_openai_key

# Service URLs
REDIS_URL=redis://redis:6379
API_BASE_URL=http://localhost:8000

# Performance
FRAME_BUFFER_SIZE=100
MAX_CONCURRENT_SESSIONS=10

πŸ“Š Performance & Scaling

Recommended EC2 Instance Types

Capacity Planning

πŸ› οΈ Troubleshooting

Common Issues

1. Black screen / No video

2. No audio playback

3. High latency

πŸ“ˆ Monitoring