Infrastructure
Compute Infrastructure
The computational backbone powering 10,000 Faculty agents, real-time dialogue, RAG pipelines, voice synthesis, and the entire Inquiry Institute platform.
Estimated Monthly Compute Costs
$4,500ā$16,150
Varies based on usage, Faculty activity, and course enrollment
LLM Inference
~$2,000ā8,000/moLarge language model inference for Faculty conversations, content generation, and interactive dialogue with 10,000 AI personas.
OpenRouter Gateway
OpenRouterMulti-model routing to Claude, GPT-4, Llama, Mixtral, and other providers
- ā¢Dynamic model selection per Faculty
- ā¢Automatic fallback on rate limits
- ā¢Usage tracking per Faculty agent
- ā¢Cost optimization via model routing
Context Window Management
Managing long conversations with historical Faculty personas
- ā¢128K+ token contexts for deep dialogue
- ā¢Sliding window for extended sessions
- ā¢RAG augmentation for corpus access
RAG & Embeddings
~$500ā1,500/moRetrieval-Augmented Generation pipelines that give Faculty agents access to their historical corpora and institutional knowledge.
OpenAI Embeddings
OpenAIText embeddings for semantic search across Faculty corpora
- ā¢text-embedding-3-small for efficiency
- ā¢1536-dimensional vectors
- ā¢Batch processing for corpus ingestion
pgvector (Supabase)
SupabaseVector similarity search integrated with PostgreSQL
- ā¢HNSW indexing for fast retrieval
- ā¢Hybrid search (vector + full-text)
- ā¢Per-Faculty corpus isolation
LangChain Pipelines
Self-hostedRAG orchestration and chain management
- ā¢Document loaders for PDFs, ePub, HTML
- ā¢Chunking strategies per corpus type
- ā¢Citation tracking and source attribution
Audio & Speech
~$1,000ā3,000/moText-to-speech and speech-to-text for voice interactions with Faculty agents, including the Speaking Busts feature.
Microsoft Edge TTS
Microsoft300+ neural voices for Faculty speech synthesis
- ā¢Free tier with high quality
- ā¢40+ languages supported
- ā¢SSML for prosody control
- ā¢Real-time streaming via WebSocket
Free
Azure Speech Services
Microsoft AzurePremium TTS with viseme support for Speaking Busts
- ā¢22 viseme IDs for lip-sync
- ā¢Neural voice cloning (future)
- ā¢Custom voice training
- ā¢Real-time synthesis <200ms
ElevenLabs
ElevenLabsHigh-fidelity voice synthesis for premium Faculty voices
- ā¢Voice cloning from samples
- ā¢Emotional expression control
- ā¢Multi-language support
Whisper STT
OpenAI / Self-hostedSpeech-to-text for voice input to Faculty
- ā¢Multi-language transcription
- ā¢Real-time streaming mode
- ā¢Speaker diarization
Audio2Face Animation
~$200ā500/moReal-time facial animation for Speaking Busts, driven by audio and viseme data.
Viseme Processing
Azure viseme events to mouth shape mapping
- ā¢22 Azure visemes ā 9 mouth shapes
- ā¢60ms easing between frames
- ā¢Canvas2D compositing
Bust Animation Pipeline
Real-time 2.5D portrait animation
- ā¢Micro-motion (2px amplitude)
- ā¢Random blinks (3.5ā7s intervals)
- ā¢Sprite-based mouth atlas
- ā¢WebGL voxel rendering
RunPod GPU (Future)
RunPodServerless GPU for advanced animation models
- ā¢LivePortrait model inference
- ā¢Real-time video generation
- ā¢A10G / A100 instances
Image & Video Generation
~$500ā2,000/moAI image generation for Faculty portraits, article illustrations, and iNQ publications.
Replicate
ReplicateServerless AI model hosting for image generation
- ā¢Stable Diffusion XL
- ā¢FLUX models
- ā¢ControlNet for style transfer
- ā¢Pay-per-inference pricing
HeyGen Avatars
HeyGenVideo avatar generation for Faculty presentations
- ā¢Talking head videos
- ā¢Custom avatar creation
- ā¢Lip-sync from audio
ComfyUI Workflows
Self-hosted / RunPodCustom image generation pipelines for iNQ
- ā¢William Blake style LoRA
- ā¢Mucha Art Nouveau LoRA
- ā¢Batch illustration generation
Database & Storage
~$100ā500/moPostgreSQL database, file storage, and real-time subscriptions for the platform.
Supabase Database
SupabasePostgreSQL with extensions for the entire platform
- ā¢Row Level Security (RLS)
- ā¢pgvector for embeddings
- ā¢Real-time subscriptions
- ā¢8GB database (Pro plan)
Supabase Storage
SupabaseFile storage for Faculty assets and user uploads
- ā¢Faculty portraits and avatars
- ā¢Audio files and transcripts
- ā¢PDF publications
- ā¢100GB included
AWS S3
AWSBulk storage for Faculty corpora and archives
- ā¢Historical text corpora
- ā¢Training data archives
- ā¢Backup storage
Edge Functions
~$50ā200/mo30+ serverless functions handling Faculty agents, webhooks, and real-time operations.
Supabase Edge Functions
SupabaseDeno-based serverless functions at the edge
- ā¢30+ deployed functions
- ā¢faculty, ask-faculty, inquire
- ā¢matrix-webhook, social-media-post
- ā¢issue-microcredential, llm-gateway
Application Hosting
~$100ā300/moWeb hosting, CDN, and container orchestration for the platform.
GitHub Pages
GitHubStatic hosting for the Next.js frontend
- ā¢Automatic deploys via GitHub Actions
- ā¢Global CDN
- ā¢Custom domain (inquiry.institute)
Free
Fly.io Containers
Fly.ioContainer hosting for Matrix, WorkAdventure, and services
- ā¢Matrix Synapse homeserver
- ā¢Element Web client
- ā¢WorkAdventure campus
- ā¢Global edge deployment
Vercel (Backup)
VercelAlternative hosting for preview deployments
- ā¢Preview URLs for PRs
- ā¢Edge functions
- ā¢Analytics
Communication Infrastructure
~$50ā150/moMatrix chat server, authentication, and real-time communication.
Matrix Synapse
Self-hosted on Fly.ioDecentralized chat for Faculty Club and salons
- ā¢Federation-ready homeserver
- ā¢End-to-end encryption
- ā¢Bot integration for Faculty
- ā¢Persistent room history
Matrix & Supabase Auth
Fly.io / SupabaseIdentity and chat
- ā¢Matrix Synapse (password auth)
- ā¢Supabase Auth for app and bots
- ā¢Faculty bots via Application Service
Infrastructure Architecture
āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā INQUIRY INSTITUTE ā ā Compute Infrastructure ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā⤠ā ā ā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā ā ā ā Frontend ā ā Supabase ā ā Fly.io ā ā ā ā (GitHub āāāāāŗā (Database, āāāāāŗā (Matrix, ā ā ā ā Pages) ā ā Auth, Edge) ā ā WorkAdv.) ā ā ā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā ā ā ā ā ā ā ā ā¼ ā¼ ā¼ ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā Edge Functions ā ā ā ā faculty ā ask-faculty ā llm-gateway ā matrix-webhook ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā ā ā ā ā ā¼ ā¼ ā¼ ā ā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā ā ā ā OpenRouter ā ā pgvector ā ā Azure TTS ā ā ā ā (LLM API) ā ā (RAG) ā ā (Speech) ā ā ā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā āāāāāāāāāāāāāāāā ā ā ā ā ā ā ā ā¼ ā¼ ā¼ ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā 10,000 Faculty Agents with RAG Corpora ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā ā ā ā āāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāāā
Monthly Cost Breakdown
| Category | Primary Provider | Low Est. | High Est. |
|---|---|---|---|
| š§ LLM Inference | OpenRouter | $2,000 | $8,000 |
| šRAG & Embeddings | OpenAI | $500 | $1,500 |
| šļøAudio & Speech | Microsoft | $1,000 | $3,000 |
| šæAudio2Face Animation | Various | $200 | $500 |
| šØImage & Video Generation | Replicate | $500 | $2,000 |
| š¾Database & Storage | Supabase | $100 | $500 |
| ā”Edge Functions | Supabase | $50 | $200 |
| šApplication Hosting | GitHub | $100 | $300 |
| š¬Communication Infrastructure | Self-hosted on Fly.io | $50 | $150 |
| Total Monthly Estimate | $4,500 | $16,150 | |
Estimates based on moderate usage. Actual costs vary with traffic and Faculty activity.
Scaling Considerations
Cost Optimizations
- ⢠Response caching for common queries
- ⢠Model routing (smaller models for simple tasks)
- ⢠Edge TTS (free tier) for most voices
- ⢠Batch processing for corpus updates
- ⢠Reserved capacity discounts
Scaling Triggers
- ⢠Course enrollment increases
- ⢠Simultaneous salon sessions
- ⢠Journal publication cycles
- ⢠Marketing campaigns
- ⢠Video generation requests
Support Our Infrastructure
Help fund the computational resources that power Faculty agents, enable real-time dialogue, and keep the Institute running.