Question 1

What makes MiniRAG different from hosted RAG solutions?

Accepted Answer

You own everything. MiniRAG runs on your infrastructure — your data never leaves your servers. No vendor lock-in, no per-query pricing, no usage limits. Full source code under MIT.

Question 2

Can I use MiniRAG in production?

Accepted Answer

Yes. MiniRAG is battle-tested with 129 automated tests (pytest + Newman), async FastAPI for high concurrency, connection pooling, and proper error handling. It runs PostgreSQL, Qdrant, and Redis — all production-grade infrastructure.

Question 3

Which LLM providers does MiniRAG support?

Accepted Answer

Any provider compatible with the OpenAI API format via LiteLLM: OpenAI, Anthropic, Google Gemini, Ollama (local models), Azure OpenAI, and more. Switch providers per bot profile without code changes.

Question 4

How does the embeddable widget work?

Accepted Answer

Add one <script> tag to any website. The widget loads in a Shadow DOM for complete style isolation — no CSS conflicts. Customize colors, position, and behavior with CSS custom properties and data attributes.

Question 5

What does the admin dashboard include?

Accepted Answer

Bot profile management, document source ingestion, chat history with feedback tracking, webhook configuration, usage analytics with cost breakdowns, and user role management. All behind a glassmorphism UI with built-in chat testing.

Question 6

How secure is MiniRAG?

Accepted Answer

Four layers: Argon2id for password hashing, Fernet (AES-128-CBC) for encrypting LLM API keys at rest, HMAC-SHA256 for signed webhook deliveries, and JWT (HS256) for stateless session tokens. Multi-tenant isolation ensures complete data separation.

Question 7

What infrastructure does MiniRAG require?

Accepted Answer

Docker and Docker Compose. The stack includes PostgreSQL (structured data), Qdrant (vector storage), Redis (caching and task queues), and the FastAPI application. Minimum 2GB RAM recommended. All services are containerized.

Question 8

Can I run MiniRAG locally for development?

Accepted Answer

Yes. Use the manual setup: create a Python virtualenv, install dependencies, run the supporting services with Docker Compose, and start the FastAPI server with hot-reload. Full development docs in the README.

Your AI.
Your Data.
Your Infrastructure.

Everything you need for RAG

Multi-Tenant Isolation

RAG Pipeline

Provider-Agnostic LLMs

Real-Time Streaming

Embeddable Widget

Admin Dashboard

Webhooks & Events

Auto-Refresh

Usage Analytics

How the pipeline works

Three steps to production

Deploy

Ingest

Chat

Everything you need in one dashboard

Bot Profiles

Source Management

Chat History

Webhook Configuration

Usage Analytics

User & Role Management

Embed anywhere in seconds

Security at every layer

Password Hashing

Encryption at Rest

Webhook Signatures

Session Tokens

From zero to production in 5 minutes

Frequently Asked Questions

Deploy your RAG chatbot today

Your AI.Your Data.Your Infrastructure.

Everything you need for RAG

Multi-Tenant Isolation

RAG Pipeline

Provider-Agnostic LLMs

Real-Time Streaming

Embeddable Widget

Admin Dashboard

Webhooks & Events

Auto-Refresh

Usage Analytics

How the pipeline works

Three steps to production

Deploy

Ingest

Chat

Everything you need in one dashboard

Bot Profiles

Source Management

Chat History

Webhook Configuration

Usage Analytics

User & Role Management

Embed anywhere in seconds

Security at every layer

Password Hashing

Encryption at Rest

Webhook Signatures

Session Tokens

From zero to production in 5 minutes

Frequently Asked Questions

Deploy your RAG chatbot today

Your AI.
Your Data.
Your Infrastructure.