Complyra

Complyra = Comply + RA(G) — Enterprise compliance knowledge assistant powered by Retrieval-Augmented Generation.

Complyra is a production-ready, multi-tenant enterprise RAG system with human-in-the-loop approval workflow, RBAC, full audit logging, knowledge base management, and cloud deployment automation — built for compliance-sensitive environments.

Why Complyra?

In compliance-critical industries (finance, healthcare, legal), AI-generated answers cannot be blindly trusted. Complyra solves this by adding:

Approval gates — Human reviewers approve/reject AI answers before release
Output policy guards — Automatic detection of leaked secrets, credentials, and sensitive patterns
Per-document sensitivity controls — Fine-grained approval rules at the document level
Complete audit trail — Every question, answer, approval, and action is logged and exportable

Architecture

graph LR
    subgraph Frontend
        UI[React SPA<br/>TypeScript + Vite]
    end

    subgraph API["API Layer"]
        GW[FastAPI Gateway]
        Auth[JWT + RBAC]
    end

    subgraph WF["LangGraph Workflow"]
        RW[Query Rewrite]
        R[Retrieve]
        J[Judge<br/>ReAct Loop]
        D[Draft Answer]
        P[Policy Gate]
        A[Approval]
    end

    subgraph Data["Data Stores"]
        QD[(Qdrant<br/>Vector DB)]
        PG[(PostgreSQL)]
        RD[(Redis Queue)]
    end

    subgraph LLM["LLM Layer"]
        OL[Ollama / OpenAI / Gemini]
        EMB[Embeddings<br/>BGE / OpenAI / Gemini]
    end

    subgraph Obs["Observability"]
        PR[Prometheus]
        GR[Grafana]
        LS[LangSmith]
        SE[Sentry]
    end

    UI -->|HTTP / SSE| GW
    GW --> Auth --> RW
    RW --> R --> J
    J -->|sub-questions| R
    J --> D --> P
    P -->|sensitive| A --> PG
    P -->|safe| GW
    R --> QD
    R --> EMB
    D --> OL
    GW -->|Audit| PG
    GW -->|Ingest Jobs| RD
    GW -->|/metrics| PR
    PR --> GR
    RW -.->|Traces| LS
    GW -.->|Errors| SE

See docs/workflow-design.md for the complete workflow state machine and sequence diagrams.

Features

Category	Feature	Description
RAG	Multi-tenant retrieval	Tenant-scoped document ingestion and vector search via `X-Tenant-ID`
RAG	ReAct retrieval loop	Judge → sub-question → re-retrieve for complex queries
RAG	Query rewriting	LLM-powered query rewriting for better retrieval
RAG	Pluggable embeddings	SentenceTransformer (BGE), OpenAI, or Gemini — switchable via config
RAG	Hybrid search	Dense + sparse vector search for improved recall
Workflow	Human-in-the-loop approval	LangGraph workflow with configurable approval gates
Workflow	Approval policy chain	Document override → tenant policy → global setting
Workflow	Output policy guard	Regex-based detection of secrets, API keys, credentials
Workflow	SSE streaming	Real-time token-by-token chat via `POST /chat/stream`
KB	Document management	Upload, sensitivity levels, bulk operations, preview
KB	Per-document approval override	`always` / `never` / `inherit` per document
KB	Async ingestion	Redis + RQ worker for background document processing
KB	OCR support	Tesseract-based OCR for scanned documents and images
Security	RBAC	Three roles: `admin`, `auditor`, `user`
Security	Tenant isolation	Row-level and vector-level data isolation
Ops	Audit trail	Full event logging with search and CSV export
Ops	LangSmith tracing	Optional LLM observability with zero code overhead
Ops	Prometheus + Grafana	Custom metrics: query latency, embedding throughput, queue depth
Ops	Sentry integration	Error tracking and alerting
Infra	Terraform IaC	AWS infrastructure with OPA policy gate
Infra	Docker ARM64	Multi-arch container builds for ECS Fargate

Quick Start

Docker Compose (recommended)

git clone https://github.com/weiguangli-io/complyra.git
cd complyra
cp .env.example .env
docker compose up --build -d

Service	URL
Web UI	http://localhost:5173
API Docs	http://localhost:8000/docs (Swagger UI)
Health	http://localhost:8000/api/health/live
Prometheus	http://localhost:9090
Grafana	http://localhost:3000

Default credentials: demo / demo123

See docs/getting-started.md for a step-by-step tutorial with screenshots.

Local Development

# Backend
python3 -m venv .venv && source .venv/bin/activate
pip install -r requirements-dev.txt
cp .env.example .env
alembic upgrade head
uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

# Frontend
cd web && npm install && npm run dev

# Worker (in a separate terminal)
rq worker ingest --url redis://localhost:6379/0

Project Structure

complyra/
├── app/
│   ├── api/routes/          # REST endpoints (auth, chat, documents, approvals, audit, ...)
│   ├── core/                # Config, security, logging, metrics, middleware
│   ├── db/                  # SQLAlchemy models, session, CRUD operations
│   ├── models/              # Pydantic request/response schemas
│   ├── services/            # Business logic (workflow, LLM, retrieval, policy, ...)
│   └── workers/             # Background job processors (ingestion)
├── web/                     # React + TypeScript frontend (Vite)
├── tests/                   # 500+ unit and integration tests
├── terraform/               # AWS infrastructure as code
├── scripts/aws/             # Deployment automation scripts
├── docs/                    # Comprehensive documentation
├── docker-compose.yml       # Local development stack
├── Dockerfile               # Multi-stage API container (non-root, healthcheck)
└── .github/workflows/       # CI pipeline (lint, test, build)

API Endpoints

Method	Path	Auth	Description
`POST`	`/api/auth/login`	—	Authenticate and get JWT
`POST`	`/api/auth/logout`	—	Clear session
`POST`	`/api/chat/`	user+	Synchronous chat (JSON)
`POST`	`/api/chat/stream`	user+	Streaming chat (SSE)
`POST`	`/api/ingest/file`	admin	Upload document for ingestion
`GET`	`/api/ingest/jobs/{id}`	admin	Check ingestion job status
`GET`	`/api/documents/`	admin/auditor	List documents (paginated, filterable)
`GET`	`/api/documents/{id}`	admin/auditor	Document details
`PATCH`	`/api/documents/{id}`	admin	Update sensitivity / approval override
`DELETE`	`/api/documents/{id}`	admin	Soft-delete document
`POST`	`/api/documents/bulk`	admin	Bulk delete / update sensitivity
`GET`	`/api/documents/{id}/preview`	user+	Preview original file
`GET`	`/api/approvals/`	admin/auditor	List pending approvals
`POST`	`/api/approvals/{id}/decision`	admin/auditor	Approve / reject answer
`GET`	`/api/approvals/{id}/result`	user+	Get approval result
`GET`	`/api/audit/`	admin/auditor	Query audit logs
`GET`	`/api/audit/export`	admin	Export audit logs (CSV)
`GET`	`/api/tenants/`	admin	List tenants
`POST`	`/api/tenants/`	admin	Create tenant
`GET`	`/api/tenants/{id}/policy`	admin	Get tenant approval policy
`PUT`	`/api/tenants/{id}/policy`	admin	Update approval policy
`GET`	`/api/users/`	admin	List users
`POST`	`/api/users/`	admin	Create user
`GET`	`/api/health/live`	—	Liveness probe
`GET`	`/api/health/ready`	—	Readiness probe (DB + Qdrant + LLM)

See docs/api-reference.md for complete request/response schemas, error codes, and curl examples.

Configuration

All settings use the APP_ prefix. See .env.example for the full list.

Variable	Default	Description
`APP_EMBEDDING_PROVIDER`	`sentence-transformers`	`sentence-transformers`, `openai`, or `gemini`
`APP_EMBEDDING_MODEL`	`BAAI/bge-small-en-v1.5`	Local SentenceTransformer model name
`APP_OPENAI_API_KEY`	(empty)	Required when `embedding_provider=openai`
`APP_GEMINI_API_KEY`	(empty)	Required when `embedding_provider=gemini`
`APP_EMBEDDING_DIMENSION`	`384`	Vector dimension (384 BGE / 1536 OpenAI / 768 Gemini)
`APP_LLM_PROVIDER`	`ollama`	`ollama`, `openai`, or `gemini`
`APP_OLLAMA_MODEL`	`qwen2.5:3b-instruct`	Ollama LLM model
`APP_REQUIRE_APPROVAL`	`true`	Global approval gate default
`APP_OUTPUT_POLICY_ENABLED`	`true`	Enable output policy checks
`APP_QUERY_REWRITE_ENABLED`	`true`	Enable LLM query rewriting
`APP_REACT_RETRIEVAL_ENABLED`	`true`	Enable ReAct retrieval loop
`APP_LANGSMITH_TRACING`	`false`	Enable LangSmith tracing
`APP_DATABASE_URL`	`sqlite:///./data/app.db`	Database connection string
`APP_HYBRID_SEARCH_ENABLED`	`true`	Enable hybrid (dense + sparse) search

See docs/configuration.md for the complete reference with all 60+ settings.

Testing

pip install -r requirements-dev.txt
PYTHONPATH=. pytest tests/ -v --cov=app --cov-report=term-missing

The test suite includes 500+ tests covering:

Unit tests for all service modules
Route-level API tests
Integration tests for the full workflow
Approval policy resolution tests
Document lifecycle tests

Linting & Formatting

black --check app/
isort --check app/
ruff check app/

Deployment (AWS)

graph TB
    subgraph VPC["AWS VPC"]
        subgraph Public["Public Subnet"]
            ALB[Application Load Balancer]
        end
        subgraph Private["Private Subnet"]
            API[ECS: complyra-api]
            WEB[ECS: complyra-web]
            WKR[ECS: complyra-worker]
            RDS[(RDS PostgreSQL)]
            EC[(ElastiCache Redis)]
            QD[Qdrant EC2]
        end
    end
    R53[Route 53] --> ALB
    ALB --> API
    ALB --> WEB
    API --> RDS
    API --> EC
    API --> QD
    WKR --> RDS
    WKR --> EC
    WKR --> QD

Prepare AWS account — docs/aws-account-onboarding.md
Terraform plan + OPA policy gate — ./scripts/aws/07_terraform_plan.sh
Build & push ARM64 images — ./scripts/aws/03_build_and_push.sh
Deploy ECS services — ./scripts/aws/09_deploy_services_from_release.sh
Run smoke tests — ./scripts/aws/05_smoke_test.sh

Full runbook: docs/aws-deployment.md | Architecture: docs/deployment-architecture.md

Tech Stack

Component	Technology	Version
Backend	FastAPI + Uvicorn	0.115.8
Workflow	LangGraph	0.2.55
Database	PostgreSQL + SQLAlchemy	16 / 2.0
Vector DB	Qdrant	1.12.6
Queue	Redis + RQ	7 / 1.16
LLM	Ollama / OpenAI / Gemini	multi-provider
Embeddings	SentenceTransformers / OpenAI / Gemini	pluggable
Frontend	React + TypeScript + Vite	18 / 5.x
Observability	Prometheus + Grafana + LangSmith + Sentry	-
IaC	Terraform + OPA/Conftest	1.9.x
CI/CD	GitHub Actions	-

Documentation

Document	Description
Getting Started	Step-by-step tutorial for first-time users
Architecture	System design, layered backend, data isolation
Workflow Design	LangGraph state machine, ReAct loop, approval chain
API Reference	Complete endpoint docs with schemas and examples
Database Schema	ER diagrams, table descriptions, indexes
Configuration	All 60+ environment variables explained
Streaming API	SSE protocol, event types, client examples
Deployment Architecture	AWS infrastructure, scaling, monitoring
AWS Deployment	Step-by-step production deployment
Operations Runbook	Health checks, SLOs, incident response
Release & Rollback	Versioning, blue-green deploy, rollback
Frontend Contributing	React code standards, i18n, accessibility
UI Design Tokens	Typography, colors, spacing, animations

Contributing

See CONTRIBUTING.md for development setup, code style, and PR process.

Security

See SECURITY.md for our security policy and vulnerability reporting.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Complyra

Why Complyra?

Architecture

Features

Quick Start

Docker Compose (recommended)

Local Development

Project Structure

API Endpoints

Configuration

Testing

Linting & Formatting

Deployment (AWS)

Tech Stack

Documentation

Contributing

Security

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github		.github
alembic		alembic
app		app
docs		docs
infra		infra
ops		ops
scripts		scripts
tests		tests
web		web
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Complyra

Why Complyra?

Architecture

Features

Quick Start

Docker Compose (recommended)

Local Development

Project Structure

API Endpoints

Configuration

Testing

Linting & Formatting

Deployment (AWS)

Tech Stack

Documentation

Contributing

Security

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages