Contributing to vLLM Semantic Router

Thank you for your interest in contributing to the vLLM Semantic Router project! This guide will help you get started with development and contributing to the project.

Development Setup
Prerequisites
Building the Project
Running Tests
Development Workflow
Code Style and Standards
- Code Quality Checks
Submitting Changes
Project Structure

Development Setup

Prerequisites

Before you begin, ensure you have the following installed:

Rust (latest stable version)
Go 1.24.1 or later
Hugging Face CLI (pip install huggingface_hub)
Make (for build automation)
Python 3.8+ (Optional: for training and testing)

Initial Setup

Clone the repository:

git clone https://github.com/vllm-project/semantic-router.git
cd semantic-router

Download required models:
```
make download-models
```
This downloads the pre-trained classification models from Hugging Face.

Install Python dependencies(Optional):

# For training and development
pip install -r requirements.txt

# For end-to-end testing
pip install -r e2e-tests/requirements.txt

Building the Project

The project consists of multiple components that need to be built in order:

Build Everything

make build

Build Individual Components

Rust library (Candle binding):
```
make rust
```
Go router:
```
make build-router
```

Running the System

Start Envoy proxy (in one terminal):
```
make run-envoy
```
Start the semantic router (in another terminal):
```
make run-router
```

Running Tests

Unit Tests

Test Rust bindings:
```
make test-binding
```
Test Go semantic router:
```
make test-semantic-router
```

Test individual classifiers:

make test-category-classifier
make test-pii-classifier
make test-jailbreak-classifier

Manual Testing

Test different routing scenarios:

# Test model auto-selection
make test-prompt

# Test PII detection
make test-pii

# Test prompt guard (jailbreak detection)
make test-prompt-guard

# Test tools auto-selection
make test-tools

End-to-End Tests

Ensure both Envoy and the router are running, then:

# Run all e2e tests
python e2e-tests/run_all_tests.py

# Run specific test
python e2e-tests/00-client-request-test.py

# Run tests matching a pattern
python e2e-tests/run_all_tests.py --pattern "0*-*.py"

# Check if services are running
python e2e-tests/run_all_tests.py --check-only

The test suite includes:

Basic client request tests
Envoy ExtProc interaction tests
Router classification tests
Semantic cache tests
Category-specific tests
Metrics validation tests

Development Workflow

Making Changes

Create a feature branch:

git checkout -b feature/your-feature-name

Make your changes following the project structure and coding standards.
Build and test:
```
make clean
make build
make test
```

Run end-to-end tests:

# Start services
make run-envoy &
make run-router &

# Run tests
python e2e-tests/run_all_tests.py

Commit your changes:

Commit your changes with a clear message, making sure to sign off on your work using the -s flag. This is required by the project's Developer Certificate of Origin (DCO).
```
git add .
git commit -s -m "feat: add your feature description"
```

Debugging

Envoy logs: Check the terminal running make run-envoy for detailed request/response logs
Router logs: Check the terminal running make run-router for classification and routing decisions
Rust library: Use RUST_LOG=debug environment variable for detailed Rust logs
Go library: Use SR_LOG_LEVEL=debug environment variable for detailed Go logs

Code Style and Standards

Code Quality Checks

Before submitting a PR, please run the pre-commit hooks to ensure code quality and consistency. These checks are mandatory and will be automatically run on every commit once installed.

Step 1: Install pre-commit tool

# Using pip (recommended)
pip install pre-commit

# Or using conda
conda install -c conda-forge pre-commit

# Or using homebrew (macOS)
brew install pre-commit

Step 2: Install pre-commit hooks for this repository

# Install pre-commit hooks
pre-commit install

# Run all checks
pre-commit run --all-files

Go Code

Follow standard Go formatting (gofmt)
Use meaningful variable and function names
Add comments for exported functions and types
Write unit tests for new functionality
Keep Go modules tidy: Run go mod tidy in the appropriate directory after adding or removing dependencies
- For candle-binding: cd candle-binding && go mod tidy
- For src/semantic-router: cd src/semantic-router && go mod tidy
- The CI will automatically check that go.mod and go.sum files are tidy using make check-go-mod-tidy

Rust Code

Follow Rust formatting (cargo fmt)
Use cargo clippy for linting
Handle errors appropriately with Result types
Document public APIs

Python Code

Follow PEP 8 style guidelines
Use type hints where appropriate
Write docstrings for functions and classes

Submitting Changes

Ensure all tests pass:
```
make test
python e2e-tests/run_all_tests.py
```
The make test command includes:
- go vet for static analysis
- check-go-mod-tidy for Go module dependency verification
- Unit tests for all components
Create a pull request with:
- Clear description of changes
- Reference to any related issues
- Test results and validation steps
Address review feedback promptly

Project Structure

├── candle-binding/          # Rust library for BERT classification
├── src/semantic-router/     # Go implementation of the router
├── src/training/           # Model training scripts
├── e2e-tests/              # End-to-end test suite
├── config/                 # Configuration files
├── docs/                   # Documentation
├── deploy/                 # Deployment configurations
├── Makefile               # Build automation
└── requirements.txt       # Python dependencies

Key Components

Candle Binding: Rust library providing BERT-based classification
Semantic Router: Go service implementing the Envoy ExtProc interface
Training Scripts: Python scripts for fine-tuning classification models
Configuration: YAML files defining routing rules and model endpoints

Getting Help

Check the documentation
Review existing issues and pull requests
Ask questions in discussions or create a new issue

License

By contributing to this project, you agree that your contributions will be licensed under the same license as the project (Apache 2.0).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to vLLM Semantic Router

Table of Contents

Development Setup

Prerequisites

Initial Setup

Building the Project

Build Everything

Build Individual Components

Running the System

Running Tests

Unit Tests

Manual Testing

End-to-End Tests

Development Workflow

Making Changes

Debugging

Code Style and Standards

Code Quality Checks

Go Code

Rust Code

Python Code

Submitting Changes

Project Structure

Key Components

Getting Help

License

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to vLLM Semantic Router

Table of Contents

Development Setup

Prerequisites

Initial Setup

Building the Project

Build Everything

Build Individual Components

Running the System

Running Tests

Unit Tests

Manual Testing

End-to-End Tests

Development Workflow

Making Changes

Debugging

Code Style and Standards

Code Quality Checks

Go Code

Rust Code

Python Code

Submitting Changes

Project Structure

Key Components

Getting Help

License