Skip to content

CodeBoarding/awesome-architecture-mds

Repository files navigation

Awesome Architecture MDs

Architecture diagrams for popular open-source repos. Auto-generated, markdown, drop-in-ready for your coding agent.

graph LR
  subgraph 1["Orchestration & Lifecycle"]
    1__1_1["CLI & Execution Orchestrator"]
    1__1_2["Environment & Tooling Manager"]
    1__1_3["Project Discovery & Context Builder"]
    1__1_4["Agentic Analysis Pipeline"]
    1__1_5["System Registry & Persistence"]
  end
  subgraph 2["Static Analysis Core"]
    2__2_1["Analysis Orchestrator & Cache Manager"]
    2__2_2["LSP Client & Language Adapters"]
    2__2_3["Semantic Graph Engine"]
    2__2_4["Architectural Clustering & Validation"]
  end
  subgraph 3["Change Analysis"]
    3__3_1["Change Detection & Context Manager"]
    3__3_2["Incremental State Orchestrator"]
    3__3_3["Analysis Serialization Layer"]
  end
  subgraph 4["AI Reasoning Layer"]
    4__4_1["Agentic Orchestration & Execution"]
    4__4_2["Structural Analysis & Health Engine"]
    4__4_3["Prompt Factory & Provider Abstraction"]
    4__4_4["Context Extraction & Persistence"]
  end
  subgraph 5["Health & Documentation"]
    5__5_1["Health Metrics Evaluator"]
    5__5_2["Documentation & Visualization Generator"]
    5__5_3["Unified Data Serializer"]
    5__5_4["Structural Analysis Engine"]
    5__5_5["Incremental Update Manager"]
    5__5_6["CLI & Repository Interface"]
  end
  subgraph 6["Persistence & Tooling"]
    6__6_1["Local Persistence Engine"]
    6__6_2["Repository Interaction Toolkit"]
    6__6_3["Execution Monitoring & Telemetry"]
  end
  1 -->|"Triggers change detection to determine the scope of the current run"| 3
  1 -->|"Initiates the full project scan and graph construction"| 2
  3 -->|"Provides diff context to enable partial updates of the call graph"| 2
  2 -->|"Provides structural graph data for semantic interpretation"| 4
  4 -->|"Uses the toolkit to read specific source code files for deep analysis"| 6
  2 -->|"Provides the call graph for metric calculation and diagram generation"| 5
  4 -->|"Provides semantic descriptions and component purposes for the final reports"| 5
  6 -->|"Provides access to cached analysis data and the local database"| 2
  1 -->|"calls"| 4
  1 -->|"calls"| 5
  1 -->|"calls"| 6
  2 -->|"calls"| 1
  2 -->|"calls"| 3
  3 -->|"calls"| 1
  3 -->|"calls"| 4
  4 -->|"calls"| 1
  4 -->|"calls"| 2
  5 -->|"calls"| 2
  5 -->|"calls"| 4
  6 -->|"calls"| 1
  3__3_1 -->|"Provides the `ChangeSet` and method-level diffs along with the cached `StaticAnalysisResults` from previous runs to initiate the delta computation."| 3__3_2
  3__3_2 -->|"Supplies the updated and validated `AnalysisInsights` to be transformed into structured LLM prompts and response models."| 3__3_3
  3__3_2 -->|"calls"| 3__3_1
  4__4_1 -->|"Requests formatted system and user messages tailored to the active LLM provider."| 4__4_3
  4__4_1 -->|"Invokes repository tools to ground LLM reasoning in actual source code and metadata."| 4__4_4
  4__4_1 -->|"Consumes code clusters and health insights to provide high-level architectural summaries."| 4__4_2
  4__4_2 -->|"Reads raw static analysis data and persists graph-based clustering results for future incremental runs."| 4__4_4
  4__4_2 -->|"calls"| 4__4_1
  4__4_4 -->|"calls"| 4__4_2
  6__6_3 -->|"Persists aggregated run statistics, token usage, and job status updates to the DuckDB/SQLite backend"| 6__6_1
  6__6_2 -->|"Queries the cache using file content hashes to determine if previous analysis results can be reused"| 6__6_1
  6__6_3 -->|"Instruments tool execution to track latency and frequency of filesystem interactions during an analysis run"| 6__6_2
  2__2_1 -->|"Orchestrates the startup and shutdown of language servers required for the current project"| 2__2_2
  2__2_1 -->|"Triggers the graph construction process and provides cached symbol data to minimize redundant LSP queries"| 2__2_3
  2__2_2 -->|"Supplies raw symbol locations and reference batches used to build graph edges"| 2__2_3
  2__2_3 -->|"Provides the completed call graph and hierarchy as input for community detection and logical grouping"| 2__2_4
  2__2_4 -->|"Returns the final clustered results and change metrics to be persisted in the analysis cache"| 2__2_1
  2__2_1 -->|"calls"| 2__2_4
  2__2_2 -->|"calls"| 2__2_1
  2__2_3 -->|"calls"| 2__2_2
  5__5_6 -->|"triggers"| 5__5_5
  5__5_5 -->|"coordinates with"| 5__5_4
  5__5_4 -->|"provides component boundaries to"| 5__5_1
  5__5_1 -->|"sends findings to"| 5__5_3
  5__5_4 -->|"feeds final architectural map into"| 5__5_3
  5__5_3 -->|"provides structured JSON to"| 5__5_2
  5__5_1 -->|"calls"| 5__5_4
  5__5_1 -->|"calls"| 5__5_5
  5__5_2 -->|"calls"| 5__5_3
  5__5_2 -->|"calls"| 5__5_6
  5__5_3 -->|"calls"| 5__5_4
  5__5_3 -->|"calls"| 5__5_5
  5__5_4 -->|"calls"| 5__5_5
  5__5_5 -->|"calls"| 5__5_3
  5__5_6 -->|"calls"| 5__5_2
  1__1_1 -->|"Verifies and prepares the local environment (LSP servers, Node.js) before any analysis begins."| 1__1_2
  1__1_1 -->|"Initiates the repository scanning process to identify the codebase structure and languages."| 1__1_3
  1__1_1 -->|"Triggers the high-level analysis workflow once the environment and project context are established."| 1__1_4
  1__1_3 -->|"Provides the structured project context and file-to-language mappings required for agentic reasoning."| 1__1_4
  1__1_4 -->|"Persists analysis results and retrieves cached signatures to support incremental analysis and reduce LLM token usage."| 1__1_5
  1__1_1 -->|"Uses the registry to dynamically load available tools and plugins during initialization."| 1__1_5
  1__1_3 -->|"calls"| 1__1_1
  1__1_3 -->|"calls"| 1__1_2
  1__1_3 -->|"calls"| 1__1_5
  1__1_4 -->|"calls"| 1__1_1
  1__1_4 -->|"calls"| 1__1_3
  1__1_5 -->|"calls"| 1__1_3
Loading

Above: CodeBoarding diagrammed by CodeBoarding. Every repo in the atlas below is mapped the same way.

Drop into your agent

Onboard your AI agent. Paste the markdown into your ARCHITECTURE.md so the agent onboards in fewer tokens.

# Cursor / Claude Code / Aider — load a repo's architecture as context
curl -sL https://raw.githubusercontent.com/<org>/awesome-architecture-mds/main/vllm/architecture.md \
  | pbcopy

Or reference it directly in your prompt:

@https://github.com/codeboarding/awesome-architecture-mds/blob/main/vllm/on_boarding.md
Using the architecture above, implement X without breaking module boundaries.

Contents

AI & machine learning

LLM serving & inference

  • bella-openapi - AI gateway routing and orchestrating requests to multiple large language models. (Java)
  • BitNet - Efficient 1-bit LLM inference runtime with Python orchestration and C++/CUDA kernels. (Python)
  • FastChat - Distributed platform for serving and evaluating chat-based large language models. (Python)
  • gpt4free - Proxy client library routing requests across free LLM providers. (Python)
  • koila - PyTorch wrapper providing lazy evaluation to prevent GPU out-of-memory errors. (Python)
  • lightning-thunder - Source-to-source compiler for optimizing PyTorch model execution. (Python)
  • litellm - Unified LLM client and proxy supporting 100+ providers via OpenAI format. (Python)
  • mlc-llm - Universal LLM compilation and deployment across hardware and platforms. (Python)
  • mlx-lm - Apple MLX framework toolkit for running and fine-tuning language models. (Python)
  • mlx-vlm - Apple MLX toolkit for serving and interacting with vision-language models. (Python)
  • ollama-python - Official Python client library for the Ollama local LLM runtime API. (Python)
  • openai-python - Official Python client library for the OpenAI API and models. (Python)
  • tinygrad - Minimalist deep learning framework between PyTorch and karpathy's micrograd. (Python)
  • transformer-deploy - Efficient transformer model deployment on ONNX Runtime TensorRT and Triton. (Python)
  • tvm - Apache TVM compiler stack for deploying deep learning models across hardware. (Python)
  • vllm - High-throughput memory-efficient inference and serving engine for LLMs. (Python)

Agent frameworks & orchestration

  • ABCs-of-control - Conversational agent that parses queries, calls LLMs, and optionally invokes external tools. (Python)
  • AdalFlow - Framework for building, training, and evaluating LLM-powered applications and pipelines. (Python)
  • ag2 - Multi-agent LLM orchestration framework derived from AutoGen with tool and code execution. (Python)
  • agno - Lightweight framework for building multimodal AI agents with memory and tool use. (Python)
  • arcade-ai - Platform for developing and serving authenticated tools for AI agents. (Python)
  • Archon - Agent orchestration system that builds and refines other AI agents automatically. (Python)
  • AutoAgent - Fully autonomous LLM agent framework with CLI, REPL, and tool management. (Python)
  • autogen - Microsoft framework for orchestrating multi-agent LLM conversations and task automation. (Python)
  • Bard - Conversational AI client for interacting with Google Bard language model. (Python)
  • browser-use - AI agent framework giving LLMs control of a web browser for automation. (Python)
  • ChatGPT - Unofficial CLI and API wrappers for interacting with ChatGPT conversational models. (Python)
  • ChatterBot - Machine learning conversational bot engine with pluggable storage and logic adapters. (Python)
  • cognee - Memory and knowledge-graph layer for AI agents with ingestion and retrieval APIs. (Python)
  • composio - Platform giving AI agents authenticated access to hundreds of external tools. (Python)
  • contextgem - LLM-powered framework for structured information extraction from documents. (Python)
  • crawl4ai - LLM-friendly web crawler producing clean markdown and structured data for RAG. (Python)
  • crewAI - Framework for orchestrating role-playing autonomous AI agent crews. (Python)
  • dataproduct-mcp - MCP server exposing data products to LLMs with governance and guardrails. (Python)
  • dspy - Framework for programming with LLMs via modular signatures and automatic prompt optimization. (Python)
  • Flowise - Drag-and-drop UI for building customized LLM flows and agent pipelines. (TypeScript)
  • genai-processors - Asynchronous pipeline library for building generative AI data processing workflows. (Python)
  • gpt-researcher - Autonomous agent that conducts comprehensive research with LLMs and citations. (Python)
  • graphrag - Microsoft knowledge-graph-enhanced retrieval-augmented generation pipeline. (Python)
  • haystack - Modular framework for building LLM-powered search and RAG applications. (Python)
  • instructor - Library for extracting structured Pydantic outputs from large language models. (Python)
  • kor - Schema-driven LLM library for structured information extraction. (Python)
  • kotaemon - Open-source clean UI RAG chatbot for document question answering. (Python)
  • langchain - Framework for building applications powered by language model chains and agents. (Python)
  • langextract - LLM-agnostic structured information extraction pipeline library. (Python)
  • langflow - Visual low-code UI for experimenting with LangChain workflows. (Python)
  • langgraph - Library for building stateful, multi-actor LLM agent applications as graphs. (Python)
  • llama_index - Data framework for indexing and querying data with LLMs for RAG. (Python)
  • llm-graph-builder - LLM-powered pipeline that builds Neo4j knowledge graphs from unstructured data. (Python)
  • lmql - Query language for large language models with constrained generation. (Python)
  • MaiBot - Modular AI conversational agent with pluggable NLP and communication adapters. (Python)
  • mcp-agent - Framework for building AI agents with MCP integration and workflow patterns. (Python)
  • mcp-agent-discussion - Multi-agent system orchestrating complex AI workflows with MCP integration. (Python)
  • mcp-context-forge - API gateway for managing MCP servers, tools, and resources across protocols. (Python)
  • mcp-use - Library for building LLM agents that use MCP tools and servers. (Python)
  • mem0 - Memory layer for AI agents integrating LLMs, embeddings, and vector stores. (Python)
  • MetaGPT - Multi-agent framework assigning software roles to collaborative LLM agents. (Python)
  • mongodb-mcp-server - MCP server exposing MongoDB operations as tools for LLM agents. (TypeScript)
  • Open-Assistant - Open-source conversational AI assistant with human feedback training pipeline. (Python)
  • openai-agents-python - OpenAI's lightweight SDK for building multi-agent workflows with tools and handoffs. (Python)
  • OpenCopilot-PikoAi - Terminal-based AI copilot CLI for task automation via natural language. (Python)
  • openr - LLM reasoning framework combining reinforcement learning and process reward models. (Python)
  • OS-Copilot - Self-improving AI agent framework for autonomous OS task execution. (Python)
  • paper-search-mcp - MCP server aggregating academic paper search across multiple research databases. (Python)
  • pipecat - Open-source framework for building voice and multimodal conversational AI agents. (Python)
  • podcastfy - Converts content into AI-generated multilingual conversational podcasts. (Python)
  • poml - Prompt-oriented markup language engine for structured LLM prompt authoring. (Python)
  • pywit - Python SDK for the Wit.ai natural language understanding service. (Python)
  • quivr - Retrieval-augmented generation framework for building second brain chatbots. (Python)
  • ragbits - Framework for building and evaluating retrieval-augmented generation systems. (Python)
  • redis-vl-python - Redis vector library for building LLM-powered semantic search and RAG applications. (Python)
  • sample-app-aoai-chatGPT - Microsoft sample chat app integrating Azure OpenAI with enterprise data sources. (Python)
  • Scrapegraph-ai - LLM-powered web scraping library generating extraction pipelines from prompts. (Python)
  • SuperAGI - Open-source autonomous AI agent framework for building and deploying agents. (Python)
  • symbolicai - Neuro-symbolic framework combining LLMs with symbolic reasoning operations. (Python)
  • thinkgpt - LLM orchestrator adding memory long-term context and reasoning capabilities. (Python)
  • trt-llm-rag-linux - Linux reference RAG application using NVIDIA TensorRT-LLM for inference. (Python)
  • txtai - All-in-one embeddings database for semantic search and LLM workflows. (Python)
  • whatsapp-chatgpt - ChatGPT-powered WhatsApp bot integration. (?)

AI coding tools

  • aideml - Autonomous ML engineer that iteratively writes and improves data science code. (Python)
  • auto-code-rover - Autonomous program-repair agent that navigates repos and patches buggy code. (Python)
  • AutoGPT - Autonomous GPT-powered agent platform for chaining goals into executable tasks. (Python)
  • cocode - CLI tool combining code analysis with AI for summarization and vulnerability detection. (Python)
  • crush - Charm's terminal AI coding assistant for interactive code generation. (Go)
  • DeepCode - AI-powered code analysis and generation agent for document and code processing. (Python)
  • deepwiki-open - FastAPI+Next.js tool auto-generating interactive wikis and chat for GitHub repos. (TypeScript)
  • dyad - Local AI app builder with IPC-based UI for code generation workflows. (TypeScript)
  • gpt-engineer - CLI tool that generates entire codebases from natural-language prompts. (Python)
  • gpt_engineer - AI agent that writes and iterates on full software projects from prompts. (Python)
  • hai-build - AI-driven software development lifecycle platform with orchestration services. (Python)
  • llama.vim - Vim/Neovim plugin providing LLM-powered local code completion. (VimScript)
  • open-interpreter - LLM-powered interpreter that runs code locally via natural language instructions. (Python)
  • python_code_generator - Multi-agent AI code generation platform with web interface. (Python)
  • ScreenCoder - Tool converting UI screenshots into front-end code using vision-language models. (Python)
  • stagehand - AI framework for browser automation combining deterministic code and natural language. (TypeScript)
  • SWE-agent - Autonomous software engineering agent that fixes bugs using language models. (Python)
  • SWE-ReX - Remote execution framework for SWE-agent running code in sandboxed environments. (Python)

ML research & models

  • addons - TensorFlow Addons extending core TensorFlow with custom ops, layers, and utilities. (Python)
  • aitextgen - Python tool for training and generating text with GPT-2 transformer models. (Python)
  • albumentations - Fast and flexible image augmentation library for computer vision training pipelines. (Python)
  • AlphaPy - Machine learning framework for quantitative finance and sports analytics workflows. (Python)
  • antialiased-cnns - PyTorch implementations of antialiased convolutional networks using blur-pooling layers. (Python)
  • AutoDL-Projects - Research toolkit for neural architecture search and hyperparameter optimization experiments. (Python)
  • avalanche - PyTorch-based continual learning research library with strategies and benchmarks. (Python)
  • BasicTS - PyTorch benchmark and toolbox for time series forecasting deep learning models. (Python)
  • bayesian_meta_learning - Research code exploring Bayesian meta-learning strategies for few-shot tasks. (Python)
  • BCEmbedding - Bilingual and crosslingual embedding and reranking models for retrieval and RAG. (Python)
  • benchmark_VAE - Unified implementation and benchmarking of variational autoencoder generative models. (Python)
  • bert-extractive-summarizer - BERT-based extractive text summarization library with embedding-based sentence selection. (Python)
  • BERTopic - Topic modeling library combining transformer embeddings with c-TF-IDF clustering. (Python)
  • bindsnet - Spiking neural network simulation library built on PyTorch for biological modeling. (Python)
  • boruta_py - Python implementation of the Boruta all-relevant feature selection algorithm. (Python)
  • causal-learn - Python package for causal discovery algorithms and structure learning. (Python)
  • CBAM.PyTorch - PyTorch implementation of the Convolutional Block Attention Module for CNNs. (Python)
  • ChatGLM-6B - Open bilingual chat model based on the GLM architecture with fine-tuning scripts. (Python)
  • chatterbox - Open-source text-to-speech and voice conversion deep learning model. (Python)
  • ChatTTS - Generative text-to-speech model optimized for expressive dialogue synthesis. (Python)
  • ckiptagger - Chinese NLP toolkit for word segmentation, POS tagging, and named entity recognition. (Python)
  • ClassyVision - Facebook's modular end-to-end PyTorch image classification training framework. (Python)
  • clip-rt - Real-time CLIP-based visual understanding for robotics and perception. (Python)
  • clothes-virtual-try-on - Deep learning virtual clothing try-on system with Gradio interface. (Python)
  • CogDL - Extensive graph deep learning research toolkit with many benchmarks and models. (Python)
  • compare_gan - Research framework for training and benchmarking GAN variants at scale. (Python)
  • CompressAI - PyTorch library for learned neural image and video compression models. (Python)
  • ConvNeXt-V2 - PyTorch implementation of ConvNeXt V2 vision backbone models. (Python)
  • CosyVoice - Multilingual zero-shot text-to-speech synthesis model with voice cloning. (Python)
  • CTGAN - Conditional GAN for generating synthetic tabular data resembling real datasets. (Python)
  • datasets - Hugging Face library for loading, processing, and sharing ML datasets. (Python)
  • Deep-Live-Cam - Real-time deepfake face-swap pipeline for live webcam video streams. (Python)
  • deepface - Lightweight face recognition and attribute analysis framework wrapping many models. (Python)
  • deepgaze - Computer vision library for head pose, gaze, and saliency estimation. (Python)
  • denoiser - Real-time audio speech denoising using an encoder-decoder deep neural network. (Python)
  • Dense_OpticalFlow_and_CNN_based_Motion_Segmentation_and_Object_Recognition - Video pipeline combining dense optical flow with CNNs for motion segmentation. (Python)
  • detectron2 - Facebook's PyTorch object detection and segmentation research platform. (Python)
  • DeTikZify - Generative model converting sketches and images into editable TikZ vector code. (Python)
  • dgl - Deep Graph Library providing graph neural network primitives for PyTorch and others. (Python)
  • diffusers - Hugging Face library of state-of-the-art diffusion models for image and audio generation. (Python)
  • DocLayout-YOLO - YOLO-based deep learning model for document layout analysis. (Python)
  • DPR - Facebook Dense Passage Retrieval bi-encoder for open-domain question answering. (Python)
  • EasyOCR - Ready-to-use OCR toolkit supporting 80+ languages with deep learning models. (Python)
  • EasyRec - Configurable deep learning recommendation model training and serving framework. (Python)
  • efficientnet - Keras/TensorFlow implementation of EfficientNet convolutional image classification models. (Python)
  • elephas - Distributed deep learning with Keras on Apache Spark clusters. (Python)
  • ERNIE - Large-scale pretrained NLP model from Baidu with knowledge integration. (Python)
  • face_recognition - Simple Python library for face recognition with CLI tools. (Python)
  • fairseq - PyTorch sequence modeling toolkit for translation, summarization, and language modeling. (Python)
  • fast-bert - Easy-to-use library for BERT-based NLP classification and NER. (Python)
  • fastdup - Tool for finding duplicates and anomalies in large image datasets. (Python)
  • FastVideo - High-performance toolkit for training and inference of video generation models. (Python)
  • flair - PyTorch-based NLP library for state-of-the-art text embeddings and tagging. (Python)
  • flow_matching - Reference implementation of flow matching generative models for text and images. (Python)
  • fold - TensorFlow library for processing dynamically-structured deep learning inputs. (Python)
  • FoolNLTK - Chinese natural language processing toolkit for segmentation, POS, and NER. (Python)
  • gae - Graph autoencoder reference implementation for learning graph representations. (Python)
  • geatpy - Python evolutionary algorithms framework with population-based optimization. (Python)
  • gemma_pytorch - Official PyTorch implementation of Google Gemma open language models. (Python)
  • gflownet - Framework for GFlowNet generative flow networks on structured data. (Python)
  • GFPGAN - GAN-based algorithm for real-world blind face restoration. (Python)
  • gplearn - Scikit-learn compatible genetic programming for symbolic regression. (Python)
  • gradslam - Differentiable SLAM library for 3D reconstruction in PyTorch. (Python)
  • graph4nlp - Library for NLP tasks using graph neural networks. (Python)
  • GraphGym - Platform for systematic design and evaluation of graph neural networks. (Python)
  • gym - OpenAI toolkit providing reinforcement learning environments and the Env API. (Python)
  • gym-pybullet-drones - PyBullet-based Gym environments for quadcopter reinforcement learning. (Python)
  • HanLP - Multilingual NLP toolkit supporting tokenization, parsing, and named entity recognition. (Python)
  • hiddenlayer - Neural network graph visualization library for PyTorch and TensorFlow. (Python)
  • hls4ml - Translates deep learning models to FPGA high-level synthesis code. (Python)
  • hopfield-layers - PyTorch implementation of modern Hopfield networks as attention layers. (Python)
  • Hunyuan3D-2.1 - Tencent model pipeline for generating 3D shapes and textures from inputs. (Python)
  • jax - Composable transformations of NumPy programs with autodiff and JIT for accelerators. (Python)
  • jieba - Popular Chinese text segmentation library with dictionary-based tokenization. (Python)
  • keras - High-level deep learning API running on TensorFlow, JAX, or PyTorch. (Python)
  • keras-resnet - Keras implementation of ResNet architectures for image classification. (Python)
  • Keras-TextClassification - Keras toolkit with many text classification model implementations. (Python)
  • khaiii - Kakao Korean morphological analyzer using a CNN-based model. (Python)
  • KoBERT - Korean BERT pretrained language model with tokenizer and utilities. (Python)
  • kornia - Differentiable computer vision library for PyTorch with classical CV operators. (Python)
  • labelme - Qt-based image polygonal annotation tool for computer vision datasets. (Python)
  • langdetect - Port of Google language-detection library for Python. (Python)
  • LaTeX-OCR - Neural OCR model that converts images of math formulas to LaTeX. (Python)
  • libra - High-level facade library that automates end-to-end ML workflows. (Python)
  • Lidar_AI_Solution - NVIDIA LiDAR and multi-modal perception pipeline for autonomous driving. (Python)
  • lightning - PyTorch Lightning framework for organized, scalable deep learning training. (Python)
  • lightweight-gan - Minimal implementation of lightweight GAN for one-GPU image generation. (Python)
  • LightZero - MCTS-based reinforcement learning framework with AlphaZero and MuZero. (Python)
  • llama - Inference code for Meta's Llama family of open foundation language models. (Python)
  • llama3 - Reference inference pipeline for Meta's Llama 3 language models. (Python)
  • LLaVA - Multimodal LLM combining vision and language for visual instruction following. (Python)
  • ManiSkill - GPU-parallelized robotic manipulation simulation and benchmark suite. (Python)
  • metric-learn - Scikit-learn compatible Python library for supervised metric learning. (Python)
  • micro_diffusion - Compact latent diffusion transformer implementation with training pipeline. (Python)
  • MinerU - PDF and document extraction toolkit converting content to machine-readable formats. (Python)
  • ml-cvnets - Apple computer vision network training library for mobile models. (Python)
  • ml-fastvit - Apple FastViT hybrid vision transformer reference implementation. (Python)
  • ml4a - Machine learning for artists library bundling creative deep learning models. (Python)
  • MLBox - Automated machine learning library for preprocessing and model stacking. (Python)
  • mmdetection - Modular PyTorch toolbox for object detection and instance segmentation research. (Python)
  • model2vec - Toolkit for distilling static embedding models from transformer encoders. (Python)
  • models - Collection of reference neural network model implementations and training pipelines. (Python)
  • nano-llama31 - Minimal from-scratch reference implementation of the Llama 3.1 model. (Python)
  • neat-python - Python implementation of NeuroEvolution of Augmenting Topologies (NEAT) algorithm. (Python)
  • nerfacc - Accelerated neural radiance fields library with efficient volumetric ray marching. (Python)
  • nltk - Comprehensive natural language processing toolkit with corpora and algorithms. (Python)
  • nsfw_model - Pretrained deep learning model for classifying NSFW images and video. (Python)
  • omnizart - Automatic music transcription toolkit with deep learning models. (Python)
  • once-for-all - Train one elastic supernet that specializes into efficient subnets for deployment. (Python)
  • Open3D-ML - Open3D extension for 3D machine learning tasks on point clouds and meshes. (Python)
  • OpenNE - Toolkit of graph network embedding algorithms for representation learning. (Python)
  • openWakeWord-cpp - Real-time wake word detection engine with streaming audio processing. (C++)
  • PaddleOCR - PaddlePaddle toolkit for multilingual optical character recognition and document understanding. (Python)
  • penzai - JAX research toolkit treating neural networks as manipulable pytrees. (Python)
  • Personae - Reinforcement and supervised learning experiments for financial market trading. (Python)
  • PGL - PaddlePaddle graph learning framework for GNN development. (Python)
  • PGPortfolio - Deep reinforcement learning framework for cryptocurrency portfolio management. (Python)
  • phonemizer - Multilingual text-to-phoneme conversion library for speech applications. (Python)
  • pke - Python keyphrase extraction toolkit with unsupervised and supervised methods. (Python)
  • Pointnet2_PyTorch - PyTorch implementation of PointNet++ for 3D point cloud deep learning. (Python)
  • poker_ai - Poker AI research codebase with counterfactual regret minimization training. (Python)
  • pseudo - Deep learning prototype of graph neural networks for scientific machine learning. (Python)
  • PVT - Pyramid Vision Transformer implementation for various computer vision tasks. (Python)
  • pygod - Graph anomaly detection library built on PyTorch Geometric. (Python)
  • pymarl - PyTorch framework for multi-agent reinforcement learning research. (Python)
  • pyserini - Python toolkit for reproducible information retrieval research with dense/sparse retrieval. (Python)
  • pytorch - Deep learning framework with dynamic computation graphs and GPU acceleration. (Python)
  • PyTorch-Encoding - PyTorch toolkit for semantic segmentation with synchronized batch normalization. (Python)
  • pytorch3d - PyTorch library for 3D deep learning on meshes, point clouds and volumes. (Python)
  • pytorch_geometric - PyTorch library for deep learning on graphs and irregular structures. (Python)
  • PyTorch_YOLOv4 - PyTorch implementation of the YOLOv4 object detection model. (Python)
  • qlib - AI-oriented quantitative investment research platform with reinforcement learning. (Python)
  • Real-ESRGAN - Practical image and video super-resolution using enhanced GAN models. (Python)
  • Real-Time-Voice-Cloning - Real-time voice cloning toolkit using speaker encoder and neural vocoder. (Python)
  • RecLearn - Modular recommender systems research library for deep learning models. (Python)
  • recommenders - Microsoft toolkit with examples and best practices for building recommender systems. (Python)
  • rembg - Tool to remove backgrounds from images using deep learning segmentation models. (Python)
  • rerankers - Unified Python interface for document reranking models used in retrieval pipelines. (Python)
  • rf-detr - Real-time DETR-based object detection framework with modular components. (Python)
  • RLBench - Robot learning benchmark and environment for reinforcement learning research. (Python)
  • ROMP - Deep learning pipeline for 3D multi-person pose and shape estimation from images. (Python)
  • rsl_rl - Fast PyTorch reinforcement learning library focused on robotics training loops. (Python)
  • sam - PyTorch implementation of Sharpness-Aware Minimization optimizer for better generalization. (Python)
  • Sapiens - Meta foundation model for human-centric vision tasks like pose and segmentation. (Python)
  • sdfstudio - Unified framework for neural implicit surface reconstruction and rendering. (Python)
  • Semi-supervised-learning - Unified PyTorch codebase for semi-supervised and imbalanced learning algorithms. (Python)
  • ShuffleNet-Series - Reference implementations of ShuffleNet and related efficient neural architectures. (Python)
  • skip-thoughts - Reference implementation of skip-thought sentence embedding vectors. (Python)
  • skrub - Data cleaning and feature engineering library for tabular machine learning. (Python)
  • sktime - Unified Python framework for machine learning with time series data. (Python)
  • snntorch - PyTorch-based framework for training and simulating spiking neural networks. (Python)
  • solo-learn - Self-supervised learning methods library in PyTorch Lightning. (Python)
  • spaCy - Industrial-strength natural language processing library in Python and Cython. (Python)
  • sparse_attention - OpenAI reference implementation of sparse attention mechanisms for transformers. (Python)
  • spikingjelly - PyTorch-based deep learning framework for spiking neural networks. (Python)
  • sru - Simple recurrent unit PyTorch implementation with CUDA acceleration. (Python)
  • stable-diffusion-tensorflow - TensorFlow and Keras implementation of the Stable Diffusion image generation model. (Python)
  • stable-ts - Whisper wrapper providing stable word-level timestamps for audio transcription. (Python)
  • super-resolution - Reference PyTorch implementations of image super-resolution deep learning models. (Python)
  • supervision - Reusable computer vision utilities for detection tracking and visualization. (Python)
  • Synchronized-BatchNorm-PyTorch - Synchronized batch normalization PyTorch module for multi-GPU distributed training. (Python)
  • tensorflow - End-to-end open-source machine learning platform for research and production. (Python)
  • tensorflow-DeepFM - TensorFlow implementation of the DeepFM factorization machine recommendation model. (Python)
  • TensorFlowTTS - Real-time state-of-the-art speech synthesis toolkit built on TensorFlow. (Python)
  • TextGrapher - Tool for converting raw text into interactive knowledge graphs. (Python)
  • tf_unet - TensorFlow implementation of the U-Net convolutional network for image segmentation. (Python)
  • theseus - Differentiable nonlinear optimization library for robotics and computer vision. (Python)
  • THULAC-Python - Chinese lexical analyzer for segmentation and part-of-speech tagging. (Python)
  • TimeMixer - Time series forecasting model using decomposable multiscale mixing architecture. (Python)
  • tods - Automated machine learning system for outlier detection on time series data. (Python)
  • torchgan - PyTorch-based framework for designing and training generative adversarial networks. (Python)
  • torchgfn - PyTorch library for generative flow network research and implementation. (Python)
  • torchsde - Differentiable stochastic differential equation solvers for PyTorch. (Python)
  • torchstat - Lightweight neural network analyzer reporting flops parameters and memory usage. (Python)
  • torchsurv - PyTorch library for deep learning survival analysis. (Python)
  • trackers - Object tracking library implementing SORT and DeepSORT algorithms. (Python)
  • TradingGym - Reinforcement learning environment for backtesting and developing trading strategies. (Python)
  • transformers - State-of-the-art pretrained models for NLP vision and audio from Hugging Face. (Python)
  • TTS - Deep learning toolkit for text-to-speech with pretrained voice models. (Python)
  • uis-rnn - Unbounded interleaved-state recurrent neural network for speaker diarization. (Python)
  • ultralytics - YOLO computer vision framework for object detection segmentation and classification. (Python)
  • unidiffuser - Unified multi-modal diffusion model for generating images text and joint samples. (Python)
  • vit-pytorch - PyTorch implementations of vision transformer variants and related models. (Python)
  • vits - Conditional variational autoencoder with adversarial learning for end-to-end speech synthesis. (Python)
  • VLA-OS - Research framework for vision-language-action foundation models in robotics. (Python)
  • webdataset - High-performance I/O library for PyTorch using tar archives for training data. (Python)
  • whisper - OpenAI speech recognition model for multilingual audio transcription and translation. (Python)
  • WildGS-SLAM - Gaussian splatting-based SLAM system for 3D mapping and camera pose tracking. (Python)
  • xlstm - Extended LSTM architecture research toolkit for language modeling. (Python)
  • YOLO_tensorflow - TensorFlow implementation of the YOLO real-time object detection network. (Python)
  • yolov5 - PyTorch implementation of YOLOv5 real-time object detection model. (Python)
  • Yolov5-deepsort-inference - Real-time object detection and tracking pipeline combining YOLOv5 with DeepSORT. (Python)

Training, evaluation & guardrails

  • agentdojo - Benchmark suite for evaluating LLM agent robustness against prompt-injection attacks. (Python)
  • AIX360 - IBM toolkit providing algorithms and metrics for AI explainability and interpretability. (Python)
  • Alien - Active learning toolkit for iteratively selecting informative samples and training models. (Python)
  • async_rlhf - Asynchronous reinforcement learning from human feedback training with DPO and PPO. (Python)
  • baxbench - Benchmark system for evaluating security of code generated by large language models. (Python)
  • beir - Heterogeneous benchmark for zero-shot evaluation of information retrieval models. (Python)
  • chatarena - Multi-agent language game environment for evaluating LLMs in interactive settings. (Python)
  • deepeval - Unit-testing framework for evaluating and benchmarking LLM outputs with metrics. (Python)
  • DeepSpeed - Microsoft library optimizing large-scale distributed deep learning training and inference. (Python)
  • DiCE - Microsoft library generating diverse counterfactual explanations for classifiers. (Python)
  • dlrover - Distributed deep learning training orchestrator with elastic scheduling and fault tolerance. (Python)
  • finetuner - Jina AI Cloud library for fine-tuning deep learning embedding models. (Python)
  • fitlog - ML experiment logging tool with Git integration and web dashboard. (Python)
  • GaLore - Memory-efficient LLM training via gradient low-rank projection. (Python)
  • ImageReward - Reward model for evaluating and fine-tuning text-to-image generation. (Python)
  • inspect_ai - LLM evaluation framework for building and running safety and capability evals. (Python)
  • invariant - Policy engine and language for analyzing and enforcing LLM agent behavior. (Python)
  • invariant-gateway - LLM proxy gateway with guardrails, monitoring, and multi-provider routing. (Python)
  • jiant - NLP experiment framework for multitask and transfer learning benchmarks. (Python)
  • LAMA - Framework for probing language models for factual and commonsense knowledge. (Python)
  • lighteval - Hugging Face LLM evaluation suite with configurable task registry. (Python)
  • llm-guard - Content-scanning toolkit providing input/output safety scanners for LLMs. (Python)
  • matharena - Evaluation framework for testing LLM performance on mathematical problems. (Python)
  • Megatron-LM - NVIDIA framework for large-scale distributed language model training. (Python)
  • mlflow - Open-source platform for managing the end-to-end machine learning lifecycle. (Python)
  • Olive - Hardware-aware AI model optimization toolkit with pluggable compression techniques. (Python)
  • optuna - Hyperparameter optimization framework with define-by-run API and pruning. (Python)
  • PaddleSlim - PaddlePaddle model compression toolkit with pruning, quantization, and NAS. (Python)
  • piq - PyTorch image quality assessment metrics collection for model evaluation. (Python)
  • pyreft - Library for fine-tuning LLMs with representation finetuning interventions. (Python)
  • pytorch-lightning - Lightweight PyTorch wrapper organizing training code for scale and reproducibility. (Python)
  • RagaAI-Catalyst - Platform for LLM observability, evaluation, and experiment tracking. (Python)
  • rexmex - Recommender system evaluation metrics library for machine learning researchers. (Python)
  • RL - RLHF training framework for large language models with distributed computing. (Python)
  • ROLL - Distributed RLHF training framework built on Ray for LLM post-training. (Python)
  • Ruli - Research toolkit for machine unlearning and privacy attack experiments. (Python)
  • safe-rlhf - Safe reinforcement learning from human feedback framework for language model alignment. (Python)
  • smollm3_finetune - Fine-tuning scripts and utilities for the SmolLM3 small language model. (Python)
  • SWE-bench - Benchmark evaluating language models on real-world GitHub issue resolution. (Python)
  • SWEBench - Benchmark for evaluating software engineering tasks with code generation models. (Python)
  • TextBrewer - Knowledge distillation toolkit for compressing NLP models from teacher to student. (Python)
  • ToolFuzz - Fuzzing framework for testing tools used by AI agents like LangChain and AutoGen. (Python)
  • trojai-submission-all - Repository aggregating TrojAI challenge submissions for AI trojan detection. (Python)
  • uncertainty-toolbox - Toolbox for predictive uncertainty quantification calibration and visualization. (Python)
  • unsloth - Fast LLM fine-tuning library with optimized kernels and memory efficiency. (Python)
  • verl - Volcano Engine reinforcement learning library for post-training large language models. (Python)
  • vizier - Google's scalable black-box optimization service for hyperparameter tuning. (Python)
  • zenml - Extensible MLOps framework for creating production-ready machine learning pipelines. (Python)

Data & analytics

ETL & workflow orchestration

  • airflow - Platform to programmatically author, schedule, and monitor DAG-based data workflows. (Python)
  • BayerCLAW - AWS-based workflow orchestrator for running containerized bioinformatics pipelines. (Python)
  • bonobo - Lightweight Python ETL framework for building functional data transformation graphs. (Python)
  • bytewax - Python stream processing framework built on top of Rust's Timely Dataflow. (Python)
  • celery - Distributed task queue for running asynchronous background jobs with message brokers. (Python)
  • conductor - Netflix distributed workflow orchestration engine for microservices. (Java)
  • django-celery-beat - Database-backed periodic task scheduler for Celery managed through Django admin. (Python)
  • faust - Python stream processing library for Kafka with declarative agents. (Python)
  • fugue - Unified distributed computing interface for Spark, Dask, and Ray workflows. (Python)
  • lea - Minimalist SQL-based data transformation tool that orchestrates DAGs. (Python)
  • metaflow - Netflix human-centric framework for building and deploying data science workflows. (Python)
  • prefect - Modern Python workflow orchestration engine for data pipelines. (Python)
  • quix-streams - Python streaming framework for Kafka-based real-time data pipelines. (Python)
  • redbeat - Redis-backed scheduler for Celery enabling dynamic periodic task management and persistence. (Python)
  • redun - Expressive workflow framework using functional reactive programming for task orchestration. (Python)
  • rq-scheduler - Job scheduler extension for Redis Queue enabling periodic and future-dated tasks. (Python)
  • saga - Saga pattern implementation for distributed transaction orchestration across services. (Python)
  • snakemake - Python-based workflow management system for scientific reproducible pipelines. (Python)
  • SpiffWorkflow - Python-based workflow engine implementing BPMN business process management. (Python)
  • streamparse - Python tools for running and managing Apache Storm topologies. (Python)
  • submitit - Python tool for submitting and managing jobs on SLURM clusters. (Python)
  • taskiq - Python asynchronous distributed task queue inspired by Celery. (Python)

Databases & storage

  • aiomysql - Asynchronous MySQL driver for Python asyncio applications with connection pooling. (Python)
  • aiosqlite - Asynchronous wrapper around Python's sqlite3 for use in asyncio code. (Python)
  • btree - In-memory B-tree data structure implementation for ordered key storage. (Python)
  • djongo - Django ORM connector translating relational queries into MongoDB operations. (Python)
  • godror - Go database/sql driver for Oracle Database using ODPI-C. (Go)
  • influxdb-python - Official Python client library for InfluxDB time-series database. (Python)
  • mongo-python-driver - Official Python driver for connecting to and querying MongoDB databases. (Python)
  • orator - ActiveRecord-style ORM for Python inspired by Laravel's Eloquent. (Python)
  • orm - Async ORM for Python built on SQLAlchemy Core with typed models. (Python)
  • piccolo - Async Python ORM and query builder supporting multiple database backends. (Python)
  • pokedex - Relational database of Pokemon data with CLI for export and queries. (Python)
  • psycopg2 - PostgreSQL database adapter for Python with full DB-API 2.0 compliance. (Python)
  • PyHive - Python DBAPI and SQLAlchemy dialect for Hive, Presto and Trino. (Python)
  • pymemcache - Comprehensive pure-Python memcached client library. (Python)
  • pymodm - Object-document mapper for MongoDB providing declarative model definitions. (Python)
  • python-driver - Official Python driver for Apache Cassandra and DataStax Enterprise clusters. (Python)
  • python-irodsclient - Python client library for the iRODS data management system. (Python)
  • python-oracledb - Python driver for Oracle Database with async support and connection pooling. (Python)
  • redis-py - Official Python client library for the Redis key-value store. (Python)
  • sqlalchemy - Python SQL toolkit and Object Relational Mapper for database abstraction. (Python)
  • sqlalchemy-crud-plus - Enhanced CRUD operations helper library built on top of SQLAlchemy. (Python)
  • theine - High-performance Python caching library with adaptive eviction policies. (Python)
  • tidb - Distributed SQL database compatible with MySQL protocol and horizontally scalable. (Go)
  • valkey-py - Python client library for the Valkey key-value store with cluster support. (Python)
  • valkey-timeseries - Time series data extension module for the Valkey key-value store. (Rust)

Data processing & analysis

  • abu - Modular quantitative trading and backtesting platform for developing and optimizing strategies. (Python)
  • academic-keyword-occurrence - Command-line web scraper that tracks academic keyword occurrence trends over time. (Python)
  • akshare - Python library fetching Chinese financial and economic market data from many sources. (Python)
  • asammdf - Parser and editor for ASAM MDF automotive measurement data files. (Python)
  • AutoViz - Automated exploratory data visualization library generating charts from any dataset. (Python)
  • backtrader - Python framework for backtesting and live trading of algorithmic investment strategies. (Python)
  • bt - Flexible backtesting framework for Python with tree-structured trading strategies. (Python)
  • cartopy - Cartographic Python library for geospatial data processing and map projection plotting. (Python)
  • common-pile - Tooling for ingesting and processing large-scale openly-licensed text corpora. (Python)
  • d3py - Python library generating interactive D3.js and Vega visualizations from dataframes. (Python)
  • dask - Parallel computing library scaling NumPy and pandas workflows across clusters. (Python)
  • DataProfiler - Library for profiling, labeling, and generating reports on diverse datasets. (Python)
  • docling - IBM library converting documents in many formats into structured data. (Python)
  • dtale - Web-based interactive visualizer and editor for pandas DataFrames. (Python)
  • explorer - Dataset management and trace exploration tool with policy enforcement. (Python)
  • FinQuant - Python library for portfolio optimization and financial quantitative analysis. (Python)
  • freqtrade - Open-source cryptocurrency algorithmic trading bot with backtesting. (Python)
  • glom - Declarative Python library for restructuring and transforming nested data. (Python)
  • h3-py - Python bindings for Uber H3 hexagonal geospatial indexing system. (Python)
  • hummingbot - Open-source framework for building high-frequency crypto market making bots. (Python)
  • ijson - Iterative JSON parser for Python handling large files without full loading. (Python)
  • matplotlib - Comprehensive Python library for creating static and interactive visualizations. (Python)
  • numpy - Fundamental N-dimensional array package for scientific computing in Python. (Python)
  • OpenBB - Open source financial investment research and data analysis platform. (Python)
  • optimus - Unified data cleaning and transformation API over Pandas, Spark and Dask. (Python)
  • order_book_server - Real-time order book server consuming Hyperliquid market data streams. (Rust)
  • pandas - High-performance DataFrame library for data analysis and manipulation in Python. (Python)
  • pathway - High-performance real-time streaming data processing framework with Rust engine. (Python)
  • pingouin - Statistical package for Python built on top of Pandas and NumPy. (Python)
  • plotly.py - Interactive graphing library for Python creating publication-quality charts. (Python)
  • polars - Fast DataFrame library with Rust engine and Python bindings. (Rust)
  • prettyplotlib - Matplotlib wrapper producing publication-ready plots with sensible defaults. (Python)
  • prince - Python multivariate exploratory data analysis library for PCA and related methods. (Python)
  • pypeln - Concurrent data pipeline library with thread, process, and asyncio backends. (Python)
  • pyreadstat - Python interface reading SPSS, SAS, and Stata statistical files via C library. (Python)
  • python-benedict - Python dictionary subclass with keylist/keypath and serialization helpers. (Python)
  • qstock - Quantitative stock analysis toolkit with data acquisition and backtesting. (Python)
  • riko - Stream processing library for creating modular data pipelines and feed aggregation. (Python)
  • scipy - Fundamental scientific computing library providing mathematics, science, and engineering algorithms. (Python)
  • scrapy - High-level Python web crawling and scraping framework for structured data extraction. (Python)
  • scrapy-proxies - Random proxy middleware for rotating IPs in Scrapy crawlers. (Python)
  • seaborn - Statistical data visualization library built on matplotlib with attractive defaults. (Python)
  • skfolio - Portfolio optimization library building on scikit-learn for quantitative finance. (Python)
  • splink - Python record linkage library for deduplication across large datasets. (Python)
  • sqllineage - SQL lineage analysis tool tracing column and table dependencies. (Python)
  • superset - Modern data exploration and visualization platform with rich dashboards. (Python)
  • textfilter - Pipeline-based text content filtering tool for keyword-based moderation. (Python)
  • tika-python - Python binding to Apache Tika REST services for content extraction from documents. (Python)
  • tushare - Python library for retrieving Chinese financial market data. (Python)
  • usaddress - Python library using CRFs for parsing unstructured US address strings. (Python)
  • vnpy - Python-based quantitative trading platform with event-driven architecture. (Python)
  • webscraping - Web scraping pipeline for extracting job listings from online portals. (Python)
  • zipline - Pythonic algorithmic trading library for backtesting quantitative strategies. (Python)

Web & UI

Frontend frameworks

  • angular - Google's TypeScript framework for building scalable single-page web applications. (TypeScript)
  • babel - Internationalization library providing locale data, translations, and formatting utilities. (Python)
  • dominate - Python library generating HTML documents programmatically using context managers. (Python)
  • fastapi - Modern Python web framework for building fast APIs with type hints. (Python)
  • fastapi-pagination - Pagination extension library for FastAPI applications. (Python)
  • flask - Lightweight Python WSGI web framework with routing and templating. (Python)
  • flask-ask - Flask extension for rapidly building Amazon Alexa skills. (Python)
  • flask-jwt-extended - Flask extension for JWT-based authentication and authorization. (Python)
  • flask-wtf - Flask integration for WTForms form handling with CSRF protection. (Python)
  • gpfjs - Angular frontend for GPF genomic and phenotypic data exploration. (TypeScript)
  • kivy - Open-source Python framework for cross-platform multi-touch GUI applications. (Python)
  • mangum - Adapter for running ASGI Python applications on AWS Lambda. (Python)
  • microdot - Minimalistic Python web framework for MicroPython and CPython servers. (Python)
  • nicegui - Python framework for creating web-based user interfaces with minimal boilerplate. (Python)
  • paypal-js - React wrapper components for loading the PayPal JavaScript SDK. (JavaScript)
  • pyramid - Flexible Python web framework scaling from small apps to large applications. (Python)
  • react - Declarative JavaScript library for building user interfaces with components. (JavaScript)
  • reflex - Pure-Python framework for building full-stack reactive web applications. (Python)
  • sanic - High-performance async Python web framework built for speed and extensibility. (Python)
  • spring-boot - Framework for building standalone production-grade Spring-based Java applications. (Java)
  • streamlit - Python framework for quickly building interactive data apps and dashboards. (Python)
  • tornado - Python web framework and asynchronous networking library for high-concurrency. (Python)
  • tsoa - TypeScript OpenAPI framework generating routes and specs from decorators. (TypeScript)
  • vue - Progressive JavaScript framework for building reactive user interfaces. (TypeScript)
  • XgagSPA - React single-page application displaying user posts and statistics. (JavaScript)

UI libraries & no-code

  • ant-design - Enterprise-class React UI component library with consistent design language. (TypeScript)
  • carbon - IBM's Carbon design system React and web component library for enterprise UIs. (JavaScript)
  • CopilotKit - React SDK for embedding in-app AI copilots, chatbots, and agents. (TypeScript)
  • CustomTkinter - Modern customizable widget library extending Python's tkinter GUI toolkit. (Python)
  • dash - Plotly framework for building analytical web applications in pure Python. (Python)
  • deep-chat - Customizable chat component embedding AI conversations into any web application. (TypeScript)
  • formio - API-driven platform for dynamic form building and data management. (JavaScript)
  • gradio - Python library to build web demos for machine learning models. (Python)
  • NodeGraphQt - PyQt-based node graph framework for building visual programming interfaces. (Python)
  • PySimpleGUI - Simplified wrapper creating GUIs across Tkinter, Qt and web frameworks. (Python)
  • Tkinter-Designer - Tool for converting Figma designs into Python Tkinter GUI code. (Python)
  • vuestic-admin - Open-source Vue 3 admin dashboard template with Vuestic UI components. (TypeScript)

Apps & platforms

  • appsmith - Low-code platform for building internal tools, dashboards, and admin panels. (TypeScript)
  • arxiv-base - Core Flask application infrastructure powering the arXiv preprint platform. (Python)
  • asgiref - ASGI reference utilities bridging synchronous and asynchronous Python web code. (Python)
  • atlassian-connect-example-app-node - Example Atlassian Connect app with React frontend and Node.js Express backend. (JavaScript)
  • Caly - Avalonia-based cross-platform calendar desktop application using MVVM patterns. (C#)
  • cherrypy - Minimalist object-oriented Python web framework with built-in HTTP server. (Python)
  • claude-code-testing - Web application rendering markdown onboarding guides as interactive CodeBoarding docs. (TypeScript)
  • codeforlife-portal - Django-based educational portal delivering coding lessons to students and teachers. (Python)
  • copyparty - Self-hosted multi-protocol file server supporting HTTP, FTP, SMB, and more. (Python)
  • core - Home Assistant core home-automation platform integrating smart-home devices. (Python)
  • cvimprover-api - Django application that improves resumes using AI with Stripe-based payments. (Python)
  • dispatch - Netflix's open-source security incident and case management platform. (Python)
  • dj-rest-auth - Django REST framework package providing authentication and registration endpoints. (Python)
  • dj-stripe - Django package syncing Stripe subscription and payment objects into local models. (Python)
  • django - High-level Python web framework encouraging rapid development and clean design. (Python)
  • django-anymail - Django email backend sending mail via multiple transactional ESP providers. (Python)
  • django-crm - Modular Django customer relationship management application for leads and deals. (Python)
  • Django-facebook - Django extension for Facebook OAuth login and user profile synchronization. (Python)
  • django-filer - Django digital asset management app handling files, folders, and image metadata. (Python)
  • django-lifecycle - Django mixin adding declarative save/delete lifecycle hooks to ORM models. (Python)
  • django-modeltranslation - Django extension enabling translation of model field content into multiple languages. (Python)
  • django-ninja - Fast Django web framework for building type-hinted REST APIs. (Python)
  • django-notifications - Django app providing a reusable GitHub-style user notifications framework. (Python)
  • django-postgres-docker-skeleton - Boilerplate Django project skeleton with PostgreSQL and Docker configuration. (Python)
  • django-react-typescript - Full-stack starter combining Django backend with React TypeScript frontend. (Python)
  • django-rest-framework - Powerful and flexible toolkit for building Web APIs with Django. (Python)
  • django-rules - Tiny Django authorization library based on predicate rule composition. (Python)
  • django-tenant-schemas - Django app providing PostgreSQL-schema-based multi-tenancy for SaaS applications. (Python)
  • erpnext - Open-source enterprise resource planning platform for managing businesses. (Python)
  • eShop - Reference microservices-based e-commerce sample application on .NET. (C#)
  • EShopMicroservices - Microservices e-commerce reference with API gateway and event-driven services. (C#)
  • eShopOnAzure - Azure-deployed microservices e-commerce reference application. (C#)
  • eShopOnWeb - Monolithic ASP.NET Core e-commerce sample demonstrating clean architecture. (C#)
  • Fashion-Brand - Static frontend website for a fashion brand with HTML, CSS, and JavaScript. (HTML/JS)
  • Flyerscord-Bot - Discord bot application for the Philadelphia Flyers community. (JavaScript)
  • foodgram-project - Dockerized web application for sharing recipes with Django backend and NGINX. (Python)
  • ForestBlog - Java-based Spring MVC blog system with admin panel. (Java)
  • HR-SM - HR management web app with React frontend and Node.js/MongoDB backend. (JavaScript)
  • hrms - Open-source Frappe-based human resources management system. (Python)
  • jellyfin - Free open-source media server for streaming and managing personal media. (C#)
  • jovvix - Polyglot interactive quiz application with web frontend and services. (JavaScript)
  • laudspeaker - Open-source multichannel customer messaging and marketing automation platform. (TypeScript)
  • MayarDataScienceLab - Static informational website for Mayar data science lab. (HTML/JS)
  • nest - Progressive Node.js framework for building scalable server-side applications. (TypeScript)
  • obsidian-spaced-repetition - Obsidian plugin for spaced repetition flashcard review over notes. (TypeScript)
  • OctoPrint - Web interface for controlling, monitoring, and managing 3D printers. (Python)
  • odoo - Open source ERP and business application suite covering CRM, sales, inventory. (Python)
  • OpenUpgrade - Framework for upgrading Odoo ERP deployments across major versions. (Python)
  • payload - TypeScript-first headless CMS and application framework. (TypeScript)
  • pelican-plugins - Collection of plugins extending the Pelican static site generator. (Python)
  • PharmaSage - Next.js pharma-themed web application with component-based architecture. (TypeScript)
  • quick-notes-extension - Browser extension for taking and managing quick notes. (JavaScript)
  • RuoYi - Spring Boot admin dashboard system with role-based access control. (Java)
  • saleor - GraphQL-first headless e-commerce platform built with Django. (Python)
  • sdos-orchestration-flow-designer - Visual flow designer for the SDOS orchestration platform with React frontend. (TypeScript)
  • snappass - Flask-based web application for sharing passwords and secrets with expiring URLs. (Python)
  • social-app-django - Django integration for python-social-auth providing social authentication flows. (Python)
  • StructureBlock - Minecraft server management application with CLI and web GUI. (?)
  • synapse - Matrix homeserver reference implementation for decentralized real-time communication. (Python)
  • timetagger - Tag-based time tracking web application for individuals and small teams. (Python)
  • warehouse - Codebase powering the Python Package Index at pypi.org. (Python)
  • winboat - Electron desktop app running Windows VMs on Linux hosts. (TypeScript)
  • zulip - Open-source team chat application with threaded conversations and integrations. (Python)

Infrastructure & DevOps

Configuration & automation

  • ansible - Agentless IT automation platform for configuration management and application deployment. (Python)
  • archinstall - Guided installer automating Arch Linux setup with interactive and scripted modes. (Python)
  • AYABInterface - Python library controlling AYAB-modified knitting machines over serial communication. (Python)
  • charmcraft - CLI tool for building, packaging, and publishing Juju charms and bundles. (Python)
  • dellemc-openmanage-ansible-modules - Ansible collection automating Dell EMC server and iDRAC management tasks. (Python)
  • fabric - High-level Python library for SSH-based remote execution and deployment. (Python)
  • foxops - DevOps/GitOps automation tool for templating and managing infrastructure repositories. (Python)
  • irods_capability_automated_ingest - Distributed asynchronous framework for automated iRODS data ingest. (Python)
  • kalico - 3D printer firmware host (Klipper fork) for microcontroller motion control. (Python)
  • nginxconfig.io - Web tool for generating optimized NGINX and Docker Compose configurations. (JavaScript)
  • nornir - Pluggable Python network automation framework for multi-device orchestration. (Python)
  • sceptre - CLI tool for managing and orchestrating AWS CloudFormation stacks. (Python)
  • stackstorm_pack_ansible - StackStorm integration pack exposing Ansible operations as automation actions. (Python)
  • turborepo-remote-cache-cloudflare - Turborepo remote cache implementation running on Cloudflare Workers. (TypeScript)
  • VertFlow - Airflow operator for running Cloud Run jobs across regions optimizing for green compute. (Python)

Observability & telemetry

  • datadogpy - Official Python client for Datadog metrics, events, and API interactions. (Python)
  • dd-agent - Datadog host agent collecting system and application metrics for monitoring. (Python)
  • dd-trace-py - Datadog APM Python tracer instrumenting applications with distributed traces. (Python)
  • django-prometheus - Django library exporting application metrics in Prometheus format. (Python)
  • grafanalib - Python library for defining Grafana dashboards as code. (Python)
  • insights-core - Red Hat framework for collecting and analyzing system data for insights. (Python)
  • logparser - Toolkit of automated log parsing algorithms for log analytics research. (Python)
  • logster - Utility for reading log files and generating metrics for monitoring systems. (Python)
  • newrelic-python-agent - New Relic APM agent for instrumenting Python applications and collecting telemetry. (Python)
  • opentelemetry-go - OpenTelemetry SDK and API for instrumenting Go applications with traces and metrics. (Go)
  • opentelemetry-python - OpenTelemetry API and SDK for Python application observability instrumentation. (Python)
  • raven-python - Legacy Sentry SDK for capturing Python application errors and events. (Python)
  • sentry-python - Official Sentry SDK for Python error tracking and performance monitoring. (Python)
  • simplemonitor - Simple Python-based monitoring framework checking hosts and services with alerting. (Python)

Media & real-time infra

  • livekit - Open-source WebRTC SFU for scalable real-time audio, video, and data. (Go)

Platform SDKs

  • cluster-api - Kubernetes project providing declarative APIs and controllers for cluster lifecycle management. (Go)
  • docker-py - Official Python SDK for interacting with the Docker Engine API. (Python)
  • localstack - Fully functional local AWS cloud stack for testing and development. (Python)
  • moby - Container toolkit and runtime powering Docker and related projects. (Go)
  • nginx - High-performance HTTP server and reverse proxy with modular architecture. (C)
  • python-lambda - CLI tool for developing and deploying AWS Lambda functions. (Python)

Developer tools

Package & environment management

  • cibuildwheel - CI tool building Python wheels for CPython and PyPy across many platforms. (Python)
  • pipenv - Python dependency management tool combining pip and virtualenv workflows. (Python)
  • poetry - Modern Python packaging and dependency management tool. (Python)
  • poetry-multiproject-plugin - Poetry plugin enabling monorepo-style multi-project workflows. (Python)
  • proxpi - Caching proxy server for PyPI reducing package download latency. (Python)
  • pybuilder - Continuous build tool for Python projects with declarative configuration. (Python)

Language tooling (lint / types / format)

  • cloudpickle - Extended pickle module serializing closures, lambdas, and dynamically-defined Python objects. (Python)
  • codon - High-performance Python compiler using JIT and AOT to generate native code. (C++)
  • cuda-python - NVIDIA's official Python bindings for the CUDA driver and runtime APIs. (Python)
  • dacite - Small utility converting nested dictionaries into strongly-typed Python dataclasses. (Python)
  • deco - Decorator-based automatic parallelization of Python functions via AST rewriting. (Python)
  • injector - Python dependency injection framework inspired by Google Guice. (Python)
  • mypy - Static type checker for Python with a compiler for optimizing type-checked code. (Python)
  • pre-commit - Framework for managing and running multi-language pre-commit git hooks. (Python)
  • pyannotate - Auto-generates PEP 484 type annotations by observing Python runtime behavior. (Python)
  • pydantic - Data validation library using Python type hints for runtime checking. (Python)
  • pyflakes - Lightweight passive Python source static analyzer for error detection. (Python)
  • python-betterproto - Protobuf code generator producing modern idiomatic Python dataclasses and gRPC. (Python)
  • rope - Python refactoring library providing AST-based code analysis and transformation. (Python)
  • ruff-lsp - Language server protocol wrapper for the Ruff Python linter and formatter. (Python)
  • shopify_python - Shopify Python style guide enforcement tools extending Pylint and AST utilities. (Python)
  • stm32-rs - Generator for Rust peripheral access crates for STM32 microcontrollers. (Rust)
  • torchtyping - Runtime type annotations for PyTorch tensor shapes and dtypes. (Python)
  • typeguard - Runtime type checker for Python function annotations using AST transformation. (Python)
  • TypeScript - Strongly typed programming language that builds on JavaScript with compile-time types. (TypeScript)
  • typescript-language-server - Language server protocol implementation for TypeScript and JavaScript. (TypeScript)
  • typeshed - Collection of library stubs for Python used by type checkers like mypy. (Python)
  • voluptuous - Python data validation library with schema definition and compilation. (Python)
  • vscode-xslt-tokenizer - VS Code extension for XSLT and XPath syntax highlighting and language features. (TypeScript)
  • xed - Intel x86 encoder decoder library and disassembler. (C)
  • zls - Zig language server providing autocompletion and IDE features. (Zig)

CLIs, docs & DX

  • aiomultiprocess - Asynchronous multiprocessing pool for running async tasks across worker processes. (Python)
  • amazing-qr - Generator for artistic QR codes with custom images, GIFs, and colors. (Python)
  • argcomplete - Bash/zsh tab completion provider for Python argparse command-line programs. (Python)
  • bashplotlib - Library for plotting basic histograms and scatterplots directly in the terminal. (Python)
  • blinker - Fast in-process signal/event dispatching library for decoupled Python applications. (Python)
  • cli - Snyk's command-line interface for scanning code and dependencies for vulnerabilities. (TypeScript)
  • cobra - Go library for building modern CLI applications with nested commands and flags. (Go)
  • CodeBoarding-vscode - VSCode extension visualizing code architecture with control flow graph diagrams. (TypeScript)
  • cookiecutter - CLI utility creating new projects from templated cookiecutter directories. (Python)
  • craft-application - Framework for Canonical's craft-family CLI tools providing shared lifecycle scaffolding. (Python)
  • delegator.py - Simplified subprocess wrapper providing a friendly API for shell command execution. (Python)
  • delorean - Python library making timezone-aware datetime manipulation simpler and safer. (Python)
  • DirLink - Command-line utility for creating and managing directory symlinks. (Python)
  • dive - CLI tool exploring Docker image layers and analyzing wasted space. (Go)
  • drgn - Programmable debugger for introspecting the Linux kernel and user processes. (Python)
  • ebooklib - Python library for reading and writing EPUB ebook files. (Python)
  • eventsourcing - Python library implementing the event sourcing pattern for domain-driven design. (Python)
  • ganda - Go CLI utility for making parallel HTTP requests from a URL list. (Go)
  • gaphor - Simple yet powerful UML and SysML modeling application. (Python)
  • git-stacktrace - Tool that parses Python stack traces and maps them to git blame information. (Python)
  • gitdiagram - Tool for generating interactive architecture diagrams from GitHub repositories. (TypeScript)
  • GitHubPoster - CLI tool to generate visual posters from GitHub and activity data. (Python)
  • Gooey - Library that turns Python command-line programs into full GUI applications. (Python)
  • guietta - Simple Python library for declaratively creating PyQt GUIs in few lines. (Python)
  • gydnc - CLI for managing guidance entities with storage and service layers. (Go)
  • hashids-python - Python library to generate short, unique, non-sequential IDs from numbers. (Python)
  • heartrate - Real-time visualization of Python program execution in the browser. (Python)
  • holidays - Python library generating holiday dates for many countries and regions. (Python)
  • honcho - Python port of Foreman process manager for Procfile-based applications. (Python)
  • humanize - Python library for converting numbers, dates, and sizes to human-readable strings. (Python)
  • ipdb - IPython-powered interactive Python debugger. (Python)
  • ipython - Enhanced interactive Python shell with rich display and magic commands. (Python)
  • itermplot - Matplotlib backend that renders plots inline in iTerm2 terminals. (Python)
  • jupyterlab - Next-generation web-based user interface for Project Jupyter notebooks. (Python)
  • KiKit - KiCad automation tool for PCB panelization and manufacturing outputs. (Python)
  • knittingpattern - Python library for parsing and converting knitting pattern DSL files. (Python)
  • loguru - Python logging library designed to be stupidly simple to use. (Python)
  • markitdown - Microsoft CLI tool that converts documents and files to Markdown. (Python)
  • mkdocs - Static site generator geared towards building project documentation from Markdown. (Python)
  • mkdocstrings - MkDocs plugin for auto-generating API documentation from docstrings. (Python)
  • mtkclient - CLI and GUI client for flashing and interacting with MediaTek devices. (Python)
  • napi - CLI framework built on yargs with middleware and manifest generation. (TypeScript)
  • notebook - Jupyter Notebook web application for interactive computing and data analysis. (Python)
  • Pipe - Small Python library enabling fluent function composition through pipe operators. (Python)
  • poco - Project management CLI for bootstrapping and managing multi-repo projects. (Python)
  • poethepoet - Task runner for Python projects configured via pyproject.toml. (Python)
  • posting - Terminal-based HTTP client for API development and testing. (Python)
  • pyautogui - Cross-platform Python library for GUI automation of mouse and keyboard. (Python)
  • pydash - Functional utility library for Python inspired by lodash. (Python)
  • python-adb - Pure Python ADB and Fastboot implementation for Android device communication. (Python)
  • python-fire - Google library for automatically generating CLIs from Python objects. (Python)
  • python-nubia - Facebook framework for building interactive CLIs with REPL and autocomplete. (Python)
  • PyWhatKit - Python helper library for WhatsApp messaging and desktop automation tasks. (Python)
  • questionary - Python library for building interactive command line prompts and forms. (Python)
  • railroad-diagrams - Library for generating SVG railroad diagrams from grammar definitions. (Python)
  • retrying - Python decorator library for adding configurable retry logic to functions. (Python)
  • rich - Library for rich text and beautiful formatting in the terminal. (Python)
  • simplejson - Simple fast extensible JSON encoder and decoder for Python. (Python)
  • terminal_markdown_viewer - Command-line tool for rendering markdown files beautifully in the terminal. (Python)
  • textual - Python framework for building rich interactive terminal user interfaces. (Python)
  • tools - Nextflow tooling for managing pipelines modules and subworkflows. (Python)
  • tqdm - Fast extensible progress bar library for Python and command line. (Python)
  • tuna - Visualization tool for Python profile and import time data. (Python)
  • vscode - Microsoft Visual Studio Code source editor for modern web and cloud applications. (TypeScript)
  • wdb - Web-based remote Python debugger with real-time breakpoint interface. (Python)
  • XKCD-password-generator - CLI tool generating memorable passwords following the XKCD passphrase method. (Python)

Testing & load

  • appium - Cross-platform mobile automation framework for testing native and web apps. (JavaScript)
  • AsyncFlow - Discrete-event simulation framework for modeling asynchronous request-response systems. (Python)
  • atheris - Coverage-guided fuzzing engine for Python with native extension instrumentation. (Python)
  • behave - Behavior-driven development framework executing Gherkin feature files as tests. (Python)
  • chromeless - Serverless headless Chrome automation running browser sessions on AWS Lambda. (JavaScript)
  • facebook-wda - Python client for WebDriverAgent to automate iOS device UI testing. (Python)
  • hypothesis-torch - Property-based testing extension for PyTorch models via Hypothesis. (Python)
  • IOS13-SimulateTouch - iOS automation framework with Python client and on-device tweak for touch simulation. (Python)
  • locust - Python load testing framework where users write scenarios as code. (Python)
  • openhtf - Python framework for building scalable manufacturing hardware test automation. (Python)
  • optics-framework - Automated testing framework for building and running test suites. (Python)
  • perfplot - Python micro-benchmarking library with automatic runtime plotting. (Python)
  • pytest - Plugin-driven Python testing framework for writing expressive tests. (Python)
  • pytest-xdist - Pytest plugin for distributed parallel test execution across workers. (Python)
  • python-client - Appium Python client for mobile application test automation via WebDriver. (Python)
  • requestium - Library merging Requests, Selenium, and Parsel for robust web scraping automation. (Python)
  • RIDE - Robot Framework test data editor with a graphical user interface. (Python)
  • robotframework - Generic open-source automation framework for acceptance testing and RPA. (Python)
  • scanapi - Automated integration and contract testing tool for REST APIs using YAML specs. (Python)
  • selenium-wire - Selenium extension adding request inspection and interception to browser automation. (Python)
  • testcontainers-python - Python library for running throwaway Docker containers during integration tests. (Python)
  • uiautomator2 - Python wrapper for Android UIAutomator2 enabling mobile UI automation and testing. (Python)
  • undetected-chromedriver - Patched Selenium ChromeDriver bypassing Cloudflare Distil and other bot detection. (Python)

Scientific & research computing

Genomics & bioinformatics

  • aestetik - Spatial transcriptomics analysis library using deep learning embeddings on tissue data. (Python)
  • AfterQC - Quality control and preprocessing tool for next-generation sequencing FASTQ data. (Python)
  • alphafold - DeepMind protein structure prediction pipeline producing accurate 3D protein models. (Python)
  • alphagenome - Python SDK applying deep learning to genomics sequence and regulatory analysis. (Python)
  • anndata - Annotated data container for storing large single-cell gene expression datasets. (Python)
  • annofilter-junctions - Bioinformatics pipeline annotating and filtering RNA-seq splice junction calls. (Python)
  • augur - Bioinformatics pipeline toolkit for phylogenetic analysis of pathogen sequences. (Python)
  • BeadArrayFiles - Parser library for Illumina BeadArray BPM, cluster, and GTC genotyping files. (Python)
  • bigwig-loader - GPU-accelerated data loader for streaming BigWig genomic signal tracks into deep learning. (Python)
  • BioPhi - Web platform for antibody design, humanization, and humanness evaluation. (Python)
  • biopython - Comprehensive toolkit for computational biology, sequence analysis, and biological I/O. (Python)
  • CoSpred - Deep learning predictor of mass spectrometry spectra for peptide identification. (Python)
  • decima - Neural network for analyzing gene expression and biological sequence data. (Python)
  • DeepSpot - Spatial transcriptomics deep learning predictor from histology images. (Python)
  • deeptools - Command-line suite for analyzing and visualizing high-throughput sequencing data. (Python)
  • digest - High-performance ntHash bioinformatics sequence hashing library with Python bindings. (C++)
  • EFAAR_benchmarking - Benchmarking toolkit for evaluating biological perturbation representation methods. (Python)
  • enact-pipeline - Pipeline for processing and analyzing spatial transcriptomics data. (Python)
  • equifold - Machine learning pipeline for protein structure prediction and computational biology. (Python)
  • esm - Protein language models for structure prediction and sequence embedding. (Python)
  • GearNet - Deep learning framework for protein structure representation learning. (Python)
  • GeneAgent - Self-verifying LLM agent for bioinformatics gene set analysis. (Python)
  • genie - Deep learning framework for generative protein structure design. (Python)
  • genome-manager - Bioinformatics data management system with Nextflow workflow automation. (Python)
  • gnomAD_DB - SQLite-backed database for querying gnomAD genomic variant data. (Python)
  • gpf - Genomic data management and analysis platform for genetics research. (Python)
  • gpf_flash - Layered bioinformatics data platform for genomic and phenotypic data. (Python)
  • gpf_old - Legacy Genomic data management platform with web interface. (Python)
  • gpf_small - Scalable framework for managing and querying large-scale genomic variants. (Python)
  • gReLU - Deep learning framework for biological sequence analysis and interpretation. (Python)
  • HLAIIPred - HLA class II peptide binding prediction tool for immunology research. (Python)
  • jump_hub - Biological data processing and analysis hub for JUMP Cell Painting. (Python)
  • KAZU - Biomedical NLP framework for named entity recognition and ontology linking. (Python)
  • kipoiseq - Standard dataloaders for genomic sequence and variant machine learning models. (Python)
  • LISTT - CLI bioinformatics pipeline orchestrating external tools for genomic analysis. (Python)
  • MultiQC - Aggregates bioinformatics tool outputs into a single summary report. (Python)
  • Muon - Multi-omics data analysis framework built around the MuData object. (Python)
  • neusomatic - Deep learning-based somatic variant caller for long-read sequencing data. (Python)
  • nf-crispr-primer-design - Nextflow bioinformatics pipeline for designing primers for CRISPR experiments. (Python)
  • omnibenchmark - Framework for automating and standardizing benchmarking of bioinformatics tools. (Python)
  • openfold - Open-source PyTorch reimplementation of DeepMind's AlphaFold 2 protein prediction. (Python)
  • ProteinFlow - Pipeline for processing protein structure data for machine learning tasks. (Python)
  • PyDESeq2 - Python implementation of DESeq2 for RNA-seq differential expression analysis. (Python)
  • pysam - Python interface to SAMtools for manipulating genomic sequencing files. (Python)
  • railroadtracks - Framework for defining and executing bioinformatics workflows with provenance. (Python)
  • scanpy - Scalable Python toolkit for single-cell gene expression data analysis. (Python)
  • scikit-bio - Bioinformatics data structures, algorithms, and educational resources in Python. (Python)
  • scimilarity - Metric learning framework for single-cell RNA-seq similarity search and annotation. (Python)
  • scirpy - Scanpy-based toolkit for single-cell T-cell and B-cell receptor repertoire analysis. (Python)
  • scispacy - spaCy NLP models and pipelines for biomedical and scientific text processing. (Python)
  • scvi-tools - Deep probabilistic models for single-cell omics data analysis. (Python)
  • seqlike - Unified Python interface for biological sequence manipulation and analysis. (Python)
  • somaticseq - Ensemble pipeline combining somatic variant callers with machine learning classification. (Python)
  • spatialone-pipeline - Spatial transcriptomics pipeline for Visium platform data processing. (Python)
  • SRAgent - AI agent system for curating and retrieving Sequence Read Archive bioinformatics data. (Python)
  • starfish - Pipeline toolkit for image-based spatial transcriptomics analysis. (Python)

Imaging, neuroscience & health

  • AllenSDK - Allen Institute SDK for accessing and analyzing neuroscience brain observatory data. (Python)
  • arviz - Exploratory analysis and visualization library for Bayesian statistical models. (Python)
  • bagel-cli - CLI for annotating neuroimaging datasets with harmonized BIDS-compatible metadata. (Python)
  • BIC-megaplots - Neuroimaging visualization tool producing composite megaplots from brain scan data. (Python)
  • cellpose - Generalist deep learning model for segmenting cells in biomedical images. (Python)
  • DeepLabCut - Markerless animal pose estimation tool for neuroscience and behavioral research. (Python)
  • dipy - Python library for diffusion MRI analysis, tractography, and neuroimaging workflows. (Python)
  • fmriprep - Robust preprocessing pipeline for fMRI neuroimaging data following BIDS standards. (Python)
  • h5bench - HDF5 I/O performance benchmarking suite for scientific computing. (C)
  • improver - UK Met Office modular post-processing toolkit for meteorological forecasts. (Python)
  • iris - Python library for analyzing and visualizing Earth science data. (Python)
  • lyse - Labscript suite analysis component for scientific experiment data. (Python)
  • MEA-NAP - MATLAB pipeline for microelectrode array neural activity analysis. (MATLAB)
  • medconb - GraphQL service for managing medical concepts, codelists, and phenotypes. (Python)
  • MedicalNet - Pretrained 3D CNN models for medical image transfer learning. (Python)
  • mne-python - Neuroscience library for MEG, EEG and neurophysiological signal analysis. (Python)
  • mouse-echo-neural-net - Desktop deep learning app for biomedical mouse echocardiography image analysis. (Python)
  • MS-lesion-segmentation - Deep learning pipeline for multiple sclerosis lesion segmentation in MRI. (Python)
  • neuro-forestwalk - Machine learning pipeline for behavioral phenotyping on HPC environments. (Python)
  • neuro-green - Deep learning research library for computational neuroscience applications. (Python)
  • neuro-meeglet-paper - Research code accompanying a neuroscience MEG/EEG wavelet analysis paper. (Python)
  • NiftyNet - Deep learning platform for medical image analysis and research. (Python)
  • nilearn - Scientific Python library for statistical learning on neuroimaging data. (Python)
  • nipype - Neuroimaging pipeline framework uniting heterogeneous analysis packages under one API. (Python)
  • ome-zarr-py - Python library for reading and writing OME-Zarr bioimaging data format. (Python)
  • PhenEx - Defines and computes patient phenotypes and cohorts from clinical databases. (Python)
  • pvlib-python - Scientific library for simulating photovoltaic energy system performance. (Python)
  • pybalance - Causal inference matching library for balancing treatment and control groups. (Python)
  • PyHealth - Deep learning toolkit for healthcare predictive modeling on EHR data. (Python)
  • pymedphys - Medical physics toolkit for radiotherapy and diagnostic imaging calculations. (Python)
  • scikit-image - Collection of image processing algorithms for scientific Python applications. (Python)
  • sit2standpy - Python library for detecting sit-to-stand transitions from wearable sensor data. (Python)
  • TotalSegmentator - Deep learning tool for automatic segmentation of anatomical structures in CT scans. (Python)
  • WUCSS - Scientific pipeline for sleep state classification from physiological signals. (Python)

Molecular dynamics & chemistry

  • aizynthfinder - Retrosynthetic planning tool predicting synthesis routes for target molecules using AI. (Python)
  • beignet - Scientific computing library for molecular modeling, polynomials, and 3D rotations. (Python)
  • boltz - Diffusion-based molecular structure and binding affinity prediction model. (Python)
  • ccdutils - PDB Chemical Component Dictionary utilities for processing and analyzing small molecules. (Python)
  • ChEMBL_Structure_Pipeline - Chemical structure standardization and validation pipeline used by ChEMBL. (Python)
  • chemicalx - PyTorch library for drug-drug interaction prediction using graph neural networks. (Python)
  • ConfGF - Score-based generative model for 3D molecular conformation generation. (Python)
  • ddi-designer - Tool for designing drug-drug interaction studies in pharmacology research. (Python)
  • deep-molecular-optimization - Deep generative models for optimizing molecular structures against target properties. (Python)
  • deepchem - Machine learning library for drug discovery, materials, and quantum chemistry. (Python)
  • DeeplyTough - Deep learning comparison of protein binding pockets for drug discovery. (Python)
  • diffrax - JAX library for numerical differential equation solvers with autodiff support. (Python)
  • drcHelper - R toolkit helping dose-response analyses in toxicology and pharmacology. (R)
  • geomstats - Python library for computations and statistics on Riemannian manifolds. (Python)
  • insitro-research - Research code for molecular docking and predictive drug modeling. (Python)
  • mlr-xai-selfies - Explainable AI for SELFIES-based molecular property prediction models. (Python)
  • mmpdb - Matched molecular pair database tool for cheminformatics and drug discovery. (Python)
  • MOCCA - Chromatographic data analysis framework for peak detection and deconvolution. (Python)
  • NonadditivityAnalysis - Tool for detecting non-additive structure-activity relationships in chemistry data. (Python)
  • openmc - Monte Carlo neutron and photon transport simulation code for nuclear physics. (Python)
  • psych.js - JavaScript library for psychrometric calculations with interactive chart viewer. (JavaScript)
  • pyemma - Molecular dynamics analysis library for Markov model estimation. (Python)
  • pymatgen - Materials science library for structure analysis and high-throughput computation. (Python)
  • PyPSA - Power system analysis toolbox for energy network optimization. (Python)
  • python-skyfield - Astronomy library computing positions of stars, planets, and satellites. (Python)
  • rdkit - Open-source cheminformatics library for molecular operations and drug discovery. (C++)
  • synflownet-boltz - GFlowNet-based generative machine learning framework for chemistry design. (Python)
  • torsional-strain - Computational chemistry workflow for calculating torsional strain in molecules. (Python)

Security & privacy

App & supply-chain security

  • bandit - Static security linter finding common vulnerabilities in Python code via AST analysis. (Python)
  • bcrypt - Python bindings for the bcrypt password hashing algorithm with native speedups. (Python)
  • chainguard - Lakera guardrails library adding prompt and response safety checks to LLMs. (Python)
  • credential-digger - Scanner that hunts for secrets and credentials in source code repositories. (Python)
  • django-axes - Django middleware tracking failed login attempts and locking out attackers. (Python)
  • django-two-factor-auth - Pluggable two-factor authentication system for Django web applications. (Python)
  • fail2ban - Daemon that bans IPs after repeated authentication failures via log analysis. (Python)
  • garak - LLM vulnerability scanner probing for prompt injection and content issues. (Python)
  • hpn-ssh - High-performance patched OpenSSH for high-bandwidth network transfers. (C)
  • jumpserver - Open-source bastion host for managing SSH, RDP, and database access. (Python)
  • lemur - TLS certificate management and lifecycle automation service from Netflix. (Python)
  • mcp-scan - CLI that scans MCP server configurations and verifies entities against whitelists. (Python)
  • microsoft-authentication-library-for-python - MSAL Python library for acquiring tokens from Microsoft identity platform. (Python)
  • Name-That-Hash - CLI tool to identify hash types from unknown hash strings. (Python)
  • PMapper - Graph-based tool for mapping and analyzing AWS IAM privilege escalation paths. (Python)
  • policy_sentry - CLI for generating least-privilege AWS IAM policies from templates. (Python)
  • prowler - Cloud security assessment tool auditing AWS, Azure, GCP and Kubernetes. (Python)
  • pycasbin - Policy-based access control library supporting RBAC, ABAC and custom models. (Python)
  • python-tuf - Reference implementation of The Update Framework for secure software updates. (Python)
  • safe-chain - Security gateway for JavaScript and Python package managers blocking malicious dependencies. (Python)
  • safety - Python dependency vulnerability scanner checking packages against security databases. (Python)
  • text_blind_watermark - Text steganography library embedding invisible watermarks in strings. (Python)

Offensive / red team

  • Bashfuscator - Modular Bash script obfuscator for offensive security and red-team operations. (Python)
  • CobaltStrikeBeaconCppSource - Leaked Cobalt Strike beacon C++ source code used for offensive security. (C++)
  • GhostTrack - CLI tool for open-source intelligence gathering and tracking. (Python)
  • MHDDoS - Python DDoS attack tool with many protocol methods. (Python)
  • pyrdp - Man-in-the-middle RDP proxy for session interception and analysis. (Python)
  • QBDI - Dynamic binary instrumentation framework for reverse engineering and analysis. (C++)
  • rdpy - Python RDP and VNC implementation for remote desktop protocol tools. (Python)
  • sulley - Fuzzing framework for discovering vulnerabilities in network protocols. (Python)
  • WhatWaf - Tool detecting and identifying web application firewalls via payload fingerprinting. (Python)

Games, graphics & media

Game engines & 3D

  • evennia - Python framework for building text-based multiplayer online games (MUDs). (Python)
  • g3m - Multi-platform 3D/4D geographic rendering and visualization engine. (C++)
  • openage - Open-source clone of the Age of Empires II real-time strategy engine. (C++)
  • panda3d - 3D game engine with Python bindings for interactive application development. (C++)
  • pygame - Python library for writing 2D games and multimedia applications. (Python)
  • pygame-menu - Menu creation and management library for Pygame applications. (Python)
  • pyglet - Cross-platform windowing and multimedia library for Python games. (Python)
  • pyrender - Physically-based OpenGL renderer for 3D scenes and meshes in Python. (Python)
  • pyunity - Python-based Unity-style 2D/3D game engine with scene management. (Python)
  • Quake-2 - Classic Quake 2 game engine source code. (C)
  • sdf - Library for generating 3D meshes from signed distance functions. (Python)
  • Solitaire - Solitaire card game implementation. (?)
  • Sudoku - Sudoku puzzle generator and solver implementation. (?)
  • TA - Game project component abstractions. (?)

Video, audio & downloaders

  • anime-downloader - CLI tool for searching and downloading anime from various streaming sites. (Python)
  • moviepy - Python library for video editing, compositing, and FFmpeg-based media processing. (Python)
  • pyo - Real-time digital signal processing and audio synthesis library. (Python)
  • resemble-enhance - Deep learning toolkit for audio denoising and speech enhancement. (Python)
  • sms-tools - Audio analysis and synthesis tools for sound and music computing. (Python)
  • soundconverter - GNOME application for converting audio files between formats. (Python)
  • spleeter - Deezer source separation library for splitting audio into stems. (Python)
  • syncedlyrics - Command-line utility for fetching time-synced lyrics from multiple providers. (Python)
  • versatile_audio_super_resolution - Deep learning pipeline for audio super-resolution and quality enhancement. (Python)
  • ykdl - Command-line video downloader supporting multiple Chinese video platforms. (Python)
  • youtube-dl - Command-line program for downloading videos from YouTube and other sites. (Python)
  • yt-dlp - Feature-rich youtube-dl fork with additional patches and extractors. (Python)

Creative tooling

  • AliceLG - Blender add-on for rendering 3D scenes to Looking Glass holographic displays. (Python)
  • AmberLG - Blender add-on integrating holographic displays for Looking Glass light-field rendering. (Python)
  • blender-mcp - MCP server letting Claude AI drive Blender for 3D model generation and manipulation. (Python)
  • ComfyUI - Node-based graphical interface for designing and executing Stable Diffusion workflows. (Python)
  • ComfyUI-Easy-Use - ComfyUI extension packaging generative AI workflows into simpler user-friendly nodes. (Python)
  • GrooveScribe - Web-based single-page application for writing and practicing drum grooves. (JavaScript)
  • manim - Programmatic animation engine for creating precise mathematical visualizations. (Python)
  • OCRmyPDF - Adds OCR text layer to scanned PDFs for searchability and indexing. (Python)
  • opencv - Comprehensive computer vision and image processing library with ML utilities. (C++)
  • opencv-python - Python bindings for the OpenCV computer vision and image processing library. (Python)
  • PCV - Python computer vision toolkit with feature extraction and image processing pipelines. (Python)
  • PDFMathTranslate - Translates math-heavy PDF documents while preserving formulas and layout. (Python)
  • Pillow - Friendly fork of PIL providing image processing capabilities for Python. (Python)
  • PyPDF2 - Pure Python library for reading, writing, and manipulating PDF files. (Python)
  • pytheory - Python music theory library for tones, chords, and temperament systems. (Python)
  • word_cloud - Python library for generating word cloud visualizations from text. (Python)

Networking, APIs & protocols

Platform SDKs & API clients

  • analytics-python - Segment analytics client library for tracking events from Python applications. (Python)
  • atlassian-python-api - Python client covering Jira, Confluence, Bitbucket and other Atlassian product APIs. (Python)
  • besu - Enterprise Ethereum blockchain client implementing execution and consensus layers. (Java)
  • bioblend - Python client for the Galaxy and ToolShed bioinformatics platform APIs. (Python)
  • boto3 - Official AWS SDK for Python providing high-level resource and client interfaces. (Python)
  • botocore - Low-level foundation library for boto3 handling AWS request signing and transport. (Python)
  • check-if-email-exists - Service verifying whether an email address actually exists without sending mail. (Rust)
  • chembl_webresource_client - Python client for the ChEMBL bioactivity database REST web services. (Python)
  • cian-protocol - Decentralized finance automation protocol for scheduled and conditional on-chain actions. (Solidity)
  • cloudant-python-sdk - IBM Cloudant Python SDK for managing documents, databases, and change feeds. (Python)
  • clubhouse-py - Unofficial Python client for the Clubhouse social audio application API. (Python)
  • coinbase-python - Official Coinbase Python API client for cryptocurrency exchange operations. (Python)
  • datahowlab-sdk-python - Python SDK for DataHowLab's bioprocess experiments, products, and models. (Python)
  • deep-translator - Unified Python wrapper around multiple online translation service APIs. (Python)
  • discord - Python library for interacting with the Discord chat platform's API. (Python)
  • discord.py - Full-featured async Python wrapper around the Discord bot API. (Python)
  • dropbox-sdk-python - Official Dropbox Python SDK for file, sharing, and team operations. (Python)
  • facebook-python-business-sdk - Python SDK for Facebook Marketing API ad campaign management. (Python)
  • foxops-client-python - Python SDK client library for interacting with the Foxops API. (Python)
  • friendli-python - Python SDK for interacting with the Friendli LLM inference API. (Python)
  • g2papi - CLI client for the Gene to Protein API retrieving biological data. (Python)
  • gkeepapi - Unofficial Python client library for interacting with Google Keep. (Python)
  • gmail - Python IMAP-based library for reading and managing Gmail messages. (Python)
  • go-micro - Go framework for building distributed microservices with pluggable components. (Go)
  • hangups - Third-party Python client for the Google Hangouts chat protocol. (Python)
  • inbox.py - Lightweight asynchronous Python SMTP server library for email processing. (Python)
  • invariant-sdk - Python SDK for interacting with the Invariant trace and dataset API. (Python)
  • itchatmp - Python framework for building WeChat public/enterprise platform bots. (Python)
  • jira - Python client library for the Atlassian Jira REST API. (Python)
  • JobSpy - Job scraping library for LinkedIn, Indeed, Glassdoor, and similar boards. (Python)
  • magic-wormhole - Secure peer-to-peer file transfer CLI and library using short codes. (Python)
  • mailin - Event-driven SMTP server that parses incoming mail into structured data. (JavaScript)
  • mixpanel-python - Official Python client for sending events to the Mixpanel analytics API. (Python)
  • paysafe_sdk_python - Python SDK for the Paysafe payment processing APIs. (Python)
  • pinterest-python-sdk - Official Python SDK for the Pinterest API. (Python)
  • pixivpy - Python API wrapper for the Pixiv illustration sharing service. (Python)
  • praw - Python Reddit API Wrapper for interacting with the Reddit platform. (Python)
  • PSpider - Concurrent multithreaded web crawling framework for scraping content. (Python)
  • py-trello - Python library for interacting with the Trello REST API. (Python)
  • pycoin - Python toolkit for Bitcoin and cryptocurrency key, wallet, and transaction operations. (Python)
  • pycord - Modern Python API wrapper for interacting with the Discord platform. (Python)
  • PyGithub - Python library for accessing the GitHub REST API. (Python)
  • Pyrebase - Python wrapper for Firebase Auth, Realtime Database and Storage APIs. (Python)
  • pyrh - Unofficial Python interface to the Robinhood trading API. (Python)
  • pysnowball - Python wrapper for the Xueqiu (Snowball) stock market API. (Python)
  • python-asana - Official Python client library for the Asana REST API. (Python)
  • python-bitcoinlib - Python library for the Bitcoin protocol, data structures, and RPC. (Python)
  • python-connector-api - Python connector for the Meteomatics weather data API. (Python)
  • python-digitalocean - Python library for managing DigitalOcean cloud resources via the API. (Python)
  • python-facebook - Python client for Facebook Graph, Instagram and Threads APIs. (Python)
  • python-gitlab - Python library for interacting with the GitLab REST API. (Python)
  • python-linode-api - Python client library for the Linode cloud hosting API. (Python)
  • python-nomad - Python client library for interacting with HashiCorp Nomad clusters. (Python)
  • python-o365 - Python library for interacting with Microsoft Graph and Office 365 APIs. (Python)
  • python-sdk-core - Foundational SDK core for IBM Cloud service Python libraries. (Python)
  • python-slack-sdk - Official Python SDK for building Slack apps and integrations. (Python)
  • python-telegram-bot - Python wrapper for the Telegram Bot API with a dispatcher framework. (Python)
  • python-twitch-client - Python interface to the Twitch APIs (v5 and Helix). (Python)
  • python-user-agents - Python library for parsing user agent strings into structured data. (Python)
  • python-zeep - Modern Python SOAP client for interacting with web services. (Python)
  • quandl-python - Python client library for retrieving financial and economic data from Quandl. (Python)
  • rauth - Python library for OAuth 1.0/2.0 authentication with service providers. (Python)
  • requests - Elegant and simple HTTP library for making web requests in Python. (Python)
  • requests-cache - Transparent persistent HTTP caching layer for the Python requests library. (Python)
  • requests-ip-rotator - Rotates IPs via AWS API Gateway to bypass rate limits on scraping. (Python)
  • Riot-Watcher - Python wrapper for Riot Games APIs covering League of Legends and other titles. (Python)
  • robin_stocks - Unified Python interface for Robinhood, Gemini, and TD Ameritrade trading APIs. (Python)
  • scholarly - Python module retrieving author and publication information from Google Scholar. (Python)
  • sendgrid-python - Official Python client library for the SendGrid email delivery service. (Python)
  • shopify_django_app - Django starter template for building Shopify embedded applications. (Python)
  • shopify_python_api - Python gem for authenticating and interacting with the Shopify admin API. (Python)
  • simple-salesforce - Simple REST client library for Salesforce REST, Bulk, and Metadata APIs. (Python)
  • slacker - Full-featured Python interface for the Slack API. (Python)
  • slackminion - Modular plugin-based Python bot framework for Slack. (Python)
  • solana-py - Python client library for interacting with the Solana blockchain. (Python)
  • spotify - Python client library interfacing with the Spotify Web API. (Python)
  • spotipy - Lightweight Python library for interacting with the Spotify Web API. (Python)
  • square-python-sdk - Official Python SDK for integrating with Square payment APIs. (Python)
  • stripe-python - Official Stripe Python library for payment processing API integration. (Python)
  • susi_api_wrapper - Python wrapper client for the SUSI AI conversational assistant API. (Python)
  • target-python-sdk - Python SDK for Adobe Target personalization and A/B testing platform. (Python)
  • twarc - Command-line tool and Python library for archiving Twitter JSON data. (Python)
  • tweepy - Python library for easy access to the Twitter API. (Python)
  • twilio-python - Official Twilio Python library for communications APIs. (Python)
  • twitter-api-client - Python implementation of the Twitter GraphQL and REST APIs. (Python)
  • vsphere-automation-sdk-python - Python SDK samples for VMware vSphere and VMware Cloud automation APIs. (Python)
  • web3.py - Python library for interacting with Ethereum blockchain and smart contracts. (Python)
  • WebWhatsapp-Wrapper - Pythonic automation wrapper for WhatsApp Web using Selenium. (Python)
  • xhs - Python library for interacting with Xiaohongshu Little Red Book platform. (Python)
  • yarl - Fast URL parsing and manipulation library for Python. (Python)
  • yfinance - Python library for downloading historical market data from Yahoo Finance. (Python)
  • yfinance_depth_2 - Extended Yahoo Finance data library with caching and multi-asset support. (Python)

Industrial & IoT protocols

  • cantools - CAN bus database tools for parsing DBC files and encoding automotive signals. (Python)
  • DJITelloPy - Python interface for controlling DJI Tello drones and coordinating swarms. (Python)
  • mavlink - Lightweight drone communication protocol with multi-language code generation. (C/Python)
  • pymodbus - Full Modbus protocol implementation for industrial control systems. (Python)
  • pyserial - Cross-platform Python library for serial port communication. (Python)
  • python-can - Python library for controller area network (CAN) bus communication. (Python)
  • python-opcua - Pure Python OPC UA client and server implementation for industrial automation. (Python)
  • scapy - Interactive packet manipulation library for network protocol forgery and analysis. (Python)

Learning codebases

  • 30-Days-Of-Python - Progressive 30-day curriculum teaching foundational Python concepts step-by-step with exercises. (Python)
  • algo - Collection of algorithm implementations and learning resources for practicing programming. (Python)
  • algorithms - Minimal examples of classic data structures and algorithms implemented for learning. (Python)
  • annotated_deep_learning_paper_implementations - Collection of annotated PyTorch implementations of deep learning research papers. (Python)
  • awesome-llm-apps - Curated collection of example LLM applications demonstrating agents and RAG patterns. (Python)
  • awesome-python - Curated awesome list of Python libraries, frameworks, and software resources. (Python)
  • basic_verilog - Collection of basic Verilog HDL modules and templates for FPGA designers. (Verilog)
  • BayesERbook - Tutorial book on Bayesian exposure-response modeling for pharmacometrics. (R)
  • BookWorm - Book-reading and annotation study project exploring text analysis techniques. (Python)
  • CleanArchitecture - Reference .NET solution template demonstrating clean architecture patterns. (C#)
  • d2l-zh - Chinese-language interactive deep learning textbook with runnable framework examples. (Python)
  • deep-learning-for-image-processing - Tutorial repo of deep learning image classification and detection implementations. (Python)
  • funNLP - Chinese NLP resource collection with dictionaries, corpora, and datasets. (Python)
  • hacking-tutorial - Educational scripts demonstrating network hacking techniques like ARP and WEP. (Python)
  • Hello-Python - Beginner Python course demonstrating basic syntax and arithmetic. (Python)
  • Java - Collection of Java sample projects including JspChat, SpringBoot-Shiro, and eStore. (Java)
  • Jetpack-Compose-Tutorials - Android tutorial project demonstrating Jetpack Compose UI fundamentals. (Kotlin)
  • learn-python - Progressive Python tutorial repository covering syntax to OOP. (Python)
  • Learning-Journal - Django tutorial app for creating topics and entries with authentication. (Python)
  • micrograd - Tiny autograd engine and neural net library for educational purposes. (Python)
  • minGPT - Minimal PyTorch re-implementation of GPT for educational clarity. (Python)
  • mlcourse.ai - Open machine learning course with Jupyter notebook lessons. (Python)
  • public-apis - Curated list of free public APIs organized for developer reference. (?)
  • python-archetype - Python clean architecture template repository with CQRS and DDD patterns. (Python)
  • python-guide - Opinionated best-practices guide for Python developers. (Python)
  • python-mastery - David Beazley's advanced Python programming course materials. (Python)
  • python-mini-projects - Collection of small independent Python project scripts and applications. (Python)
  • python-template-repository - Template scaffold for structuring new Python projects. (Python)
  • python-weekly - Weekly curated content pipeline aggregating Python news and articles. (Python)
  • pytorch-tutorial - Beginner-friendly PyTorch tutorial series covering deep learning fundamentals. (Python)
  • spinningup - Educational resource for learning deep reinforcement learning algorithms from scratch. (Python)
  • stanford-tensorflow-tutorials - Collection of TensorFlow tutorials and examples from Stanford CS20 course. (Python)
  • system-design-primer - Educational resource teaching system design and object-oriented design patterns. (Python)

Uncategorized

  • mole_public - Undetermined repository with minimal architectural description. (?)
  • newton - Undetermined repository with minimal architectural description. (?)
  • OhunIslam - Undetermined repository with no architectural description available. (?)
  • OtakuWorld - Undetermined Android anime-related application with minimal description. (?)
  • pdc - Undetermined repository with no architectural description available. (?)
  • PerforatedAI - Undetermined repository with no architectural description available. (?)
  • phase1b - Undetermined repository with no architectural description available. (?)
  • product-attribute - Undetermined repository, likely Odoo product attribute addons. (?)
  • products - Likely .NET identity server sample rather than Python project. (?)
  • python_testbench - Undetermined repository with no architectural description available. (?)
  • snd-textmod - Unclassified repository with insufficient description to place in taxonomy. (?)
  • snipsnap - Unclassified repository with insufficient description to place in taxonomy. (?)
  • sparrow - Unclassified repository with insufficient description to place in taxonomy. (?)
  • the-matrix - Unclassified repository with insufficient description to place in taxonomy. (?)
  • trailarr - Unclassified repository with insufficient description to place in taxonomy. (?)
  • tsh - Unclassified repository with insufficient description to place in taxonomy. (?)
  • unit-converter - Unit conversion utility library. (?)
  • unleash - Unclassified repository with insufficient description to place in taxonomy. (?)
  • website - Unclassified repository with insufficient description to place in taxonomy. (?)

How diagrams are generated

flowchart LR
    Code[Source Code] --> CB
    subgraph CB [CodeBoarding]
        direction LR
        Clone[Clone repo] --> Static[Static analysis] --> LLM[LLM reasoning]
    end
    CB --> MD[architecture.md]
Loading

Every diagram is produced by running CodeBoarding — a local static-analysis + LLM-reasoning engine — over the repo at --depth-level 2. The engine parses real imports, call graphs, and module boundaries; the LLM only names and summarizes. No diagram is hand-drawn.

Contribute

Found a mistake? Static analysis + LLMs aren't perfect. If a module is misnamed or a dependency is invented, open a PR on the .md.

Want your repo in here? Open an issue with:

  • the GitHub URL
  • one sentence on what the repo does
  • primary language

We prioritize repos that are (a) actively maintained, (b) widely depended on, or (c) architecturally interesting.

Running this on your own code? CodeBoarding runs locally. Point it at any repo, public or private.

About

A collection of system designs of popular open-source projects

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors