Last Updated: 4/8/2026
Pie Project Documentation
Welcome to the Pie project documentation! Pie is a high-performance, programmable LLM serving system that empowers you to design and deploy custom inference logic and optimization strategies.
Note 🧪
This software is in a pre-release stage and under active development. It’s recommended for testing and research purposes only.
Getting Started
Installation
Pie can be installed via PyPI, Docker, or built from source.
Option 1: Docker (Recommended)
docker pull pie-project/pie:latest
docker run -it pie-project/pie:latestOption 2: PyPI
pip install "pie-server[cuda]" # Linux/Windows
pip install "pie-server[metal]" # macOSOption 2: Build from Source (Recommended)
git clone https://github.com/pie-project/pie.git && cd pie/pie
# Recommended: use uv to sync (options: cu126, cu128, cu130, metal)
uv sync --extra cu128Quick Start
Run a test prompt (you will be prompted for configuration and model download if this is your first time):
pie run text-completion -- --prompt "Hello world!"
pie run beam-search -- --prompt "What is the capital of France?" --beam-size 2Note: The first run may take longer due to JIT compilation.
If built from source, prefix commands withuv run(e.g.,uv run pie config init).
Community
Issues & Bugs: Please report bugs on GitHub Issues .
Discussions: Have a question or feedback? Join us on GitHub Discussions .
Discord: Join our Discord server for real-time chat and support: discord.gg/pie-project
Resources
For more detailed information, check out the official documentation at https://pie-project.org/docs .
License
Pie is licensed under the Apache License 2.0 .