Previous Releases

For latest (...), use uv pip install vllm

Loading releases...

Installation Notes

  • • Commands use uv for faster installation. Replace uv pip with pip if needed
  • • Available platforms are fetched from the wheels index. Not all CUDA versions are available for all releases
  • vLLM < v0.10.0: Recommended to use Python 3.12 (uv venv --python 3.12)
  • vLLM v0.9.x: These versions require transformers < 4.54.0
  • vLLM < v0.6.0: These versions have additional dependency issues that require manual fixes
  • CPU: Requires glibc ≥ 2.35 (Ubuntu 22.04+, Debian 12+, RHEL 9+)
  • • For the latest version, use uv pip install vllm
  • • For more installation options, see installation docs