# pascal-pkgs-ci
**Repository Path**: wangbaochao/pascal-pkgs-ci
## Basic Information
- **Project Name**: pascal-pkgs-ci
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 1
- **Created**: 2026-05-16
- **Last Updated**: 2026-05-16
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# pascal-pkgs-ci
The main repository for building Pascal-compatible versions of ML applications and libraries.
1. vLLM `0.5.5`, `0.6.0`, `0.6.1`, `0.6.1.post1`, `0.6.1.post`, `0.6.2`, `0.6.3`, `0.6.3.post1`, `0.6.4`, `0.6.4.post1`, `0.6.5`, `0.6.6`, `0.6.6.post1`, `0.7.0`, `0.7.1`, `0.7.2`, `0.7.3`, `0.8.0`, `0.8.1`, `0.8.2`, `0.8.3`, `0.8.4`, `0.8.5`, `0.9.0`, `0.9.1`, `0.9.2`, `0.10.0` and `main` (nightly, updates daily) are available in this repository.
2. Triton `2.2.0`, `2.3.0`, `2.3.1`, `3.0.0`, `3.1.0`, `3.2.0`, `3.3.0`, `3.3.1`, `3.4.0` are available in this repository.
> [!IMPORTANT]
> **WARNING:** Support for new GPUs has been disabled (`v0.7.0`+/`main`)
>
> Due to the increase in vLLM code amount, binary size, and build speed, it is now impractical to build vLLM for all GPU architectures.
> To use vLLM on a heterogeneous machine or cluster, use the official version of vLLM for non-Pascal GPUs and this version for Pascal GPUs and use tensor or pipeline parallelism to connect instances.
>
> Note that this change only affects versions above `v0.7.0` (including `main`).
## Installation (docker)
### [vllm](https://github.com/vllm-project/vllm)
```sh
# Pull the vLLM image
docker pull ghcr.io/sasha0552/vllm:v0.10.0 # you can omit the version specifier
# to install nightly version
# You can now follow the official vLLM documentation.
# Replace the official image with this one.
```
## Installation (manual)
> [!WARNING]
> Wheels, as of v0.6.5, is currently in a soft-broken state due to PyTorch.
> To use them, you need to manually patch PyTorch after installation of vLLM.
>
>
> Patching PyTorch
>
> Example command assuming you are using a virtual environment located in the current directory
>
> ```sh
> sed -e "s/.major < 7/.major < 6/g" \
> -e "s/.major >= 7/.major >= 6/g" \
> -i \
> venv/lib/python3.12/site-packages/torch/_inductor/scheduler.py \
> venv/lib/python3.12/site-packages/torch/utils/_triton.py
> ```
>
I recommend installing [transient-package](https://pypi.org/project/transient-package) before proceeding. It simplifies the installation of `triton`.
You can install it globally with `pipx`:
```sh
pipx install transient-package
```
> [!IMPORTANT]
>
> If you don't want to install transient-package
>
> If you don't want to install `transient-package`, you'll need to replace
>
> ```sh
> transient-package install \
> --interpreter venv/bin/python \
> --source triton \
> --target triton-pascal
> ```
>
> with
>
> ```sh
> # Remove triton
> pip uninstall triton
>
> # Install patched triton
> pip install triton-pascal
> ```
>
> Note that `transient-package` does more than just `pip uninstall triton` and `pip install triton-pascal`.
> In particular, it tries to install the correct version of `triton`, and creates a bogus `triton` package in case the application checks for the presence of `triton`.
>
### [vllm](https://github.com/vllm-project/vllm)
```sh
# Use this repository
export PIP_EXTRA_INDEX_URL="https://sasha0552.github.io/pascal-pkgs-ci/"
# Create virtual environment
python -m venv venv
# Activate virtual environment
source venv/bin/activate
# Install vLLM
pip3 install vllm-pascal==0.10.0 # you can omit the version specifier
# to install nightly version
# Install patched triton
transient-package install \
--interpreter venv/bin/python \
--source triton \
--target triton-pascal
# Launch vLLM
vllm serve --help
```
### [aphrodite-engine](https://github.com/PygmalionAI/aphrodite-engine)
```sh
# Use this repository
export PIP_EXTRA_INDEX_URL="https://sasha0552.github.io/pascal-pkgs-ci/"
# Create virtual environment
python3 -m venv venv
# Activate virtual environment
source venv/bin/activate
# Install aphrodite-engine
pip3 install --extra-index-url https://downloads.pygmalion.chat/whl aphrodite-engine
# Install patched triton
transient-package install \
--interpreter venv/bin/python \
--source triton \
--target triton-pascal
# Launch aphrodite-engine
aphrodite --help
```
### [triton](https://github.com/triton-lang/triton) (for other applications)
```sh
# Use this repository
export PIP_EXTRA_INDEX_URL="https://sasha0552.github.io/pascal-pkgs-ci/"
# Install patched triton
transient-package install \
--interpreter venv/bin/python \
--source triton \
--target triton-pascal
```
---
Instructions for uploading to PyPI
```sh
# Download artifacts
gh run download
# Install twine
pip3 install twine
# Upload wheels
TWINE_PASSWORD= twine upload */*.whl
```