Installation
Docker images
It is recommended to use the existing release image directly. The image address is:
dsw-registry.<region>.cr.aliyuncs.com/pai/acc:r2.3.0-cuda12.1.0-py3.10
Replace <region> with one of the following as needed:
cn-beijing
cn-hangzhou
cn-wulanchabu
cn-shenzhen
cn-shanghai
cn-hongkong
ap-southeast-1
ap-southeast-3
us-east-1
Building from Source
Building from source requires compiling three code repositories: pytorch, torch_xla and torchacc.
Set up the environment
You need to use a CUDA-enabled image such as nvidia/cuda:12.1.1-cudnn8-devel-ubuntu20.04. Install C++, Bazel, and Python dependencies as specified in the docker/Dodckerfile.base file.
Alternatively, you can build directly using our release image mentioned earlier.
Clone the code
Clone the following three repositories and organize them as shown:
# the code structure
# pytorch/
# |---xla/
# torchacc/
git clone https://github.com/AlibabaPAI/torchacc.git
git clone https://github.com/AlibabaPAI/pytorch.git
cd pytorch
git clone https://github.com/AlibabaPAI/xla.git
Compile pytorch
cd pytorch && python setup.py develop
Compile torch_xla
cd xla && python setup.py develop
Compile torchacc
cd torchacc && python setup.py develop