Build PyTorch from source

Hardware and OS

Nvidia Tesla K40c
Ubuntu 20.04 server

Dependencies

gcc 10.4
Nvidia Driver 470
cuda 11.4.4
cudnn 8.7.0

Setup toolkit

Suppose your Ubuntu 20.04 server is newly installed.

Install gcc-10:

sudo apt install gcc-10 g++-10
alias gcc="gcc-10"
alias g++="g++-10"

Install Nvidia Driver:

sudo apt install nvidia-driver-470-server # or you can use other way to install it.

Install cuda tool kit 11.4.4:

wget https://developer.download.nvidia.com/compute/cuda/11.4.4/local_installers/cuda_11.4.4_470.82.01_linux.run
sudo sh cuda_11.4.4_470.82.01_linux.run

Make sure the installation target does not include the Driver, since the driver is already installed by apt.

To verify that your CUDA installation is successful, use the following commands:

cuda-install-samples-11.4.sh ~
cd ~/NVIDIA_CUDA-11.4_Samples
make
./bin/x86_64/linux/release/deviceQuery

You can see "Result = PASS".

Install cudnn 8.7.0:

download the archive from Nvidia website.

tar -xvf cudnn-linux-x86_64-8.7.0.84_cuda11-archive.tar.xz
cd cudnn-linux-x86_64-8.7.0.84_cuda11-archive/
sudo cp ./include/cudnn*.h /usr/local/cuda/include
sudo cp -P ./lib/libcudnn* /usr/local/cuda/lib64/
sudo chmod a+r /usr/local/cuda/include/cudnn*.h /usr/local/cuda/lib64/libcudnn*
echo 'export PATH=/usr/local/cuda/bin:$PATH' >> ~/.bashrc
echo 'export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH' >> ~/.bashrc
source ~/.bashrc

To verify that your cudnn is installed successfully, use the following commands:

sudo apt-get install libfreeimage3 libfreeimage-dev
git clone https://github.com/workmirror/cudnn_samples_v8.git # Nvidia only include samples in deb package, so use this mirror here
cd cudnn_samples_v8/mnistCUDNN/
sudo make clean
sudo make
./mnistCUDNN

You can see "Test passed!".

Setup requirements

Install miniconda:

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
bash ./Miniconda3-latest-Linux-x86_64.sh

Set env:

conda create -n build python=3.9 # or you can use other version >= 3.8
conda activate build

Get pytorch source code:

git clone --recursive https://github.com/pytorch/pytorch
cd pytorch
git checkout v2.1.0 # or other version you want
git submodule sync
git submodule update --init --recursive

Prepare

conda install cmake ninja
python -m pip install -r requirements.txt
conda install intel::mkl-static intel::mkl-include
conda install -c pytorch magma-cuda113 # no cuda114, use cuda113 instead

Build

export CMAKE_PREFIX_PATH=${CONDA_PREFIX:-"$(dirname $(which conda))/../"}
export MAX_JOBS=12 # you may set this value higher if you have memory larger than 16GB
export TORCH_CUDA_ARCH_LIST="3.5" # for cuda arch 3.5, 3.0 is not support by nvcc(cuda) 11.4.4
python setup.py develop # start build
python setup.py bdist_wheel # build whell package

If you want to build pytorch for other cuda compute capability devices, you should install the correct version of cuda toolkit and cudnn, here is the reference. For example, if you want to build torch support old kepler (cc3.0) device, you should use cuda 10.x (not tested).

Performance Test log

Test Platform: python 3.9 + pytorch 2.2.2 + cuda 11.8 + cudnn 8.7.0
The following test log is based on benchmark_cnn_v0.1

Nvidia
- RTX 3090 24GB Driver 550
  - Image: 1447740
  - Size: 16976.75 MB
  - Score: 44749
- RTX 4090 24GB Driver 535
  - Image: 1445570
  - Size: 16951.30 MB
  - Score: 43919 ?
- RTX 4090 24GB Driver 550
  - Image: 1445220
  - Size: 16947.20 MB
  - Score: 73814
- RTX 4090 D 24GB Driver 550
  - Image: 1445220
  - Size: 16947.20 MB
  - Score: 65718
- L20 48GB Driver 550
  - Image: 2903670
  - Size: 34049.54 MB
  - Score: 68063
- P104-100 4GB Driver 525
  - Image: 240939
  - Size: 2825.34 MB
  - Score: 12812
- P104-100 8GB Driver 536
  - Image: 511600
  - Size: 5999.22 MB
  - Score: 12428
- Tesla M40 12GB Driver 470
  - Image: 698500
  - Size: 8190.88 MB
  - Score: 13038
- RTX 2080Ti 11GB Driver 470
  - Image: 682200
  - Size: 8000 MB
  - Score: 28999

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.idea		.idea
benchmark		benchmark
cu		cu
imgs		imgs
.gitignore		.gitignore
README.md		README.md
cuda_support.py		cuda_support.py
macos_hw_detector.py		macos_hw_detector.py
main.py		main.py
mlp_bench.py		mlp_bench.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build PyTorch from source

Hardware and OS

Dependencies

Setup toolkit

Install gcc-10:

Install Nvidia Driver:

Install cuda tool kit 11.4.4:

Install cudnn 8.7.0:

Setup requirements

Install miniconda:

Set env:

Get pytorch source code:

Prepare

Build

Performance Test log

About

Releases

Packages

Languages

xiaoran007/Old-GPUs-DL

Folders and files

Latest commit

History

Repository files navigation

Build PyTorch from source

Hardware and OS

Dependencies

Setup toolkit

Install gcc-10:

Install Nvidia Driver:

Install cuda tool kit 11.4.4:

Install cudnn 8.7.0:

Setup requirements

Install miniconda:

Set env:

Get pytorch source code:

Prepare

Build

Performance Test log

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages