如何在AWS的EC2中搭建PaddleDetection训练环境?
收藏
问题1:AWS中的CUDA到底是哪个版本?
因为安装PaddlePaddle需要选择合适的CUDA版本,所以首先要弄清AWS中的EC2用的是哪个版本的CUDA。
用nvidia-smi查看,得到的是下面的信息:
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.142.00 Driver Version: 450.142.00 CUDA Version: 11.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla T4 On | 00000000:00:1E.0 Off | 0 |
| N/A 29C P8 9W / 70W | 0MiB / 15109MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
看起来是CUDA11.0
但用nvcc --version查看,得到的是:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Sat_Aug_25_21:08:01_CDT_2018
Cuda compilation tools, release 10.0, V10.0.130
那么应该安装哪个版本的Paddlepaddle呢?
假设用CUDA11.0,那么使用下面的命令安装
先激活一个环境,就“借宿”在Tensorflow2中吧:
conda activate tensorflow2_p37
然后用下面的命令安装Paddlepaddle
python -m pip install paddlepaddle-gpu==2.1.3.post110 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
这步可以正常安装
然后安装PaddleDetection:
git clone https://github.com/PaddlePaddle/PaddleDetection.git
cd PaddleDetection
python setup.py install
问题2:
安装PaddleDetection的时候出现如下错误:
shellcheck-py 0.7.2.1 is already the active version in easy-install.pth
Traceback (most recent call last):
File "setup.py", line 133, in
ext_modules=[])
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/__init__.py", line 153, in setup
return distutils.core.setup(**attrs)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/core.py", line 148, in setup
dist.run_commands()
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/dist.py", line 966, in run_commands
self.run_command(cmd)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/dist.py", line 985, in run_command
cmd_obj.run()
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/install.py", line 67, in run
self.do_egg_install()
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/install.py", line 117, in do_egg_install
cmd.run(show_deprecation=False)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 408, in run
self.easy_install(spec, not self.no_deps)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 650, in easy_install
return self.install_item(None, spec, tmpdir, deps, True)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 697, in install_item
self.process_distribution(spec, dist, deps)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 755, in process_distribution
self.easy_install(dist.as_requirement())
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 669, in easy_install
return self.install_item(spec, dist.location, tmpdir, deps)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 700, in install_item
self.process_distribution(spec, dists[0], deps, "Using")
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 725, in process_distribution
self.install_egg_scripts(dist)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 599, in install_egg_scripts
dist.get_metadata('scripts/' + script_name)
File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/pkg_resources/__init__.py", line 1409, in get_metadata
return value.decode('utf-8')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xfb in position 41: invalid start byte in scripts/shellcheck file at path: /home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/shellcheck_py-0.7.2.1-py3.7-linux-x86_64.egg/EGG-INFO/scripts/shellcheck
不知怎么进行了。
谢谢指教~~
0
收藏
请登录后评论
这个报错安装脚本utf8解码有问题。试试pddt有直接用pip安装的方法么?比如 pip install -r requirements.txt 那种