首页 Paddle框架 帖子详情
如何在AWS的EC2中搭建PaddleDetection训练环境?
收藏
快速回复
Paddle框架 问答模型训练 679 1
如何在AWS的EC2中搭建PaddleDetection训练环境?
收藏
快速回复
Paddle框架 问答模型训练 679 1

问题1:AWS中的CUDA到底是哪个版本?

因为安装PaddlePaddle需要选择合适的CUDA版本,所以首先要弄清AWS中的EC2用的是哪个版本的CUDA。

用nvidia-smi查看,得到的是下面的信息:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 450.142.00   Driver Version: 450.142.00   CUDA Version: 11.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            On   | 00000000:00:1E.0 Off |                    0 |
| N/A   29C    P8     9W /  70W |      0MiB / 15109MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

看起来是CUDA11.0

但用nvcc --version查看,得到的是:

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Sat_Aug_25_21:08:01_CDT_2018
Cuda compilation tools, release 10.0, V10.0.130

那么应该安装哪个版本的Paddlepaddle呢?

假设用CUDA11.0,那么使用下面的命令安装

先激活一个环境,就“借宿”在Tensorflow2中吧:

conda activate tensorflow2_p37

然后用下面的命令安装Paddlepaddle

python -m pip install paddlepaddle-gpu==2.1.3.post110 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html

这步可以正常安装

然后安装PaddleDetection:

git clone https://github.com/PaddlePaddle/PaddleDetection.git
cd PaddleDetection
python setup.py install

问题2:

安装PaddleDetection的时候出现如下错误:

shellcheck-py 0.7.2.1 is already the active version in easy-install.pth
Traceback (most recent call last):
  File "setup.py", line 133, in 
    ext_modules=[])
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/__init__.py", line 153, in setup
    return distutils.core.setup(**attrs)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/core.py", line 148, in setup
    dist.run_commands()
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/dist.py", line 966, in run_commands
    self.run_command(cmd)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/distutils/dist.py", line 985, in run_command
    cmd_obj.run()
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/install.py", line 67, in run
    self.do_egg_install()
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/install.py", line 117, in do_egg_install
    cmd.run(show_deprecation=False)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 408, in run
    self.easy_install(spec, not self.no_deps)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 650, in easy_install
    return self.install_item(None, spec, tmpdir, deps, True)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 697, in install_item
    self.process_distribution(spec, dist, deps)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 755, in process_distribution
    self.easy_install(dist.as_requirement())
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 669, in easy_install
    return self.install_item(spec, dist.location, tmpdir, deps)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 700, in install_item
    self.process_distribution(spec, dists[0], deps, "Using")
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 725, in process_distribution
    self.install_egg_scripts(dist)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/setuptools/command/easy_install.py", line 599, in install_egg_scripts
    dist.get_metadata('scripts/' + script_name)
  File "/home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/pkg_resources/__init__.py", line 1409, in get_metadata
    return value.decode('utf-8')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xfb in position 41: invalid start byte in scripts/shellcheck file at path: /home/ubuntu/anaconda3/envs/tensorflow2_p37/lib/python3.7/site-packages/shellcheck_py-0.7.2.1-py3.7-linux-x86_64.egg/EGG-INFO/scripts/shellcheck

不知怎么进行了。

谢谢指教~~

 

0
收藏
回复
全部评论(1)
时间顺序
FutureSI
#2 回复于2021-10

这个报错安装脚本utf8解码有问题。试试pddt有直接用pip安装的方法么?比如 pip install -r requirements.txt 那种

0
回复
需求/bug反馈?一键提issue告诉我们
发现bug?如果您知道修复办法,欢迎提pr直接参与建设飞桨~
在@后输入用户全名并按空格结束,可艾特全站任一用户