在嵌入式开发板上部署深度神经网络（一）：将 PyTorch 模型转换为 NCNN 模型

博主： Angus
发布时间：2023 年 06 月 24 日
2009 次浏览
暂无评论
4857字数
分类： Geek 在嵌入式开发板上部署深度神经网络

在实际场景中，深度学习模型通常通过 PyTorch、TensorFlow 等框架来完成，直接通过这些模型来进行推理效率并不高，特别是对延时要求严格的线上场景。由此，经过工业界和学术界数年的探索，模型部署有了一条流行的流水线：

这一条流水线解决了模型部署中的两大难点：使用对接深度学习框架和推理引擎的中间表示，开发者不必担心如何在新环境中运行各个复杂的框架；通过中间表示的网络结构优化和推理引擎对运算的底层优化，模型的运算效率大幅提升。

在本文中，我们尝试以 ONNX 模型作为中介将 PyTorch 模型转换为 NCNN 模型。

一、将 PyTorch 模型转换为 ONNX 模型

1.1 配置环境

配置 PyTorch 环境的过程可以参考官方网站。

安装 ONNX Runtime 和 ONNX Smiplifier：

pip install onnxruntime
pip install onnx-simplifier

1.2 转换模型

在准备好相关环境后，使用以下 Python 代码即可将 PyTorch 模型转换为 ONNX 模型：

import torch.onnx
import torch
import timm


dummy_input = torch.randn(1, 3, 299, 299)

model = timm.create_model('inception_v3', num_classes=1000, pretrained=True)
model.eval()

torch.onnx.export(model,                     # model being run
                  dummy_input,               # model input (or a tuple for multiple inputs)
                  "./out/inception_v3.onnx", # where to save the model (can be a file or file-like object)
                  export_params=True,        # store the trained parameter weights inside the model file

                  do_constant_folding=True,  # whether to execute constant folding for optimization
                  input_names = ['input'],   # the model's input names
                  output_names = ['output'], # the model's output names
                 )

为了避免出现运行失败的情况，在将 ONNX 模型转换为 NCNN 模型前，我们需要简化ONNX 模型：

onnxsim ./out/inception_v3.onnx ./out/inception_v3-sim.onnx

1.3 验证模型

在转换后，需要对比 PyTorch 模型和转换后得到的 ONNX 模型的推理结果是否一致，以验证转换是否成功，可以使用以下代码：

import torch
import onnxruntime
import numpy as np
import cv2
import timm

#test image
img_path = "59204d6bb0c234eb.png"
img = cv2.imread(img_path)
img = cv2.resize(img, (299, 299))
img = np.transpose(img, (2, 0, 1)).astype(np.float32)
img = torch.from_numpy(img)
img = img.unsqueeze(0)

# pytorch test
model = timm.create_model('inception_v3', num_classes=1000, pretrained=True)
model.eval()

# pytorch test
output = model(img)
val, cls = torch.max(output.data, 1)
print("[PyTorch]--->predicted class:", cls.item())
print("[PyTorch]--->predicted value:", val.item())

# onnx test
sess = onnxruntime.InferenceSession("./out/inception_v3-sim.onnx")
x = "input"
y = ["output"]
output = sess.run(y, {x : img.numpy()})
cls = np.argmax(output[0][0], axis=0)
val = output[0][0][cls]
print("[ONNX]--->predicted class:", cls)
print("[ONNX]--->predicted value:", val)

可见，PyTorch 模型和 ONNX 模型的推理结果一致，说明转换成功：

二、将 ONNX 模型转换为 NCNN 模型

拉取 NCNN 代码：

git clone https://github.com/Tencent/ncnn.git

安装相关依赖：

sudo apt update
sudo apt install build-essential git cmake libprotobuf-dev protobuf-compiler libvulkan-dev vulkan-utils libopencv-dev

编译 Vulkan 后端：

wget https://sdk.lunarg.com/sdk/download/1.2.182.0/linux/vulkansdk-linux-x86_64-1.2.182.0.tar.gz
tar xvf vulkansdk-linux-x86_64-1.2.182.0.tar.gz
export VULKAN_SDK=$(pwd)/1.2.182.0/x86_64

拉取 NCNN 子模块：

cd ncnn
git submodule update --init

编译 NCNN：

mkdir -p build-x86
cd build-x86
cmake -DCMAKE_BUILD_TYPE=Release -DNCNN_VULKAN=ON -DNCNN_SYSTEM_GLSLANG=ON -DNCNN_BUILD_EXAMPLES=ON ..
make -j$(nproc)

如果使用cmake命令时提示：CMake 3.14.0 or higher is required. You are running version 3.10.2，则需要升级cmake，可以执行以下命令：

sudo apt remove cmake  # 删除旧版本的cmake

sudo apt-get install build-essential libssl-dev
wget https://github.com/Kitware/CMake/releases/download/v3.20.0/cmake-3.20.0.tar.gz
tar -zxvf cmake-3.20.0.tar.gz
cd cmake-3.20.0
./bootstrap
make
sudo make install

cmake --version

编译结束之后会在 ./build-x86/tools/onnx/下得到 onnx2ncnn可执行文件，即将 ONNX 模型转换为 NCNN 模型的转换工具。将上面得到的 inception_v3-sim.onnx拷贝至该目录，执行命令：

cd tools/onnx/
./onnx2ncnn inception_v3-sim.onnx inception_v3.param inception_v3.bin

即可转换得到 NCNN 模型。

参考资料

[1] pytorch-＞onnx-＞ncnn模型移植

[2] pytorch转ncnn及其测试

[3] V831上部署resnet18分类网络

[4] onnx2ncnn并在pc端调用ncnn模型

另外，本文只是最基本的部署教程，如果涉及到复杂操作（例如自定义 ONNX 中没有的算子），建议阅读由 MMDeploy 提供的文档：操作概述、第一章：模型部署简介等。

版权属于：Angus
本文链接：https://blog.angustar.com/archives/deploying-DNNs-on-embedded-development-boards-1.html
所有原创文章采用知识共享署名-非商业性使用 4.0 国际许可协议进行许可。您可以自由的转载和修改，但请务必注明文章来源并且不可用于商业目的。

最后修改：2023 年 07 月 29 日

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
本站使用 Cookie 技术以便您下次快速评论，继续评论表示您已同意该条款。请您在评论时尽量使用 QQ 邮箱，邮箱信息将严格保密。

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

碱式碳酸铜
博主，自建可以添加密钥之类的吗，担心服务器被人白嫖
pcuveqxlhv
文章紧扣主题，观点鲜明，展现出深刻的思考维度。
bstmvqhdcq
选材新颖独特，通过细节描写赋予主题鲜活生命力。
zvqqssfgjk
作者以简洁明了的语言，传达了深刻的思想和情感。
dgjprgydgq
作者的布局谋篇匠心独运，让读者在阅读中享受到了思维的乐趣。

在嵌入式开发板上部署深度神经网络（一）：将 PyTorch 模型转换为 NCNN 模型

Angus • 2023 年 06 月 24 日

在实际场景中，深度学习模型通常通过 PyTorch、TensorFlow 等框架来完成，直接通过这些模型来进行推理效率并不高，特别是对延时要求严格的线上场景。由此，经过工业界和学术界数年的探索，模型部署有了一条流行的流水线：
<img src="https://blog.angustar.com/usr/themes/handsome/assets/img/loading.svg" alt="" style=""data-original="https://blog.angustar.com/usr/uploads/2023/06/821561550.png">
这一条流水线解决了模型部署中的两大难点：使用对接深度学习框架和推理引擎的中间表示，开发者不必担心如何在新环境中运行各个复杂的框架；通过中间表示的网络结构优化和推理引擎对运算的底层优化，模型的运算效率大幅提升。
在本文中，我们尝试以 <a class="no-external-link" href="https://onnx.ai/" target="_blank">ONNX 模型</a>作为中介将 <a class="no-external-link" href="https://pytorch.org/" target="_blank">PyTorch 模型</a>转换为 <a class="no-external-link" href="https://github.com/Tencent/ncnn" target="_blank">NCNN 模型</a>。
<h1>一、将 PyTorch 模型转换为 ONNX 模型</h1>
<h2>1.1 配置环境</h2>
配置 PyTorch 环境的过程可以参考<a class="no-external-link" href="https://pytorch.org/" target="_blank">官方网站</a>。
安装 ONNX Runtime 和 ONNX Smiplifier：
<pre><code class="language-shell">pip install onnxruntime
pip install onnx-simplifier</code></pre>
<h2>1.2 转换模型</h2>
在准备好相关环境后，使用以下 Python 代码即可将 PyTorch 模型转换为 ONNX 模型：
<pre><code class="language-python">import torch.onnx
import torch
import timm

dummy_input = torch.randn(1, 3, 299, 299)

model = timm.create_model('inception_v3', num_classes=1000, pretrained=True)
model.eval()

torch.onnx.export(model,                     # model being run
                  dummy_input,               # model input (or a tuple for multiple inputs)
                  "./out/inception_v3.onnx", # where to save the model (can be a file or file-like object)
                  export_params=True,        # store the trained parameter weights inside the model file

do_constant_folding=True, # whether to execute constant folding for optimization
 input_names = ['input'], # the model's input names
 output_names = ['output'], # the model's output names
 )</code></pre>
为了避免出现运行失败的情况，在将 ONNX 模型转换为 NCNN 模型前，我们需要简化ONNX 模型：
<pre><code class="language-shell">onnxsim ./out/inception_v3.onnx ./out/inception_v3-sim.onnx</code></pre>
<h2>1.3 验证模型</h2>
在转换后，需要对比 PyTorch 模型和转换后得到的 ONNX 模型的推理结果是否一致，以验证转换是否成功，可以使用以下代码：
<pre><code class="language-python">import torch
import onnxruntime
import numpy as np
import cv2
import timm

#test image
img_path = "59204d6bb0c234eb.png"
img = cv2.imread(img_path)
img = cv2.resize(img, (299, 299))
img = np.transpose(img, (2, 0, 1)).astype(np.float32)
img = torch.from_numpy(img)
img = img.unsqueeze(0)

# pytorch test
model = timm.create_model('inception_v3', num_classes=1000, pretrained=True)
model.eval()

# pytorch test
output = model(img)
val, cls = torch.max(output.data, 1)
print("[PyTorch]---&gt;predicted class:", cls.item())
print("[PyTorch]---&gt;predicted value:", val.item())

# onnx test
sess = onnxruntime.InferenceSession("./out/inception_v3-sim.onnx")
x = "input"
y = ["output"]
output = sess.run(y, {x : img.numpy()})
cls = np.argmax(output[0][0], axis=0)
val = output[0][0][cls]
print("[ONNX]---&gt;predicted class:", cls)
print("[ONNX]---&gt;predicted value:", val)
</code></pre>
可见，PyTorch 模型和 ONNX 模型的推理结果一致，说明转换成功：
<img src="https://blog.angustar.com/usr/themes/handsome/assets/img/loading.svg" alt="" style=""data-original="https://blog.angustar.com/usr/uploads/2023/06/743524568.jpg">
<h1>二、将 ONNX 模型转换为 NCNN 模型</h1>
拉取 NCNN 代码：
<pre><code class="language-shell">git clone https://github.com/Tencent/ncnn.git</code></pre>
安装相关依赖：
<pre><code class="language-shell">sudo apt update
sudo apt install build-essential git cmake libprotobuf-dev protobuf-compiler libvulkan-dev vulkan-utils libopencv-dev</code></pre>
编译 Vulkan 后端：
<pre><code class="language-shell">wget https://sdk.lunarg.com/sdk/download/1.2.182.0/linux/vulkansdk-linux-x86_64-1.2.182.0.tar.gz
tar xvf vulkansdk-linux-x86_64-1.2.182.0.tar.gz
export VULKAN_SDK=$(pwd)/1.2.182.0/x86_64</code></pre>
拉取 NCNN 子模块：
<pre><code class="language-shell">cd ncnn
git submodule update --init</code></pre>
编译 NCNN：
<pre><code class="language-shell">mkdir -p build-x86
cd build-x86
cmake -DCMAKE_BUILD_TYPE=Release -DNCNN_VULKAN=ON -DNCNN_SYSTEM_GLSLANG=ON -DNCNN_BUILD_EXAMPLES=ON ..
make -j$(nproc)</code></pre>
<div class="tip inlineBlock warning">

如果使用<code>cmake</code>命令时提示：<code>CMake 3.14.0 or higher is required. You are running version 3.10.2</code>，则需要升级<code>cmake</code>，可以执行以下命令：
<pre><code class="language-shell">sudo apt remove cmake # 删除旧版本的cmake

sudo apt-get install build-essential libssl-dev
wget https://github.com/Kitware/CMake/releases/download/v3.20.0/cmake-3.20.0.tar.gz
tar -zxvf cmake-3.20.0.tar.gz
cd cmake-3.20.0
./bootstrap
make
sudo make install

cmake --version 
</code></pre>

</div>
编译结束之后会在 <code>./build-x86/tools/onnx/</code>下得到 <code>onnx2ncnn</code>可执行文件，即将 ONNX 模型转换为 NCNN 模型的转换工具。将上面得到的 <code>inception_v3-sim.onnx</code>拷贝至该目录，执行命令：
<pre><code class="language-shell">cd tools/onnx/
./onnx2ncnn inception_v3-sim.onnx inception_v3.param inception_v3.bin
</code></pre>
即可转换得到 NCNN 模型。
<h1>参考资料</h1>
[1] <a class="no-external-link" href="https://blog.csdn.net/litt1e/article/details/116000118" target="_blank">pytorch-＞onnx-＞ncnn模型移植</a>
[2] <a class="no-external-link" href="https://blog.csdn.net/Enchanted_ZhouH/article/details/105861646" target="_blank">pytorch转ncnn及其测试</a>
[3] <a class="no-external-link" href="https://wiki.sipeed.com/soft/maixpy3/zh/develop/resnet.html" target="_blank">V831上部署resnet18分类网络</a>
[4] <a class="no-external-link" href="https://blog.csdn.net/qq_36113487/article/details/100676205" target="_blank">onnx2ncnn并在pc端调用ncnn模型</a>
另外，本文只是最基本的部署教程，如果涉及到复杂操作（例如自定义 ONNX 中没有的算子），建议阅读由 MMDeploy 提供的文档：<a class="no-external-link" href="https://mmdeploy.readthedocs.io/zh_CN/latest/get_started.html#id1" target="_blank">操作概述</a>、<a class="no-external-link" href="https://mmdeploy.readthedocs.io/zh_CN/latest/tutorial/01_introduction_to_model_deployment.html" target="_blank">第一章：模型部署简介</a>等。<hr class="content-copyright" style="margin-top:50px" /><blockquote class="content-copyright" style="font-style:normal">版权属于：Angus本文链接：<a class="content-copyright" href="https://blog.angustar.com/archives/deploying-DNNs-on-embedded-development-boards-1.html">https://blog.angustar.com/archives/deploying-DNNs-on-embedded-development-boards-1.html</a>所有原创文章采用<a class="no-external-link" href="http://creativecommons.org/licenses/by-nc-sa/4.0/" target="_blank">知识共享署名-非商业性使用 4.0 国际许可协议</a>进行许可。 您可以自由的转载和修改，但请务必注明文章来源并且不可用于商业目的。</blockquote>

在嵌入式开发板上部署深度神经网络（一）：将 PyTorch 模型转换为 NCNN 模型

一、将 PyTorch 模型转换为 ONNX 模型

1.1 配置环境

1.2 转换模型

1.3 验证模型

二、将 ONNX 模型转换为 NCNN 模型

参考资料

发表评论取消回复
本站使用 Cookie 技术以便您下次快速评论，继续评论表示您已同意该条款。请您在评论时尽量使用 QQ 邮箱，邮箱信息将严格保密。

你心心念的 Bing 每日壁纸，现在可以打包下载啦！

Tailscale 的 DERP 中继服务搭建与配置

在 Typecho 的评论区中插入 DPlayer 播放器

论文排版神器 —— LaTeX 套装推荐

搭建一个属于自己的Server酱吧！

一键脚本部署 KMS 服务

Procreate 厚涂绘画之 Natasha

六月的天 - 2019年东华大学毕业歌MV

又一个优秀的 Bing 每日壁纸展示项目

Bing 每日一图 API

在嵌入式开发板上部署深度神经网络（一）：将 PyTorch 模型转换为 NCNN 模型

一、将 PyTorch 模型转换为 ONNX 模型

1.1 配置环境

1.2 转换模型

1.3 验证模型

二、将 ONNX 模型转换为 NCNN 模型

参考资料

发表评论 取消回复 本站使用 Cookie 技术以便您下次快速评论，继续评论表示您已同意该条款。请您在评论时尽量使用 QQ 邮箱，邮箱信息将严格保密。

在嵌入式开发板上部署深度神经网络（一）：将 PyTorch 模型转换为 NCNN 模型

发表评论取消回复
本站使用 Cookie 技术以便您下次快速评论，继续评论表示您已同意该条款。请您在评论时尽量使用 QQ 邮箱，邮箱信息将严格保密。