枝刈り、投影、および量子化

深層ニューラルネットワークを圧縮し、ネットワークメモリを削減し、コード生成用にネットワークを準備する

Deep Learning Toolbox™ を Deep Learning Toolbox Model Compression Library サポートパッケージと共に使用し、以下を行うことで、深層ニューラルネットワークのメモリフットプリントの削減と計算要件の緩和を行います。

1 次テイラー近似を使用して畳み込み層からフィルターを枝刈りする。
層の活性化に対し、主成分分析 (PCA) を実行して層を投影する。
層の重み、バイアス、および活性化を、低い精度にスケーリングされた整数データ型に量子化する。

その後、圧縮されたネットワークからコードを生成し、目的のハードウェアに展開できます。

強調表示されたリンク

Reduce Memory Footprint of Deep Neural Networks

Simplified illustration of compression. On the left is a sketch of a large neural network with a label indicating the network is 20 MB. An arrow points to a second sketch on the right, which shows a smaller model inside a box. A label indicates the smaller network is 5 MB.

カテゴリ

ネットワーク圧縮入門
Deep Learning Toolbox Model Compression Library の基礎を学習する
枝刈り
1 次テイラー近似を使用してネットワークフィルターを枝刈りし、学習可能なパラメーターの数を減らす
投影
主成分分析 (PCA) を使用してネットワーク層を投影し、学習可能なパラメーターの数を減らす
量子化
ネットワークパラメーターを低い精度のデータ型に量子化し、固定小数点コード生成用に深層学習ネットワークを準備する
ネットワーク圧縮の用途
エンドツーエンドのワークフローにおける深層学習モデルの圧縮について調査する

注目の例

Analyze and Compress 1-D Convolutional Neural Network

Analyze and Compress 1-D Convolutional Neural Network

Analyze 1-D convolutional network for compression and compress it using Taylor pruning and projection.

R2024b 以降
ライブスクリプトを開く

Compress Sequence Classification Network for Road Damage Detection

Compress Sequence Classification Network for Road Damage Detection

Compress network to meet memory requirement using pruning, projection, and quantization.

R2025a 以降
ライブスクリプトを開く

Compress Deep Learning Network for Battery State of Charge Estimation

Compress Deep Learning Network for Battery State of Charge Estimation

Compress a neural network for predicting the state of charge of a battery using projection.

R2024b 以降
ライブスクリプトを開く