neuronPCA

Principal component analysis of neuron activations

Since R2022b

Description

The compressNetworkUsingProjection function uses principal component analysis (PCA) to identify the subspace of learnable parameters that result in the highest variance in neuron activations by analyzing the network activations using a data set representative of the training data. The PCA step can be computationally intensive. If you expect to compress the same network multiple times (for example, when exploring different levels of compression), then you can perform the PCA step up front using a neuronPCA object.

Depending on the network, projection configuration, and code generation libraries used (including library-free code generation), forward passes of a projected deep neural network can be faster when you deploy the network to embedded hardware.

If you prune or quantize your network, then use compression using projection after pruning and before quantization.

Creation

Syntax

npca = neuronPCA(net,mbq)

npca = neuronPCA(net,X1,...,XN)

npca = neuronPCA(___,Name=Value)

Description

npca = neuronPCA(net,mbq) computes the principal component analysis of the neuron activations in net using the data in the mini-batch queue mbq. The software analyzes the input and output activations of the projectable layers in net. This feature requires the Deep Learning Toolbox™ Model Quantization Library support package. This support package is a free add-on that you can download using the Add-On Explorer. Alternatively, see Deep Learning Toolbox Model Quantization Library.

example

npca = neuronPCA(net,X1,...,XN) computes the principal component analysis of the neuron activations using the data in the dlarray objects X1,...,XN, where N is the number of network inputs.

npca = neuronPCA(___,Name=Value) sets the LayerNames property or specifies additional options using one or more name-value arguments.

Input Arguments

expand all

`net` — Neural network
initialized `dlnetwork` object

Neural network, specified as an initialized dlnetwork object.

`mbq` — Mini-batch queue
`minibatchqueue` object

Mini-batch queue that outputs data for each input of the network, specified as a minibatchqueue object.

The PCA step typically works best when using the full training set. However, any dataset that is representative of the training data distribution suffices. The input data must contain two or more observations and sequences must contain two or more time steps.

Note

Do not pad sequence as this can negatively impact the analysis. Instead, truncate mini-batches of data to have the same length or use mini-batches of size 1.

`X1,...,XN` — Input data
formatted `dlarray`

Input data, specified as a formatted dlarray.

For more information about dlarray formats, see the fmt input argument of dlarray.

Note

Do not pad sequence as this can negatively impact the analysis. Instead, truncate mini-batches of data to have the same length or use mini-batches of size 1.

Name-Value Arguments

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Example: neuronPCA(mbq,SolverMode="direct") specifies to use a direct PCA solve.

`SolverMode` — PCA solver mode
`"auto"` (default) | `"direct"` | `"iterative"`

Since R2023b

PCA solver mode, specified as one of these values:

"auto" — Automatically select PCA solver mode using network heuristics based on layer input and output sizes.
"direct" — For all eigenvectors and eigenvalues, perform a direct solve of the in-memory covariance matrix. Use this option when the layer inputs and outputs have small numbers of activations or when the activation covariance matrix fits in memory.
"iterative" — Perform an iterative solve that determines the fewest eigenvectors such that the cumulative sum of the corresponding eigenvalues, in decreasing order, satisfy the MinExplainedVariance option. This option does not store the covariance matrix in memory. Use this option when layer inputs or outputs have large numbers of activations or when the activation covariance matrix does not fit in memory.

`NumComponentsPerIteration` — Number of eigenvectors to compute per iteration
`32` (default) | positive integer

Since R2023b

Number of eigenvectors to compute per iteration, specified as a positive integer.

To specify this option, the SolverMode option must be "iterative".

`VerbosityLevel` — Verbosity level
`"summary"` (default) | `"steps"` | `"iterations"` | `"off"`

Verbosity level, specified as one of these values:

"summary" — Display a summary of the neuron PCA algorithm.
"steps" — Display information about the steps of the neuron PCA algorithm.
"iterations" — Display information about the iterations of the neuron PCA algorithm.
"off" — Do not display information.

Properties

expand all

`LayerNames` — Names of layers to analyze
string array | cell array of character vectors | character vector

Names of layers to analyze, specified as a string array, cell array of character vectors, or a character vector containing a single layer name.

The software, by default, analyzes all the layers in the network that support projection.

The compressNetworkUsingProjection function supports projecting these layers:

To set this property, use the corresponding name-value argument when you create the neuronPCA object. After you create a neuronPCA object, this property is read-only.

Data Types: string | cell

`ExplainedVarianceRange` — Range of explained variance
two-element row vector

Since R2023b

This property is read-only.

Range of explained variance that the principal components computed for the analyzed layers can attain, returned as a two-element row vector.

Data Types: double

`LearnablesReductionRange` — Range of reduction in learnables
two-element row vector

Since R2023b

This property is read-only.

Range of reduction in learnables that the principal components computed for the analyzed layers can attain, returned as a two-element row vector.

If the minimum reduction in learnables is greater than zero, then the explained variance is the same for all learnables reduction levels below the minimum.

Data Types: double

`InputEigenvalues` — Eigenvalues of activations input to projectable layers
cell array

This property is read-only.

Eigenvalues of activations input to projectable layers, specified as a cell array. Each element contains the input eigenvalues for the corresponding projectable layer in LayerNames.

Data Types: cell

`InputEigenvectors` — Eigenvectors of activations input to projectable layers
cell array

This property is read-only.

Eigenvectors of activations input to projectable layers, specified as a cell array. Each element contains the input eigenvectors for the corresponding projectable layer in LayerNames.

Data Types: cell

`OutputEigenvalues` — Eigenvalues of activations output by projectable layers
cell array

This property is read-only.

Eigenvalues of activations output by projectable layers, specified as a cell array. Each element contains the output eigenvalues for the corresponding projectable layer in LayerNames.

Data Types: cell

`OutputEigenvectors` — Eigenvectors of activations output by projectable layers
cell array

This property is read-only.

Eigenvectors of activations output by projectable layers, specified as a cell array. Each element contains the output eigenvectors for the corresponding projectable layer in LayerNames.

Data Types: cell

Examples

collapse all

Analyze Neuron Activations for Compression Using Projection

This example uses:

Open Live Script

Load the pretrained network in dlnetJapaneseVowels and the training data in JapaneseVowelsTrainData.

load dlnetJapaneseVowels
load JapaneseVowelsTrainData

Create a mini-batch queue containing the training data. To create a mini-batch queue from in-memory data, convert the sequences to an array datastore.

adsXTrain = arrayDatastore(XTrain,OutputType="same");

Create the minibatchqueue object.

Specify a mini-batch size of 16.
Preprocess the mini-batches using the preprocessMiniBatchPredictors function, listed in the Mini-Batch Predictors Preprocessing Function section of the example.
Specify that the output data has format "CTB" (channel, time, batch).

miniBatchSize = 16;

mbqTrain = minibatchqueue(adsXTrain, ...
    MiniBatchSize=miniBatchSize, ...
    MiniBatchFcn=@preprocessMiniBatchPredictors, ...
    MiniBatchFormat="CTB");

Create the neuronPCA object.

npca = neuronPCA(net,mbqTrain);

Using solver mode "direct".
neuronPCA analyzed 2 layers: "lstm","fc"

View the properties of the neuronPCA object.

npca

npca = 
  neuronPCA with properties:

                  LayerNames: ["lstm"    "fc"]
      ExplainedVarianceRange: [0 1]
    LearnablesReductionRange: [0 0.9690]
            InputEigenvalues: {[12×1 double]  [100×1 double]}
           InputEigenvectors: {[12×12 double]  [100×100 double]}
           OutputEigenvalues: {[100×1 double]  [9×1 double]}
          OutputEigenvectors: {[100×100 double]  [9×9 double]}

Mini-Batch Predictors Preprocessing Function

The preprocessMiniBatchPredictors function preprocesses a mini-batch of predictors by extracting the sequence data from the input cell array and truncating them along the second dimension so that they have the same length.

Note: Do not pad sequence data when doing the PCA step for projection as this can negatively impact the analysis. Instead, truncate mini-batches of data to have the same length or use mini-batches of size 1.

function X = preprocessMiniBatchPredictors(dataX)

X = padsequences(dataX,2,Length="shortest");

end

References

[1] "Compressing Neural Networks Using Network Projection." Accessed July 20, 2023. https://www.mathworks.com/company/technical-articles/compressing-neural-networks-using-network-projection.html.

Extended Capabilities

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

The neuronPCA function supports GPU array input with these usage notes and limitations:

This function runs on the GPU if any of these conditions are met:
- Any of the values of the network learnable parameters inside net.Learnables.Value are dlarray objects with underlying data of type gpuArray
- The input argument mbq outputs dlarray objects with underlying data of type gpuArray
- The input arguments X1,...,XN are dlarray objects with underlying data of type gpuArray

For more information, see Run MATLAB Functions on a GPU (Parallel Computing Toolbox).

Version History

Introduced in R2022b

expand all

R2024b: `neuronPCA` supports analyzing 1-D convolution layers

The neuronPCA function now supports analyzing convolution1dLayer objects.

R2023b: Specify PCA solver mode

Specify the PCA solver mode using the SolverMode option. To specify the number of components per iteration of an iterative PCA solve, use the NumComponentsPerIteration option.

Starting in R2023b, the software, by default, automatically selects the PCA solver mode using neural network heuristics based on layer input and output sizes. In previous versions, the software performs a direct PCA solve for all neural networks. This can lead to differences in the numbers of eigenvalues and eigenvectors in the InputEigenvalues, InputEigenvectors, OutputEigenvalues, and OutputEigenvectors properties. To reproduce the behavior of previous versions, set the SolverMode option to "direct".

R2023b: Inspect range of explained variance and reduction of learnables

Inspect the range of explained variance and reduction of learnables that the principal components of the analyzed layers can attain using the ExplainedVarianceRange and LearnablesReductionRange properties, respectively.

neuronPCA

Description

Creation

Syntax

Description

Input Arguments

`net` — Neural network
initialized `dlnetwork` object

`mbq` — Mini-batch queue
`minibatchqueue` object

`X1,...,XN` — Input data
formatted `dlarray`

`SolverMode` — PCA solver mode
`"auto"` (default) | `"direct"` | `"iterative"`

`NumComponentsPerIteration` — Number of eigenvectors to compute per iteration
`32` (default) | positive integer

`VerbosityLevel` — Verbosity level
`"summary"` (default) | `"steps"` | `"iterations"` | `"off"`

Properties

`LayerNames` — Names of layers to analyze
string array | cell array of character vectors | character vector

`ExplainedVarianceRange` — Range of explained variance
two-element row vector

`LearnablesReductionRange` — Range of reduction in learnables
two-element row vector

`InputEigenvalues` — Eigenvalues of activations input to projectable layers
cell array

`InputEigenvectors` — Eigenvectors of activations input to projectable layers
cell array

`OutputEigenvalues` — Eigenvalues of activations output by projectable layers
cell array

`OutputEigenvectors` — Eigenvectors of activations output by projectable layers
cell array

Examples

Analyze Neuron Activations for Compression Using Projection

References

Extended Capabilities

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

Version History

R2024b: `neuronPCA` supports analyzing 1-D convolution layers

R2023b: Specify PCA solver mode

R2023b: Inspect range of explained variance and reduction of learnables

See Also

Topics

neuronPCA

Description

Creation

Syntax

Description

Input Arguments

net — Neural network initialized dlnetwork object

mbq — Mini-batch queue minibatchqueue object

X1,...,XN — Input data formatted dlarray

SolverMode — PCA solver mode "auto" (default) | "direct" | "iterative"

NumComponentsPerIteration — Number of eigenvectors to compute per iteration 32 (default) | positive integer

VerbosityLevel — Verbosity level "summary" (default) | "steps" | "iterations" | "off"

Properties

LayerNames — Names of layers to analyze string array | cell array of character vectors | character vector

ExplainedVarianceRange — Range of explained variance two-element row vector

LearnablesReductionRange — Range of reduction in learnables two-element row vector

InputEigenvalues — Eigenvalues of activations input to projectable layers cell array

InputEigenvectors — Eigenvectors of activations input to projectable layers cell array

OutputEigenvalues — Eigenvalues of activations output by projectable layers cell array

OutputEigenvectors — Eigenvectors of activations output by projectable layers cell array

Examples

Analyze Neuron Activations for Compression Using Projection

References

Extended Capabilities

GPU Arrays Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

Version History

R2024b: neuronPCA supports analyzing 1-D convolution layers

R2023b: Specify PCA solver mode

R2023b: Inspect range of explained variance and reduction of learnables

See Also

Topics

`net` — Neural network
initialized `dlnetwork` object

`mbq` — Mini-batch queue
`minibatchqueue` object

`X1,...,XN` — Input data
formatted `dlarray`

`SolverMode` — PCA solver mode
`"auto"` (default) | `"direct"` | `"iterative"`

`NumComponentsPerIteration` — Number of eigenvectors to compute per iteration
`32` (default) | positive integer

`VerbosityLevel` — Verbosity level
`"summary"` (default) | `"steps"` | `"iterations"` | `"off"`

`LayerNames` — Names of layers to analyze
string array | cell array of character vectors | character vector

`ExplainedVarianceRange` — Range of explained variance
two-element row vector

`LearnablesReductionRange` — Range of reduction in learnables
two-element row vector

`InputEigenvalues` — Eigenvalues of activations input to projectable layers
cell array

`InputEigenvectors` — Eigenvectors of activations input to projectable layers
cell array

`OutputEigenvalues` — Eigenvalues of activations output by projectable layers
cell array

`OutputEigenvectors` — Eigenvectors of activations output by projectable layers
cell array

GPU Arrays
Accelerate code by running on a graphics processing unit (GPU) using Parallel Computing Toolbox™.

R2024b: `neuronPCA` supports analyzing 1-D convolution layers