Message Boards Message Boards

What Neural Networks look

Posted 7 years ago

This is the pelican which Neural Networks(VGG-16) look.

enter image description here

I referred this post to the presentation by Markus van Almsick at the 2017 Wolfram Technology Conference. In addition to the information posted here, this presentation contains a variety of interesting and useful information of how to use WL deep learning framework.

Introduction VGG-16

VGG-16 is one of the representative models in deep learning to identify the main object in an image. Fortunately we can get the pre-trained VGG-16 from the Wolfram Neural Net Repository.

vgg16 = NetModel["VGG-16 Trained on ImageNet Competition Data"]

enter image description here

VGG-16 is able to identify an image.

enter image description here

enter image description here

The probability is 99.59%

vgg16[img, {"TopProbabilities", 1}]

enter image description here

What is the pelican of the probability 1(=100%) ?

What does VGG-16 look?

Find an image to maximize the part(145th element) corresponding to the pelicans of the output(size:1000 vector) of the layer fc8 which is the layer before the last of VGG-16.

NetExtract[vgg16, "fc8"]

enter image description here

classes = NetExtract[vgg16, "Output"];
pelican = 
 Position[Normal[classes]["Labels"], 
   Entity["Concept", "Pelican::jpfg7"]][[1, 1]]

145

Start with an image of all 0, then gradually tweak the image towards what VGG-16 looks like pelican. Define a loss function below as each element of the output is 0 or more and NetTrain minimizes the loss.

Loss = NetChain[{ElementwiseLayer[x \[Function] -x]}];

Create a new network using the subnet of VGG-16.

net = NetGraph[<|
   "image" -> 
    ConstantArrayLayer["Array" -> ConstantArray[0., {3, 224, 224}]], 
   "vgg16subnet" -> Take[vgg16, {1, -2}],
   "select" -> PartLayer[pelican],
   "Loss" -> Loss |>,
  {"image" -> "vgg16subnet" -> "select" -> "Loss", 
   "Loss" -> NetPort["Loss"] }]

enter image description here

Train the new network. The training data is dummy as it is unnecessary. And the subnet of VGG-16 does not undergo training and is left unchanged by NetTrain.

Off[First::nofirst]
trainedNet = NetTrain[net, <|0 -> {0}|>, "Loss",
   LearningRateMultipliers -> {"image" -> 1, _ -> None},
   MaxTrainingRounds -> 2000];

Retrieve the image that VGG-16 looks like pelican from the trainedNet.

meanColor = Normal[NetExtract[vgg16, "Input"]]["MeanImage"];
idec = NetDecoder[{"Image", "MeanImage" -> meanColor}];
vgg16pelican = 
 idec[NetExtract[NetExtract[trainedNet, "image"], "Array"]]

enter image description here

vgg16[vgg16pelican]

enter image description here

The probability of this image is 100%!

vgg16[vgg16pelican, {"TopProbabilities", 1}]

enter image description here

Examples

enter image description here

What does each layer of the VGG-16 look?

As the same above, find an image to maximize the activation of the filters in different convolutionLayers of the VGG-16.

There are 13 ConvolutionLayers.

layernames = (NetExtract[vgg16, All] // Keys);
convlist = Select[layernames, StringTake[#, 1] == "c" &]

enter image description here

The number of filters in each convolutionLayer is

(NetExtract[NetExtract[vgg16, #], "Output"] & /@ convlist)[[All, 1]]

enter image description here

Start with an image of all 0, then gradually tweak the image to maximize the activation of each filter. Define a loss function below.

Loss = NetChain[{ElementwiseLayer[x \[Function] x^2], 
    SummationLayer[], ElementwiseLayer[x \[Function] -x]}];

Create new networks and train the networks of the first 10 filters in each layer.

Off[First::nofirst]
result = {};
Do[
 subnet = Take[vgg16, {1, convlist[[j]]}];
 Do[
  net = NetGraph[<|
     "image" -> 
      ConstantArrayLayer["Array" -> ConstantArray[0., {3, 224, 224}]], 
     "vgg16subnet" -> subnet,
     "select" -> PartLayer[i],
     "Loss" -> Loss |>,
    {"image" -> "vgg16subnet" -> "select" -> "Loss", 
     "Loss" -> NetPort["Loss"] }];
  trainedNet = NetTrain[net, <|0 -> {0}|>, "Loss",
    LearningRateMultipliers -> {"image" -> 1, _ -> None},
    MaxTrainingRounds -> 300, BatchSize -> 1];
  AppendTo[result, 
   idec[NetExtract[NetExtract[trainedNet, "image"], "Array"]]];,
  {i, 10}];,
 {j, Length[convlist]}]

Retrieve the image that each filter looks from the trainedNet. These images gradually become more complex patterns as the layer becomes deeper.

Grid[Join[{Join[{""}, Range[10]]}, 
  Flatten[{convlist[[#]], Partition[result, 10][[#]]}] & /@ 
   Range[Length[convlist]]], Frame -> All, Spacings -> {.5, .5}]

enter image description here

POSTED BY: Kotaro Okazaki
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard

Group Abstract Group Abstract