Message Boards

WOLFRAM COMMUNITY

12760 Views

11 Replies

12 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Staff Picks Image Processing Curated Data Wolfram Language Machine Learning Neural Networks

Automatically sliding a conv net onto a larger image

Matthias Odisio

Matthias Odisio, Thermo Fisher Scientific

Posted 6 years ago

POSTED BY: Matthias Odisio

11 Replies

Sort By:

Moderation Team

Moderation Team, WOLFRAM

Posted 6 years ago

POSTED BY: Moderation Team

Jérôme Louradour

Jérôme Louradour, Wolfram Research

Posted 6 years ago

POSTED BY: Jérôme Louradour

Matthias Odisio

Matthias Odisio, Thermo Fisher Scientific

Posted 6 years ago

Thanks, this is an acceptable workaround. May I ask how you derive this formula, `(61+16*n)`? By the way, a set of parenthesis is missing in `ref/ConvolutionLayer`'s notes for the output size formula. The Property example gives the correct result. And, I take good note that "the future will be better."

POSTED BY: Matthias Odisio

Jérôme Louradour

Jérôme Louradour, Wolfram Research

Posted 6 years ago

POSTED BY: Jérôme Louradour

Matthias Odisio

Matthias Odisio, Thermo Fisher Scientific

Posted 6 years ago

POSTED BY: Matthias Odisio

Jérôme Louradour

Jérôme Louradour, Wolfram Research

Posted 6 years ago

POSTED BY: Jérôme Louradour

Matthias Odisio

Matthias Odisio, Thermo Fisher Scientific

Posted 6 years ago

POSTED BY: Matthias Odisio

Matthias Odisio

Matthias Odisio, Thermo Fisher Scientific

Posted 6 years ago

Thanks Jerome. So it's not feasible to realistically reduce the stride. I hope this similar topic will also interest you. What about sliding this denoising autoencoder? size = 157; n1 = 32; k = 5; conv2[n_] := NetChain[{ConvolutionLayer[n, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["ReLU"], DropoutLayer[], ConvolutionLayer[n, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["ReLU"]}]; deconv2[n_] := NetChain[{DeconvolutionLayer[n, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["ReLU"], DropoutLayer[], DeconvolutionLayer[n/2, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["SoftSign"]}]; sum[] := NetChain[{TotalLayer["Inputs" -> 2]}]; constantPowerLayer[] := NetChain[{ ElementwiseLayer[Log@Clip[#, {$MachineEpsilon, 1}] &], ConvolutionLayer[1, 1, "Biases" -> None, "Weights" -> {{{{1}}}}], ElementwiseLayer[Exp]}] ddae = NetGraph[ <\| "bugworkaround" -> ElementwiseLayer[# &], "c12" -> conv2[n1], "c34" -> conv2[2n1], "d12" -> deconv2[2n1], "d34" -> NetChain[{DeconvolutionLayer[n1, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["ReLU"], DeconvolutionLayer[1, k, "Stride" -> 2], BatchNormalizationLayer[], ElementwiseLayer["SoftSign"]}], "sum1" -> sum[], "sum2" -> NetChain[{sum[], constantPowerLayer[]}], "loss" -> MeanSquaredLossLayer[] \|>, { "bugworkaround" -> "c12" -> "c34" -> "d12" -> "sum1" -> "d34" -> "sum2" -> NetPort["loss", "Input"], "bugworkaround" -> "sum2", NetPort["Noisy"] -> "bugworkaround", "c12" -> "sum1", NetPort["Target"] -> NetPort["loss", "Target"] }, "Noisy" -> NetEncoder[{"Image", {size, size}, ColorSpace -> "Grayscale"}], "Target" -> NetEncoder[{"Image", {size, size}, ColorSpace -> "Grayscale"}] ] trained = NetTake[NetInitialize@ddae, {"bugworkaround", "sum2"}] Now I "automatize" the input dimensions: n3 = NetReplacePart[trained, "Noisy" -> Automatic]; This new network works fine if given same dimensions, but fails with larger input dimensions. Any idea how to fix this problem? In[142]:= n3[RandomReal[1, {1, 157, 157}]] // Dimensions Out[142]= {1, 157, 157} In[143]:= n3[RandomReal[1, {1, 1570, 1570}]] // Dimensions During evaluation of In[143]:= NetGraph::tyfail1: Inferred inconsistent value for output size of layer 4 of layer "d34". Out[143]= {}

Thanks Jerome. So it's not feasible to realistically reduce the stride.

I hope this similar topic will also interest you. What about sliding this denoising autoencoder?

size = 157;

n1 = 32;
k = 5;

conv2[n_] := 
  NetChain[{ConvolutionLayer[n, k, "Stride" -> 2], 
    BatchNormalizationLayer[], ElementwiseLayer["ReLU"], 
    DropoutLayer[], ConvolutionLayer[n, k, "Stride" -> 2], 
    BatchNormalizationLayer[], ElementwiseLayer["ReLU"]}];

deconv2[n_] := 
  NetChain[{DeconvolutionLayer[n, k, "Stride" -> 2], 
    BatchNormalizationLayer[], ElementwiseLayer["ReLU"], 
    DropoutLayer[], DeconvolutionLayer[n/2, k, "Stride" -> 2], 
    BatchNormalizationLayer[], ElementwiseLayer["SoftSign"]}];

sum[] := NetChain[{TotalLayer["Inputs" -> 2]}];

constantPowerLayer[] := NetChain[{
   ElementwiseLayer[Log@Clip[#, {$MachineEpsilon, 1}] &],
   ConvolutionLayer[1, 1, "Biases" -> None, "Weights" -> {{{{1}}}}],
   ElementwiseLayer[Exp]}]

ddae = NetGraph[
  <|
   "bugworkaround" -> ElementwiseLayer[# &],
   "c12" -> conv2[n1],
   "c34" -> conv2[2*n1],

   "d12" -> deconv2[2*n1],
   "d34" -> 
    NetChain[{DeconvolutionLayer[n1, k, "Stride" -> 2], 
      BatchNormalizationLayer[], ElementwiseLayer["ReLU"], 
      DeconvolutionLayer[1, k, "Stride" -> 2], 
      BatchNormalizationLayer[], ElementwiseLayer["SoftSign"]}],

   "sum1" -> sum[],
   "sum2" -> NetChain[{sum[], constantPowerLayer[]}],

   "loss" -> MeanSquaredLossLayer[]
   |>,
  {
   "bugworkaround" -> 
    "c12" -> 
     "c34" -> 
      "d12" -> "sum1" -> "d34" -> "sum2" -> NetPort["loss", "Input"],
   "bugworkaround" -> "sum2",
   NetPort["Noisy"] -> "bugworkaround",
   "c12" -> "sum1",
   NetPort["Target"] -> NetPort["loss", "Target"]
   },
  "Noisy" -> 
   NetEncoder[{"Image", {size, size}, ColorSpace -> "Grayscale"}],
  "Target" -> 
   NetEncoder[{"Image", {size, size}, ColorSpace -> "Grayscale"}]
  ]

trained = NetTake[NetInitialize@ddae, {"bugworkaround", "sum2"}]

Now I "automatize" the input dimensions:

n3 = NetReplacePart[trained, "Noisy" -> Automatic];

This new network works fine if given same dimensions, but fails with larger input dimensions. Any idea how to fix this problem?

In[142]:= n3[RandomReal[1, {1, 157, 157}]] // Dimensions

Out[142]= {1, 157, 157}

In[143]:= n3[RandomReal[1, {1, 1570, 1570}]] // Dimensions

During evaluation of In[143]:= NetGraph::tyfail1: Inferred inconsistent value for output size of layer 4 of layer "d34".

Out[143]= {}

POSTED BY: Matthias Odisio

Giulio Alessandrini

Giulio Alessandrini, Wolfram Research Inc.

Posted 6 years ago

We should probably advertise NetReplacePart[net, "Input" -> Automatic] For LeNet you can do NetReplacePart[ NetDrop[ NetModel["LeNet Trained on MNIST Data", EvaluationNet"], -5 ], "Input" -> Automatic] and get