Group Abstract

Message Boards

9.1K Views

4 Replies

0 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Data Science Wolfram Language Optimization Machine Learning Neural Networks

Posted 4 years ago

Hi everybody, I would like to implement a custom norm regularisation on the weights of a LinearLayer. But Let's suppose for ease of discussion that I want to implement a L1 regularisation. In this case my objective will be to minimise a loss which is: (loss of task) + (Total[Abs[weights]]) I tried to play around with NetArray with without much success, here's an example: net = NetGraph[ Association[ "linear" -> LinearLayer[1, "Weights" -> NetArray["c"]], "reg" -> FunctionLayer[{Total[ Abs[NetArray[<\|"Name" -> "c", "Dimensions" -> 100\|>]]]} &], "thread" -> ThreadingLayer[#1 + #2 &] ], { NetPort["Input"] -> "linear", "reg" -> "thread", NetPort["linear", "Output"] -> "thread" }] That can be trained for example with: dataTrain = Table[RandomReal[1, 100] -> {RandomReal[]}, 100]; trained = NetTrain[net, dataTrain] But when I go to inspect the actual value of the weights, I get different results. That is: NetExtract[trained, {"linear", "Weights"}] is different from NetExtract[trained, {"reg", "Net", 1, "Array"}] How can I implement it? Do you have any ideas/comment/observations? (I'm pretty sure that the example I gave is wrong as I would like to minimise the loss of the ouput of linear layer + the sum of the absolute value of the weights, but I think that the essence is the same)

POSTED BY: Ettore Mariotti

4 Replies

Sort By:

Posted 4 years ago

Hi everybody, I would like to implement a custom norm regularisation on the weights of a LinearLayer. But Let's suppose for ease of discussion that I want to implement a L1 regularisation. In this case my objective will be to minimise a loss which is: (loss of task) + (Total[Abs[weights]]) I tried to play around with NetArray with without much success, here's an example: net = NetGraph[ Association[ "linear" -> LinearLayer[1, "Weights" -> NetArray["c"]], "reg" -> FunctionLayer[{Total[ Abs[NetArray[<\|"Name" -> "c", "Dimensions" -> 100\|>]]]} &], "thread" -> ThreadingLayer[#1 + #2 &] ], { NetPort["Input"] -> "linear", "reg" -> "thread", NetPort["linear", "Output"] -> "thread" }] That can be trained for example with: dataTrain = Table[RandomReal[1, 100] -> {RandomReal[]}, 100]; trained = NetTrain[net, dataTrain] But when I go to inspect the actual value of the weights, I get different results. That is: NetExtract[trained, {"linear", "Weights"}] is different from NetExtract[trained, {"reg", "Net", 1, "Array"}] How can I implement it? Do you have any ideas/comment/observations? (I'm pretty sure that the example I gave is wrong as I would like to minimise the loss of the ouput of linear layer + the sum of the absolute value of the weights, but I think that the essence is the same)

POSTED BY: Lothar Thiele

Posted 4 years ago

Here's a file. I believe it is close to correct, but I am still trying to figure out why the answer differs from that using Fit with Regularization, so it is quite possible I am doing something wrong. I have not had time to work through the issues, but perhaps this will help you overcome some barriers I encountered. Attachments: Emulating Regula...nb

POSTED BY: Seth Chandler

Posted 4 years ago

This question looks very similar to one I posted a few weeks ago. https://community.wolfram.com/groups/-/m/t/2179969 I have made some progress on it but have not had time to post a notebook of results. My approach looks similar to yours with shared arrays. If you want to message me, I can send you a no warranties notebook.

POSTED BY: Seth Chandler

Posted 4 years ago

POSTED BY: Ettore Mariotti

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback