Group Abstract

Message Boards

WOLFRAM COMMUNITY

5.2K Views

2 Replies

3 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Data Science Wolfram Language Machine Learning

Create a custom FeatureExtractor that learns and remember?

Ettore Mariotti

Posted 6 years ago

Hello everybody!! I'm not sure if this should be a question or an idea, I think it depends if what i'm looking for does already exsists or not.. Anyway I have been trying to implement a model which is currently unvailable in the Wolfram Language: Extreme Learning Machines. They are conceptually simple models which follow these steps: Standardize data Randomly project the data in a high dimensional space (and apply a non-linear function) Find optimal weights that predicts the quantity of interest (as LinearRegression in Predict or LogisticRegression in Classify). In order to implement them in a quick and integrated way (so that i could have a predictor function) i have used the following code: train = ResourceData["Sample Data: Boston Homes", "TrainingData"]; test = ResourceData["Sample Data: Boston Homes", "TestData"]; numFeats = Length@Keys@train[[1]] w = RandomReal[{-1, 1}, {100, numFeats}]; randomProject[data_] := Tanh[w.data] elm = Predict[train, FeatureExtractor -> {Standardize, randomProject}, Method -> "LinearRegression", PerformanceGoal -> "TrainingSpeed"] The code works (and actually also have good performances) but I have a couple of issues: 1) can I automatize the inference of the value "numFeats" when i pass it as a FeatureExtractor? 2) How can I initialize the random matrix `w` so that I don't have to define it as a global variable? 3) Does Standardize actually compute the mean and variance on the train set and apply them on the test set? Or is it just standardizing a single List of data? If it is, how can we explicitly pass a Extractor that learn the parameter from the data (in this case just mean and variance)? I suspect that is not doing what i'm thinking because performances are better without the Standardize step. 4) I know that Classify follow an internal random seach in order to find the best hyperparameters set. My provocative question is: is it possible to pass in its search space a parameter from the featureExtractor? (In this case for example the dimension of the final projecting space that have been arbitrarily set to 100) -- (PS: It has been really exciting to implement all of this in such a succint and clean way)

POSTED BY: Ettore Mariotti

2 Replies

Sort By:

Rohit Namjoshi

Posted 6 years ago

Hi Ettore, For this question 2) How can I initialize the random matrix w so that I don't have to define it as a global variable? one way to do it in WL is to define a symbol that returns a function with the value of `w` locally bound. e.g. randomProject = With[{w = RandomReal[{-1, 1}, {100, numFeats}]}, Function[data, Tanh[w.data]]]; Each time the above definition is evaluated, it will return a function with a different random matrix. Multiple evaluations of the function returned will use the same random matrix. randomProject[train[[1, 1]]] == randomProject[train[[1, 1]]] (* True *)

POSTED BY: Rohit Namjoshi

Ettore Mariotti

Posted 6 years ago

Thank you very much Rohit! It works wonderfully. Anyway I have to admit that I prefer the #& convention for declaring function randomProject = With[{w = RandomReal[{-1, 1}, {100, numFeats}]}, Tanh[w.#] &]; It feels more clean and nautral. Thank you again, I never used the With statment before, looks very useful!

POSTED BY: Ettore Mariotti

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback