Group Abstract

Message Boards

6.4K Views

3 Replies

2 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Data Science Wolfram Language Machine Learning Neural Networks

Posted 5 years ago

Hi guys, I'm very curious why the loss of validation set is even lower than the loss of training set when I use NetTrain. Say, on this page, https://reference.wolfram.com/language/tutorial/NeuralNetworksSequenceLearning.html#1094728277 for the Q&A RNN trained on the bAbI QA Dataset, the loss of validation set shouldn't be lower than the loss of training set, according to Goodfellow's DL book. Right? Is it possible that these 2 sets are mistakenly labeled in the NetTrain function when it tries to plot the learning curve?

POSTED BY: HH C

3 Replies

Sort By:

Posted 5 years ago

It is not a bug, this behaviour is explained by the dropout. The training loss is computed with dropout enabled (as in DropoutLayer[...][..., NetEvaluationMode -> "Train"]). The validation loss is computed with dropout disabled (as in DropoutLayer[...][..., NetEvaluationMode -> "Test"]).

POSTED BY: Jérôme Louradour

Posted 5 years ago

Thanks.

POSTED BY: HH C

Posted 5 years ago

Crossposted here.

POSTED BY: Rohit Namjoshi

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback