Group Abstract

Message Boards

5.4K Views

1 Reply

1 Total Like

View groups...

Follow this post

Share this post:

GROUPS:

Data Science Wolfram Language Machine Learning Neural Networks

Posted 4 years ago

Hello everyone I'm trying to fine-tune GPT-2 (https://resources.wolframcloud.com/NeuralNetRepository/resources/GPT-2-Transformer-Trained-on-WebText-Data) I tried training it like this: gpt = NetModel[{"GPT-2 Transformer Trained on WebText Data", "Task" -> "LanguageModeling", "Size" -> "345M"}] gpt = NetTrain[gpt, {"This is an example"}] But it didn't work, can someone explain me how to train transformers in Mathematica?

POSTED BY: Mike Bark

1 Reply

Sort By:

Posted 4 years ago

Unfortunately, training transformers is slightly more complicated. You would need to add code for the loss function. You could instead train a simple (non-transformer) text classifier in Mathematica using the Classify function: classifier = Classify[{"This is a happy example"->1, "This is a negative or bad example" -> 0}] If you really wanted to train transformers (and do so without implementing a lot of code yourself), you could check out the HiggingFace transformer github repo - which is a Python library implementing training of transformers.

POSTED BY: Alec Graves

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback