Group Abstract Group Abstract

Message Boards Message Boards

Shakespearean GPT from scratch: create a generative pre-trained transformer

10 Replies
POSTED BY: Frank Scherbaum
POSTED BY: Jack I Houng

with the new 2.0.2 version of openailink now the poetry is as I requested! enter image description here

POSTED BY: Jack I Houng

After reading "What Is ChatGPT Doing … and Why Does It Work?" I really wanted to explore the networks, better understand what they do and play around with them in Mathematica. This post gave me everything in one go... which is awesome!

POSTED BY: Martijn Froeling

Happy to hear that. :)

I'm preparing another post on the original vanilla encoder-decoder transformer architecture (Attention is all you need paper from 2017).

It also might be of your interest this other recent post on Few Shot Learning using GPT-3 and WL: https://community.wolfram.com/groups/-/m/t/2848741

This is an excellent hands-on presentation. Thank you for this. I do have a question, though. In the example with:

trainedNet=NetExtract[results["TrainedNet"],"decode"]; 

Instead of using CloudPut and CloudGet, what would be the correct expression to save to a local file and retrieve it from the local file?

POSTED BY: Loren Abdulezer
POSTED BY: Loren Abdulezer

Excellent introduction ... thanks!

POSTED BY: Stuart Nettleton
POSTED BY: EDITORIAL BOARD
Reply to this discussion
Community posts can be styled and formatted using the Markdown syntax.
Reply Preview
Attachments
Remove
or Discard