Group Abstract

Message Boards

WOLFRAM COMMUNITY

23.8K Views

12 Replies

25 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Staff Picks Data Science Wolfram Language Machine Learning Computational Linguistics Natural Language Processing Wolfram Summer School Neural Networks Artificial Intelligence

[WSS20] Text generation using GANs

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

POSTED BY: Suman Sigdel

12 Replies

Sort By:

Phil Nguyen

Posted 3 years ago

Missing definitions such as ArrayLayer... Notebook does not work... Any ideas?

POSTED BY: Phil Nguyen

Osama Sarhan

Osama Sarhan, Alexandria University

Posted 4 years ago

Excellent work.

POSTED BY: Osama Sarhan

Blair Birdsell

Blair Birdsell, Mr.

Posted 5 years ago

GANs are very unstable during training as there are two networks that are dependent on each other for improvements in their performance. While several methods to improve the stability were implemented in the GAN architecture, they still suffer from non-convergence. Due to their unstable nature, an early stopping method fails to retrieve the best performing GAN model. In order to solve this, we will save a checkpoint of the model in every round along with the n-gram performance metric to select the best performing model. This stuck out to me as similar to my experience implementing a GAN in Tensorflow2. The instabilities were maddening. Still trying to improve some problems with mode collapse. Glad to hear your model selection strategy eventually worked.

POSTED BY: Blair Birdsell

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

Thank you. Good to hear that you had similar experience. Check out GAN Hacks by Soumith Chintala. https://github.com/soumith/ganhacks .

POSTED BY: Suman Sigdel

Blair Birdsell

Blair Birdsell, Mr.

Posted 5 years ago

GANs are very unstable during training as there are two networks that are dependent on each other for improvements in their performance. While several methods to improve the stability were implemented in the GAN architecture, they still suffer from non-convergence. Due to their unstable nature, an early stopping method fails to retrieve the best performing GAN model. In order to solve this, we will save a checkpoint of the model in every round along with the n-gram performance metric to select the best performing model. This stuck out to me as similar to my experience implementing a GAN in Tensorflow2. The instabilities were maddening. Still trying to improve some problems with mode collapse. Glad to hear your model selection strategy eventually worked.

POSTED BY: Blair Birdsell

Pedro Cabral

Pedro Cabral, Data Scientist @ Hapvida Notredame Intermédica | CS @ UNIFOR

Posted 5 years ago

Impressive! Really well written. I like the GIF!

POSTED BY: Pedro Cabral

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

Thanks Pedro ! I'm glad you liked the project :D

POSTED BY: Suman Sigdel

EDITORIAL BOARD

EDITORIAL BOARD, WOLFRAM

Posted 5 years ago

-- you have earned *Featured Contributor Badge* Your exceptional post has been selected for our editorial column *Staff Picks* http://wolfr.am/StaffPicks and Your Profile is now distinguished by a *Featured Contributor Badge* and is displayed on the Featured Contributor Board. Thank you!

POSTED BY: EDITORIAL BOARD

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

Thank you so much !

POSTED BY: Suman Sigdel

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

Full Notebook can be found here : https://notebookarchive.org/2020-07-6hfo8lo

POSTED BY: Suman Sigdel

Michael Sollami

Michael Sollami, Salesforce

Posted 5 years ago

Nice project! I wanted to run your code, but there's many missing definitions, e.g. convolutionBlock, ArrayLayer, vocabulary, some .wlnet files etc.) do you think you can add them so it is runnable?

POSTED BY: Michael Sollami

Suman Sigdel

Suman Sigdel, Bennington College

Posted 5 years ago

POSTED BY: Suman Sigdel

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback