Why are GAN models not heavily used for NLP?

Question

I am wondering why there has not been more usage of GANs for NLP. I know there has been research on the subject (The Google Scholar page for the subject is here).

Are there any specific reasons why GANs do not work for NLP specifically VQGAN + CLIP variants? I do not understand why most text generated by AI is done through predicting the next letter or word in a sequence with RNNs when GANs have had so much success generating deep fakes and the such instead of say, predicting the next pixel.

profPlum · Accepted Answer · 2024-01-23T17:45:22.113

A few reasons:

Transformers are amazing at text generation already (e.g. GPT-3 which almost passes the Turing-Test)
The original GAN requires a continuous data representation (e.g. images) instead of a discrete one (e.g. text), so that slight error signals can be used for learning.
Empirically speaking, GANs don't seem to work that well on non-image data. I recently applied them to regular tabular data but found auto-encoders much more useful.

P.S. Also it's my intuition that GAN's and Denoising Diffusion Models are popular in the image generation community because they both offer mechanisms that can prevent "blurriness" in the generated images (either the adversary or extra denoising iterations).

Why are GAN models not heavily used for NLP?

1 Answers1