Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training

Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo


Abstract
Generating diverse texts is an important factor for unsupervised text generation. One approach is to produce the diversity of texts conditioned by the sampled latent code. Although several generative adversarial networks (GANs) have been proposed thus far, these models still suffer from mode-collapsing if the models are not pre-trained. In this paper, we propose a GAN model that aims to improve the approach to generating diverse texts conditioned by the latent space. The generator of our model uses Gumbel-Softmax distribution for the word sampling process. To ensure that the text is generated conditioned upon the sampled latent code, reconstruction loss is introduced in our objective function. The discriminator of our model iteratively inspects incomplete partial texts and learns to distinguish whether they are real or fake by using the standard GAN objective function. Experimental results using the COCO Image Captions dataset show that, although our model is not pre-trained, the performance of our model is quite competitive with the existing baseline models, which requires pre-training.
Anthology ID:
2021.eacl-srw.23
Volume:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Month:
April
Year:
2021
Address:
Online
Editors:
Ionut-Teodor Sorodoc, Madhumita Sushil, Ece Takmaz, Eneko Agirre
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
175–182
Language:
URL:
https://aclanthology.org/2021.eacl-srw.23
DOI:
10.18653/v1/2021.eacl-srw.23
Bibkey:
Cite (ACL):
Takeshi Kojima, Yusuke Iwasawa, and Yutaka Matsuo. 2021. Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 175–182, Online. Association for Computational Linguistics.
Cite (Informal):
Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training (Kojima et al., EACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.eacl-srw.23.pdf
Data
MS COCO