Skip to content

Ksuryateja/DCGAN-MNIST-pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DCGAN-MNIST-pytorch

A DCGAN built on the MNIST dataset using pytorch

DCGAN is one of the popular and successful network designs for GAN. It mainly composes of convolution layers without max pooling or fully connected layers. It uses convolutional stride and transposed convolution for the downsampling and the upsampling. Architecture guidelines for stable Deep Convolutional GANs as mentioned by Soumith Chintala

These are the guidelines for constructing a DCGAN as mentioned by Soumith Chintala (https://arxiv.org/abs/1511.06434)

Replace any pooling layers with strided convolutions(discriminator) and fractional- strided convolutions (generator).

Use batchnorm in both the generator and the discriminator.

Remove fully connected hidden layers for deeper architectures.

Use ReLU activation in generator for all layers except for the output, which uses Tanh.

Use LeakyReLU activation in the discriminator for all layers.

The simplicity of DCGAN contributes to its success. We reach certain bottleneck that increasing the complexity of the generator does not necessarily improve the image quality. Until we identify the bottleneck and know how to train GANs more effective, DCGAN remains a good start point for a new project. I created a DCGAN model for mimicking the data distribution of MNIST dataset

The results from the DCGAN training on MNIST dataset are:

Releases

No releases published

Packages

No packages published

Languages