Discussion: June 15th
Guess what, we’re doing GANs this time.
Implementing the basic GAN logic isn’t too difficult. You will likely want to
use low-level training loops (i.e. GradientTape
) because of the non-trvial
control flow. There are many examples around the web that can help you get
started.
This should be enough for a functional training procedure. Train some models and generate samples for evaluation. They will most likely be terrible.
Take note: Evaluating whether GAN training is progressing/”working” is difficult. The loss values are not very informative. You will most likely want to take some samples and plot them every so often while training is progressing to get an impression of the current state. However, even this can be misleading: You might run into mode collapse problems early on, which you can take as evidence that training is not working, and stop the process early. However, it could actually happen that the mode collapse “magically” gets fixed over the course of a few training iterations, and diverse samples are produced. For this reason, consider always training for a large number of steps (larger than e.g. VAEs) and just see what happens.
GANs are notoriously difficult to train. In the rest of this assignment, you are asked to try out various ways to improve the basic procedure. There are countless advanced GAN variants, but for now you may focus on “tricks” to make the original formulation more stable. Here are some leads:
Include as many of these methods as you want/need into your model and try to achieve some nice samples!