Reading Assignment 5: Evaluating Generative Models

There is no one “correct” method to evaluate generative models. First off, you can get an overview over different approaches from the following articles:

Aside from that, there is a very influential paper which shows how problematic evaluation really is, e.g. in that different metrics can be effectively independent.

Finally, here is an interesting article on the concept of typicality and how it relates to likelihood. All in all, the latter two articles show that maximum likelihood may not be the ideal goal for training generative models.