I'm currently studying the original GAN objective, and I'm struggling to understand that.
$$ \min_{G}\,\max_{D}\;V(D,G) \;=\; \mathbb{E}_{x\sim p_{\mathrm{data}}(x)}\bigl[\log D(x)\bigr] \;+\; \mathbb{E}_{z\sim p_{z}(z)}\bigl[\log\bigl(1 - D(G(z))\bigr)\bigr]$$
And I understand that
- D(x) is the discriminator’s estimate that a real sample x is actually a real example
- D(G(z)) is the discriminator’s estimate that a “fake” sample G(z) is real
So intuitively, I'd say:
- Discriminator’s goal:
- Maximaze D(x).
- Minimaze D(G(z)).
- Generator’s goal
- Maximize D(G(z)) to fool the discriminator
So, why is the formula structured that way with a log(1-)?
Any pointers to an intuitive derivation or references that explain this choice would be greatly appreciated!