Hi,
This isn't a homework question, but a side task given in a machine learning class I am taking.
Question: Using variational calculus, prove that one can minimize the KL-divergence by choosing ##q## to be equal to ##p##, given a fixed ##p##.
Attempt:
Unfortunately, I have never seen...