TOG 2022 | ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based Characters

type

status

date

slug

summary

Basic Elements

💡

普通的 VAE 一般都会假设，即一个与无关的分布。这里把当做条件，可以做到根据当前的 state 来生成不同的 latent variable。

💡

这里的是后验概率的近似。不要把它与搞混淆。

💡

在 inference 的过程中，后验概率是不会被用到的。这对应着普通的 VAE 在做 inference 时，只会用到 decoder，不会用到 encoder。

💡

普通的 VAE encoder 的训练目标是将后验概率尽可能的接近隐变量的分布，而隐变量的分布通常是定义好的，如。在这里，由于隐变量的分布与当前状态有关，不能被提前定义，所以要与共同训练。

For each training epoch

the temporary buffer is merged into , replacing the oldest trajectories and keeping the size of smaller that .

increases once every 500 training epochs from 0.01 to 0.1 during the training.

where is a network. By some math computing, we can make the KL-divergence independent of the