π DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
./hubert_ckpt
: hubert audio encoder, same as hubert-base
./pbnet_both/checkpoint_100000.pth.tar
: checkpoint for the Pose-Blink generation network
DAWN_128.pth
: checkpoint for the A2V-FDM with 128 * 128 resolution
DAWN_256.pth
: checkpoint for the A2V-FDM with 256 * 256 resolution
LFG_128_1000ep.pth
: checkpoint for the LFG with 128 * 128 resolution
LFG_256_400ep.pth
: checkpoint for the LFG with 256 * 256 resolution