Would be cool to see how different FSDP strategies and N of GPUs affects the required memory.
· Sign up or log in to comment