Out of memory when using mergekit

#10
by kongzym - opened

Hey, great work @mlabonne

I am currently using mergkit to merge two 70B models on a node with 8*A800, but got out of memory issues.
Do you know mergekit support loading a model on different GPUs? I checked the code, but not find the related part.

Thanks in advance!

Hey, you shouldn't need any GPU to perform the merge. You might require additional RAM perhaps?

Thanks for your reply! I am using merge-evolve, in which a CMA-ES was implemented. This requires GPUs, I guess.

Oh alright, sorry I'm not familiar with this technique

It's okay. Thank you!

Sign up or log in to comment