Is it possible to split the 34G model into several pieces?

#20

by kirp - opened Jun 16, 2023

kirp

Jun 16, 2023

The 34G model is too large for my cpu(16G) to load.
So I wander whether we can set max_shard_size or something to split the model.
Then, we can the load the model to gpu piece by piece.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment