Question

by dondre - opened Jun 9, 2023

Jun 9, 2023

Wish I could test out this model but it needs a A100 to load.
What kind of setup do you have?
Your models are great but with only 48GB VRAM i'm on the outside looking in.
Can you add some inference examples to these larger models?

Cebtenzzre

Jun 11, 2023

There are quantizations available that don't need as much VRAM:
https://huggingface.co/TheBloke/WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GPTQ
https://huggingface.co/TheBloke/WizardLM-Uncensored-SuperCOT-StoryTelling-30B-GGML

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment