library_name: transformers | |
tags: [] | |
AWQ Quantized version of [cognitivecomputations/dolphin-2.9-llama3-70b](/cognitivecomputations/dolphin-2.9-llama3-70b). | |
For use with vllm and other inference engines. |
library_name: transformers | |
tags: [] | |
AWQ Quantized version of [cognitivecomputations/dolphin-2.9-llama3-70b](/cognitivecomputations/dolphin-2.9-llama3-70b). | |
For use with vllm and other inference engines. |