amanpreetsingh459
/

llama-2-7b-chat_q4_quantized_cpp

text-generation-inference

Model card Files Files and versions Community

llama-2-7b-chat_q4_quantized_cpp / README.md

amanpreetsingh459's picture

amanpreetsingh459

Add credits to the README.md file

bfcebe0 about 1 year ago

|

630 Bytes

metadata

license: mit

amanpreetsingh459/llama-2-7b-chat_q4_quantized_cpp

This model contains the 4-bit quantized version of llama2 model.
This can be run on a local cpu system as a cpp module available at: https://github.com/ggerganov/llama.cpp
As for the testing, the model has been tested on Ubuntu Linux os with 12 GB RAM and core i5 processor.

Credits: