code

#4
by rakmik - opened

how to run it
python code?

at676 changed discussion status to closed

colab t4

RelaxML org

I cannot help you if you are unwilling or unable to read instructions. Our generation pipeline is contained here https://github.com/Cornell-RelaxML/qtip/blob/main/eval/interactive_gen.py. We have not tested our code with the transformers pipeline class and it most likely does not work there as you have experienced.

It is not installed in any way basically
Do you have a Colab page that has been tested
Colab as it worked for you will work for others
The problem is if you follow the instructions it does not execute

I tried all the instructions in your code pages and it didn't work because the core libraries don't install and I doubt they depend on their cores that don't work in Colab.

I cannot help you if you are unwilling or unable to read instructions

Don't blame me for following instructions or not. If you have a modern collab page, provide it to me. It will be much better. Thank you for your effort, but I can't try it.

Install the fast-hadamard-transform package

python -m eval.interactive_gen --hf_path relaxml/Llama-2-13b-QTIP-3Bit --max_new_tokens 256 --streaming

How to run awesome compressed models in Colab t4 without installation complications

Sign up or log in to comment