MoMonir/gte-Qwen1.5-7B-instruct-GGUF

This model was converted to GGUF format from Alibaba-NLP/gte-Qwen1.5-7B-instruct using llama.cpp
Refer to the original model card for more details on the model.

Note: This is an Embedding Model

For more information about Embedding check OpenAI Embedding Document

Downloads last month
38
GGUF
Model size
7.72B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using MoMonir/gte-Qwen1.5-7B-instruct-GGUF 3