MoMonir/gte-Qwen1.5-7B-instruct-GGUF

This model was converted to GGUF format from Alibaba-NLP/gte-Qwen1.5-7B-instruct using llama.cpp
Refer to the original model card for more details on the model.

Note: This is an Embedding Model

For more information about Embedding check OpenAI Embedding Document

Downloads last month
1
GGUF
Model size
7.72B params
Architecture
qwen2

4-bit

5-bit

6-bit

Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using MoMonir/gte-Qwen1.5-7B-instruct-GGUF 4