Exllama v2 defog/llama-3-sqlcoder-8b

Using turboderp's ExLlamaV2 v0.0.21 for quantization.

The "main" branch only contains the measurement.json, download one of the other branches for the model

Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.

Original model: defog/llama-3-sqlcoder-8b
Calibration dataset is based on: m-a-p/CodeFeedback-Filtered-Instruction

Available sizes

Branch	Bits	lm_head bits	Description
8_0	8.0	8.0	Maximum quality that ExLlamaV2 can produce, near unquantized performance.
6_5	6.5	8.0	Very similar to 8.0, good tradeoff of size vs performance, recommended.
5_5	5.5	8.0	Slightly lower quality vs 6.5, but usable on 8GB cards.

A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models.

Model Description

Developed by: Defog, Inc Model type: [Text to SQL] License: [CC-by-SA-4.0] Finetuned from model: [Meta-Llama-3-8B-Instruct]

Demo Page

https://defog.ai/sqlcoder-demo/

Ideal prompt and inference parameters

Set temperature to 0, and do not do sampling.

Prompt

<|begin_of_text|><|start_header_id|>user<|end_header_id|>

Generate a SQL query to answer this question: `{user_question}`
{instructions}

DDL statements:
{create_table_statements}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

The following SQL query best answers the question `{user_question}`:
```sql

Evaluation

This model was evaluated on SQL-Eval, a PostgreSQL based evaluation framework developed by Defog for testing and alignment of model capabilities.

You can read more about the methodology behind SQLEval here.

Apel-sin
/

llama-3-sqlcoder-8b-exl2