Exllama v2 defog/llama-3-sqlcoder-8b
Using turboderp's ExLlamaV2 v0.0.21 for quantization.
The "main" branch only contains the measurement.json, download one of the other branches for the model
Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
Original model: defog/llama-3-sqlcoder-8b
Calibration dataset is based on: m-a-p/CodeFeedback-Filtered-Instruction
Available sizes
Branch | Bits | lm_head bits | Description |
---|---|---|---|
8_0 | 8.0 | 8.0 | Maximum quality that ExLlamaV2 can produce, near unquantized performance. |
6_5 | 6.5 | 8.0 | Very similar to 8.0, good tradeoff of size vs performance, recommended. |
5_5 | 5.5 | 8.0 | Slightly lower quality vs 6.5, but usable on 8GB cards. |
A capable language model for text to SQL generation for Postgres, Redshift and Snowflake that is on-par with the most capable generalist frontier models.
Model Description
Developed by: Defog, Inc Model type: [Text to SQL] License: [CC-by-SA-4.0] Finetuned from model: [Meta-Llama-3-8B-Instruct]
Demo Page
https://defog.ai/sqlcoder-demo/
Ideal prompt and inference parameters
Set temperature to 0, and do not do sampling.
Prompt
<|begin_of_text|><|start_header_id|>user<|end_header_id|>
Generate a SQL query to answer this question: `{user_question}`
{instructions}
DDL statements:
{create_table_statements}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
The following SQL query best answers the question `{user_question}`:
```sql
Evaluation
This model was evaluated on SQL-Eval, a PostgreSQL based evaluation framework developed by Defog for testing and alignment of model capabilities.
You can read more about the methodology behind SQLEval here.
Contact
Contact us on X at @defogdata, or on email at [email protected]