Model Card for Model ID

Model Details

Model Description

TinyLlama/TinyLlama-1.1B-Chat-v1.0 finetuned on cardiffnlp/tweet_eval for sentiment-analysis task with trl's SFTTrainer.

How to Get Started with the Model

Use the code below to get started with the model.


model = AutoModelForCausalLM.from_pretrained(f"efromomr/llm-course-hw3-tinyllama-qlora", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained(f"efromomr/llm-course-hw3-tinyllama-qlora")
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = "left"

input_ids = tokenizer(text, return_tensors="pt").input_ids

output_ids = model.generate(input_ids, max_new_tokens=16)
generated_text = tokenizer.decode(output_ids[0][len(input_ids[0]) :], skip_special_tokens=True)
print(generated_text)
#positive

efromomr
/

llm-course-hw3-tinyllama-qlora

Model Card for Model ID

Model Details

Model Description

How to Get Started with the Model

Training Details

Training Data

Evaluation

Testing Data, Factors & Metrics

Metrics

Model tree for efromomr/llm-course-hw3-tinyllama-qlora

Dataset used to train efromomr/llm-course-hw3-tinyllama-qlora

Collection including efromomr/llm-course-hw3-tinyllama-qlora

LLM_course_hw3