Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xiryss
/
llm-course-hw2-ppo
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
xiryss
commited on
21 days ago
Commit
45b1897
·
verified
·
1 Parent(s):
b7a8b5f
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -23,6 +23,7 @@ Mean reward on training set is equal to 6.56
23
## Examples
24
25
**Before:**
26
user
27
28
What's your morning routine like?
23
## Examples
24
25
**Before:**
26
+
27
user
28
29
What's your morning routine like?