ArthurFischel commited on
Commit
cb9c6a8
1 Parent(s): 680f0fc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -19,6 +19,14 @@ I want to use small llms and a focused data set so I can really get a good idea
19
 
20
  My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
21
 
 
 
 
 
 
 
 
 
22
  I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
23
 
24
  # 7/25/23
 
19
 
20
  My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
21
 
22
+ My justifications for my justifications for the data sets:
23
+
24
+ Tiny stories: I want to give the model a basic understanding of the English language so that it can hopefully understand what's happening in the Wikipedia or any of the game messages that nah hack produces.
25
+
26
+ Trajectory data set: this carefully formatted data set will be used to structure how the agent behaves and how I parse out the states and actions and other various information I'm interested in.
27
+
28
+ Subset of the nattack wiki: I will be making a subset data set that contains categories that I think would be most useful to an agent who should have information on things inside the game.
29
+
30
  I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
31
 
32
  # 7/25/23