Blackroot commited on
Commit
9cea78c
·
verified ·
1 Parent(s): 0512a5a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -1,3 +1,5 @@
 
 
1
 
2
  # Training Metrics
3
 
 
1
+ Test network using differential attention instead of classical attention. Other than some alterations to the attention, this is otherwise the same configuration as https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct
2
+
3
 
4
  # Training Metrics
5