Add links for github repo
Browse files
README.md
CHANGED
@@ -49,6 +49,9 @@ CUDA_VISIBLE_DEVICES=0 python run_glue.py \
|
|
49 |
|
50 |
We believe these results could be improved with modest tuning of hyperparameters: `per_device_train_batch_size`, `learning_rate`, `num_train_epochs`, `num_grads` and `damp`. For the sake of fair comparison and a robust default setup we use the same hyperparameters across all models (`bert-tiny`, `bert-mini`) and all datasets (SQuAD version 2 and GLUE).
|
51 |
|
|
|
|
|
|
|
52 |
## BibTeX entry and citation info
|
53 |
|
54 |
```bibtex
|
|
|
49 |
|
50 |
We believe these results could be improved with modest tuning of hyperparameters: `per_device_train_batch_size`, `learning_rate`, `num_train_epochs`, `num_grads` and `damp`. For the sake of fair comparison and a robust default setup we use the same hyperparameters across all models (`bert-tiny`, `bert-mini`) and all datasets (SQuAD version 2 and GLUE).
|
51 |
|
52 |
+
Our code for M-FAC can be found here: [https://github.com/IST-DASLab/M-FAC](https://github.com/IST-DASLab/M-FAC).
|
53 |
+
A step-by-step tutorial on how to integrate and use M-FAC with any repository can be found here: [https://github.com/IST-DASLab/M-FAC/tree/master/tutorials](https://github.com/IST-DASLab/M-FAC/tree/master/tutorials).
|
54 |
+
|
55 |
## BibTeX entry and citation info
|
56 |
|
57 |
```bibtex
|