Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Dovakiins
/
qwerrwe
Build error

App Files Files Community
Fetching metadata from the HF Docker repository...
qwerrwe / scripts
Ctrl+K
Ctrl+K
  • 100 contributors
History: 16 commits
winglian's picture
winglian
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c about 2 years ago
  • alpaca_json_to_jsonl.py
    834 Bytes
    black formatting about 2 years ago
  • finetune.py
    16.6 kB
    deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches about 2 years ago