license: mit | |
pipeline_tag: feature-extraction | |
This repository contains the GPT-Soft-1.2b model described in [Cottention: Linear Transformers With Cosine Attention](https://huggingface.co/papers/2409.18747). |
license: mit | |
pipeline_tag: feature-extraction | |
This repository contains the GPT-Soft-1.2b model described in [Cottention: Linear Transformers With Cosine Attention](https://huggingface.co/papers/2409.18747). |