asafaya
/

kanarya-750m

Text Generation

Inference Endpoints

Model card Files Files and versions Community

asafaya commited on Feb 10

Commit

c58dd10

•

1 Parent(s): a666c50

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ language:
 - tr
 pipeline_tag: text-generation
 widget:
-- text: "Benim adım Zeynep, ve en sevdiğim kitap"
   example_title: "Benim adım Zeynep"
 - text: "Bugünkü yemeğimiz"
   example_title: "Bugünkü yemeğimiz"
@@ -15,7 +15,7 @@ widget:
 # Kanarya-750M: Turkish Language Model
-<img src="https://asafaya.me/images/kanarya.webp" alt="Kanarya Logo" style="width:500px;"/>
 **Kanarya** is a pre-trained Turkish GPT-J 750M model. Released as part of [Turkish Data Depository](https://tdd.ai/) efforts, the Kanarya family has two versions (Kanarya-2B, Kanarya-0.7B). Kanarya-2B is the larger version and Kanarya-0.7B is the smaller version. Both models are trained on a large-scale Turkish text corpus, filtered from OSCAR and mC4 datasets. The training data is collected from various sources, including news, articles, and websites, to create a diverse and high-quality dataset. The models are trained using a JAX/Flax implementation of the [GPT-J](https://github.com/kingoflolz/mesh-transformer-jax) architecture. The models are only pre-trained and are intended to be fine-tuned on a wide range of Turkish NLP tasks.

 - tr
 pipeline_tag: text-generation
 widget:
+- text: "Benim adım Zeynep, ve en sevdiğim kitabın adı:"
   example_title: "Benim adım Zeynep"
 - text: "Bugünkü yemeğimiz"
   example_title: "Bugünkü yemeğimiz"
 # Kanarya-750M: Turkish Language Model
+<img src="https://asafaya.me/images/kanarya.webp" alt="Kanarya Logo" style="width:600px;"/>
 **Kanarya** is a pre-trained Turkish GPT-J 750M model. Released as part of [Turkish Data Depository](https://tdd.ai/) efforts, the Kanarya family has two versions (Kanarya-2B, Kanarya-0.7B). Kanarya-2B is the larger version and Kanarya-0.7B is the smaller version. Both models are trained on a large-scale Turkish text corpus, filtered from OSCAR and mC4 datasets. The training data is collected from various sources, including news, articles, and websites, to create a diverse and high-quality dataset. The models are trained using a JAX/Flax implementation of the [GPT-J](https://github.com/kingoflolz/mesh-transformer-jax) architecture. The models are only pre-trained and are intended to be fine-tuned on a wide range of Turkish NLP tasks.