WelfCrozzo
/

T5-L128-belarusian

text2text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

T5-L128-belarusian / README.md

WelfCrozzo's picture

Update README.md

6bada3c over 1 year ago

|

history blame contribute delete

2.27 kB

	---
	license: openrail
	datasets:
	- WelfCrozzo/kupalinka
	language:
	- be
	- en
	- ru
	metrics:
	- bleu
	library_name: transformers
	tags:
	- translation
	widget:
	- text: "<extra_id_1>да зорак праз цяжкасці"
	example_title: "be -> ru"
	- text: "<extra_id_2>да зорак праз цяжкасці"
	example_title: "be -> en"
	- text: "<extra_id_3>к звездам через трудности"
	example_title: "ru -> be"
	- text: "<extra_id_5>к звездам через трудности"
	example_title: "ru -> en"
	- text: "<extra_id_6>to the stars through difficulties."
	example_title: "en -> be"
	- text: "<extra_id_7>to the stars through difficulties."
	example_title: "en -> ru"
	---

	# T5 for belarusian language

	![model image](https://camo.githubusercontent.com/623b4dea0b653f2ad3f36c71ebfe749a677ac0a1/68747470733a2f2f6d69726f2e6d656469756d2e636f6d2f6d61782f343030362f312a44304a31674e51663876727255704b657944387750412e706e67)

	This model is based on T5-small with sequence length equal 128 tokens. Model trained from scratch on RTX 3090 24GB.

	# Supported tasks:
	- translation BE to RU: `<extra_id_1>`
	- translation BE to EN: `<extra_id_2>`
	- translation RU to BE: `<extra_id_3>`
	- translation RU to EN: `<extra_id_5>`
	- translation EN to BE: `<extra_id_6>`
	- translation EN to RU: `<extra_id_7>`

	# Metrics:
	- [evel/BLEU](https://api.wandb.ai/links/miklgr500/31mq4s36)
	- [evel/loss](https://api.wandb.ai/links/miklgr500/rvi2p69n)
	- [train/loss](https://api.wandb.ai/links/miklgr500/z9alu3n5)

	# How to Get Started with the Model

	<details>
	<summary> Click to expand </summary>

	```python
	from transformers import T5TokenizerFast, T5ForConditionalGeneration

	tokenizer = T5TokenizerFast.from_pretrained("WelfCrozzo/T5-L128-belarusian")
	model = T5ForConditionalGeneration.from_pretrained("WelfCrozzo/T5-L128-belarusian")

	x = tokenizer.encode('<extra_id_1>да зорак праз цяжкасці', return_tensors='pt')

	result = model.generate(x, return_dict_in_generate=True, output_scores=True,max_length=128)
	print(tokenizer.decode(result["sequences"][0]))
	```
	</details>

	# References
	- [Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer](https://jmlr.org/papers/volume21/20-074/20-074.pdf)