File size: 4,630 Bytes
c257fee
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
# Model Catalogue

- Repository of boutique AI models
- Still a work in progress

## Pretrained base models for embedding

English models were finetuned on a subset of [Zyphra/Zyda-2](https://huggingface.co/datasets/Zyphra/Zyda-2).
- [snowflake-arctic-embed-xs-zyda-2](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-zyda-2)
- [deberta-v3-xsmall-zyda-2](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2)
- [deberta-v3-base-zyda-2](https://huggingface.co/agentlans/deberta-v3-base-zyda-2)

Multilingual models were aligned using [agentlans/en-translations](https://huggingface.co/datasets/agentlans/en-translations).
- [multilingual-e5-small-aligned](https://huggingface.co/agentlans/multilingual-e5-small-aligned)
- [distilbert-base-multilingual-cased-aligned](https://huggingface.co/agentlans/distilbert-base-multilingual-cased-aligned)

## Text statistics models

- **Input:** text
- **Output:** number

| **Base model**                | **Language** | **Quality**                                                                    | **Readability**                                                                    | **Sentiment**                                                                    |
|:-----------------------------:|:------------:|:------------------------------------------------------------------------------:|:----------------------------------------------------------------------------------:|:--------------------------------------------------------------------------------:|
| deberta-v3-xsmall-zyda-2      | English only | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-quality)      | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-readability)      | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-sentiment)      |
| deberta-v3-base-zyda-2        | English only | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-quality)        | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-readability)        | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-sentiment)        |
| multilingual-e5-small-aligned | Multilingual | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-quality) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-readability) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-sentiment) |
| mdeberta-v3-base              | Multilingual | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-quality)              | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-readability)              | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-sentiment)              |

Note: the `mdeberta-v3-base` models were trained on a previous version of the dataset, not the complete dataset

## Small text-to-text models

- **Input:** text
- **Output:** text

These are English only.

| **Task**           | **Model**                                                                         | **Dataset**                                                                                            |
|:------------------:|:---------------------------------------------------------------------------------:|:------------------------------------------------------------------------------------------------------:|
| Keyword extraction | [flan-t5-small-keywords](https://huggingface.co/agentlans/flan-t5-small-keywords) | [wikipedia-paragraph-keywords](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-keywords) |
| Title generation   | [flan-t5-small-title](https://huggingface.co/agentlans/flan-t5-small-title)       | [wikipedia-paragraph-titles](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-titles)     |

## Natural language inference (NLI) models

- **Input:** text
- **Output:** label (entailment, neutral, or contradiction)

These are English only.

- [all-MiniLM-L6-v2-nli](https://huggingface.co/agentlans/all-MiniLM-L6-v2-nli)
- [bge-small-en-v1.5-nli](https://huggingface.co/agentlans/bge-small-en-v1.5-nli)
- [e5-small-v2-nli](https://huggingface.co/agentlans/e5-small-v2-nli)
- [mobilebert-uncased-nli](https://huggingface.co/agentlans/mobilebert-uncased-nli)
- [NoInstruct-small-Embedding-v0-nli](https://huggingface.co/agentlans/NoInstruct-small-Embedding-v0-nli)
- [snowflake-arctic-embed-s-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-s-nli)
- [snowflake-arctic-embed-xs-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-nli)
- [TinyBERT_General_4L_312D-nli](https://huggingface.co/agentlans/TinyBERT_General_4L_312D-nli)