-
-
-
-
-
-
Inference Providers
Active filters:
RL
Text Classification
•
Updated
•
41
•
9
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
•
1.33k
•
37
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
•
15B
•
Updated
•
7.67k
•
73
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-RL
Video-Text-to-Text
•
8B
•
Updated
•
43
•
3
stanfordnlp/SteamSHP-flan-t5-xl
Updated
•
5
•
43
stanfordnlp/SteamSHP-flan-t5-large
Updated
•
87
•
33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
•
2B
•
Updated
•
9
•
5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B
•
Updated
•
37
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
•
71B
•
Updated
•
43
•
•
3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
•
71B
•
Updated
•
53
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
•
71B
•
Updated
•
69
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
•
Updated
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
0.5B
•
Updated
•
14
•
24
Reinforcement Learning
•
Updated
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B
•
Updated
•
111
•
1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B
•
Updated
•
204
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B
•
Updated
•
59
Text Generation
•
684B
•
Updated
•
47
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B
•
Updated
•
161
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
•
0.5B
•
Updated
•
35
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B
•
Updated
•
55
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
•
2B
•
Updated
•
28
•
1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B
•
Updated
•
60
•
1
mradermacher/Zireal-0-GGUF
mradermacher/Magellanic-Qwen-25B-R999-GGUF
25B
•
Updated
•
42
•
1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
25B
•
Updated
•
80
•
1
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
•
3B
•
Updated
•
1
Teen-Different/squiral_maze
Reinforcement Learning
•
Updated
Teen-Different/Tabular_RL_For_Multi_Env
Reinforcement Learning
•
Updated