Nikolay Kozlov's picture

Nikolay Kozlov

NikolayKozloff

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

ServiceNow-AI/Apriel-5B-Base

liked a model 1 day ago

ServiceNow-AI/Apriel-5B-Instruct

liked a model 2 days ago

OpenGVLab/InternVL3-78B

View all activity

Organizations

NikolayKozloff's activity

upvoted 2 collections 4 days ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated 4 days ago • 73

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 2 days ago • 58

upvoted a collection 6 days ago

Cogito v1 Preview

5 items • Updated 7 days ago • 95

upvoted a collection 8 days ago

Minueza-2-96M

The second version of the Minueza series. Base model and its fine-tunings. • 6 items • Updated 1 day ago • 1

upvoted a collection 9 days ago

Llama 4

Llama 4 release • 10 items • Updated 9 days ago • 424

upvoted a collection 11 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 11 days ago • 116

upvoted a collection 14 days ago

YandexGPT-5-Lite-8B

3 items • Updated 15 days ago • 4

upvoted a collection 19 days ago

Ling

6 items • Updated Mar 10 • 8

upvoted a collection 21 days ago

Tessa-T1 REACT REASONING MODEL

Tessa-T1 is a model that generates Stateful React with tailwind styling. It has features of other libraries as well. It is based on Qwen2.5-Coder. • 5 items • Updated 21 days ago • 6

upvoted a paper 23 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 25 days ago • 46

upvoted 2 collections 26 days ago

Hamanasu

A brand new series of Models from yours truly, Designed for Intelligence, Creativity and Roleplay - R/Locallama keeps DELETING MY GODDAMN COMMENTS • 31 items • Updated 8 days ago • 8

Llama Nemotron

Open, Production-ready Enterprise Models • 4 items • Updated 10 minutes ago • 36

upvoted a collection 27 days ago

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated 28 days ago • 86

upvoted 5 collections about 1 month ago

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 27

BD3-LMs

https://m-arriola.com/bd3lms/ • 4 items • Updated 3 days ago • 20

Gemma 3 Release

17 items • Updated 11 days ago • 327

CardProjector-v2

Big update! • 4 items • Updated Mar 10 • 2

D_AU - Thinking / Reasoning Models - Reg and MOEs.

QwQ,DeepSeek, EXONE, DeepHermes, and others "thinking/reasoning" AIs / LLMs in regular model type, MOE (mix of experts), and Hybrid model formats. • 55 items • Updated about 16 hours ago • 5