Vlad Kostoglodov

vkost

Kostoglodov

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

upvoted a paper about 1 month ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

upvoted a paper 4 months ago

Training Large Language Models to Reason in a Continuous Latent Space

View all activity

Organizations

vkost's activity

upvoted a paper 28 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published Mar 3 • 29

upvoted a paper about 1 month ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 73

upvoted a paper 4 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 82

upvoted a paper 5 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 93

liked a model 6 months ago

jxm/cde-small-v1

Feature Extraction • Updated Jan 21 • 648 • 285

upvoted a collection 6 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 586

liked a model 7 months ago

colbert-ir/colbertv2.0

Updated Apr 5, 2024 • 1.34M • 246

liked a Space 7 months ago

8.26k

Kolors Virtual Try-On

👕

Overlay garment on person image

liked a model 8 months ago

vidore/colpali

Visual Document Retrieval • Updated Feb 5 • 10.9k • 432

upvoted a paper 10 months ago

nabla^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials

Paper • 2406.14347 • Published Jun 20, 2024 • 101

liked a model 12 months ago

NousResearch/Meta-Llama-3-8B-GGUF

Text Generation • Updated Apr 18, 2024 • 413 • 48

liked a Space over 1 year ago

105

Tatr Demo

👁

Extract tables from images and convert to CSV

liked 2 models over 1 year ago

IlyaGusev/saiga_mistral_7b_lora

Text Generation • Updated Feb 13, 2024 • 97

ai-forever/ruGPT-3.5-13B

Text Generation • Updated Dec 5, 2023 • 3.97k • 281

liked a Space almost 2 years ago

Transfer Learning Time Series

🐠

liked a model about 2 years ago

google/flan-t5-xxl

Text2Text Generation • Updated Jul 27, 2023 • 292k • • 1.24k

liked a Space about 2 years ago

337

Chat Llm Streaming

📊

liked 2 Spaces over 2 years ago

11.1k

Stable Diffusion 2-1

🔥

Generate images from text descriptions

2.87k

CLIP Interrogator

🕵

Analyze image to generate descriptive prompt