Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

updated a Space about 14 hours ago

huggingface/inference-playground

liked a Space 4 days ago

enzostvs/deepsite

published an article 5 days ago

The NLP Course is becoming the LLM Course!

View all activity

Organizations

mishig's activity

updated a Space about 14 hours ago

Inference Playground

Set and update website theme based on user preference

liked a Space 4 days ago

DeepSite

Generate any application with DeepSeek

published an article 5 days ago

Article

The NLP Course is becoming the LLM Course!

By

and 9 others •

5 days ago

• 62

updated a dataset 7 days ago

hf-doc-build/doc-build

Updated 8 minutes ago • 378k • 8

upvoted a paper 10 days ago

Universal Language Model Fine-tuning for Text Classification

Paper • 1801.06146 • Published Jan 18, 2018 • 7

upvoted a paper 26 days ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 15

updated a Space about 1 month ago

Visualize Dataset (v2.0+ latest dataset format)

Browse robotic datasets visually

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 170

updated a Space about 1 month ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook about 1 month ago

Make hash section working

#89 opened about 1 month ago by

upvoted an article about 1 month ago

Article

Remote VAEs for decoding with HF endpoints 🤗

Feb 24

• 37

upvoted 2 papers about 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 151

liked a Space about 2 months ago

AI Podcast Generator

Generate Podcast using Kokoro-TTS!

liked a model about 2 months ago

zed-industries/zeta

Updated Feb 27 • 3.17k • 259

upvoted a collection about 2 months ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Feb 20 • 50

upvoted an article about 2 months ago

Article

State of open video generation models in Diffusers

Jan 27

• 50

upvoted a paper about 2 months ago

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5 • 29

upvoted a collection 2 months ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated Feb 6 • 52