17 22 183

NB PRO

Skier8402

https://nyab.notion.site

Shuyib

AI & ML interests

Practicing Computer Vision, Optimization, NLP and multimodal system implementation.

Recent Activity

liked a model 5 days ago

facebook/bart-large-cnn

updated a dataset 16 days ago

Skier8402/prompt-garden

new activity 16 days ago

huggingface/InferenceSupport:Qwen/Qwen2.5-Omni-7B

View all activity

Organizations

Skier8402's activity

liked a model 5 days ago

facebook/bart-large-cnn

Summarization • Updated Feb 13, 2024 • 4.32M • • 1.35k

liked a Space 17 days ago

252

Qwen2.5 Omni 7B Demo

🏆

Submit media inputs to generate text and speech responses

liked a model 17 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 2 days ago • 128k • 1.35k

liked a dataset 19 days ago

glaiveai/glaive-function-calling-v2

Viewer • Updated Sep 27, 2023 • 113k • 1.83k • 425

liked a Space 19 days ago

RF-DETR

🔥

SOTA real-time object detection model

liked a Space 25 days ago

120

OctoTools

🚀

An Agentic Framework with Tools for Complex Reasoning

liked a dataset 26 days ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Feb 22 • 50.1k • 20.4k • 634

liked a model about 1 month ago

sesame/csm-1b

Text-to-Speech • Updated 28 days ago • 98k • • 1.84k

liked 4 Spaces about 1 month ago

102

Phi 4 Multimodal

🌖

Interact with AI using text, images, or audio

Magma UI

📚

Magma-8B model for UI Agents

402

OmniParser V2

🏢

OmniParser, turn your LLM into GUI agent

153

Agent Dino

🌠

@image @rAgent @web @text @tts1 @tts2 @3d

liked a dataset about 1 month ago

allenai/olmOCR-mix-0225

Viewer • Updated Feb 25 • 259k • 2.81k • 116

liked 2 models about 1 month ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 4 days ago • 862k • 1.29k

allenai/olmOCR-7B-0225-preview-GGUF

Updated Feb 26 • 2.18k • 23

liked a model about 2 months ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 7 days ago • 1.21M • 325

liked 4 Spaces about 2 months ago

1.43k

Wan2.1

💻

Wan: Open and Advanced Large-Scale Video Generative Models

Talk to OpenAI (Gradio UI)

🗣

Talk to OpenAI (Gradio UI)

Hello Computer (Gradio)

💻

Say computer (Gradio)

Talk to Gemini

♊

Talk to Gemini using Google's multimodal API